Subscribe to Angel Blog Reviews Subscribe to Angel Blog Reviews's comments

Posts tagged ‘search-engine’

Whatever Google’s reason for threatening to leave China , the latest numbers from StatCounter , a free online stat service, show that it certainly isn’t for lack of opportunity. Google has been steadily gaining ground on China’s premier search service, Baidu, since last August. Sponsor According to StatCounter , Google held just 28% of China’s search market back in August and since has climbed to nearly 43%. That’s a 15-percentage-point gain in just four or five months. Yahoo and Bing account for just over 1% of China’s search engine market. While estimates put Google’s projected 2010 income from business in China at around 2% of the company’s entire revenue, the long-term implications of pulling out of the country are much larger . If any company should just give up the ghost and get out of China before making any more PR (and human rights) gaffes , maybe it’s Yahoo. Discuss

a0367be0d0200902.jpg Google Gained Ground on Baidu in 2009

Visit link:
Google Gained Ground on Baidu in 2009

One hour ago, three emergency vehicles responded to a report of an unconscious person at the world headquarters of Nike Inc. in Portland, Oregon. How do I know? An automated form-pumping robot from startup company Nozzl Media told me. Nozzl Media today unveiled a demonstration of its first product, a widget intended for newspaper websites seeking to display real-time local information derived from Twitter messages, blog posts and automatically extracted public records like restaurant health inspections, building reports and public safety emergency responses. It’s like a little robot reporter and the company plans on offering it as a mobile app in the future as well. Nozzl raises questions, though, about what constitutes news and whether or not human reporters are expendable in the news process. Sponsor Nozzl was founded by a team of ex-newspaper reporters and engineers. They got out when it was clear the newspaper industry was in trouble, but now they aim to give something back by bringing together the real-time, programmatic possibilities of the web with the reporting of the newspapers. Reporters have for decades written programming scripts that repeat database queries over and over again to extract public data for bulk analysis. The Nozzl team has taken that to the next level and combined it with new social media. The company put up a demo page for Portland, Oregon news that anyone can look at today. The public records streams are the big value-add and are fully customizable per newspaper. Visitors can then type live filter terms into the box at the bottom of the widget to zero in on topics of interest to them. That’s the nozzle in Nozzl Media. User Experience is Hard For Robots Unfortunately, there are two big issues here. First, the flow of Tweets is overwhelming and undifferentiated. On the demonstration site you see almost nothing else unless you can think of something to filter for. If automated Twitter feeds hold value for local news, they will probably require some smart pre-proccessing before being presented to the reading public. There have always been people who like to listen to police radio scanners. Myself, I like to read restaurant health inspection reports, building permit applications and liquor license applications. If Nozzl had some categories I could choose between, that would be very helpful. The second, and more interesting, problem is that the public records that are extracted are exciting in theory but relatively unreadable in practice. The truth is, Nozzl didn’t exactly tell me that there was an unconscious person reported at Nike HQ today – it told me with code from a form that there was an UNCONS/UNRESPONSIVE report at 1 SW Bowerman Drive, in Portland. A little Mad-Libs style transformation of forms into human-readable sentences and some pre-fetching of names associated with addresses could go a long way. Run the name associated with that address through a News search engine and tell me if its an entity that’s been reported on in the past – if so then it’s probably high-priority news to push live again. The company needs to put these machine-readable pages it displays into coherent English sentences, or find some other solution. Don’t Forget the Humans Ambulances to Nike’s Headquarters to help someone unconscious today? That sounds like it could be news. Even if the technology presented the information this clearly – it may take a human eye to pick this out of a list of automatically captured ambulance reports. Having a human available to pick up a phone, call Nike HQ and ask who was found unconscious there this afternoon would add another element of value to this data – but that’s not what Nozzl is looking to do. The company is serving up raw data to news consumers. In the end, human reporters and raw robot feeds sound like a great combination. That appears to be what Nozzl is aiming to create by offering its widgets to established news organizations. The company says that a mobile application could be in its future, too. That’s something I’m very excited about. Be it a widget or a mobile app, Nozzl’s robot reporters need more polish before they are ready to win back the hearts of fast-leaving newspaper readers. As a picture of the future, though – Nozzl is very inspiring. Interested in what companies like Nozzl Media mean for the future of the web? Check out our profile of Nozzl and ten other case study companies in our recent research report The Real-Time Web and Its Future . Discuss

 Welcome to the Age of Robot Reporters

View original post here:
Welcome to the Age of Robot Reporters

If you’ve ever believed that semantic search is meant exclusively for researchers, then Meaning Tool might prove you wrong. Through Popego , the semantic search engine allows you to add your online profile and interests such as “gadgets” or “current news”. From here, Meaning Tool serves you entertaining content from across your social graph. ReadWriteWeb took a look at how the tool works and how it just might bridge the gap between scholars and social media junkies. Sponsor Meaningtool – Demo from Popego on Vimeo . Meaning Tool is a semantic engine that offers users a chance to extract concepts from text using specific semantic trees. As mentioned, you define your categories of interest by creating search parameters and training them with related websites or RSS feeds. Similar to Open Calais , the service appears to use the linked data standard to retrieve data via dereferenceable URIs on the web . From there you can search text in any romantic language to produce relevant words and categories. Categories such as “technology” and “security” are then shown in a pie chart to represent the percentage of relevancy the text has to these key categories. The system also offers a tag cloud of relevant keywords and key concepts. And finally, Meaning Tool extracts entities such as mentioned companies, people and places. Unlike many other semantic search services, your satisfaction with results as a researcher, marketer or general consumer weigh heavily on how you train the system. To find out more about the semantic web, check out For more info on the semantic web, check out our article on semantic search’s myths and realities . To add some of your own interests to Meaining Tool visit

meaningtool logo dec09 Meaning Tool: Training Semantic Search With Feeds

Continue reading here:
Meaning Tool: Training Semantic Search With Feeds

German company apnoti.com has today launched what it claims is the world’s first mobile price search engine with integrated barcode scanner. It’s an iPhone app called iBARCODE , which enables users to scan product barcodes and find the best prices. It currently only works in the U.S., U.K, Germany and France. To scan, the user simply holds the iPhone over a product’s barcode. The integrated camera in the iPhone essentially takes a photograph of the barcode, from which the app recognizes the product and finds the best offers on the Internet for it. Sponsor These are located via the recently launched real-time price search engine smart.apnoti.com , which the company says has more than 65 million product offerings from more than 10,000 online shops worldwide. The mobile application recognises the following kinds of barcodes: UPC8, UPC12, EAN and ISBN. These barcodes can be found on a range of products, including cameras, audio and hi-fi devices, computers, furniture, books, DVDs and medicines. The iPhone app also enables users to store and send product watchlists. Last year we ran a series entitled The Scannable World: Barcode Scanning In The Real World . But we’re only now beginning to see user-friendly consumer applications that allow scanning of barcodes via mobile phone. Next page: screenshots of iBARCODE Discuss

ibarcode logo iBARCODE: Real Time Price Search Engine Launches

Read the rest here:
iBARCODE: Real-Time Price Search Engine Launches

The real-time web was hot this year and it’s likely to become a standard expectation on sites all around the world next year. We’ve tracked this trend extensively with a face-to-face summit of industry leaders and an 84-page research report on The Real-Time Web and Its Future . Who were the big movers and shakers in real time this year? Check out our list of the top 10 below and let us know if there are any important ones we missed. Sponsor ReadWriteWeb’s Best Products of 2009: Pubsubhubbub Pubsubhubbub , created as a 20% project by Googlers Brett Slatkin and Brad Fitzpatrick, is described as “a simple, open, server-to-server web-hook-based pubsub (publish/subscribe) protocol as an extension to Atom and RSS.” It delivers updated content in real-time from a pinged hub server out to all subscribers that have requested updates. Real-time PubSubHubbub feeds are already being published by FeedBurner, Blogger, LiveJournal, LiveDoor, Google Alerts, Feedoor and the feed republishing service Superfeedr. Facebook’s FriendFeed, LazyFeed and the newest version of Netvibes are consuming Hubbub feeds so far, as are a number of small sites and services that are using the feeds for machine-to-machine communication. Hubbub consuming applications are reporting server traffic savings of up to 85% and engineers love it . RSSCloud RSSCloud is a technology that’s been a part of the RSS 2.0 spec for years but got a new burst of development energy this year when creator Dave Winer began working on it in part as a way to create a decentralized Twitter experience. RSSCloud is similar to Hubbub, is often implemented in conjunction with it but doesn’t deliver full content updates with the notification of changes to a feed. The first major move to adopt RSSCloud was by blog publisher WordPress . The latest addition to the technology is a new feature called CloudPipe , which will enable delivery of real-time feeds to desktop and mobile clients, even behind a firewall. Creator Dave Winer has been a key figure in an incredible number of the most important technologies of the read/write era of the web. He created the first popular blogging software (Radio Userland), was the first to enable podcast delivery in an RSS feed visa-vi the now standard method of enclosures, he built the web’s leading blog ping server (weblogs.com), he ushered RSS into the mainstream, he created the format for sharing bundles of RSS feeds and other outlines (OPML), he wrote the XML-RPC framework (predecessor of SOAP) and the MetaWeblog API for remote blog management. Now Dave Winer is working on real-time web technology and we’d be fools to not watch what he’s doing. Facebook Facebook, for all its shortcomings, has turned more than 200 million new people on to real-time streams of content pushed to their browsers in 2009. If you think this paradigm is important, Facebook deserves a medal. Google Real-Time Search Honorable Mentions Echo – real-time comment aggregation Evri – real-time semantic news tracker Lazyfeed – topical discovery engine Netvibes – now probably the most popular real-time consuming feed reader in the world Just this week the Big G showed of its new real-time search feature . It kills what Bing and Yahoo are doing. It’s simple but elegant and effective. For certain search queries, real time web pages, Twitter updates, Facebook content, MySpace updates and more will appear in a subtle, streaming box in your results page, with a pause button. It’s not live on the public site yet, it’s just a demo, but it’s going to be very, very big next year. Big enough that it belongs on the list this year just for being demoed. Twitter search Whether you’re watching brand mentions for your work or participating in a semi-obscene public ritual of riffs on a trending meme – millions of people now regularly watch the real-time updates on Twitter search results pages. Twitter bought a search engine called Summize in July of 2008, built by a group of former AOL scientists and originally intended to be a sentiment analysis technology. It has become incredibly important this year. When the site’s new GeoLocation API gets put to more substantive use, that search engine is going to become all the more important – in ways that could change our day-to-day lives. Next page: Top 10 Real-Time Technologies of 2009 6-10 Superfeedr Julien Genestoux’s Superfeedr is a service that pulls in content feeds from around the Web and then offers updates for those feeds in XMPP or PubSubHubbub format. It’s like FeedBurner for the real-time web and in fact just added publisher analytics ala FeedBurner today . Superfeedr is a key enabler for other applications and if you want an interesting view into the nitty gritty of the real-time web, you should go subscribe to the Superfeedr company blog right now. Genestoux says the companies using his service so far include SixApart, Adobe, Twitterfeed, StatusNet and a number of small services such as Webwag, EventVue, Quub, AppNotifications, Excla.im and SmackSale. That’s an impressive list and your company could well be on it by next year. Tornado This September, Facebook open-sourced the newly acquired FriendFeed’s real-time infrastructure. It’s a fast, relatively easy way to add real-time flow to your application and developers around the world are excited about it. We’re all about the potential here at ReadWriteWeb and we think Tornado has a lot of it. We hope to see big things from this project next year. Breaking News Online’s iPhone App Breaking News Online is an international news organization founded by now 19 year old Netherlands native Michael van Poppel. Van Poppel somehow sold a video of Ossama Bin Laden to Reuters two years ago and has since built up the fastest, smallest news organization on the planet. The American Red Cross watches BNO closely for notices of new natural disasters. MSNBC paid what have been a hefty sum for control over the Breaking News Online Twitter account this month, but the organization’s iPhone app lives on in the hands of the original organization. It’s a simple app but one that will keep you on top of world events around the clock like nothing else. It’s a great use of the iPhone’s new Push feature, implemented this year. Aardvark Aardvark is a social search engine that combines artificial intelligence, natural-language processing and presence data to create what the company calls “the real-time Web of people.” It’s got some heavy engineering behind it and this author uses it almost every day. Google is reportedly in the process of trying to buy it. Cliqset We love a good technical standard and stream reader startup Cliqset is blazing new trails with its new real-time ActivityStreams feed normalization API . The API means activities from 70 different social services can be read in a common language and 3rd party services can slice and dice them to create new user experiences. Several high-profile applications have already begun consuming activity feeds republished through Cliqset and the company says many more consumers are in the works. This is the stuff that distributed, interoperable platforms are built on, where small innovators have access to economies of scale. Those are our picks! Check them out, let us know who we missed and get ready for a coming time when most of the web will be running in real time! Discuss

77c7374ce911350.png Top 10 Real Time Technologies of 2009

View original post here:
Top 10 Real-Time Technologies of 2009

The rapid growth of the mobile web is a force that could be disruptive to Google, a company that built its search engine for a desktop-based world. On the handheld, all bets are off. Anyone with an innovative concept for improving mobile search could gain ground, possibly even overtaking Google as the top search provider for mobile devices. But don’t worry – Google hasn’t been ignoring this trend. The company has been busy prepping various initiatives designed to get people googling from their mobile phones. From scannable barcodes to an innovative visual search app that lets you perform searches by taking photos, Google is slowly revealing how they plan to dominate search in the real world too. Sponsor Google Does Barcodes (Again) Google hasn’t given up on barcode scanning just yet. Although a failed Print Ad program featuring barcodes for newspapers was shut down at the beginning of the year, that hasn’t stopped the company from giving barcodes another go. This time, the venue isn’t the old-fashioned newspaper, but local businesses. Through Google’s Favorite Places program, over 100,000 of the U.S.’s most popular local businesses will receive stickers sporting Google’s logo, a scannable barcode and a message reading “we’re a favorite place on Google.” Business owners can post these decals to their store windows to show off their respectability and popularity – and you can bet many will. Customers scanning the barcode will be taken to that store’s “place page” which reveals various details about the business including hours of operation, reviews, photos, directions, phone numbers, brands carried, menus (if a restaurant), and even mobile coupons if available. In addition, users can “star” (rate) the establishment and submit their own review, if desired, turning Google Local Businesses into a Yelp -like user-generated reviews service. While this initiative has a better chance for success in introducing barcode-scanning to the U.S. market than the Print Ad program did, there’s still going to be some confusion on the part of consumers as to how to get started. Google notes in their Favorite Places FAQ that many modern smartphones including the iPhone, Blackberry, Droid, and other Android devices offer barcode scanning applications, but no links or suggestions are provided. This leaves consumers with having to figure it out on their own. In addition, feature phone owners whose more basic devices include cameras may also wonder if there is software for their phones, too. In some cases there is , but the less tech-savvy mainstream user base has no way of discovering that without taking the time to do some research on the topic. Perhaps Google should have introduced a cross-platform barcode-scanning application of their own? If they had, it could have definitely helped push the technology adoption forward. It’s almost surprising that they haven’t yet done so especially considering that their latest search rival, Microsoft, has. With Microsoft Tag , for example, you can create your own barcode-like “Tag images” as well as download mobile Tag-reading software. Mobile Coupons As mentioned above, the Favorite Places’ barcodes will link to pages that support mobile coupons, assuming the business chooses to offer them. However, these coupons aren’t limited to “favorite” businesses – any business listed on Google Local Search can use this feature. Announced late last month , Google introduced the mobile coupon feature to their Google Local Business Center program which lets any company offer coupons that consumers can access right from their mobile phone. At checkout, the shopper just needs to show the coupon on their mobile’s screen to receive the discount. Visual Search via Mobile Photos Google Visual Search is an upcoming technology still in development which was revealed on CNBC’s ” Inside the Mind of Google ” segment on December 3rd. This innovative mobile application aims to provide an even more intuitive way for interacting with the real world via your mobile phone. With Visual Search, users with phones running Google’s own mobile operating system “Android” will be able to take a photo of their location and use that to trigger a Google search. In order for this to work, advanced algorithms have to match the photo with those stored in a massive database on the backend. Initially, this service could be used to provide information about various landmarks, businesses, or other notable locales, but really the possibilities are endless. Eventually, the same technology that recognizes landmarks could recognize other objects, too, like products on store shelves, billboard ads, or street intersections. It could even

About 18 months ago, we wrote about an obscure search startup from Germany called FAROO . We believed that its radical alternative, using peer-to-peer (P2P) technology, had a shot at being a real disruptive force. Today, it has made some progress, has raised some money and is getting out into the market. (Disclosure: FAROO is currently a ReadWriteWeb sponsor). FAROO is wisely underplaying P2P in its marketing, preferring more fashionable terms such as “real-time search” and “social discovery.” But the P2P technology drives it. Sponsor So, we decided to invite someone who understands P2P at a technical level to interview Wolf Garbe, FAROO’s founder. Our tech expert, Kiril Pertsev, of Agily Networks , has already written about P2P for us in the past . Kiril: Why .NET? Did you already have development resources or did you make this choice because you consider it a better option for networked desktop applications? Would you make this choice again? And if you’re not satisfied with .NET, what would your platform of choice be, given all of your experience over the past few years? Wolf: I come from Delphi (Object Pascal). So, the choice of C#/.NET was a dedicated decision for a new platform, not driven by legacy. When I started to work on the first prototype in 2004, Delphi moved towards .NET. I preferred to go with the original, especially because the development of C# was led by Anders Hejlsberg, the designer of Borland’s Turbo Pascal (which Delphi derived from). Of course, I also looked into Java, which I found quite similar, both from the language perspective (C# vs. Java) and the JIT Runtime environment (Java Virtual Machine vs. .NET Runtime). The decision for C# was based on the dominating desktop market share of Windows and the assumption that embedding the .NET framework into the OS would ensure fast penetration of .NET. This only partially came true, partly due to the limited success of Vista, which was the first Windows version with .NET pre-installed. Kiril: Doesn’t this choice hinder your ability to move to Mac and Linux platforms. Wolf: We were betting on Mono for platform compatibility. Unfortunately, Mac OS X still has no Mono application launcher, other than starting with the terminal, which is not feasible for a mass market. With the increasing importance of the Mac OS X platform, I expect this to change. Silverlight today is already natively available for Mac. For the ultimate platform independence, we are also continually observing the diverse RIA developments (AJAX, AIR, Silverlight, Mozilla Prism, HTML 5 persistent web storage, Mozilla’s DOM storage, Google Gears and Flash persistent storage), which could one day allow us to remove the download and installation step for P2P. But so far, no solution meets all of the requirements: out-of-browser capability, permanent background operation, auto-start option, tray icon support, cross-domain connection support, persistent storage, accepting an incoming connection and receiving data and NAT traversal. Kiril: If you become dissatisfied with .NET, what would be your next platform of choice. Wolf: Although not everything went as expected, I still believe that .NET is a very powerful platform, and C# as a language is evolving at a much faster and broader pace than Java. Today, we have a good .NET penetration rate in the US and Europe. With Windows 7, I expect that to increase in Asia as well. Kiril: I see that you’re using a pretty simple P2P communication technology instead of sophisticated Hamachi-like NAT traversal using UDP hole punching. Wolf: I suppose you are referring to the transport layer, which is HTTP over TCP/IP. The real P2P overlay protocol on top of that is not that simple anymore. Because our distributed search engine system architecture breaks with almost all legacy paradigms, we thought it would be a good idea that it be at least based on proven and widely used standards wherever possible. There are several reasons for this: It reduces complexity and development time. It improves compatibility (there is probably no protocol more widely used than HTTP over Port 80). It’s unlikely that this connectivity will break anytime soon by changes in protocols, OS, drivers or hardware. Behaving like a standard browser from the protocol view makes the application less vulnerable to filtering, blocking or traffic shaping and ensures that it even works in most corporate environments. NAT traversal is the most critical issue for every P2P application. It’s really a shame that although the Internet is built on a distributed foundation, end-to-end connectivity between users in a decentralized way is completely broken . We are using several NAT traversal techniques: Manual Port Forwarding, Automatic Port Forwarding via UPnP and Teredo. Teredo is a IPv6 Tunneling technology , standardized according to RFC4380 . Teredo is part of Windows XP, Vista, and Windows 7; with Miredo , there is also an open-source implementation for Linux and Mac OS X available. Microsoft reports that with Teredo, the chance of a connection between two peers increases from 15% to 84% (PDF link). Our observations are somewhere between 60% and 70%. Teredo is quite sophisticated technology and is a more universal approach. It provides connectivity at the OS level, in contrast to having several applications in use, where each uses its own proprietary traversal technology. Kiril: Could you please elaborate on choosing network technology, having achieved a substantial number of users and collecting usage statistics. Do you know how many active and passive peers you have at any given time? What is the ratio? Wolf: We have solid insight into the state of our P2P network. We know the number of active and passive peers on any given day (using the log from our update server). The active peer ratio is between 60 and 70%. We are also currently working on an improved distributed intraday statistic. (The distributed statistic currently built into the P2P client is not valid anymore for the increased network size. For scalability, every peer has only a limited view of the whole network, which requires more advanced methods for calculating the actual network size.) Kiril: Your search index essentially is a distributed storage system with DHT addressing, right? Wolf: Yes. Kiril: Have you thought about other uses of such technology, beyond search: back-up, private distributed storage, file-sharing, etc. (like Wuala )? In a publication from 2001 (in German), in which I also outlined the idea of a peer-to-peer search engine, this was part of an integrated solution with a P2P Web server, P2P file-sharing and a P2P anonymizer. Due to various legal copyright issues, we are currently not looking into file-sharing. But from a technological standpoint, a distributed storage system is quite universal, from storing a search engine index to attention data, Web pages, instant messages, social network profiles, micro-blogging messages, back-ups and files. Kiril: Could you please share your vision of the mythical “P2P operating system,” now that we already have P2P networking, P2P processing, P2P storage and P2P applications (like search). Wolf: P2P and distributed architectures are a universal principle that the whole Internet is built upon. Unfortunately, distributed technologies like Mail, IRC, Usenet and even independent Web servers are being increasingly replaced by centralized solutions (the cloud, Google Wave, etc.). Despite the obvious short-term convenience, this leads to long-term monopolies and dependencies and makes the Internet infrastructure more vulnerable in terms of reliability and political influence. I believe that a solid, standardized P2P stack integrated in the operating system can fix the broken end-to-end connectivity between users, enabling the use of an endless amount of latent storage, memory, processor cycles and bandwidth. Distributed storage is certainly a core component, as is distributed processing to make more sense of all of the data. On top of this, there should be a distributed programming framework, which enables the development of distributed applications and the distribution and aggregations of tasks in a standardized manner (e.g. the distributed version of MapReduce/Hadoop is part of this). A distributed attention data repository, shared by all applications, but under full user control. There should be resource management that puts the user in full control of the amount of resources she or he would like to dedicate to a particular distributed project—possibly combined with a ratio system and/or virtual currency to maintain a healthy usage to contribution ratio. Distributed identity management and authentication, authorization and access control. This could replace most of the centralized cloud solutions by delivering the same convenience and scalability in a decentralized way. BOINC (the universal distributed processing platform where seti@home runs today) goes partially in that direction. This is partially because the peers contribute, by taking tasks from a centralized server and providing results back to this server. But this system is not fully distributed, nor are the results intended to be used by the peers themselves. Kiril: Do you encounter scalability issues? Do you have any single point of failure resources in your P2P network? How reliable is it—meaning, what percentage of the network could you lose without seriously degrading search quality and performance? Wolf: We have scaled the P2P network in a controlled way. While we have made some scalability-related adjustments to our P2P protocol, the core algorithms proved that there are no inherent scalability limits. Due to our fully distributed architecture, we have no single point of failure. We have twenty-fold redundancy of each item, which replicates automatically if peers leave the network. Only if all 20 copies of the item are removed at the same time would this piece of information be lost. This leads to a mean information lifetime of 120 years under realistic churn (i.e. the peers randomly joining and abandoning the network temporarily or permanently). This is more than sufficient for search, where 50% of the information changes during the year (and is therefore refreshed anyway at a much higher rate). Kiril: Do you think that “mobile P2P” is feasible? What would you say about implementing P2P search (or any other application) on, say, the iPhone? Are mobile terminals ready for P2P? Are cell networks ready? Do they have enough CPU power, etc? Wolf: Today, we distinguish between mobile connectivity and landlines. But I believe this separation will fade away. Device performance, bandwidth and flat-rate pricing structures will become close. While today, processor cycles, memory and bandwidth in mobile phones are too precious for wide use of P2P applications, this will change. Even “walling off” tendencies and restrictive App Store policies will be liberated by regulation or user demand. But much more interesting than bringing file-sharing to the iPhone will be P2P applications that use mobility, possibly combined with GPS, distributed camera/augmented reality and RFID. This will bring P2P technology into completely new application fields. Think of distributed traffic control (peers could be users with iPhones in cars or the cars themselves) or applications to lead crowds of people at large public events or in disaster zones, as well as gaming, distributed weather and earthquake prognosis . Bluetooth could even make this independent of cell networks. Global communication between peers would be asynchronous through moving people . Also, cell network and Bluetooth mashups would be possible. In the near future, we will provide Web access to our P2P Web search for mobile users. They will just be passive users of the resources contributed by active PC users. Kiril: What is your vision of the P2P road map? Apparently, the first “killer P2P application” was file-sharing, Kazaa, then BitTorrent. Given that the next one is search, what would be the next after that? Wolf: As I mentioned, instead of another isolated P2P application, I would like to see P2P built into the OS and Internet stack in a standardized manner. So that an application can benefit from P2P without any specific effort, in the same easy and natural way that applications today use the Internet (HTTP, AJAX and JSON). Then, P2P technology would become ubiquitous and part of almost every application. Every application that uses cloud services today could benefit from such P2P technology. An example would be a distributed platform for micro-blogging services and social networks, heralding the end of walled gardens. But my personal vision is to combine P2P with the next thing after search. Twenty years ago, I wrote a small expert system on my C64 (today, a C64 emulator is on the iPhone!), using Predicate Logic and an Eliza -style natural-language interface. So, you could tell the system, ” All cats have claws. All tigers are cats. ” And then you could ask the system, ” Do tigers have claws? ” And it would answer, ” Yes! ” You could retrieve information and relationships that were not explicitly stored (or that anyone was even aware existed). At that time I had to enter every bit of information manually. Today, almost all information on earth is accessible on the Internet, together with comments and conversation streams. Predicate logic would be supported by fuzzy logic, statistical machine learning and more. Today, known translations are used to translate untranslated text. But this could be much more universal: using known connections in one field to explain unknown correlations in another . Such a system could autonomously formulate queries, combine facts, fill in the missing link in a theory to prove or falsify it. So, I think the next step after search will be reasoning; and in combination with P2P technology and distributed processing, this may bring us a kind of global brain. A brain that not only stores and retrieves information but that is capable of cognition and conclusion at a giant scale. It will discover hidden correlations and unknown facts and will answer questions with answers that cannot be found in any document. You can see, this is much more HAL than the next Google. Kiril: What technical feature of your technology are you most proud of? 100+ patents must something. Wolf: Most crucial has been to ensure both a quick response time and complete results for queries with multiple terms and phrases (only 15% of searches are single keywords) in a completely distribute P2P architecture. For queries with multiple keywords, we eliminated the need for the intersection of huge posting lists across different peers. While we had to invent a lot of things—just because they hadn’t been done before in a way that was required for distributed search—they are not all patented (so, we don’t own 100+ patents). Prior to funding, this would have been impossible financially. Kiril: How is your real-time search related to the P2P search? Does it also run on a distributed network? If so, then how do peers communicate results to the front page of the FAROO website? Wolf: Currently, we use a hybrid architecture. While we are building up our P2P network and use it for general Web search, in parallel we use a central index for the real-time data. The focus on the most recent and popular Web pages keeps the costs moderate. Attention data collected by FAROO peers serves also for real-time discovery and ranking (in addition to analyzing the Twitter stream). But we believe in a holistic approach. Our real-time search will evolve into an integral part of Web search and be fully based on our P2P architecture . There will still be a gateway/proxy server that enables Web access to our P2P network for those users not able or ready to install a P2P client (e.g. for mobile). Discuss

faroo logo Technical Q&A With FAROO Founder

See original here:
Technical Q&A With FAROO Founder