Web indexing provides new content more quickly – Google Caffeine launch

Web indexing provides new content more quickly – Google Caffeine launch

Caffeine, Google's new web indexing system, went live Wednesday. Announcing the global launch of Caffeine, Google discussed that its evolving search engine technology makes even more freshly minted web content accessible and delivers that new material faster than before. Nobody will change the way they use Google. But links to a broader range of relevant content are now presented much faster following the content is published. The Caffeine overhaul of the web indexing technology also can be able to give Google a lot more flexibility to keep pace with a web that is evolving at an accelerating rate.

Article Resource: Google Caffeine launch - web indexing delivers new content faster By Personal Money Store

Speed isn’t every little thing - Google Caffeine launch

Google said that the Caffeine launch delivers at least 50 percent fresher search results. That feature alone may be hard to translate into the advantage for the average Google user. PCWorld tested a comparison of web indexing systems when Caffeine was in development and discovered that results took .15 seconds on the regular Google search and .09 seconds on Caffeine. No one else can be able to repeat that test now, since Caffeine is now the regular Google search. And .06 seconds probably won't make much of a change for searchers. However, what shows up .06 seconds faster will make a difference for those who work with content publishing.

Content publishing real time

Google Caffeine’s average user will see the immediate benefit of more fresh content. Google's Matt Cutts explained to Search Engine Land that that "Caffeine benefits both searchers and content owners because it means that all content (and not just content deemed “real time”) could be searchable within seconds after it is crawled.” As outlined by Search Engine Land, the old Google would crawl a set of pages, process those pages and add them to the index. The whole batch had to proceed at one time rather than one page at a time. Now pages are processed individually.

Caffeine - huge storage capability

For Caffeine to eliminate the delay between when it finds a page and makes it available to the public calls for an astronomical amount of storage. Carrie Grimes said Caffeine indexed webpages on an enormous scale on the official Google blog. Caffeine processes hundreds of thousands of pages in parallel -- each and every second. Paper pages processed at that rate would stack 3 miles high -- each second. Caffeine takes up nearly 100 million gigabytes of storage in one database and adds a whole lot of new details at a rate of hundreds of thousands of gigabytes per day. To store that much information, you would need 625,000 of the largest iPods; if these were stacked end-to-end they would go for more than 40 miles. The bill from Apple would be $155,625,000.

Keeping up with Caffeine

The Google Caffeine launch doesn't change any of the web looking or content publishing. But Resource Shelf points out an significant detail. Details can be changing locations daily. This is because pages are being refreshed a lot more frequently and the cache is also being updated a lot more frequently. If a searcher needs content on a page the way it looked at noon on Wednesday, it’s a good idea to make a copy with something like Zotero, which is actually a Firefox extension because by 12:15 p.m. on Wednesday the content on the page might change when the cache is updated.

Read more on this topic here

PC World



searchengineland.com/googles-new-indexing-infrastructure-caffeine-now-live-43891?utm_source=feedburner&utm_medium=feed&utm_campaign=Feed: searchengineland (Search Engine Land)&utm_content=Google Reader

Official Google Blog