« Perils of Strong Copyright | Main | Mark Up and Mark Down »

The Secret Source of Google's Power

Speculation about Gmail has inspired Rick Skrenta's blog essay on Google. They don't just have the world's best search engine, they have the world's largest and most scalable platform for developing huge web-based applications. [Via Simon Willison]

Google has taken the last 10 years of systems software research out of university labs, and built their own proprietary, production quality system. What is this platform that Google is building? It's a distributed computing platform that can manage web-scale datasets on 100,000 node server clusters. It includes a petabyte, distributed, fault tolerant filesystem, distributed RPC code, probably network shared memory and process migration. And a datacenter management system which lets a handful of ops engineers effectively run 100,000 servers. Any of these projects could be the sole focus of a startup. ...

Google is a company that has built a single very large, custom computer. It's running their own cluster operating system. They make their big computer even bigger and faster each month, while lowering the cost of CPU cycles. It's looking more like a general purpose platform than a cluster optimized for a single application.

While competitors are targeting the individual applications Google has deployed, Google is building a massive, general purpose computing platform for web-scale programming.

Posted by Tom on April 12, 2004