Hadoop, a Free Software Program, Finds Uses Beyond Search
New York Times (03/17/09) P. B3; Vance, Ashlee
Hadoop software has quickly become widely used by the top search engines and other Web sites to analyze and access the unprecedented amounts of data created by the Internet. The free program maps information over thousands of computers and offers a simpler method for writing analytical queries, thus enabling users to explore data by simply asking a question. "It's a breakthrough," says Lawrence Livermore National Laboratory's Mark Seager. "I think this type of technology will solve a whole new class of problems and open new services." Hadoop is based on MapReduce technology developed by Google. MapReduce, when paired with the file management technology Google uses to catalog the Web, can be used to index the entire Internet on a regular basis and analyze the vast amounts of information to determine the quality of search results and how people use the company's various services. MapReduce makes it possible to break large sets of data into small pieces, which can be spread across thousands of computers, ask the computers questions, and then receive cohesive answers. Google has largely kept the MapReduce technology a secret, but the company published papers on some of the underlying techniques, which software consultant Doug Cutting used to create Hadoop. Hadoop can track people's behavior to see what types of stories and content they view, and then match ads with that content. Microsoft uses Hadoop to improve its search system, and Facebook uses the program to determine how closely linked people are based on who appears in users' photographs.
Tuesday, March 17, 2009
Blog: Hadoop, a Free Software Program, Finds Uses Beyond Search
Subscribe to:
Post Comments (Atom)
Blog Archive
-
►
2012
(35)
- ► April 2012 (13)
- ► March 2012 (16)
- ► February 2012 (3)
- ► January 2012 (3)
-
►
2011
(118)
- ► December 2011 (9)
- ► November 2011 (11)
- ► October 2011 (7)
- ► September 2011 (13)
- ► August 2011 (7)
- ► April 2011 (8)
- ► March 2011 (11)
- ► February 2011 (12)
- ► January 2011 (15)
-
►
2010
(183)
- ► December 2010 (16)
- ► November 2010 (15)
- ► October 2010 (15)
- ► September 2010 (25)
- ► August 2010 (19)
- ► April 2010 (21)
- ► March 2010 (7)
- ► February 2010 (6)
- ► January 2010 (6)
-
▼
2009
(120)
- ► December 2009 (5)
- ► November 2009 (12)
- ► October 2009 (2)
- ► September 2009 (3)
- ► August 2009 (16)
- ► April 2009 (4)
-
▼
March 2009
(20)
- Blog: New Architects of Service-Oriented Computing...
- Blog: Vast Spy System Loots Computers in 103 Count...
- Blog: A New Step Towards Quantum Computers
- Blog: Multicore Chips Pose Next Big Challenge for ...
- Blog: Will HIPAA changes torpedo health IT stimulus?
- Blog: Stimulus Package Includes Changes to HIPAA P...
- Blog: Hadoop, a Free Software Program, Finds Uses ...
- Blog: New System for Improving Decision Support Sy...
- Blog: Society's Vital Networks Prone to 'Explosive...
- Blog: Berners-Lee: Semantic Web Will Have Privacy ...
- Blog: Application Security Best Practices: A New M...
- Blog: An Upgrade for the Web; HTML5
- Blog: Cyberattack Mapping Could Alter Security Def...
- Blog: NIST Suggests Areas for Further Security Met...
- Blog: Wolfram Alpha: 'A new paradigm for using com...
- Blog: Noise Could Mask Web Searchers' IDs
- Blog: Computer Scientists Deploy First Practical W...
- Blog: Google Launches Google Code Labs
- Blog: Koobface Variant Spreading Through Social Ne...
- Blog: A New World Record in Go Established by PRAC...
- ► February 2009 (9)
- ► January 2009 (19)
-
►
2008
(139)
- ► December 2008 (15)
- ► November 2008 (16)
- ► October 2008 (17)
- ► September 2008 (2)
- ► August 2008 (2)
- ► April 2008 (12)
- ► March 2008 (25)
- ► February 2008 (16)
- ► January 2008 (6)
-
►
2007
(17)
- ► December 2007 (4)
- ► November 2007 (4)
- ► October 2007 (7)
Blog Labels
- research
- CSE
- security
- software
- web
- AI
- development
- hardware
- algorithm
- hackers
- medical
- machine learning
- robotics
- data-mining
- semantic web
- quantum computing
- Cloud computing
- cryptography
- network
- EMR
- search
- NP-complete
- linguistics
- complexity
- data clustering
- optimization
- parallel
- performance
- social network
- HIPAA
- accessibility
- biometrics
- connectionist
- cyber security
- passwords
- voting
- XML
- biological computing
- neural network
- user interface
- DNS
- access control
- firewall
- graph theory
- grid computing
- identity theft
- project management
- role-based
- HTML5
- NLP
- NoSQL
- Python
- cell phone
- database
- java
- open-source
- spam
- GENI
- Javascript
- SQL-Injection
- Wikipedia
- agile
- analog computing
- archives
- biological
- bots
- cellular automata
- computer tips
- crowdsourcing
- e-book
- equilibrium
- game theory
- genetic algorithm
- green tech
- mobile
- nonlinear
- p
- phone
- prediction
- privacy
- self-book publishing
- simulation
- testing
- virtual server
- visualization
- wireless
No comments:
Post a Comment