CMU Research Finds Regional Dialects Are Alive and Well on Twitter
Carnegie Mellon News (PA) (01/06/11) Byron Spice
Regional slang is evident in Twitter postings, but such dialects appear to be evolving in social media, as determined by a Twitter word usage analysis method developed by Jacob Eisenstein and colleagues in Carnegie Mellon University's (CMU's) Machine Learning Department. Eisenstein says Twitter offers a new means of examining regional lexicon, since tweets are informal and conversational, while tweeters using cell phones can opt to tag their messages with global positioning system coordinates. The CMU researchers collected seven days' worth of Twitter messages in March 2010, and chose geotagged messages from Twitter users who wrote at least 20 messages. From this they generated a database of 9,500 users and 380,000 messages. The team used a statistical model to identify regional variation in word use and topics that could predict the geographical whereabouts of a microblogger in the continental United States with a median error of approximately 300 miles. The researchers can only speculate on the profiles of the microbloggers, and Eisenstein says it is reasonable to assume that users who send many tweets via cell phone are younger than average Twitter users--and this appears to be mirrored by the subjects these users tend to discuss. Through automated analysis of Twitter message streams, linguists can observe the real-time evolution of regional dialects.
Thursday, January 6, 2011
Blog: CMU Research Finds Regional Dialects Are Alive and Well on Twitter
Subscribe to:
Post Comments (Atom)
Blog Archive
-
►
2012
(35)
- ► April 2012 (13)
- ► March 2012 (16)
- ► February 2012 (3)
- ► January 2012 (3)
-
▼
2011
(118)
- ► December 2011 (9)
- ► November 2011 (11)
- ► October 2011 (7)
- ► September 2011 (13)
- ► August 2011 (7)
- ► April 2011 (8)
- ► March 2011 (11)
- ► February 2011 (12)
-
▼
January 2011
(15)
- Blog: Testable System Administration
- Blog: I, Algorithm: A New Dawn for Artificial Inte...
- Blog: How Watson Works: A Conversation With Eric B...
- Blog: A Clearer Picture of Vision
- Blog: Review: My Amazon Kindle Single publishing e...
- Blog: Cloud Robotics: Connected to the Cloud, Robo...
- Blog: Challenging the Limits of Learning [... lang...
- Blog: Beating the Competition [a single new connec...
- Blog: IBM Computer Gets a Buzz on for Charity Jeop...
- Blog: Fruit Fly Nervous System Provides New Soluti...
- Blog: CMU Research Finds Regional Dialects Are Ali...
- Blog: Apache Object-Oriented Data Project Goes Top...
- Blog: U.Va. Computer Scientists Look to Biological...
- Blog: Mathematical Model Shows How Groups Split In...
- Blog: The Surprising Usefulness of Sloppy Arithmetic
-
►
2010
(183)
- ► December 2010 (16)
- ► November 2010 (15)
- ► October 2010 (15)
- ► September 2010 (25)
- ► August 2010 (19)
- ► April 2010 (21)
- ► March 2010 (7)
- ► February 2010 (6)
- ► January 2010 (6)
-
►
2009
(120)
- ► December 2009 (5)
- ► November 2009 (12)
- ► October 2009 (2)
- ► September 2009 (3)
- ► August 2009 (16)
- ► April 2009 (4)
- ► March 2009 (20)
- ► February 2009 (9)
- ► January 2009 (19)
-
►
2008
(139)
- ► December 2008 (15)
- ► November 2008 (16)
- ► October 2008 (17)
- ► September 2008 (2)
- ► August 2008 (2)
- ► April 2008 (12)
- ► March 2008 (25)
- ► February 2008 (16)
- ► January 2008 (6)
-
►
2007
(17)
- ► December 2007 (4)
- ► November 2007 (4)
- ► October 2007 (7)
Blog Labels
- research
- CSE
- security
- software
- web
- AI
- development
- hardware
- algorithm
- hackers
- medical
- machine learning
- robotics
- data-mining
- semantic web
- quantum computing
- Cloud computing
- cryptography
- network
- EMR
- search
- NP-complete
- linguistics
- complexity
- data clustering
- optimization
- parallel
- performance
- social network
- HIPAA
- accessibility
- biometrics
- connectionist
- cyber security
- passwords
- voting
- XML
- biological computing
- neural network
- user interface
- DNS
- access control
- firewall
- graph theory
- grid computing
- identity theft
- project management
- role-based
- HTML5
- NLP
- NoSQL
- Python
- cell phone
- database
- java
- open-source
- spam
- GENI
- Javascript
- SQL-Injection
- Wikipedia
- agile
- analog computing
- archives
- biological
- bots
- cellular automata
- computer tips
- crowdsourcing
- e-book
- equilibrium
- game theory
- genetic algorithm
- green tech
- mobile
- nonlinear
- p
- phone
- prediction
- privacy
- self-book publishing
- simulation
- testing
- virtual server
- visualization
- wireless
No comments:
Post a Comment