Towards Context-Aware Search and Analysis on Social Media Data #bigdata

“Social media data typically consists of non-curated, short messages that are shared among people, instead of being visited manually or crawled by an automatic agent. Messages may be distributed through an explicit network of friends and followers, openly visible or privately, according to the sender’s preferences. There is no publishing delay and the barrier of entry is low, often only requiring an email address. This leads to substantial volumes of content constantly being created, and an expectation of data currency. Online social networks are a source of “big data”. Our social graphs are made explicit; our interactions are recorded; our utterances are saved in machine readable format; we can be heard across the world as easily as across the room. Twitter alone generates a million messages every ???ve minutes; a four-day stream comprises around 10(9) messages. As a result of online social networking, massive volumes of diverse social media data that capture a sample of all human discourse are accessible online”

http://www.derczynski.com/sheffield/papers/scalable_sm.pdf

Published by

Cosimo Accoto

Research Affiliate at MIT | Author "Il Mondo Dato" (Egea) | Philosopher in Residence | Business Innovation Advisor | www.cosimoaccoto.com

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Google+ photo

You are commenting using your Google+ account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s

This site uses Akismet to reduce spam. Learn how your comment data is processed.