We are making continual improvements to the sifter beta. Our goal is to develop the best possible user interface for Gnip’s PowerTrack filters when searching for historical Twitter data. Version 2 of the historical Twitter filtering system reflects a lot of great input from our early adopters. The work is far from done. This video introduces v2. What we need is your input. How can we make this tool for searching every undeleted tweet in history easier to use?
As a part of getting new users to test our sifter beta, every month this summer we are awarding 12 #datagrants to academics. All you need to do to be included in the July drawing is submit a valid historical Twitter estimate request using sifter and then send us your CV. These prizes shave thousands of dollars of costs off of your research. The June social data and tools prize winners were: Kelly Fincham The Department of Journalism, Media Studies, and Public Relations at Hofstra University
“I will use the data and software prize to further my research and analysis of journalism practice on Twitter. My research agenda explores journalists’ evolving norms and practices on social media, specifically Twitter, in the U.S. and Ireland. This grant will help me to research and analyze this subject area in more depth.” @kellyfincham
“I am hoping to use the data and software prize for my PhD research on the recovery and rebuild after the Christchurch earthquake of 2011. I am particularly interested in framing and sentiment of tweets and am hoping to compare a historical data set during disaster response and recovery to the conversation about the rebuilt of the city which is still ongoing today. I am hoping to study the differences and similarities of conversations on Twitter now and then.” @tinserella
“I will like to integrate the collected data (tweets) in my final essay in order to get my Masters degree. The subject of my essay is: racism online.” @CarminaGodoy
“This award will be used to collect and analyze select data from the early group stages of the 2014 World Cup. Social media – including but not limited to Twitter – are increasingly integrated into traditional (TV, radio, print) media campaigns. At the 2014 World Cup, the hashtags #becausefootball and #becausefutbol were promoted throughout the televising of the games. Exploratory thematic analysis of these Tweets – enabled by Sifter and Discovertext – will describe how the use of these commercially-oriented hashtags are used in comparison to what we know about live event Twitter usage in the current body of research.” @warrensallen
“I plan to use the prize to capture and analyze online discussion and commentary about police use of automated license plate recognition (ALPR) systems and wearable cameras. In particular, I hope to examine discussions related to the public disclosure of data generated by these systems under freedom of information laws.” @newmedialaw
“This project will survey the current use of online social media by health organization for health campaign and analyze the reach and diffusion of campaign messages. Despite the ever growing number of online social media-based health campaigns, little work has been done to understand how interactive natures of online social media are used for public health promotion. For this project, Twitter data will be analyzed to enhance our understanding of how health organizations use social media for public health promotion and how such uses of online media platforms are received by the public.”
Abhay Gupta Lecturer at Fairleigh Dickinson University
“I plan to use it to understand the dynamics of public opinion. In particular, I want to test various hypotheses on how major events (e.g. election wins, market crash, sports results) impact the sentiment and whether pre-event opinion analysis has any predictive power in explaining actual outcomes.” @EmpForesights
“I am looking forward to using the Texifter data and software to investigate how consumers and brands communicate on social media. In particular, I’m interested in how language use affects consumer behavior in online contexts. Given the extent to which consumers have and are continuing to adopt social media, this research should have important implications for marketing practitioners.” @vabarger
“I am studying the influence of social movements on changes in the law — specifically land law. I hope to use the prize to access Twitter data that can tell me about the relationships between movement actors, how they form their interests, and how these change over time.” @jrgbaxter
“I will use the software and data to continue my study of the lifecycle of policy initiatives. I used DiscoverText in my latest book Interpreting Hashtag Politics (Palgrave Macmillan, 2014). Historic Twitter data reveals the first mention of policies that enjoy several months of widespread attention before disappearing without trace. To understand why and how this occurs, I will continue use DiscoverText to de-duplicate the dat
a and develop thematic code sets with a team of research assistants.” @SRJeffares
Cristian Vaccari Lecturer in Politics at the Royal Holloway University of London
“I am planning on using the data and software to analyze how politically motivated users of social media engage with mediated political events, such as televised leader debates and high-profile interviews, to better understand the interplay between television and social media in the flow of political messages.” 25lettori
Bill D. Herman Remember: All you need to do to be included in the July drawing is submit a valid historical Twitter estimate request using sifter and then send us your CV.
We could not be happier to announce that Texifter, a developer of advanced text data analytics software, is partnering with Gnip, the world’s largest provider of social data. Our Plugged In to Gnip partnership certifies Texifter as an industry leader committed to building innovative analytics solutions on top of reliable, sustainable, and complete social data. In joining Gnip’s partner program, Texifter joins the list of leading analytics providers like Microsoft, Salesforce, and Adobe. “The Plugged In program was created to really highlight the companies that are doing the most innovative things in social data,” according to Chris Moody, CEO of Gnip, “and Texifter is a great example of that.” Texifter’s DiscoverText platform provides advanced data analytics solutions for social researchers in public and private institutions. Combining powerful tools with accessible interfaces, DiscoverText provides “five pillars of text analytics” – search, filtering, clustering, human-coding, and machine-learning. By partnering with Gnip, Texifter has access to historical Twitter data. Texifter recently launched “Sifter”, a tool to help users estimate Twitter volume associated with historical searches. The Sifter product gives users a free estimate of Twitter volume over a specific date range using advanced Gnip PowerTrack filtering. Customers who license historical Twitter data from Gnip can then access it for text analytics via a 30-day trial of DiscoverText.
“Texifter welcomes this opportunity to work even more closely with a company that we have admired and worked with for years,” said Stu Shulman, CEO of Texifter. “Gnip is an exceptionally reliable provider of social data products and services. Texifter customers will continue to see more benefit as we work with Gnip to deliver high quality products and services.”
Social Data & Tools: Prizes for Academics We felt inspired by the recent #DataGrants experiment sponsored by Twitter that generated more than 1,300 proposals from 60 countries and resulted in six extremely interesting awards. One thing is clear: many more grants of social data licenses are needed to fuel academic research. Texifter is sponsoring social data and tools prizes for academics as a simple contest with 12 winners a month this summer. In addition to social data access, we understand that many academic researchers also need “point & click” web-based tools to simplify the data access and management tasks involving social media APIs and jSON.
The Prizes We will award twelve social data prizes with text analytics software licenses every month this summer. These premium social data prizes include access to our powerful online DiscoverText tools to search, filter, cluster, code, and machine classify the data, as well as interactive visual reporting tools include several specialized views for metadata eDiscovery, time series, deduplication, near-duplicate clustering, and other project attributes. No software programming skills required. The twelve monthly prizes are:
- One grand prize per month of 10 Historical Twitter Days and credit for 1,000,000 Tweets plus one year of Enterprise access to DiscoverText. Up to one hour of free consulting on research design and social data cleaning.
- Two nearly grand prizes per month of 5 Historical Twitter Days and credit for 500,000 Tweets plus six months of Enterprise access to DiscoverText. Up to one hour of free consulting on research design and social data cleaning.
- Three prizes of six months of Enterprise access to DiscoverText plus 100,000 credits to capture day-forward Twitter data via the Gnip PowerTrack.
- Three prizes of six months of Enterprise access to DiscoverText plus 50,000 credits to capture day-forward Tumblr data via the Gnip PowerTrack.
- Three prizes of six months of Enterprise access to DiscoverText plus 25,000 credits to capture day-forward Disqus or WordPress data via the Gnip PowerTrack.
Rules to Enter This prize drawing is designed to promote experimentation with the free historical Twitter estimation tool we have in beta known as Sifter. The application provides search and retrieve access to every undeleted Tweet in the history of Twitter.
- Create a free account on Sifter: https://sifter.texifter.com/Home/Registration.
- Indicate you are an academic and show your affiliation using a URL during or after the registration process.
- Send us a copy of your CV via email (firstname.lastname@example.org). You only need to do this once even if you enter every month.
- Use Sifter to generate at least one Gnip historical PowerTrack for Twitter estimate spanning no more than 10 historical Twitter days and returning no more than 1,000,000 tweets. Estimates are free and we encourage experimentation with sampling and PowerTrack operators. Create as many estimates as you like.
- Every user with a valid estimate =<10 Twitter days and =<1,000,000 tweets during a calendar month this summer will be entered into the drawing for a social data research grant prize.
- Every month, for at least the next three starting with June 2014, we will hold a new drawing.
- A single user can only win one prize per month, but can enter the drawing every month. We will do a drawing at the end of each month limited to just the first 100 eligible entries to increase the chances of winning a prize to better than 1 in 10. Remember to visit Sifter early in the month before all 100 entries are taken.
- After the drawing each month we will publish a list of the 12 winners and their academic affiliations on the Texifter blog.
Social Data and Software Terms of Service All contest-related social data will be stored in DiscoverText. Use of the data is governed both by the publisher and Texifter Terms of Service. The Future Need for Tools and Data is Great This small contest cannot satisfy the pent up demand students and faculty have for tools and data. We do think that these prizes can equip a researcher with sufficient data and advanced analytic tools to run a successful pilot study, or to complete a graduate thesis proposal. It is our hope to grow the social data research grant program over time. If it helps to drives new awareness of the research ecosystem, Texifter would be happy to be a part of the innovative energy pouring into academic research studies of the impact and uses of social data. Project Outputs We will invite all of the contest winners to write about their project on the Texifter blog. This is optional, but we have had some great guest research posts lately about school bullying, elections, and reusable learning objects. We hope these software and social data research grants will lead to more reports of innovative teaching and research efforts.
A brief follow up on Texifter. We successfully migrated DiscoverText to Microsoft’s Azure. It was very smooth, though we are going through a period of diminished search and filtering capabilities while the data re-indexes. Otherwise, the other capabilities appear stable. We also launched a new beta product on Azure to allow users to get free estimates (and buy the data) self-serve from the full history of Twitter. The live prototype is “Sifter” (https://sifter.texifter.com). Finally, I have been elected a board member and Treasurer for the Big Boulder Initiative (https://bigboulderconf.com/about/). In that capacity, I will be playing a role helping to organize the social data industry association that will launch in June at Big Boulder. 2014 is looking good for Texifter. On January 31, 2014, the company re-acquired all assets and intellectual property related to DiscoverText, including the Sifter stack of language technologies for de-duplication, clustering, coding, and machine-learning, as well as the “CoderRank” patent. Going forward, we believe these tools can make a significant impact on the history of information.