How StockTwits Uses Machine Learning to Make Better Products

Fascinating behind the scenes interview of StockTwit’s Senior Data Scientist Garrett Hoffman. He shares great tidbits on how StockTwits uses machine learning for sentiment analysis. I’ve summarized the highlights below:

Idea generation is a huge barrier for active trading
Next gen of traders uses social media to make decisions
Garrett solves data problems and builds features for the StockTwits platform
This includes: production data science, product analytics, and insights research
Understanding social dynamics makes for a better user experience
Focus is to understand social dynamics of StockTwits (ST) community
Focuses on what’s happening inside the ST community
ST’s market sentiment model helps users with decision making
Users ’tag’ content for bullish or bearish classes
Only 20 to 30% of content is tagged
Using ST’s market sentiment model increases coverage to 100%
For Data Science work, Python Stack is used
Use: Numpy, SciPy, Pandas, Scikit-Learn
Jupyter Notebooks for research and prototyping
Flask for API deployment
For Deep Learning, uses Tensorflow with AWS EC2 instances
Can spin up GPU’s as needed
Deep Learning methods used are Recurrent Neural Nets, Word2Vec, and Autoencoders
Stays abreast of new machine learning techniques from blogs, conferences and Twitter
Follows Twitter accounts from Google, Spotify, Apple, and small tech companies
One area ST wants to improve on is DevOps around Data Science
Bridge the gap between research/prototype phase and embedding it into tech stack for deployment
Misconception that complex solutions are best
Complexity ONLY ok if it leads to deeper insight
Simple solutions are best
Future long-term ideas: use AI around natural language