Slaps Lab – Medium

Slaps Lab

Adventures in building custom datasets via Web Scrapping / Spotify API — Octane Playlist (37)…

This is a quick write up where we go through creating a custom dataset based on the Octane’s song playlist over a finite period of time…

4 min readNov 28, 2020

--

Adventures in building custom datasets via Web Scrapping / Spotify API — Octane Playlist (37)…

--

Slaps Lab
in
The Startup

Adventures in Building Custom Datasets Via Web Scrapping — Little Mermaid Edition

This is a quick, end to end write up where I go through parsing out a movie script from the web. We start with HTML and end up with a CSV.

5 min readNov 22, 2020

--

Adventures in Building Custom Datasets Via Web Scrapping — Little Mermaid Edition

--

Slaps Lab

Using Word Embeddings to help bridge different sets of vocab

This write up is meant to simulate a situation in which you already have a developed vocab but you are presented with terms outside of it.

4 min readNov 16, 2020

--

Using Word Embeddings to help bridge different sets of vocab

--

Slaps Lab

Centrality Metrics via NetworkX, Python

Zachary’s karate club is a widely used dataset [1] which originated from the paper “An Information Flow Model for Conflict and Fission in…

4 min readNov 15, 2020

--

Centrality Metrics via NetworkX, Python

--

Slaps Lab

Bag of Words via Python

This method allows us to focus on the occurrence of a term in a corpus, when order is not important.

2 min readNov 15, 2020

--

Bag of Words via Python

--

Slaps Lab

Ensemble Text Generator (LSTM) that uses content produced by the most influential authors…

This post builds upon previous ones where we pulled data from Reddit, built multiple social networks…

4 min readMay 5, 2020

--

Ensemble Text Generator (LSTM) that uses content produced by the most influential authors…

--

Slaps Lab

Building a Text Generator based on the most influential authors in the r/Siacoin…

Building an LSTM model using Keras based on the content produced by the most influential authors in the r/Siacoin Subreddit Community

6 min readMay 3, 2020

--

Building a Text Generator based on the most influential authors in the r/Siacoin…

--

Slaps Lab

Building a Centrality Metrics author filter for the r/Siacoin Subreddit Community

Using centrality metrics to filter authors to just the most influential (top=n) to aid in noise reduction.

2 min readApr 23, 2020

--

--

Slaps Lab

Extracting Social Networks from the r/Siacoin Subreddit Community

Walk-through on scrapping and building out Social Networks per month for the r/Siacoin Subreddit Community.

3 min readApr 21, 2020

--

Extracting Social Networks from the r/Siacoin Subreddit Community

--

Slaps Lab

Adventures in building custom datasets via Web Scrapping — ESPN Articles Edition

I recently decided that I wanted to build a Named-Entity Recognition (NER) NBA model. This is about how I went about getting the data.

3 min readApr 16, 2020

--

Adventures in building custom datasets via Web Scrapping — ESPN Articles Edition

--

Slaps Lab

Slaps Lab

Focused on generating original, compelling, short stories through the use of Artificial Intelligence.

Help
Status
About
Careers
Blog
Privacy
Terms
Text to speech
Teams