Video Library
Videos
Highlights, deep dives, and short-form explainers across AI, data science, and policy.
Video Links
Deep Dive Videos
Train Mixture of Experts Model from Scratch - Simpsons Edition, Youtube (Nov 2025)
A Practical Guide to Evaluating Generative AI Applications - Updated, Youtube (Nov 2025) · Blog post
RAG Retrieval Deep Dive: BM25, Embeddings, and the Power of Agentic Search, Youtube (Oct 2025) · Blog post
Evaluation for Generative AI - A simply explained starting point, Youtube (May 2025)
Using Reasoning LLMs (Claude with Python or Agno), Youtube (Apr 2025)
Get Started with Deepseek's GRPO using QWEN and Hugging Face, Youtube (Feb 2025)
Unit Testing for Natural Language (LLMs) + LMUnit model, Youtube (Feb 2025)
Training Kolmogorov-Arnold Networks (KAN) using Pytorch and Nixtla on M3/M4 Time Series Datasets, Youtube (Nov 2024)
Feature Selection Methods for Machine Learning, plus Feature Selection Curves, Youtube (Oct 2024) · Blog post
Start using Llama 3.2 Vision Models with Hugging Face Transformers (on Snowflake), Youtube (Oct 2024)
Practical Lessons in Building Generative AI: RAG and Text to SQL, Youtube (Sep 2024) · Blog post
Spark of AI: How Transfer Learning Unlocked AI's Potential, Youtube (Sep 2024) · Blog post
Interpretable ML Models, Youtube (Aug 2024) · Blog post
Intro to Generative AI and Trends (March 2024), Youtube (Jun 2024)
Model Interpretability and Explainability for Machine Learning Models, Youtube (Jun 2024)
Large Language Models (LLMs) Can Explain Their Predictions, Youtube (Jan 2024)
Evaluation for Large Language Models and Generative AI - A Deep Dive, Youtube (Nov 2023) · Blog post
NanoGPT using Simpsons Data: Get Started with Large Language Models, Youtube (Sep 2023)
16 Challenges for LLMs - Paper Highlights, Youtube (Aug 2023)
Llama 2 Paper Explained, Youtube (July 2023)
GPT or BERT? Reviewing the tradeoffs of using Large Language Models versus smaller models, Youtube (Jun 2023)
Building Better Large Language Models - Key Concepts for Prompting and Fine Tuning, Youtube (Apr 2023)
Efficient Large Language Model training with LoRA and Hugging Face PEFT, Youtube (Mar 2023)
Text style transfer in a spreadsheet using Hugging Face Inference Endpoints, Youtube (Nov 2022) · Blog post
SetFit: Few Shot Learning for Text Classification, Youtube (Oct 2022) · Blog post
Prediction Intervals with Conformal Inference: An Intuitive Explanation, Youtube (Sep 2022) · Blog post
LayoutLMv3 Training with CORD (receipts dataset), Youtube (Sep 2022)
Fine Tuning an Image Classifier on Indian Food Images, Youtube (Aug 2022)
Explanation Approaches for Transformers, Youtube (Aug 2022) · Blog post
Short Form Videos
Search by topic, keyword, or platform.
Reinforcement learning with my Eat Melon! Demo based ...
565,300 views2022-04-05
Reinforcement learning with my Eat Melon! Demo based on Karpathy #datascience #reinforcementlearning #techtok #machinelearning
Google dropped Gemini. Let's talk about the different...
253,633 views2023-12-06
Google dropped Gemini. Let's talk about the different sizes tweaked benchmarks multimodal trained on TPUs and how it's not that exciting. #g...
Curse of dimensionality reminds us to think carefully...
166,921 views2024-02-11
Curse of dimensionality reminds us to think carefully about feature selection. More isn’t always better. Use a feature selection curve. #d...
There are lots of open-source code assistant tools. S...
90,205 views2023-09-30
There are lots of open-source code assistant tools. Starcoder is the best known but many people are training and fine-tuning their own model...
Toolformer from Meta shows the possibilities of using...
88,831 views2023-02-13
Toolformer from Meta shows the possibilities of using APIs in an unsupervised way. #datascience #machinelearning #toolformer #largelanguagem...
Transformer Explainer is an interactive visualization...
69,100 views2024-08-11
Transformer Explainer is an interactive visualization tool to allow people to understand how transformers work through an end-to-end visuali...
Text to SQL is now easier with a large language model...
68,107 views2023-07-06
Text to SQL is now easier with a large language model released by Numbers Station called NSQL. #largelanguagemodels #nsql #numberstation #ma...
Med-Palm from Google for answering medical and clinic...
68,100 views2022-12-29
Med-Palm from Google for answering medical and clinical knowledge. #datascience #machinelearning #largelanguagemodels #medpalm
Control vectors are getting more widely supported mos...
65,529 views2024-03-17
Control vectors are getting more widely supported most recently in Llama.cpp. It’s another useful technique alongside prompting fine tunin...
Clustering with k-means. This skit was inspired by th...
64,800 views2022-12-31
Clustering with k-means. This skit was inspired by the examples in Schubert paper on stop using the elbow criterion for kmeans. Any other cl...
Very excited and Richard isn’t paying me for this - #...
64,100 views2022-05-06
Very excited and Richard isn’t paying me for this - #codetok #youdotcom #codingtiktok #python
Climax, a new transformer based model for predicting ...
61,400 views2023-02-08
Climax, a new transformer based model for predicting weather and climate forecasting. Great example of the flexibility of transformers based...
AI News for the week featuring OpenAI NVIDIA Google A...
59,515 views2023-10-21
AI News for the week featuring OpenAI NVIDIA Google Apple Stanford Hugging Face Anthropic and Microsoft. #machinelearning #openai #rajistics...
Simply explaining how ChatGPT works. All the technica...
56,100 views2022-12-07
Simply explaining how ChatGPT works. All the technical details of ChatGPT have not been released, so this is based on what OpenAI has been d...
Twitter open sourced it's recommendation algorithm. I...
55,500 views2023-04-01
Twitter open sourced it's recommendation algorithm. It's fun to look at someone else's production code and will be useful to people studying...
Tensorboard embedding projector - repost
52,400 views2024-07-01
Tensorboard embedding projector - repost
#onthisday tensorboard embedding projector. Let me kn...
49,500 views2023-07-01
#onthisday tensorboard embedding projector. Let me know if i should reshare these older videos.
This happens. #datascience #machinelearning #python #...
48,900 views2022-05-04
This happens. #datascience #machinelearning #python #codetok #programming
Some general advice on how to evaluate software packa...
47,700 views2022-11-30
Some general advice on how to evaluate software packages. #datascience #machinelearning #github
Point-E from #openai. Generating 3D point clouds from...
46,400 views2022-12-20
Point-E from #openai. Generating 3D point clouds from text #datascience #machinelearning
GPT4 hype that it will be 100 trillion parameters doe...
45,500 views2023-01-17
GPT4 hype that it will be 100 trillion parameters doesn’t make any sense. First, is the scaling laws in this video @rajistics and also thin...
Some common data distributions when modeling includin...
45,300 views2023-02-03
Some common data distributions when modeling including skewed and zero inflated. There are many other distributions, but just wanted people ...
Open source LLMs why they seem popular are not easy t...
43,496 views2023-06-03
Open source LLMs why they seem popular are not easy to get running in production settings. The current open source LLMs while getting better...
A simple explanation of what AI is. The video touches...
43,250 views2023-06-25
A simple explanation of what AI is. The video touches upon the impact of AI how AI works with a practical example and some of the reasons AI...
Stable diffusion, go run it yourself! It’s so awesome...
42,600 views2022-08-25
Stable diffusion, go run it yourself! It’s so awesome. #datascience #codetok #aipub #huggingface #machinelearning
Reply to @bird_3288 #python #rstat #datascience #anal...
41,600 views2022-03-12
Reply to @bird_3288 #python #rstat #datascience #analytics #programming
Working with embeddings today. #datascience #word2ve...
41,200 views2022-07-01
Working with embeddings today. #datascience #word2vec #embeddings #tensorflow #codetok #tensorboard
Data Centric AI helps to remind us not to focus too m...
40,900 views2023-02-24
Data Centric AI helps to remind us not to focus too much on the model or algorithms. In real data science, it’s more about understanding you...
It’s important to make sure your model is well calibr...
40,300 views2022-11-11
It’s important to make sure your model is well calibrated. This becomes especially important with imbalanced data. #machinelearning #datasci...
üöÄ Just get started on your journey to learn large ...
39,440 views2023-07-26
üöÄ Just get started on your journey to learn large language models! ü§î Is there a lot to learn? Yes! üòÖ ü§∑‚Äç‚ôÇÔ∏è But is it easy t...
This paper introduces vec2vec, a method that aligns t...
39,208 views2025-05-22
This paper introduces vec2vec, a method that aligns text embeddings from different language models—without access to the models or labeled d...
Diffusion models for markup. #datascience #machinelea...
38,700 views2022-10-13
Diffusion models for markup. #datascience #machinelearning #stablediffusion
So much going on around using generative tools for re...
38,609 views2023-04-12
So much going on around using generative tools for reasoning with tasks. HuggingGPT or Jarvis is focused on helping on solving AI tasks. Aut...
Interpretable models offer a great alternative to tra...
38,528 views2024-03-09
Interpretable models offer a great alternative to traditional machine learning algorithms. Generalized Additive Models like GA2M Rulefit and...
Google’s sparrow is the rumored competitor to OpenAI ...
37,900 views2023-01-22
Google’s sparrow is the rumored competitor to OpenAI ChatGPT. Check out the paper to see lots of examples of it chatting. It looks really go...
A GPT-5 + Gemini Pro pipeline might sort one apple ev...
35,944 views2025-10-01
A GPT-5 + Gemini Pro pipeline might sort one apple every 6 seconds. A simple light frequency sensor sorts hundreds per second. The outcome? ...
Some alternatives to clustering with k-means. This sk...
35,093 views2023-12-31
Some alternatives to clustering with k-means. This skit was inspired by the examples in Schubert paper on stop using the elbow criterion for...
This video explains the findings from the Google Rese...
34,282 views2025-07-26
This video explains the findings from the Google Research paper "Learning Without Training: The Implicit Dynamics of In-Context Learning" (a...
Always have a baseline model. For time series you can...
33,531 views2023-10-09
Always have a baseline model. For time series you can often compare to what happened in a previous time step like last week. There are error...
No big deal, use visualization #stats #datascience #...
32,200 views2022-01-14
No big deal, use visualization #stats #datascience #datasaurus #datascience #analytics #anscombe #visualization
Customer lifetime value is a common data science use ...
31,700 views2023-02-25
Customer lifetime value is a common data science use case. There are many ways to calculate this, but here I introduce the class RFM method ...
Knowledge distillation is a useful technique to build...
31,648 views2024-03-10
Knowledge distillation is a useful technique to build smaller high-performing models. DistilBERT is a great example of a widely used model t...
In 2023, Meta intern Guangxuan Xiao discovered that r...
31,543 views2025-08-09
In 2023, Meta intern Guangxuan Xiao discovered that removing the first few tokens in a sliding-window KV cache caused catastrophic degradati...
Learning curves, it’s a technique I use all the time ...
31,500 views2022-11-16
Learning curves, it’s a technique I use all the time when training models. Thanks to Todd C for showing me the best way to explain this. #da...
The Data Scientist title is worth $$$ €€€ £££ ¥¥¥. #d...
30,800 views2022-02-22
The Data Scientist title is worth $$$ €€€ £££ ¥¥¥. #datascience #dataanalyst #analytics
This video explores Apple’s recent study on large rea...
30,563 views2025-06-08
This video explores Apple’s recent study on large reasoning models and why they often fail to actually “reason.” It covers controlled puzzle...
Reminder to visualize your data with one of my favori...
30,389 views2023-10-30
Reminder to visualize your data with one of my favorites Anscombe's quartet #anscombesquartet #datavisualization #datascience #statistics #r...
Vicuna is awesome, go check it out. Its the latest LL...
29,200 views2023-03-31
Vicuna is awesome, go check it out. Its the latest LLama model and very impressive. I ended up cutting out the details on vicuna, since i fe...
Dealing with over plotting, another visualization tip...
28,800 views2023-01-08
Dealing with over plotting, another visualization tips from data to viz #datascience #machinelearning #statistics #datavisualization
7 Baseline Models: Time Series: Previous Value Anomal...
28,700 views2024-06-22
7 Baseline Models: Time Series: Previous Value Anomaly: p99 Search: BM25 Recommendation: Popularity Buy recommendations: last viewed Classif...
I think langchain is aweome, but the future is an eas...
28,574 views2023-03-17
I think langchain is aweome, but the future is an easy to use UI. Think Alteryx for LLMs. Langflow is a step in the right direction. #datasc...
The power of prompting! How to use a general purpose ...
28,550 views2023-11-29
The power of prompting! How to use a general purpose model to be a special purpose fine tuned model. It's really important to learn good pro...
Scaling laws help us figure out how manage the amount...
28,500 views2023-01-07
Scaling laws help us figure out how manage the amount of training data versus the model size. DeepMind showed with Chinchilla by using more ...
Visual question answering (VQA) is another cool task ...
28,500 views2022-12-03
Visual question answering (VQA) is another cool task you can do with machine learning. #datascience #machinelearning #visualquestionanswerin...
Segment Anything (SAM) is a new segmentation model fr...
27,500 views2023-04-06
Segment Anything (SAM) is a new segmentation model from Meta. It's a huge improvement over the state of the art and is going to change compu...
DeepSeekv3 is turning heads - the paper is also reall...
27,301 views2024-12-26
DeepSeekv3 is turning heads - the paper is also really good, check it all out at: https://github.com/deepseek-ai/DeepSeek-V3
YOLO is a seminal model in object detection for compu...
26,331 views2023-09-13
YOLO is a seminal model in object detection for computer vision. But what is even more interesting is the principal author Joseph Redmon and...
Pandas 2.0 combing with arrow. A short recap on how i...
25,100 views2023-03-01
Pandas 2.0 combing with arrow. A short recap on how it fits in with polars, dplyr, and data.table. #datascience #machinelearning #rstats #py...
RuterGPT is an inspirational story. Fine-tuning base ...
24,700 views2024-05-02
RuterGPT is an inspirational story. Fine-tuning base models allows you to do so much, including better language support. Check out the story...
Customer lifetime value is a common data science use ...
24,113 views2024-02-29
Customer lifetime value is a common data science use case. There are many ways to calculate this but here I introduce the classic RFM method...
YouChat and retrieval augmented models. To play aroun...
23,900 views2022-12-26
YouChat and retrieval augmented models. To play around with this, check out haystack from deepset. #datascience #machinelearning #youchat #...
Rust for machine learning. It’s useful in some cases ...
23,800 views2022-09-25
Rust for machine learning. It’s useful in some cases for ML, but learn python first. #datascience #codetok #python #machinelearning #rust
Showing the latent space for stable diffusion. Next v...
23,800 views2022-09-14
Showing the latent space for stable diffusion. Next video is on explainability. #stablediffusion #datascience #machinelearning #codetok #uma...
Other tips I should share? #datascience #timeseries #...
23,700 views2022-06-12
Other tips I should share? #datascience #timeseries #statistics #dataanalysis #python #codetok #mltok
Cheating has reared its head again over at Kaggle. So...
23,071 views2023-01-30
Cheating has reared its head again over at Kaggle. Some background for folks on Kaggle and cheating there. #datascience #machinelearning #ka...
Composer will be sharing their new generative AI mode...
22,900 views2023-02-26
Composer will be sharing their new generative AI models and they look amazing. They key is they decompose the image, which then provides a l...
Be skeptical of new models like TimeFM from Google (b...
22,632 views2024-02-04
Be skeptical of new models like TimeFM from Google (but still listen). For many reasons deep learning models do not work well for time serie...
An emerging trend of using large language models like...
22,531 views2023-05-20
An emerging trend of using large language models like GPT-4 for labeling data instead of using humans to annotate data: #datascience #machin...
Challenging the common assumption about normal data d...
22,500 views2025-02-04
Challenging the common assumption about normal data distributions, rajistics explains that real-world data often exhibits skewness, spikes a...
Missing data happens all the time. Don’t just jump to...
22,500 views2022-11-24
Missing data happens all the time. Don’t just jump to dropping rows or using imputation techniques. #dataengineering #statistics #datascienc...
LayoutLMv3 Training with CORD (receipts) dataset
22,313 views2022-09-09
LayoutLMv3 Training with CORD (receipts) dataset
Doing data analysis with large language models like C...
22,252 views2023-04-25
Doing data analysis with large language models like ChatGPT. It's going to be amazing as these technologies let us combine our data text and...
Let's dig into the detail for building your own large...
21,308 views2023-06-04
Let's dig into the detail for building your own large language model on a custom domain. The LLaVA-Med does a great breakdown of how they bu...
Always have a baseline model. For time series, you ca...
20,700 views2022-10-09
Always have a baseline model. For time series, you can often compare to what happened in a previous time step, like last week. There are err...
Graph databases accelerate multi-hop traversals, but ...
20,600 views2025-08-30
Graph databases accelerate multi-hop traversals, but most production queries are shallow (1–2 hops) that SQL or embeddings handle efficientl...
Breaking study from Harvard showing the impact of Lar...
20,400 views2023-09-18
Breaking study from Harvard showing the impact of Large Language Models like GPT-4 on office productivity. #datascience #gpt4 #officeproduct...
OpenAI's turmoil this last week will ensure enterpris...
20,339 views2023-11-19
OpenAI's turmoil this last week will ensure enterprise AI strategies will not depend on OpenAI. It's clear for any valuable AI systems it's ...
This video is based on work Omar did in tracking down...
20,248 views2023-06-27
This video is based on work Omar did in tracking down why Falcon was giving results that favored the Middle East. It's an example of how bia...
AI Engineer is starting to emerge as a new role. This...
19,998 views2023-06-30
AI Engineer is starting to emerge as a new role. This role works with LLMs and does prompt engineering and fine tuning of models. They typic...
Tesla self driving has been such a scam. I am so disa...
19,200 views2023-01-17
Tesla self driving has been such a scam. I am so disappointed. I really believed that self driving could be pretty useful (I knew it wasn’t ...
A walkthrough of the explainer dashboard. It contains...
19,200 views2022-11-29
A walkthrough of the explainer dashboard. It contains a lot of the tools you want when trying to explain your models. #datascience #machinel...
The video demonstrates the limitations of LLMs by sho...
19,115 views2025-02-16
The video demonstrates the limitations of LLMs by showcasing how various real-world AI problems are best solved using traditional machine le...
Everyone is using transformers! Are you working on op...
18,808 views2023-11-21
Everyone is using transformers! Are you working on optimizing your use? The community has been steadily finding ways to optimize transformer...
The Agony! #datascience #machinelearning #mltok #tec...
18,800 views2022-04-06
The Agony! #datascience #machinelearning #mltok #techtok #statistics
Building a question / answer application using a larg...
18,793 views2023-05-04
Building a question / answer application using a large language model is a great starter project. You will need to use a vector database and...
Evaluation for Large Language Models (LLMs) and Gener...
18,612 views2023-11-06
Evaluation for Large Language Models (LLMs) and Generative AI - A Deep Dive
One of my favorite methods for feature selection is r...
18,531 views2024-02-18
One of my favorite methods for feature selection is recursive feature elimination. It's very easy to do and a starting data scientist can co...
So what did I miss when you do error analysis? #machi...
18,400 views2022-12-10
So what did I miss when you do error analysis? #machinelearning #datascience #statistics #erroranalysis
Prompt injection attacks are a major security concern...
18,370 views2025-05-02
Prompt injection attacks are a major security concern when using large language models (LLMs) like ChatGPT. They allow attackers to overwrit...
Struggling with data validation - let’s dig into how ...
18,312 views2025-01-19
Struggling with data validation - let’s dig into how Pydantic's type system and validation framework elegantly solves these problems. The vi...
MuonClip used by Moonshot AI and developed by Keller ...
18,293 views2025-07-15
MuonClip used by Moonshot AI and developed by Keller Jordan was used during the training of their trillion-parameter Kimi 2 model, addresses...
Curse of dimensionality reminds us to think carefully...
18,100 views2023-02-11
Curse of dimensionality reminds us to think carefully about feature selection. More isn’t always better. Use a feature selection curve. #dat...
Bias in Medical Imaging #datascience #codetok #algori...
18,100 views2022-05-22
Bias in Medical Imaging #datascience #codetok #algorithmicbias #imaging #machinelearning #bias motivated by the comments from @rajistics
Histograms are a great visualization tool. Here are s...
18,000 views2023-02-10
Histograms are a great visualization tool. Here are some caveats and tips for using histograms. #datascience #statistics #datavisualization ...
Deepmind and OpenAI want everyone to focus on extreme...
17,873 views2023-05-27
Deepmind and OpenAI want everyone to focus on extreme risks of AI. This helps them hype up AI and make themselves more attractive. The reali...
The skit addresses the challenge of acquiring large v...
17,760 views2023-12-20
The skit addresses the challenge of acquiring large volumes of labeled data for machine learning projects. The video focuses on using machin...
Replying to @philosophywithsuf explaining the irony f...
17,700 views2022-12-15
Replying to @philosophywithsuf explaining the irony for pytorch building a graph and the history of tensorflow
The New York Times recently filed a lawsuit against O...
17,449 views2024-01-01
The New York Times recently filed a lawsuit against OpenAI. This is another of many copyright lawsuits against AI companies. While everyone ...
Reply to @milekumulator how I use GitHub #datascience...
17,400 views2022-05-07
Reply to @milekumulator how I use GitHub #datascience #github #codetok #python #sportsanalytics
Classification outcomes and probabilities #datascienc...
17,400 views2022-03-02
Classification outcomes and probabilities #datascience #machinelearning #algorithms
The LocalLlama subreddit received a citation in a rec...
17,063 views2023-06-28
The LocalLlama subreddit received a citation in a recent paper by Meta. Great reminder of the innovation you can get when models have a larg...
Customer lifetime value is a common data science use ...
17,031 views2024-03-05
Customer lifetime value is a common data science use case. There are many ways to calculate this but here I show how a data scientist would ...
Python Optimal Transport is an open source Python lib...
16,808 views2023-04-23
Python Optimal Transport is an open source Python library providing several solvers for optimization problems related to Optimal Transport f...
The hardest step is getting ComfyUI running on your c...
16,800 views2024-02-29
The hardest step is getting ComfyUI running on your computer (you need a GPU). Go do it! Then you can create the coolest images using stable...
Explaining how Emily Ocasio won second place with her...
16,700 views2023-03-29
Explaining how Emily Ocasio won second place with her project analyzing media coverage. I like her approach and highlights a growing trend o...
LLMs have a lot of security issues. From prompt injec...
16,462 views2024-01-18
LLMs have a lot of security issues. From prompt injection attacks extraction of training data data poisoning and even GPU based attacks. How...
Symbolic regression focuses on a mathematical represe...
16,362 views2023-08-25
Symbolic regression focuses on a mathematical representation of your data. It's helpful in many situations where you need an explainable mod...
Soundstorm is a new audio generation model from Googl...
16,247 views2023-07-17
Soundstorm is a new audio generation model from Google. It can rapidly generate high-quality audio. Google isn't making this model available...
Interpretable models are often overlooked, but a grea...
16,100 views2022-11-05
Interpretable models are often overlooked, but a great addition to your data science toolkit. Imodels is a great python package for getting ...
Replying to @Sam This video won't be popular but I ha...
16,000 views2023-05-14
Replying to @Sam This video won't be popular but I have to speak the truth. Meta AI has been really sharing out top notch open source models...
Like beautiful plots of data maps? Check out DataMapP...
15,995 views2024-01-09
Like beautiful plots of data maps? Check out DataMapPlot from Leland McInnes. To make the best use of this you will need to have your data c...
XGBoost 2.0 is out with some great new features inclu...
15,934 views2024-02-25
XGBoost 2.0 is out with some great new features including support for multi-target trees with vector-leaf outputs and learning to rank probl...
Quick introduction to optimization and for advanced f...
15,900 views2022-12-25
Quick introduction to optimization and for advanced folks, go run a notebook from gurobi or do the Kaggle Santa challenge. #datascience #mac...
I think langchain is aweome, but the future is an eas...
15,700 views2023-03-17
I think langchain is aweome, but the future is an easy to use UI. Think Alteryx for LLMs. Langflow is a step in the right direction. #datasc...
New state of the art embedding model, Instructor, for...
15,600 views2023-01-22
New state of the art embedding model, Instructor, for text is available! It accounts for task and domain when creating an mending. #datascie...
The best way to learning data science is working with...
15,500 views2022-12-17
The best way to learning data science is working with data. You don’t need to spend money on courses or books. Spending time doing useful pr...
Cheating has reared its head again over at Kaggle. So...
15,200 views2023-01-31
Cheating has reared its head again over at Kaggle. Some background for folks on Kaggle and cheating there. #datascience #machinelearning #ka...
Prompt injection attacks are a major security concern...
14,907 views2023-05-03
Prompt injection attacks are a major security concern when using large language models (LLMs) like ChatGPT. They allow attackers to overwrit...
This video breaks down why large language models can ...
14,716 views2025-06-15
This video breaks down why large language models can produce different outputs even with the same prompt, seed, and temperature. The culprit...
Replying to @Rajiv Shah long version of deep reinfor...
14,300 views2022-07-15
Replying to @Rajiv Shah long version of deep reinforcement learning video from Week 4
https://docs.google.com/presentation/d/1HEiuuOCni8Jao...
14,107 views2025-06-17
https://docs.google.com/presentation/d/1HEiuuOCni8Jao1DNbjxE6VTl9Nxl-q6ObLxto8PdLxc/edit?slide=id.g32c5831c733_0_368#slide=id.g32c5831c733_0...
To dig deeper go watch Sasha Rush's video on alternat...
13,905 views2023-12-14
To dig deeper go watch Sasha Rush's video on alternatives to attention: https://youtu.be/dKJEpOtVgXc?si=Lx94-51PsjGF-YZT Dig deeper with the...
Reply to @canutten1 Deep W with Atari Breakout #datas...
13,900 views2022-04-18
Reply to @canutten1 Deep W with Atari Breakout #datascience #reinforcementlearning #techtok #machinelearning
Grok 4 proves that scaling still delivers — trained w...
13,801 views2025-07-11
Grok 4 proves that scaling still delivers — trained with 100× more compute, it leads on Humanity’s Last Exam, ARC‑AGI, and tool-use benchmar...
Using LangChain with GPT3. I am seeing lots of cool d...
13,800 views2023-01-14
Using LangChain with GPT3. I am seeing lots of cool demos based on LangChain and needed to make I covered it. It’s an easy way to take advan...
When analyzing improvements in AI always take a look ...
13,757 views2023-11-05
When analyzing improvements in AI always take a look at the ablation studies. An important part is making sure the compute was held the same...
Bias in Generative AI. This post is based on a blog p...
13,613 views2023-05-19
Bias in Generative AI. This post is based on a blog post by text.io on bias in generative AI using an example of job postings. A great remin...
Curse of dimensionality reminds us to think carefully...
13,498 views2024-02-11
Curse of dimensionality reminds us to think carefully about feature selection. More isn’t always better. Use a feature selection curve. #d...
tiktok e182dd28dcee4103a056c2db6cbed6c7466e6f25
13,494 views2025-09-06
Visual Question/Answering with Document AI #datascien...
13,400 views2022-09-22
Visual Question/Answering with Document AI #datascience #analytics #codetok #huggingface #documentai
OpenAI announced their new deprecation policy and it'...
13,351 views2023-07-07
OpenAI announced their new deprecation policy and it's going to affect people who are using OpenAI's models in production. They will have to...
Tensorflow fans are probably seething since they were...
13,100 views2022-12-10
Tensorflow fans are probably seething since they were first and ignored. All good and will be easy for pytorch users to take advantage of m...
Don't let people overlook open source software. It mi...
12,932 views2024-01-19
Don't let people overlook open source software. It might be free but it's priceless. The Value of Open Source Software at https://papers.ssr...
Replying to @Rajiv Shah | data science & AI Llama-2 d...
12,800 views2023-07-22
Replying to @Rajiv Shah | data science & AI Llama-2 deep dive going through the paper by Meta. This is a 10-minute video but it still skips ...
OpenAI released GPT-4o mini. Let's look at the perfor...
12,800 views2024-07-19
OpenAI released GPT-4o mini. Let's look at the performance and cost of the model. We also assess how this affects competitors and the contin...
Solid forecasting advice and proved out in the M5 for...
12,705 views2024-03-03
Solid forecasting advice and proved out in the M5 forecasting competition. Start with simple baselines and statistical approaches and then a...
ChatGPT price drop. Let’s break down how much the pri...
12,700 views2023-03-02
ChatGPT price drop. Let’s break down how much the price dropped, how OpenAI could drop the price, the effects on performance, what is going ...
Diving into how Whisper v3 was trained. OpenAI used a...
12,468 views2023-11-07
Diving into how Whisper v3 was trained. OpenAI used a combination of weak learning and pseudo-labeling. #whisper #openai #rajistics Whisper:...
Uncensored models are here. Eric Hartford has been bu...
12,376 views2023-05-25
Uncensored models are here. Eric Hartford has been building the WizardLM series of models and sharing how he has been training the models. T...
LangChain added a new agent Plan and Execute. Looking...
12,375 views2023-05-14
LangChain added a new agent Plan and Execute. Looking forward to the more advanced use cases people will build with it. This was inspired by...
Saving you a trip to Twitter. #dataengineering #datab...
12,300 views2022-11-24
Saving you a trip to Twitter. #dataengineering #databases There is one big vendor left out. Probably get sued for leaving them out.
Earlier videos: @rajistics @rajistics #deeplearning #...
12,300 views2022-03-04
Earlier videos: @rajistics @rajistics #deeplearning #tensorflow #datascience #analytics
Using agents in langchain with gpt-3. You can do this...
12,291 views2023-03-04
Using agents in langchain with gpt-3. You can do this! Go check it out. #datascience #machinelearning #openai #gpt3 #langchain
Learn Regex, it will pay off #regex #datascience #pro...
12,200 views2022-03-26
Learn Regex, it will pay off #regex #datascience #programming #analysis
A new LLM focused on data annotation and labeling bea...
12,056 views2023-10-19
A new LLM focused on data annotation and labeling beats GPT4. It's built from Llama 13B and will be open source. #datascience #machinelearni...
Mixtral is a new model using a mixture of experts (Mo...
12,020 views2023-12-09
Mixtral is a new model using a mixture of experts (MoE) approach. It consists of 8x7B mistral models. It was pre-released on Friday look for...
ChatGPT for Robotics is the latest hot paper. Large l...
12,000 views2023-02-22
ChatGPT for Robotics is the latest hot paper. Large language models are the future interface. #datascience #machinelearning #largelanguagemo...
OpenAI AI classifier is a great example to remind peo...
12,000 views2023-02-04
OpenAI AI classifier is a great example to remind people of the limitations when detecting rare events. It’s not intuitive, so I showed the ...
Models that cheat, take shortcuts, and leak informati...
11,900 views2023-01-03
Models that cheat, take shortcuts, and leak information are all part of the data scientist life style. Ever my data scientist has a story li...
An experiment studying how well GPT4 can plan by usin...
11,845 views2023-08-13
An experiment studying how well GPT4 can plan by using Block World and Mystery World. #largelanguagemodels #gpt4 #aiplanning #blockworld #my...
Temperaure is an important parameter when working wit...
11,800 views2023-03-22
Temperaure is an important parameter when working with many models including got-3. This video gives a short background on temperature and t...
CLIP Interrogator is available over at the hugging fa...
11,800 views2022-10-25
CLIP Interrogator is available over at the hugging face spaces. Have fun! #datascience #machinelearning #stablediffusion #huggingface
What makes GPT-4 so special? One big part is the use ...
11,617 views2023-07-08
What makes GPT-4 so special? One big part is the use of a Mixture of Experts approach Let's start with how Galton used the wisdom of the cro...
Code Interpreter is out and it's pretty amazing at fi...
11,500 views2023-07-12
Code Interpreter is out and it's pretty amazing at first glance. However more experienced software developers and people concerned about dat...
Software licensing #github #codetok #gpl #programming...
11,500 views2022-05-10
Software licensing #github #codetok #gpl #programming #python #creativecommons #copyright
Japan said it was acceptable to use copyrighted mater...
11,495 views2023-06-01
Japan said it was acceptable to use copyrighted material such as text and images to train AI. This has the approach of United States and oth...
It pays to be organized. Find a friendly data enginee...
11,400 views2022-10-06
It pays to be organized. Find a friendly data engineer if you need to. #datascience #analytics
Running large language models and transformer models ...
11,301 views2023-09-05
Running large language models and transformer models locally in web browsers. Lot's of tools for doing this including mlc.ai transformers.js...
The best way to learning data science is working with...
11,254 views2023-12-17
The best way to learning data science is working with data. You don’t need to spend money on courses or books. Spending time doing useful ...
Yifan Zhao reverse-engineered Claude Code and uncover...
11,238 views2025-08-31
Yifan Zhao reverse-engineered Claude Code and uncovered that its secret isn’t hidden logic, but a sophisticated stack of prompts. By interce...
YOLO is a seminal model in object detection for compu...
11,200 views2024-09-13
YOLO is a seminal model in object detection for computer vision. But what is even more interesting is the principal author, Joseph Redmon an...
This post was based on great stuff on Twitter, especi...
11,200 views2022-12-01
This post was based on great stuff on Twitter, especially Ben’s Bites. I wanted to show the chat output, so wasn’t able to keep the original...
Learn about foundational models, especially in #nlp #...
11,100 views2022-04-23
Learn about foundational models, especially in #nlp #naturallanguageprocessing #datascience #deeplearning #analytics #techtok #openai
Replying to @chokokrem Best machine learning tools fo...
11,000 views2023-03-10
Replying to @chokokrem Best machine learning tools for competitions. Lots of great stuff here. #datascience #machinelearning #python #codeto...
Dolly from Databricks is an open source fine tuned in...
10,928 views2023-04-15
Dolly from Databricks is an open source fine tuned instruction large language model that can be used for commercial uses! Databricks has tak...
Replying to XYZ A quick tutorial using WizMap to visu...
10,904 views2023-07-03
Replying to XYZ A quick tutorial using WizMap to visualize embeddings. The process is extracting your embeddings using dimensionality reduct...
Reminder to visualize your data with one of my favori...
10,900 views2022-10-29
Reminder to visualize your data with one of my favorites #anscombesquartet #datavisualization #datascience #statistics
Reinforcement learning with my Eat Melon! Demo This d...
10,852 views2023-08-30
Reinforcement learning with my Eat Melon! Demo This demo is based on Karpathy's work. Link: https://bit.ly/raj_eatmelon #datascience #reinfo...
Why you should use group partitioning #datascience #m...
10,800 views2022-06-02
Why you should use group partitioning #datascience #machinelearning #statistics #codetok #deeplearning #andrewng
Histograms are a great visualization tool. Here are s...
10,759 views2023-02-09
Histograms are a great visualization tool. Here are some caveats and tips for using histograms. #datascience #statistics #datavisualization ...
GPT-4 showing amazing results in casual reasoning. Fo...
10,742 views2023-05-07
GPT-4 showing amazing results in casual reasoning. For practical purposes experiments are more useful than causal modeling. However this pap...
Nat.dev playground is awesome. Should be a great remi...
10,700 views2023-03-10
Nat.dev playground is awesome. Should be a great reminder of the diversity of large language models. #datascience #machinelearning #largelan...
Reply to @declinedher being above average. I will add...
10,600 views2022-02-10
Reply to @declinedher being above average. I will add citation in the comments. #statistics #regressiontothemean #aboveaverage
Feature engineering is an important part of the machi...
10,510 views2024-02-24
Feature engineering is an important part of the machine learning lifecycle. It’s part art and skill. It takes time to learn and the best d...
Replying to @anansaadi OpenAssistant is an open sourc...
10,500 views2023-02-19
Replying to @anansaadi OpenAssistant is an open source project that aims to provide a chat based assistant that connects to other sources of...
How are you using similarity search? #nearestneighbor...
10,500 views2022-06-26
How are you using similarity search? #nearestneighbor #annoy #spotify #datascience #statistics #codetok #python #similaritysearch
The power of prompting! How to use a general purpose ...
10,453 views2024-12-01
The power of prompting! How to use a general purpose model to be a special purpose fine tuned model. It’s really important to learn good pro...
Working with small datasets. Several tips including u...
10,401 views2023-03-06
Working with small datasets. Several tips including using crossvalidation, models like lasso, and running multiple interations with differen...
#greenscreenvideo Jealous. Go see how bad Meta bungle...
10,400 views2023-01-28
#greenscreenvideo Jealous. Go see how bad Meta bungled their chatbot @rajistics
It’s almost here. Full support for pandas in sklearn ...
10,400 views2022-10-18
It’s almost here. Full support for pandas in sklearn pipelines. #machinelearning #datascience #codetok #python #sklearn #sci-kit
OpenAI's new models look great and incorporate the la...
10,373 views2024-01-28
OpenAI's new models look great and incorporate the latest advances. But don't forget about the open source as well as some tips for thinking...
Accuracy versus Interpretability/Explainability is a ...
10,297 views2023-08-08
Accuracy versus Interpretability/Explainability is a typical tradeoff in machine learning. Depending on your use case you may favor one over...
Animated Drawings is really fun model from Meta. It c...
10,276 views2023-04-14
Animated Drawings is really fun model from Meta. It can take a sketch drawing and then animate it. Great example of combining several image ...
Updated! I am an idiot - This video explains how Mode...
10,202 views2025-04-04
Updated! I am an idiot - This video explains how Model Context Protocol (MCP) allows language models like Claude to interact with external t...
Short summary of my linger video on effieciently trai...
10,200 views2023-03-27
Short summary of my linger video on effieciently training a latge language model using PEFT and LoRA. #datascience #machinelearning #largela...
ChatGPT with the Code Interpreter can do a lot of com...
10,199 views2023-09-29
ChatGPT with the Code Interpreter can do a lot of common data science tasks. We are going to see more tools help with routine data science t...
When you build a synthetic dataset you know where the...
10,148 views2024-01-25
When you build a synthetic dataset you know where the noise is and where the signal is. This lets you better assess techniques for feature s...
Some tips for deploying large language models like Ll...
10,129 views2023-07-30
Some tips for deploying large language models like Llama. Start by building some benchmarks for your tasks to assess how your model performs...
To build generative AI models like the text-to-SQL sy...
10,104 views2024-03-25
To build generative AI models like the text-to-SQL system by Snowflake it is important to create a realistic and challenging training datase...
Claude 3 and lots of unbelievable claims. Let’s wal...
10,035 views2024-03-07
Claude 3 and lots of unbelievable claims. Let’s walk through some of the more viral reactions and explain what is going on. We also need t...
Thinking about the size of numbers becomes important ...
10,005 views2023-05-16
Thinking about the size of numbers becomes important when working with neural networks. This video touches about different techniques like u...
Picking a GPU for deep learning based on Tim Dettmers...
9,997 views2023-01-16
Picking a GPU for deep learning based on Tim Dettmers classic blog post. #datascience #machinelearning #deeplearning #gpu
Deep dive on how to improve large language models. I ...
9,988 views2023-04-28
Deep dive on how to improve large language models. I provide an introduction to zero-shot and few-shot learning methods. I also discuss the ...
This is a year old but still holds up pretty well. Th...
9,977 views2024-03-27
This is a year old but still holds up pretty well. The big difference is you may want to use TRL instead of PEFT for the training. But the c...
Reinforcement learning with my Eat Melon! Demo This ...
9,966 views2024-04-23
Reinforcement learning with my Eat Melon! Demo This demo is based on Karpathy's work. Link: https://bit.ly/raj_eatmelon #datascience #reinf...
Llama really upping it on training data. But this is ...
9,950 views2024-04-26
Llama really upping it on training data. But this is a trend with scaling laws to use more and more training data. #trainingdata #largelang...
Parquet and Arrow file formats #datascience #analytic...
9,943 views2022-05-31
Parquet and Arrow file formats #datascience #analytics #bigdata #codetok #dataengineer
How companies your data for training models will be a...
9,883 views2023-01-20
How companies your data for training models will be a big issue this year. GitHub is being sued for Copilot and Hugging Face has been buildi...
AI works with various data types: tabular unstructure...
9,874 views2024-03-16
AI works with various data types: tabular unstructured and semi-structured like JSON. While tabular data is most prevalent in enterprises Ge...
OpenAI plugins! Lets get everyones APIs working with ...
9,862 views2023-03-24
OpenAI plugins! Lets get everyones APIs working with LLMs! This isa good thing. #largelanguagemodels #langchain #openai #datascience #machin...
Reply to @sqwadiladida resources for learning about t...
9,762 views2022-04-23
Reply to @sqwadiladida resources for learning about transformer models in #naturallanguageprocessing #datascience #techtok #statistics #anal...
#onthisday showing the map of stable diffusion. #data...
9,730 views2023-09-15
#onthisday showing the map of stable diffusion. #datascience #machinelearning #stablediffusion #rajistics
Some great tips from Charlie over at Replicate on usi...
9,609 views2023-08-15
Some great tips from Charlie over at Replicate on using Llama 2. A guide to prompting Llama 2 - https://replicate.com/blog/how-to-prompt-lla...
Do you calibrate your models? For many types of model...
9,495 views2023-11-26
Do you calibrate your models? For many types of models you may need to calibrate them. This video reminds us of the importance of calibratio...
Using agents in langchain with gpt-3. You can do this...
9,416 views2023-03-04
Using agents in langchain with gpt-3. You can do this! Go check it out. #datascience #machinelearning #openai #gpt3 #langchain
ColPali: Efficient Document Retrieval with Vision Lan...
9,380 views2024-10-10
ColPali: Efficient Document Retrieval with Vision Language Models - https://arxiv.org/abs/2407.01449 - https://github.com/illuin-tech/colpal...
My data science setup for now #datascience #codetok #...
9,319 views2022-08-20
My data science setup for now #datascience #codetok #python #rstats #posit #vscode #googlecolab #digitalocean #conda
Hugging Face #reinforcementlearning class #datascienc...
9,150 views2022-04-26
Hugging Face #reinforcementlearning class #datascience #techtok #deeplearning #python
The feature or variables in auto insurance models. Le...
9,127 views2024-02-13
The feature or variables in auto insurance models. Learn from insurance good features can give you a lot of predictive power. #datascience #...
Working with small datasets. Several tips including u...
9,102 views2023-03-07
Working with small datasets. Several tips including using crossvalidation, models like lasso, and running multiple interations with differen...
Reviewing Anthropics latest research and OpenAI conti...
9,040 views2024-05-23
Reviewing Anthropics latest research and OpenAI continuing to fumble. Mapping the Mind of a Large Language Model: https://www.anthropic.com/...
Introducing myself, like a year too late. Hope this f...
8,964 views2022-11-28
Introducing myself, like a year too late. Hope this fills the gaps around this channel.
I posted this on LinkedIn today, but wanted to share ...
8,896 views2023-01-27
I posted this on LinkedIn today, but wanted to share here. GTP-3 is powerful, but sometimes domain specific models are going to do better. P...
Why do transformers lock onto “meaningless” tokens li...
8,872 views2025-10-24
Why do transformers lock onto “meaningless” tokens like [BOS] or punctuation? When one token’s activation spikes thousands of times higher, ...
Prompt sensitivity is a thing. This video covers how ...
8,830 views2024-01-11
Prompt sensitivity is a thing. This video covers how changes in formatting the persuasion used in prompts and prompt injection attacks are a...
Three major improvements to the transformer architect...
8,702 views2023-07-29
Three major improvements to the transformer architecture that everyone should know. They include Fast Attention Rotary Positional Embeddings...
Always have a baseline model. For time series, you ca...
8,644 views2022-10-09
Always have a baseline model. For time series, you can often compare to what happened in a previous time step, like last week. There are err...
Reinforcement Learning with AI Feedback (RLAIF) is an...
8,618 views2023-09-07
Reinforcement Learning with AI Feedback (RLAIF) is an emerging approach to replace Reinforcement Learning with Human Feedback (RLHF). It wor...
State-of-the-art results (100%!!) on widely used acad...
8,607 views2023-09-25
State-of-the-art results (100%!!) on widely used academic benchmarks (MMLU GSM8K HumanEval OpenbookQA ARC Challenge etc.). The model called ...
Reply to @midnightlibrarian #datasaurus #stats #dinos...
8,570 views2022-01-16
Reply to @midnightlibrarian #datasaurus #stats #dinosaur #analytics explaining #anscombe
AI News Update: Meta - Major spending on AI and conti...
8,449 views2024-01-20
AI News Update: Meta - Major spending on AI and continuing lawsuits over child safety NVIDIA - Stock at high, 1.5 Trillion market cap Google...
Some of my favorite machine learning visualizations. ...
8,389 views2024-05-05
Some of my favorite machine learning visualizations. Check them out to better understand how these algorithms work. If you work closely with...
Automating machine learning with Large Language Model...
8,388 views2023-05-01
Automating machine learning with Large Language Models (LLMs). While it's possible to ask ChatGPT to provide code for building a prediction ...
Videos with stable diffusion #datascience #machinelea...
8,327 views2022-09-07
Videos with stable diffusion #datascience #machinelearning #stablediffusion #codetok
NanoGPT is a simple fast repository for training/fine...
8,320 views2023-08-20
NanoGPT is a simple fast repository for training/finetuning medium-sized GPTs. I recommend it for getting a deeper understanding of large la...
Back! Time for AI on images. #datascience #computer...
8,284 views2022-07-12
Back! Time for AI on images. #datascience #computervision #objectdetection #yolo #machinelearning #codetok
Is RAG Actually Broken? A recent “semantic collapse” ...
8,280 views2026-01-01
Is RAG Actually Broken? A recent “semantic collapse” claim argues that embeddings fail at scale because distances compress in high dimension...
Try out these examples for yourself and lots more ava...
8,266 views2023-01-31
Try out these examples for yourself and lots more available. It’s scary cool how these models are working. #datascience #machinelearning #gp...
Using scaling laws to help us getter smaller models w...
8,252 views2023-04-13
Using scaling laws to help us getter smaller models with the same accuracy! Based on blog post by de Vries. #datascience #machinelearning #l...
Non-deterministic LLM inference is a deal.OpenAI has ...
8,182 views2023-11-14
Non-deterministic LLM inference is a deal.OpenAI has started offering it hoping the rest of the providers will also offer it for enterprise ...
It’s happened! Time series #datascience #timeseries ...
8,143 views2022-03-19
It’s happened! Time series #datascience #timeseries #analytics #statistics
Meta’s Cicero for playing Diplomacy is impressive and...
8,137 views2022-11-23
Meta’s Cicero for playing Diplomacy is impressive and a bit scary. #statistics #datascience #machinelearning #diplomacy
SKlearn Playground #datascience #machinelearning #sta...
8,137 views2022-04-12
SKlearn Playground #datascience #machinelearning #statistics #techtok #sklearn
Llama 3 the beginning of the end? Or will GPT5 up-end...
8,048 views2024-04-21
Llama 3 the beginning of the end? Or will GPT5 up-end everything (they have had over a year)? A skit based on a thread by Carmen Gutierrez o...
I closely monitor technology trends in AI. Following ...
8,044 views2024-05-04
I closely monitor technology trends in AI. Following huge developments at the end of 2022 and throughout 2023, the Generative AI space is no...
Language Models like ChatGPT can be modified by sever...
7,674 views2023-04-08
Language Models like ChatGPT can be modified by several methods including Prompting Instruction Fine-Tuning and Reinforcement Learning with ...
Applying reinforcement learning to teaching AI math. ...
7,666 views2025-02-11
Applying reinforcement learning to teaching AI math. This is based off a notebook using Group Relative Policy Optimization (GRPO) on a QWEN ...
Meta released Llama Guard for content moderation. It ...
7,660 views2023-12-08
Meta released Llama Guard for content moderation. It looks to be effective and very adaptable. This is part of their Purple Llama project ar...
Spaces gives you great interactive demos of many popu...
7,645 views2023-04-24
Spaces gives you great interactive demos of many popular sklearn examples. It's a great place to browse and even contribute back by add more...
Models and datasets have specific definitions. Models...
7,626 views2023-04-29
Models and datasets have specific definitions. Models consist of at least two licenses nowadays this has been an issue for LLaMA where the c...
The skit explains a dynamic pricing strategy that use...
7,617 views2025-07-19
The skit explains a dynamic pricing strategy that uses machine learning to adjust prices based on what customers are willing to pay, rather ...
Breaking News: Executive Order on AI Quick video on t...
7,591 views2023-11-01
Breaking News: Executive Order on AI Quick video on the main issues there is a lot more in the Order. It is over a 100 pages. #executiveorde...
You know your transformer basics? Let's go over Enco...
7,563 views2024-12-29
You know your transformer basics? Let's go over Encoder, Encoder-Decoder, and Decoder only models. If you want to dig deeper into the trans...
Context length has grown in importance for large lang...
7,549 views2023-07-02
Context length has grown in importance for large language models. A longer context length lets you pass more information to the model effect...
Entropy can be a useful measure in machine learning. ...
7,477 views2024-04-27
Entropy can be a useful measure in machine learning. Entropy and information gain is used in building decision trees. I have also seen entro...
Audio spectrogram transformer shows how widely we can...
7,438 views2022-12-19
Audio spectrogram transformer shows how widely we can use #machinelearning #datascience #mlaudio #deeplearning
Prompt engineering helped optimize model behavior whe...
7,407 views2025-07-04
Prompt engineering helped optimize model behavior when LLMs were less capable. But as models have improved, gains from prompt tweaks have di...
Lets talk about why enterprises are considering alter...
7,393 views2023-03-18
Lets talk about why enterprises are considering alternatives to chatGPT by looking to open source. An open source strategy can affect lots o...
I have a lot more tea #datarobot #corporategreed #dat...
7,356 views2022-06-22
I have a lot more tea #datarobot #corporategreed #datascience #codetok #techtok
Automatic Speech recognition in 3 lines of code using...
7,335 views2022-11-17
Automatic Speech recognition in 3 lines of code using wav2vec2 in transformers #datascience #machinelearning #huggingface #automaticspeechre...
Word as Image - great use of generative AI models lik...
7,294 views2023-03-07
Word as Image - great use of generative AI models like stable diffusion to create fonts. Check out the paper at wordasimage.github.io #datas...
Retrieval Augmented approaches are a great way to imp...
7,270 views2023-04-04
Retrieval Augmented approaches are a great way to improve your LLMs. Deepset shown in this video provides a set of tools but there are many ...
MiniGPT-4 brings us a multimodal model! It consists o...
7,241 views2023-04-17
MiniGPT-4 brings us a multimodal model! It consists of a vision encoder with a pretrained ViT and and an advanced Vicuna large language mode...
Target leakage in the CrowdAI dataset. Target leakage...
7,231 views2023-04-10
Target leakage in the CrowdAI dataset. Target leakage is a very common problem and everyone should understand it. I have seen even the smart...
Trying to talk about AGI in a reasonable manner. Ther...
7,197 views2023-11-09
Trying to talk about AGI in a reasonable manner. There needs to be more hype and more rigor in talking about AGI. The Deepmind paper provide...
Open Source with Stable Diffusion - #datascience #cod...
7,176 views2022-08-27
Open Source with Stable Diffusion - #datascience #codetok #machinelearning #stablediffusion #opensourcesoftware
Mechanistic interpretability hands on! Try Monitor: h...
7,141 views2024-12-04
Mechanistic interpretability hands on! Try Monitor: https://monitor.transluce.org/dashboard Monitor writeup: https://transluce.org/observabi...
Statistics sounds heavy but a lot of concepts are ver...
7,136 views2024-01-07
Statistics sounds heavy but a lot of concepts are very useful and can save you a lot of effort. This video is reminder of the many ways we u...
Anomaly detection is hard. This is an introduction to...
7,125 views2023-09-26
Anomaly detection is hard. This is an introduction to anomaly detection algorithms. The video focuses on the results for ADBench and what da...
Replying to @rajistics as promised, the feature or va...
7,107 views2023-02-12
Replying to @rajistics as promised, the feature or variables in auto insurance models. Keep the feedback coming. #datascience #machinelearni...
DePlot translates plots into readable tables that an ...
7,104 views2023-05-03
DePlot translates plots into readable tables that an LLM can query. It's based on the MatCha architecture with more fine-tuning on plots. Ni...
ImageBind the first AI model capable of binding data ...
7,095 views2023-05-11
ImageBind the first AI model capable of binding data from six modalities at once without the need for explicit supervision. It recognizes th...
Active learning uses an algorithm to help select what...
7,081 views2023-05-18
Active learning uses an algorithm to help select what data to label. Ideally using this approach people can get comparable model results usi...
The politics of ChatGPT, it’s no different than any o...
7,028 views2022-12-27
The politics of ChatGPT, it’s no different than any other technology and is not neutral. If you want a simple explanation of how ChatGTP wor...
Replying to @Data Storyteller Here are two examples ...
7,026 views2022-07-22
Replying to @Data Storyteller Here are two examples of data or target leakage. I bet people have other fun examples. #datascience #targetle...
If you want more details on the biggest advancements ...
6,983 views2023-12-22
If you want more details on the biggest advancements in AI for 2023 then find me on LinkedIn or Threads where I have a detailed post with al...
Feature engineering and data preprocessing are an imp...
6,980 views2023-02-27
Feature engineering and data preprocessing are an important part of the machine learning process. #datascience #machinelearning #featureengi...
Working with Categorical data using ordinal one hot (...
6,970 views2023-12-01
Working with Categorical data using ordinal one hot (dummy) and target encoding. Do you have your own favorite approach? And ChatGPT tells m...
It happens. Be careful. #aws #datascience #deeplearni...
6,940 views2022-03-30
It happens. Be careful. #aws #datascience #deeplearning #gpu
Cursor’s new Tab-RL model uses reinforcement learning...
6,923 views2025-09-13
Cursor’s new Tab-RL model uses reinforcement learning from real user feedback, rolling out checkpoints multiple times per day across 400M+ p...
TabPFN revolution in data science. Please don’t your ...
6,898 views2022-10-22
TabPFN revolution in data science. Please don’t your time on all this hype. Every week there is a revolution announced on Twitter. Ignore it...
My creator hero just released a great new book and we...
6,858 views2024-09-07
My creator hero just released a great new book and website. It's an excellent way to learn programming using JavaScript and build some very ...
Requested video - DSPy DSPy brings a systematic appro...
6,848 views2024-06-15
Requested video - DSPy DSPy brings a systematic approach to prompting that gives you better-designed workflows while also optimizing prompts...
Should you take the time to learn Kubernetes as a dat...
6,839 views2023-01-23
Should you take the time to learn Kubernetes as a data scientist? Or you already overloaded learning data science? #datascience #machinelear...
Do you have a missing data story? Missing data happen...
6,829 views2023-11-24
Do you have a missing data story? Missing data happens all the time. Should you just accept it? Drop rows? Use Imputation? or Keep digging? ...
Axolotl provides a declarative approach to fine tunin...
6,798 views2024-01-22
Axolotl provides a declarative approach to fine tuning large language models. It's very easy to get started with and much easier for folks n...
Data visualization tips #datascience #dataviz #analyt...
6,784 views2022-03-21
Data visualization tips #datascience #dataviz #analytics #datavisualization
Deep dive video on using explanations that could out ...
6,747 views2024-01-31
Deep dive video on using explanations that could out of large language models. This is something that is understudied but I find it quite us...
Synthetic datasets have given me a way to better unde...
6,739 views2023-01-25
Synthetic datasets have given me a way to better understand how to do feature selection and model explainability. Try it out sometime. #data...
This video had aged well. Models are very useful and ...
6,737 views2024-05-21
This video had aged well. Models are very useful and widely used for labeling data and generating data.
Replika and the growth of these character chatbots or...
6,722 views2023-02-14
Replika and the growth of these character chatbots or socialbots is emerging as a big use case within generative AI. Here is a recent contro...
Replying to @jbfjhcfv plotly is a great package for f...
6,714 views2022-10-05
Replying to @jbfjhcfv plotly is a great package for folks using R or Python. It’s open source, so anyone can use it. #datascience #visualiza...
Best practices for prompting is emerging. A couple of...
6,683 views2023-04-30
Best practices for prompting is emerging. A couple of simple rules is starting with a API based LLM and focus on building good prompts. This...
Using LangChain with GPT3. I am seeing lots of cool d...
6,679 views2023-01-14
Using LangChain with GPT3. I am seeing lots of cool demos based on LangChain and needed to make I covered it. It’s an easy way to take adv...
Deciding whether to use a Large Language Model or a s...
6,656 views2023-06-02
Deciding whether to use a Large Language Model or a smaller model? This video explores the tradeoffs between both approaches based on the la...
Three new multimodal models this week but only one re...
6,653 views2023-10-05
Three new multimodal models this week but only one respects data scientists. Once again it's Meta doing it right. #machinelearning #multimod...
In this video, I cover how researchers from Alibaba u...
6,623 views2025-06-30
In this video, I cover how researchers from Alibaba used supervised fine-tuning and reinforcement learning (GRPO) to improve workflow genera...
Speculating on GPT-4 size and performance. #datascien...
6,598 views2023-02-21
Speculating on GPT-4 size and performance. #datascience #machinelearning #gpt3 #gpt4 #openai see scaling law video: @rajistics
Quick intro to spacy, which is a standard tool for pe...
6,543 views2022-10-08
Quick intro to spacy, which is a standard tool for people doing natural language processing #nlp or text analytics. Not my best video, buts ...
Hugging Face announced a new valuation of $4.5 billio...
6,507 views2023-08-24
Hugging Face announced a new valuation of $4.5 billion! #datascience #machinelearning #huggingface
Beam search is an alternative way for LLMs to generat...
6,429 views2024-03-30
Beam search is an alternative way for LLMs to generate text. Let's walk through how beam search compares to greedy search. Alternatives incl...
Replying to @petererickson.art This was tough, a lot ...
6,417 views2022-09-10
Replying to @petererickson.art This was tough, a lot of ground to cover. Let me know what I messed up on. I also have related videos on embe...
4 ways to do Dimensionality Reduction - PCA, Autoenco...
6,394 views2024-11-10
4 ways to do Dimensionality Reduction - PCA, Autoencoders, TSNE, and UMAP Lots of reasons to do dimensionality reduction - you want to comp...
What kind are you? #datascience #statistics #python ...
6,376 views2022-06-24
What kind are you? #datascience #statistics #python #codetok #mltok #practicaldatascience
Amazon shared a new dataset with human-written long-f...
6,336 views2025-01-13
Amazon shared a new dataset with human-written long-form answers across 7 domains for assessing LLM performance in retrieval-augmented QA. I...
Examining the data used for training our our LLMs. Op...
6,307 views2023-04-20
Examining the data used for training our our LLMs. OpenAI is running into trouble in Europe since it won't disclose exactly what was used fo...
Roundup of this weeks news, let me know if you all li...
6,295 views2023-02-10
Roundup of this weeks news, let me know if you all like this format. I had a lot of fun making this. #datascience #machinelearning #dumbtech...
Random forests and their ease of use are important in...
6,244 views2023-02-18
Random forests and their ease of use are important in understanding modern data science. #datascience #machinelearning #statistics #randomfo...
What if you could build a research assistant that bea...
6,229 views2025-11-21
What if you could build a research assistant that beat top models for $0.01 a query? Meet DR Tulu-8B, powered by a training method called RL...
Reply to @bosstoastmaker tensorflow playground data ...
6,213 views2022-02-05
Reply to @bosstoastmaker tensorflow playground data > models #featureengineering #datascience #tensorflow #deeplearning #analytics #ai
Getting the best distance metric is crucial for solvi...
6,179 views2023-10-15
Getting the best distance metric is crucial for solving analytical problems. This video reviews Euclidean Manhattan Mahabolobis Levenshtein ...
Repost but scaling laws are still very important. Sca...
6,173 views2024-01-09
Repost but scaling laws are still very important. Scaling laws help us figure out how to manage the amount of training data versus the model...
How LLMs memorize information! Check out the Starcode...
6,162 views2023-10-17
How LLMs memorize information! Check out the Starcoder Memorization space by Mithril Security and the notebook so you can look for LLM memor...
My second try to explain in context learning or few s...
6,160 views2023-01-28
My second try to explain in context learning or few shot learning with large language models. It’s very cool and why these models are so ex...
Making efficient use of GPU Memory when training tran...
6,149 views2023-06-07
Making efficient use of GPU Memory when training transformer models. This video covers the Kernel Overhead Optimizer states Activation memor...
Meta’s less than open source model and some bad takes...
6,125 views2023-03-05
Meta’s less than open source model and some bad takes from Twitter. #datascience #machinelearning #largelanguagemodels #opensource #meta
The current FTC leader Khan is willing to confront la...
6,124 views2023-07-14
The current FTC leader Khan is willing to confront large tech companies about uncompetitive practices. There is a long history of abuses by ...
This will be fun! #python #codetok #datascience #prog...
6,046 views2022-05-02
This will be fun! #python #codetok #datascience #programming
Google announced Bard, but we still don’t know much. ...
6,006 views2023-02-06
Google announced Bard, but we still don’t know much. It has been based on Lambda which has been around for a while. This is a safe bet, not ...
Zero-shot object detection. #datascience #codetok #hu...
5,968 views2022-08-09
Zero-shot object detection. #datascience #codetok #huggingface #objectdetection #deeplearning #zeroshotclassification
Never underestimate the power of the status quo #data...
5,961 views2022-06-21
Never underestimate the power of the status quo #datascience #forecasting #statistics #SAS #python #codetok
Visualizing decision trees with dtreeviz. #datascienc...
5,948 views2022-12-28
Visualizing decision trees with dtreeviz. #datascience #machinelearning check out their GitHub and it’s pip install dtreeviz
Waterfall charts that show your progress as well as e...
5,930 views2023-11-14
Waterfall charts that show your progress as well as explaining the error! This is what I like to see when I see a visualization of model res...
Sports! #datascience #analytics #codetok #machinelear...
5,924 views2022-08-16
Sports! #datascience #analytics #codetok #machinelearning #rstats #footballanalytics #statistics
Do it! Get a server in the cloud. Build your skills....
5,900 views2022-03-27
Do it! Get a server in the cloud. Build your skills. #datascience #programming #analytics #digitalocean
Evaluation of Large Language Models is a critical top...
5,875 views2023-08-28
Evaluation of Large Language Models is a critical topic. Leaderboards provide little guidance for evaluation but have many flaws. I am very ...
Week 1. #reinforcementlearning #huggingface #datascie...
5,860 views2022-05-12
Week 1. #reinforcementlearning #huggingface #datascience #python #codetok #programming
Corporate research labs have changed academic work wi...
5,831 views2023-01-28
Corporate research labs have changed academic work with their reluctance to provide reproducible research and getting around blind peer revi...
Galactica by meta. Cool model, poor form on sharing i...
5,767 views2022-11-17
Galactica by meta. Cool model, poor form on sharing it out. #datascience #machinelearning I feel for students, it was going to write a lot ...
Reinforcement learning with my Eat Melon! Demo This ...
5,747 views2024-08-30
Reinforcement learning with my Eat Melon! Demo This demo is based on Karpathy's work. Link: https://bit.ly/raj_eatmelon #datascience #reinf...
Starting to see people productionizing GPT-3 workflow...
5,728 views2023-03-11
Starting to see people productionizing GPT-3 workflows. I am a bug fan of using large language midels. Here is how one data science dealt wi...
Async is the difference between waiting… and working....
5,725 views2025-11-22
Async is the difference between waiting… and working. It lets multiple I/O-bound tasks run concurrently on a single thread. When one request...
Trying to conserve tokens? Here are two approaches ma...
5,714 views2025-11-08
Trying to conserve tokens? Here are two approaches making waves right now. TOON cuts down on repeated syntax in structured data by replacing...
Don’t do analysis for the sake of analysis. Your anal...
5,705 views2022-02-22
Don’t do analysis for the sake of analysis. Your analysis should be synced with a business objective. #datascience #analysis #dataanalyst
AI only knows what's it's trained on. So beat it by d...
5,704 views2023-02-21
AI only knows what's it's trained on. So beat it by doing something new. The video shows recent examples of marines beating a surveillance s...
RAG systems don’t know what’s sensitive — unless you ...
5,688 views2025-07-05
RAG systems don’t know what’s sensitive — unless you tell them. Let’s talk about why access control is essential in Retrieval-Augmented Gene...
This video focuses on the difference between Word2Vec...
5,687 views2025-06-14
This video focuses on the difference between Word2Vec, standard Transformers and Sentence Transformers for creating document embeddings. It ...
SpeechT5 audio models getting added to transformers. ...
5,659 views2023-02-08
SpeechT5 audio models getting added to transformers. #datascience #machinelearning #huggingface #speecht5 #speechmodels #audiomodels
So what's inside those large language models? This vi...
5,655 views2023-06-08
So what's inside those large language models? This video explains the data pipeline for high-quality training data used in the latest LLMs l...
You might assume Vision-Language Models like Claude o...
5,647 views2025-10-30
You might assume Vision-Language Models like Claude or CLIP would crush defect detection. But on Amazon’s new Kaputt Dataset, the old-school...
The video covers Retrieval Augmented Generation (RAG)...
5,638 views2024-05-10
The video covers Retrieval Augmented Generation (RAG), a very popular approach for combining large language models and information retrieval...
Working with Categorical data using ordinal, one hot ...
5,638 views2022-09-17
Working with Categorical data using ordinal, one hot (dummy), and target encoding #datascience #statistics #analytics #featureengineering
SpeechT5 audio models getting added to transformers. ...
5,634 views2023-02-09
SpeechT5 audio models getting added to transformers. #datascience #machinelearning #huggingface #speecht5 #speechmodels #audiomodels
The history of data science. I have since learned to ...
5,624 views2024-02-18
The history of data science. I have since learned to make videos shorter and punchier.
#stitch with @debtcollective Marketing and PR. This ...
5,570 views2022-06-20
#stitch with @debtcollective Marketing and PR. This is a big topic and a lot of nuance isn’t in this video. Also relationships with academi...
Opus.ai very cool demo! If you want to build similar ...
5,561 views2023-04-05
Opus.ai very cool demo! If you want to build similar apps check out the text to code models. Santacoder is open source and they have shared ...
Interpreting stable diffusion #stabilitydiffusion #da...
5,537 views2022-09-16
Interpreting stable diffusion #stabilitydiffusion #datascience #codetok #machinelearning #texttoimage
Fine Tuning an Image Classifier on Indian Food Images
5,522 views2022-08-04
Fine Tuning an Image Classifier on Indian Food Images
#instructionfinetuning #rlhf #reinforcementlearning #...
5,506 views2024-04-11
#instructionfinetuning #rlhf #reinforcementlearning #pretrain Target leakage is a very common problem and everyone should understand it. Th...
Sports! #datascience #analytics #codetok #machinelear...
5,495 views2022-08-16
Sports! #datascience #analytics #codetok #machinelearning #rstats #footballanalytics #statistics
AI for other than productivity. Let's talk about how ...
5,481 views2023-09-28
AI for other than productivity. Let's talk about how people are really using AI. #datascience #machinelearning #rajistics #therapy Lilian We...
Critical question when framing out analytic questions...
5,467 views2022-09-13
Critical question when framing out analytic questions, since extrapolation has got me into trouble before. #datascience #analytics #codetok
With the writer's strike in the US this video reminds...
5,457 views2023-07-20
With the writer's strike in the US this video reminds us of the human and environmental costs of building AI. Three critical components for ...
Reply to @fondantlover datasaurus dozen howto #stats ...
5,447 views2022-01-17
Reply to @fondantlover datasaurus dozen howto #stats #anscombe #datascience #analytics
#xgboost short history. #datascience #statistics #mac...
5,427 views2022-05-14
#xgboost short history. #datascience #statistics #machinelearning #codetok
Dive into model error metrics! From simple mean error...
5,410 views2024-11-12
Dive into model error metrics! From simple mean error to Mean Squared Error (MSE) and Log Loss, let's see when you should use them. While MS...
No one actually knows what a data scientist does, tak...
5,338 views2022-12-07
No one actually knows what a data scientist does, take advantage of it.
ChatDoctor is a great example of fine tuning a large ...
5,320 views2023-03-22
ChatDoctor is a great example of fine tuning a large language model to get more factually correct output. This is an approach i expect many ...
Q* from OpenAI is getting the hype but let's focus on...
5,314 views2023-11-28
Q* from OpenAI is getting the hype but let's focus on the basics of their organization and the limitations of GPT-4 around planning. This vi...
Lets talk about how GPT-4 is going to affect enterpri...
5,289 views2023-04-01
Lets talk about how GPT-4 is going to affect enterprise analytics. My upcoming public talks: AI Summit in Montreal on April 20 & Arize AI ev...
I know the pain. But there are ways to make it easy ...
5,287 views2022-03-11
I know the pain. But there are ways to make it easy for people to use your code. #python #analysis #datascience
Document AI with LayoutLM #datascience #codetok #natu...
5,258 views2022-06-05
Document AI with LayoutLM #datascience #codetok #naturallanguageprocessing #layoutml #huggingface #🤗 #ocr #deeplearning #multimodal
Baseline models for time series.
5,255 views2024-10-09
Baseline models for time series.
Andrew Ng wrote recently on this no test set approach...
5,252 views2023-05-22
Andrew Ng wrote recently on this no test set approach that he is seeing when people are using prompt engineering. This is very different tha...
Examining the data used for training our our LLMs. Op...
5,241 views2023-04-20
Examining the data used for training our our LLMs. OpenAI is running into trouble in Europe since it won't disclose exactly what was used fo...
Data Quality in the AI Era - To learn more about this...
5,232 views2024-11-20
Data Quality in the AI Era - To learn more about this example, check out Hannaneh Hajishirzi - OLMo: Accelerating the Science of Language Mo...
Speed run - 8 minute video on 16 Challenges for using...
5,226 views2023-08-10
Speed run - 8 minute video on 16 Challenges for using large language models (LLMs) 1. Unfathomable Datasets 2. Tokenizer-Reliance 3. High Pr...
Anomaly detection is hard. This is an introduction to...
5,210 views2022-09-26
Anomaly detection is hard. This is an introduction to anomaly detection algorithms. The video focuses on the results for ADBench and what da...
How AIrbnb customer support is using generative AI. T...
5,175 views2023-01-16
How AIrbnb customer support is using generative AI. This is a great example of how @rajistics in context learning is growing and replacing t...
Singular value decomposition is one of many low rank ...
5,168 views2024-04-03
Singular value decomposition is one of many low rank methods when working with matrices. This video shares the intuition for why SVD matters...
Replying to @Davos What's the best algorithm? ü§î Th...
5,164 views2023-06-26
Replying to @Davos What's the best algorithm? ü§î There is no best algorithm! This is an excellent reminder of the free lunch theorem; no a...
Nat.dev playground is awesome. Should be a great remi...
5,144 views2023-03-10
Nat.dev playground is awesome. Should be a great reminder of the diversity of large language models. #datascience #machinelearning #largelan...
We all want to get paid. But just know you will end u...
5,128 views2022-08-19
We all want to get paid. But just know you will end up miserable. #datascience #codetok #analytics
How enterprises are dealing with ChatGPT it’s a prett...
5,090 views2023-02-05
How enterprises are dealing with ChatGPT it’s a pretty familiar cycle of grief. The good thing is it does open up lots of cool use cases. #...
In this video, I explain Jason Wei’s insight from his...
5,071 views2025-07-16
In this video, I explain Jason Wei’s insight from his recent blog post on the asymmetry of verification and what he calls Verifiers’ Law: AI...
Favorite tweet today. #statistics #datascience #codet...
5,071 views2022-05-19
Favorite tweet today. #statistics #datascience #codetok #machinelearning
This group is holding COLLIDE Data Conference on Octo...
5,046 views2023-08-30
This group is holding COLLIDE Data Conference on October 3-4 at Center Stage Theater üé∏üé≠ in the heart of midtown Atlanta Georgia. Regis...
Getting even bigger with all the new vision models
5,032 views2024-09-25
Getting even bigger with all the new vision models
Jokes explained - news in mid-May 2023 Google introdu...
5,010 views2023-05-12
Jokes explained - news in mid-May 2023 Google introduced Bard2 which performs on par with GPT3.5 and Claude from Anthropic. Google also anno...
#onthisday Time Series Decomposition is a great techn...
5,006 views2024-06-12
#onthisday Time Series Decomposition is a great technique for starting to understand a time series.
Are you GPU Poor? A deep dive into the state of GPUs ...
4,999 views2023-11-22
Are you GPU Poor? A deep dive into the state of GPUs based on the work of Dylan Patel of Semi Analysis. How are you coping with the lack of ...
Entropy can be a useful measure in machine learning. ...
4,992 views2023-04-27
Entropy can be a useful measure in machine learning. Entropy and information gain is used in building decision trees. I have also seen entro...
Crime seems easy to predict, but is super messy. #dat...
4,991 views2022-07-02
Crime seems easy to predict, but is super messy. #datascience #crimetok #chicago #statistics #crimonology #machinelearning #codetok #aisnake...
Uber’s FixRLeak system finds leaks with SonarQube, sc...
4,937 views2025-11-11
Uber’s FixRLeak system finds leaks with SonarQube, scopes them with Tree-sitter AST analysis, then lets GenAI safely patch only what it unde...
Is explainability important for you? #datascience #ex...
4,930 views2022-08-06
Is explainability important for you? #datascience #explainability #interpretability #statistics #codetalk #machinelearning
Grok 3 - The video explains how Grok 3's performance ...
4,899 views2025-02-23
Grok 3 - The video explains how Grok 3's performance claims rely on majority voting across 64 predictions (cons@64) rather than single predi...
Data drift analysis is a must for production workload...
4,848 views2023-03-13
Data drift analysis is a must for production workloads. Here is Uber’s D3 system fie automated drift analysis. This video covers types of da...
Using recursive feature elimination for feature selec...
4,836 views2025-02-21
Using recursive feature elimination for feature selection for machine learning.
Loss Functions - simple example of MAE versus RSME #d...
4,820 views2022-08-30
Loss Functions - simple example of MAE versus RSME #datascience #statistics #analytics #codetok #regression
NotebookLM - Convert your notes into a podcast Notebo...
4,808 views2024-10-30
NotebookLM - Convert your notes into a podcast NotebookLM: https://notebooklm.google/ Notebook LLama: https://github.com/meta-llama/llama-re...
Obliviate is now possible for LLMs. Microsoft researc...
4,788 views2023-10-07
Obliviate is now possible for LLMs. Microsoft researchers share an approach to get Large Language Models to unlearn information. #harrypotte...
GPT3.5 takes the bar exam with very little tuning. It...
4,775 views2022-12-30
GPT3.5 takes the bar exam with very little tuning. It does pretty well. #gpt #datascience #machinelearning #barexam #law
Cleaning data is such a pain. I remember having over ...
4,773 views2022-09-30
Cleaning data is such a pain. I remember having over 130+ unique combinations for US States in one project.
Reply to @shaggy335 #datascience #statistics #analyti...
4,768 views2022-05-03
Reply to @shaggy335 #datascience #statistics #analytics #techtok #machinelearning
What if Santa’s biggest problem this year is optimiza...
4,764 views2025-11-28
What if Santa’s biggest problem this year is optimization? Packing 200 Christmas tree toys into the smallest box is harder than it looks. Gr...
Mixing in a bit of law with the usual data science. L...
4,720 views2022-10-28
Mixing in a bit of law with the usual data science. Let me know if this is interesting or you waiting for the deep dive on dbscan clustering...
Text to Chart. It’s easier than ever to build great c...
4,705 views2023-02-15
Text to Chart. It’s easier than ever to build great charts using libraries like plotly or matplotlib. Are other people using ChatGPT for thi...
Stylometric analysis—specifically the detection of ov...
4,664 views2025-06-01
Stylometric analysis—specifically the detection of overused phrases known as "slop"—can reveal hidden changes in a language model's training...
Wrap up of current events going on with chat includin...
4,660 views2023-02-17
Wrap up of current events going on with chat including #openai #chatgpt #bing #amazon #datascience #machinelearning
DeepSeek-R1 didn’t copy human reasoning—it learned it...
4,654 views2025-09-19
DeepSeek-R1 didn’t copy human reasoning—it learned it. With pure RL (GRPO), it jumped from 15% to 80% on the AIME exam and began saying “wai...
Nvidia Prismer model for image captioning and zero sh...
4,653 views2023-03-15
Nvidia Prismer model for image captioning and zero shot visual question answering. It uses and ensemble or mixture of experts approach. #dat...
Try it out, link in comments. #huggingface #datascien...
4,640 views2022-06-19
Try it out, link in comments. #huggingface #datascience #reinforcementlearning #deeplearning #codetok #mltok Earlier weeks: @rajistics @raji...
History of the term regression and regression to the ...
4,633 views2022-02-08
History of the term regression and regression to the mean #statistics #datascience #galton #regression #heriditary
Deep dive into reasoning models. Notebook is freely a...
4,624 views2025-06-04
Deep dive into reasoning models. Notebook is freely available so go run it yourself. Notebook: https://github.com/rajshah4/LLM-Evaluation/b...
Data scientists will typically use regularization, wh...
4,611 views2022-11-12
Data scientists will typically use regularization, which means no p values. #machinelearning #datascience #statistics #pvalues
In this skit, a junior AI engineer tries to solve eve...
4,608 views2025-03-29
In this skit, a junior AI engineer tries to solve everything by giving the model more "thinking time" — but runs into the hard truth about v...
Reply to @bosstoastmaker shallow learning with tensor...
4,593 views2022-02-20
Reply to @bosstoastmaker shallow learning with tensorflow playground #datascience #tensorflow #python #machinelearning
Breaking down how advances in AI, from GPT to Veo 3 —...
4,566 views2025-05-24
Breaking down how advances in AI, from GPT to Veo 3 — owe their performance to massive, often ethically questionable datasets. It traces the...
This video discusses a Carnegie Mellon study comparin...
4,564 views2025-05-11
This video discusses a Carnegie Mellon study comparing prompt-based inference with fine-tuned large language models. The research found that...
Reply to @notryantaylor here it is without music. t...
4,539 views2022-02-11
Reply to @notryantaylor here it is without music. this is for my 4 kids who all text me that they are above average.
Thanks to barrnanas and AmplifyPartners
4,538 views2023-10-10
Thanks to barrnanas and AmplifyPartners
An Introduction to Dify.AI - A UI based tool for buil...
4,518 views2025-03-23
An Introduction to Dify.AI - A UI based tool for building Generative AI Agentic Workflows or Applications. I have a longer video on YT going...
Deep dive into Group Relative Policy Optimization (GR...
4,498 views2025-02-16
Deep dive into Group Relative Policy Optimization (GRPO), a Reinforcement Learning algorithm used by Deepseek in their R1 reasoning model. L...
Some common data distributions when modeling includin...
4,498 views2023-02-03
Some common data distributions when modeling including skewed and zero inflated. There are many other distributions, but just wanted people ...
#onthisday Showing the latent space for stable diffus...
4,495 views2023-09-10
#onthisday Showing the latent space for stable diffusion. #stablediffusion #datascience #machinelearning #codetok #umapêpravocê
Quantization used to be a post-training compromise, s...
4,478 views2025-11-16
Quantization used to be a post-training compromise, smaller and faster but at the cost of accuracy. Kimi K2-Thinking flips the script using ...
Highlighting some great work investigating basic beha...
4,478 views2025-02-08
Highlighting some great work investigating basic behavior of LLMs and finding issues with their reliability: Do Large Language Model Benchma...
Most people evaluate RAG the wrong way — just checkin...
4,467 views2025-03-30
Most people evaluate RAG the wrong way — just checking if the answer is “correct.” But great RAG needs to answer: how did it get there? In t...
Recommenders! I saw an internal presentation from Sim...
4,447 views2024-05-25
Recommenders! I saw an internal presentation from Simran on the effectiveness of implicit for a recommender system and wanted to share it. C...
AI researchers assumed more sensory data—like video—w...
4,435 views2025-06-27
AI researchers assumed more sensory data—like video—would lead to smarter, more reasoning-capable models. But it didn’t work. While video mo...
Labeling companies like Scale are hiring people to bu...
4,425 views2023-05-09
Labeling companies like Scale are hiring people to build and improve models based on these skills. By next year people in these fields shoul...
ggplot, matplotlib, plotly, and seaborn are what data...
4,423 views2022-10-03
ggplot, matplotlib, plotly, and seaborn are what data scientists use to make a plot or graph. #datascience #visualization #plots #analytics...
MobileLLM - A great paper that details experiments fo...
4,389 views2024-07-11
MobileLLM - A great paper that details experiments for the efficient architecture. It includes using SwiGLu, Deeper/thiner architect, reduci...
New good stuff. Let’s compare the performance, cost, ...
4,381 views2025-05-01
New good stuff. Let’s compare the performance, cost, and task alignment for using OpenAI o3 versus a small model trained with Group Relative...
Joys of autocomplete, who is with me? #datascience #p...
4,379 views2022-01-28
Joys of autocomplete, who is with me? #datascience #programming #vscode #jupyternotebook #coding #tabcompletion #python
X-decoder from Microsoft. Check out the instructional...
4,356 views2023-02-15
X-decoder from Microsoft. Check out the instructional text demo. I added in video released by the team at the bottom. If too many people don...
GPUs driven by NVIDIA are the key to today's AI. With...
4,349 views2023-09-19
GPUs driven by NVIDIA are the key to today's AI. Without this compute we would not have the models like GPT-4. Let's review why GPU performa...
Reply to @coronavirusvevo #xgboost #regression #stati...
4,326 views2022-02-18
Reply to @coronavirusvevo #xgboost #regression #statistics #datascience #algorithms
The video contrasts complex neural networks with simp...
4,305 views2025-03-12
The video contrasts complex neural networks with simpler, interpretable models like Generalized Additive Models (GAMs), which provide clear ...
Chronicles of OPT Training #meta #nlp #datascience #m...
4,297 views2022-05-12
Chronicles of OPT Training #meta #nlp #datascience #machinelearning #deeplearning #codetok #python
Agents just learned to talk without words. They pass ...
4,238 views2025-12-05
Agents just learned to talk without words. They pass thoughts directly in latent space, not text. It is faster, cheaper, and even boosts acc...
Notebook walkthrough of the 11B model using Hugging F...
4,233 views2024-09-30
Notebook walkthrough of the 11B model using Hugging Face Transformers on Snowflake The notebook highlights: Short background on vision langu...
Reminding everyone not to fall for the "research" rep...
4,233 views2024-06-08
Reminding everyone not to fall for the "research" reports from Forrester or Gartner. They should be treated like marketing material or PR ma...
Using diffusion for object detection in diffusiondet....
4,230 views2022-11-21
Using diffusion for object detection in diffusiondet. #datascience #machinelearning #objectdetection #computervision
Red light camera #chicago #datascience #redlightcame...
4,218 views2022-04-30
Red light camera #chicago #datascience #redlightcamera #anomalydetection #statistics #techtok #analytics
This skit highlights key data science concepts, aroun...
4,196 views2024-05-19
This skit highlights key data science concepts, around error analysis and iterative improvement in building models. It shows how data scient...
Facts #datascience #techtok #analytics #impostersyndrome
4,166 views2022-04-28
Facts #datascience #techtok #analytics #impostersyndrome
Relevance maps for image classification. Model explai...
4,137 views2022-11-21
Relevance maps for image classification. Model explainability is always important. #datascience #explainability #machinelearning #imageclass...
Wow! I am impressed with Claude’s new Skills feature....
4,109 views2025-10-18
Wow! I am impressed with Claude’s new Skills feature. It can make my life easier (and I know I sound like a shill, but this is super useful ...
My take on Objaverse, Llama, and Alpaca. Not a lot of...
4,104 views2023-03-25
My take on Objaverse, Llama, and Alpaca. Not a lot of respect for copyright or contract terms. #largelanguagemodels #datascience #machinelea...
This video explains how scaling laws—particularly fro...
4,046 views2025-04-16
This video explains how scaling laws—particularly from the Chinchilla paper—reveal a tradeoff between model size, training data, and compute...
Watch out for the flashy new algorithm.
4,016 views2024-10-22
Watch out for the flashy new algorithm.
Using AI for Pose Detection, this is such a cool appl...
4,008 views2022-09-05
Using AI for Pose Detection, this is such a cool application. #datascience #deeplearning #codetok #posedetection #sportsanalytics
Evaluating generative models means considering many f...
4,000 views2023-06-24
Evaluating generative models means considering many factors including prompts tokenization and evaluating generated results. This video shou...
8 Ways to improve your RAG Application 1. Metadata Fi...
3,999 views2025-05-09
8 Ways to improve your RAG Application 1. Metadata Filter 2. Semantic Chunking 3. Visual Language Model 4. Query Decomposition 5. Better Emb...
Software exec at the end is the best. Your quick intr...
3,976 views2022-10-30
Software exec at the end is the best. Your quick intro to patents, trademarks, copyright, and licenses. I see too many comments where peopl...
It’s rough for statisticians, machine learning is so ...
3,957 views2022-09-15
It’s rough for statisticians, machine learning is so popular #datascience #analytics #statistics #machinelearning
The enterprise AI landscape has radically shifted in ...
3,936 views2025-08-07
The enterprise AI landscape has radically shifted in 2025. Anthropic has overtaken OpenAI in enterprise usage by offering broader cloud acce...
Quick intro, let me know if a deeper dive is useful. ...
3,896 views2022-07-19
Quick intro, let me know if a deeper dive is useful. #translation #meta #datascience #machinelearning #huggingface
J.P. Morgan processes 50 million transactions a day —...
3,869 views2025-10-27
J.P. Morgan processes 50 million transactions a day — and they didn’t use GPT-5. By layering rules, text similarity, and a tiny 1.7 M-parame...
Collaborative filtering is a very popular and useful ...
3,868 views2025-05-21
Collaborative filtering is a very popular and useful way to build a recommender. However, getting explicit feedback is hard, and that is whe...
Just how smart is ChatGPT and other #largelanguagemod...
3,868 views2023-01-04
Just how smart is ChatGPT and other #largelanguagemodels? Big Bench is a set of benchmark tests to asses the performance of the models. And ...
Optimal Transport algorithms to efficiently allocate ...
3,864 views2025-04-23
Optimal Transport algorithms to efficiently allocate resources—in this case, croissants from eight bakeries to five cafes. It begins by cons...
ShinkaEvolve pairs evolutionary algorithms with LLMs ...
3,848 views2025-09-27
ShinkaEvolve pairs evolutionary algorithms with LLMs to invent new solutions faster. Using novelty-based rejection, smarter parent selection...
A reminder that most enterprises favor Apache and MIT...
3,845 views2023-01-06
A reminder that most enterprises favor Apache and MIT licenses. As a developer, use what you please. But to reach people working within comp...
5 things to look for when a new model is announced Li...
3,831 views2024-04-20
5 things to look for when a new model is announced License Real Open Source? - Apache 2 Commercial use? Strange conditions? Size of the mode...
Hallucinations from large language models are a conce...
3,818 views2023-05-06
Hallucinations from large language models are a concern. However balance them against the effectiveness of these models and the risks of usi...
Statistics sounds heavy but a lot of concepts are ver...
3,805 views2024-01-07
Statistics sounds heavy but a lot of concepts are very useful and can save you a lot of effort. This video is reminder of the many ways we u...
Research on productivity with the new AI code tools f...
3,789 views2025-10-10
Research on productivity with the new AI code tools from Stanford, inspired their talk I saw at the MLOps summit. Lots of great insights. Th...
LLMs can write SQL. That’s not the hard part. The har...
3,785 views2025-11-19
LLMs can write SQL. That’s not the hard part. The hard part is making sure the math matches how the business actually works. Docs help AI un...
Typical issues that often come up in everyday data sc...
3,773 views2022-12-04
Typical issues that often come up in everyday data science. Data scientists only spend a small amount of time on algorithms. #datascience #...
Everyone says “just add more agents.” This new Google...
3,768 views2025-12-14
Everyone says “just add more agents.” This new Google + MIT paper tested 180 multi-agent setups and found something uncomfortable: on averag...
As we store more information as vectors or embeddings...
3,761 views2024-04-16
As we store more information as vectors or embeddings vector databases are gaining importance. For small amounts of embeddings numpy or FAIS...
Building a Claude 3.7 AI Researcher: The Framework Di...
3,759 views2025-04-24
Building a Claude 3.7 AI Researcher: The Framework Dilemma I built this three ways using Claude 3.7's extended thinking capabilities with a ...
Encoders come in three flavors: * Encoder only conver...
3,739 views2025-09-06
Encoders come in three flavors: * Encoder only converts single texts into embeddings. * Bi-encoder encodes queries and documents separately ...
I really liked this paper from DeepMind on Synthetic ...
3,736 views2024-04-18
I really liked this paper from DeepMind on Synthetic Data. It highlights a lot of interesting uses of synthetic data along with concerns ab...
Unit Testing Deep Dive: ⚡ Evaluating Unit Tests with ...
3,733 views2025-02-10
Unit Testing Deep Dive: ⚡ Evaluating Unit Tests with LMUnit 🎯 Polar plot visualizations of multi-dimensional scores 🔬 K-means clustering ...
WorkArena and WebArena are some newer real benchmarks...
3,731 views2024-03-15
WorkArena and WebArena are some newer real benchmarks for real-world tasks. To build wider automation, itÔøΩs going to be essential to solve...
Six distributions. One video. A surprisingly relatabl...
3,722 views2025-11-29
Six distributions. One video. A surprisingly relatable breakdown of why the real world does not follow the bell curve. Diving into Normal, P...
Glitch tokens can have some unusual effects. The most...
3,715 views2024-05-12
Glitch tokens can have some unusual effects. The most well known was the SolidGoldMagikarp token. The folks at Cohere dug into this and spen...
Cosine similarity is a must know when working with ve...
3,690 views2022-10-14
Cosine similarity is a must know when working with vectors. It’s very useful and widely used in #machinelearning #datascience #statistics S...
Picking a GPU for deep learning based on Tim Dettmers...
3,688 views2023-01-17
Picking a GPU for deep learning based on Tim Dettmers classic blog post. #datascience #machinelearning #gpu #deeplearning (had to repost thi...
Knowledge distillation helps make smaller models that...
3,677 views2025-03-13
Knowledge distillation helps make smaller models that work well. DistilBERT is a popular small model created using this method. Resources: D...
Researchers at Princeton ran 20,000 tests across nine...
3,633 views2025-10-19
Researchers at Princeton ran 20,000 tests across nine benchmarks—spending $40,000—to see how AI agents really perform. Turns out the Agent M...
Its uncomfortable how good this analogy is. I have so...
3,619 views2023-09-09
Its uncomfortable how good this analogy is. I have so much more material i left out. Enjoy this dont know how long it will stay up. #largela...
4 Data Science Fails. These are a handful of ways tha...
3,615 views2025-06-12
4 Data Science Fails. These are a handful of ways that society pushes back on data science approaches. It's good to understand why these wer...
Use it! #python #conda #codetok #datascience
3,607 views2022-05-16
Use it! #python #conda #codetok #datascience
In a previous video, I focused on OpenAI’s model, but...
3,605 views2025-02-04
In a previous video, I focused on OpenAI’s model, but this issue goes far beyond just one example. AI content detectors suffer from the base...
Reviewing some new research looking into prompting ve...
3,604 views2024-05-07
Reviewing some new research looking into prompting versus fine tuning. They both have their place, but prompting performance can continue to...
his video breaks down how Gemini 2.5 Pro, a publicly ...
3,591 views2025-08-01
his video breaks down how Gemini 2.5 Pro, a publicly available model, solved 5 out of 6 problems from the IMO 2025 without any fine-tuning. ...
The future will be many different LLMs some open sour...
3,585 views2023-12-24
The future will be many different LLMs some open source and some proprietary. Other like Yann Lecun think differently. Yann's thread: https:...
In this deep dive, I go beyond the RAG basics to focu...
3,581 views2025-10-13
In this deep dive, I go beyond the RAG basics to focus on the most critical component: Retrieval. We'll provide a practical framework for th...
Fun way to talk about K-means algorithm #datascience ...
3,539 views2022-08-22
Fun way to talk about K-means algorithm #datascience #codetok #analytics #machinelearning
LLMs are approximate retrievers that are mimicking pl...
3,526 views2023-08-12
LLMs are approximate retrievers that are mimicking plans rather than truly planning. Great argument put forth by Subbarao Kambhampati who is...
Why you want prediction intervals instead of point pr...
3,516 views2022-09-20
Why you want prediction intervals instead of point predictions #datascience #machinelearning #statistics #predictioninterval
I need to focus on adding more Regularization to my l...
3,512 views2022-11-19
I need to focus on adding more Regularization to my life. #datascience #statistics #regularization
Cleanlab is open source and will improve your data qu...
3,507 views2023-01-30
Cleanlab is open source and will improve your data quality. It’s so underrated. This was hard to record vertically, so go try it out. #datas...
Repost which has held up well: Language Models like C...
3,499 views2024-04-10
Repost which has held up well: Language Models like ChatGPT can be modified by several methods including Prompting Instruction Fine-Tuning a...
Thinking about the size of numbers becomes important ...
3,497 views2024-05-17
Thinking about the size of numbers becomes important when working with neural networks. This video touches on different techniques like usin...
Word as Image - great use of generative AI models lik...
3,481 views2023-03-07
Word as Image - great use of generative AI models like stable diffusion to create fonts. Check out the paper at wordasimage.github.io #datas...
Why you want prediction intervals instead of point pr...
3,461 views2022-10-09
Why you want prediction intervals instead of point predictions. This is a repost because the first one was taken down. #datascience #codeto...
Zero shot learning #datascience #machinelearning #hug...
3,460 views2022-05-27
Zero shot learning #datascience #machinelearning #huggingface #nlp #naturallanguageprocessing #statistics background: @rajistics #codetok
Q* from OpenAI is getting the hype but let's focus on...
3,458 views2023-11-28
Q* from OpenAI is getting the hype but let's focus on the basics of their organization and the limitations of GPT-4 around planning. This vi...
Let's talk about how copyright intersects large langu...
3,455 views2023-08-19
Let's talk about how copyright intersects large language models around training LLMs outputs of LLMs and watermarking mechanisms. #datascien...
Roundup of all the big headlines, hope this is fun fo...
3,448 views2023-03-04
Roundup of all the big headlines, hope this is fun for you all. I laugh while making these, but wonder how many of you get all the refeenenc...
Google announced Bard, but we still don’t know much...
3,437 views2023-02-06
Google announced Bard, but we still don’t know much. It has been based on Lambda which has been around for a while. This is a safe bet, no...
Give it up to Data Engineers. #dataengineering #data...
3,423 views2022-03-08
Give it up to Data Engineers. #dataengineering #datascience #analytics
Training an image classifier using ü§ó transformers ...
3,412 views2022-08-04
Training an image classifier using ü§ó transformers #datascience #analytics #codetok #deeplearning #huggingface Longer video at other site ...
Hyperparameter optimization or search is an important...
3,408 views2024-06-20
Hyperparameter optimization or search is an important step in many machine learning algorithms. I cover a few of the basic approaches, inclu...
Everyone’s racing to make RAG faster — but my latest ...
3,396 views2025-10-11
Everyone’s racing to make RAG faster — but my latest tests show that might be the wrong goal. Agentic RAG, with multiple retrievals and a re...
Oasis is an interactive generative world model based ...
3,393 views2024-11-05
Oasis is an interactive generative world model based on diffusion transformers. It takes keyboard input and generates gameplay in an autoreg...
Feeling overwhelmed by all the hyperparameter options...
3,375 views2025-06-21
Feeling overwhelmed by all the hyperparameter options in XGBoost? This video walks through practical tips — from grid search and random sear...
Don't get caught up in the hype. The main value for L...
3,366 views2024-04-12
Don't get caught up in the hype. The main value for LLMs is marketing. Most of us are better off working on evaluation and prompting rather ...
OpenAI made routing the secret weapon inside GPT-5 — ...
3,365 views2025-08-23
OpenAI made routing the secret weapon inside GPT-5 — Sam Altman even admitted when it broke, the model felt dumber. Now researchers have gon...
With the growth of open-source LLMs many leaderboards...
3,363 views2023-06-10
With the growth of open-source LLMs many leaderboards to rank these models are emerging. Several different methodologies are used including ...
Are sophisticated agents really better? With GPT-5 un...
3,360 views2025-08-17
Are sophisticated agents really better? With GPT-5 unlocking Agentic AI, I break down four practical best practices—simplicity, structure, o...
#reinforcementlearning #huggingface #datascience #dee...
3,357 views2022-05-26
#reinforcementlearning #huggingface #datascience #deeplearning #codetok #deepqlearning - Week 1: @rajistics
Meta fumbled the open-source lead; Qwen—Alibaba Cloud...
3,349 views2025-08-16
Meta fumbled the open-source lead; Qwen—Alibaba Cloud’s open-weight family—has taken it, with Apache-2.0 models spanning 0.6B → 235B MoE (~2...
From the article: How do Authors’ Perceptions about t...
3,335 views2022-11-23
From the article: How do Authors’ Perceptions about their Papers Compare with Co-authors’ Perceptions and Peer-review Decisions? #statistics...
#duet with @hugging_face looks like we are on Tik Tok...
3,334 views2022-06-08
#duet with @hugging_face looks like we are on Tik Tok. Go try mini Dalle, go to hF.co
Sweep AI showed how to really make autocomplete work ...
3,323 views2025-09-20
Sweep AI showed how to really make autocomplete work in JetBrains. They moved from plain Fill-in-the-Middle to syntax-aware training on real...
Getting explainability when working with transformer ...
3,310 views2022-10-19
Getting explainability when working with transformer based vision models. Uses Captum on the backend, but makes it easy to get image attribu...
Reply to @declinedher long history of analyzing waste...
3,294 views2022-01-31
Reply to @declinedher long history of analyzing wastewater for drug residue #dataanalysis #wastewater #drugs #monitoring @rajistics
Your weekly dose of LLM news. I liked this because it...
3,287 views2022-11-06
Your weekly dose of LLM news. I liked this because it had interesting results with a smart approach. #datascience #machinelearning #largelan...
Large language models don’t just process language—the...
3,261 views2025-06-20
Large language models don’t just process language—they build internal spatial maps. This video breaks down the paper “Linear Spatial World M...
Tensorflow playground, link in comments, #tensorflow ...
3,252 views2022-02-04
Tensorflow playground, link in comments, #tensorflow #deeplearning #datascience #analytics #neuralnetworks
Understanding a confusion matrix, Part I video: @raji...
3,244 views2022-03-07
Understanding a confusion matrix, Part I video: @rajistics #datascience #statistics #machinelearning #confusionmatrix
In context learning, let’s dig deeper and let me know...
3,242 views2022-12-14
In context learning, let’s dig deeper and let me know what I should do next. #machinelearning #datascience #largelanguagemodels #incontextl...
Save this! - Deep Dive on Time Series with Kolmogorov...
3,229 views2024-11-03
Save this! - Deep Dive on Time Series with Kolmogorov-Arnold Networks (KAN) - 1. Toy Dataset Notebook: https://colab.research.google.com/dr...
Highlighting BerTopic #datascience #statistics #nlp #...
3,229 views2022-05-25
Highlighting BerTopic #datascience #statistics #nlp #huggingface #codetok
Reply to @nolankeller23
3,229 views2022-03-09
Reply to @nolankeller23
I like notebooks for data science, but others differ....
3,203 views2022-07-26
I like notebooks for data science, but others differ. #datascience #jupyternotebook #codetok #python
Sharing my favorite data science news and resources, ...
3,189 views2022-12-16
Sharing my favorite data science news and resources, find it bit.ly/raj_reads #machinelearning #datascience
What are you favorite tips for error analysis? #datas...
3,179 views2022-06-17
What are you favorite tips for error analysis? #datascience #statistics #analytics #machinelearning #codetok #mltok
Then my team builds data pipelines for the next eight...
3,159 views2022-09-17
Then my team builds data pipelines for the next eight months #datascience #dataengineering #analytics
I also think of this when walking through security or...
3,151 views2025-02-06
I also think of this when walking through security or thinking about cheaters.
I love #rstats, but spend most of my time now in #pyt...
3,145 views2022-07-21
I love #rstats, but spend most of my time now in #python #datascience #codetok #machinelearning
Sentence transformers are awesome. Lets talk about th...
3,138 views2024-06-13
Sentence transformers are awesome. Lets talk about the differences between Word2vec, Transformers, and Sentence Transformers.
Ring Attention lets you split up the attention calcul...
3,130 views2024-04-14
Ring Attention lets you split up the attention calculation across GPUs to allow much longer context lengths. LLMs are using this to scale to...
Peak ML #datascience #codetok #huggingface #gradio #h...
3,125 views2022-08-24
Peak ML #datascience #codetok #huggingface #gradio #huggable #imageclassification
Deep dive into Devin as an AI software engineer. Chec...
3,117 views2025-01-18
Deep dive into Devin as an AI software engineer. Check the deep dive by Answer AI: https://www.answer.ai/posts/2025-01-08-devin.html Inspi...
Regularization is a technique to keep your model from...
3,116 views2023-12-03
Regularization is a technique to keep your model from overfitting. It's widely used in machine learning. #datascience #statistics #regulariz...
Your AI agent isn’t dumb. It’s forgetful. Most agents...
3,108 views2025-12-19
Your AI agent isn’t dumb. It’s forgetful. Most agents redo the same work every run instead of learning from success. This video shows how th...
Synthetic datasets have given me a way to understand ...
3,108 views2023-01-25
Synthetic datasets have given me a way to understand better how to do feature selection and model explainability. Try it out sometime. #data...
Some data analysis tips: 1. Your data might be unrepr...
3,095 views2023-08-02
Some data analysis tips: 1. Your data might be unrepresentative 2. Think about what was collected and what wasn't 3. Not all data is useful ...
Visualizations for showing variation in the data or u...
3,095 views2022-12-05
Visualizations for showing variation in the data or uncertainty. Based on Unfair Comparisons by Eli Holder. #datascience #machinelearning #s...
Using LLMs for tools is an emerging trend this year. ...
3,093 views2024-04-05
Using LLMs for tools is an emerging trend this year. Cohere has focused on it with it's new Command-R+ model that focuses on enterprise use ...
Pair programming is some of my favorite times as a da...
3,084 views2023-03-19
Pair programming is some of my favorite times as a data scientist. I am starting to use ChatGPT to fill that role lately. Its useful for me....
Think you know how Reinforcement Learning for LLMs re...
3,066 views2025-11-02
Think you know how Reinforcement Learning for LLMs really works? The secret isn't "just do more training” — it's about where your data comes...
Overdue for sports analytics #datascience #analytics ...
3,042 views2022-08-12
Overdue for sports analytics #datascience #analytics #codetok #sportsanalytics #machinelearning
Ensemble learning with majority voting can improve de...
3,041 views2025-03-14
Ensemble learning with majority voting can improve decision accuracy, as demonstrated when three 70%-accurate models combined outperform a s...
Contrastive learning is common for folks working in N...
3,039 views2022-11-03
Contrastive learning is common for folks working in NLP and images. This was new to me, so wanted to share the intuition a bit more widely. ...
Feature engineering with categorical variables and ge...
3,033 views2024-09-18
Feature engineering with categorical variables and gender cateogries feom chatgpt.
Leakage is omnipresent #datascience #analytics #codet...
3,032 views2022-08-18
Leakage is omnipresent #datascience #analytics #codetok #targetleakage
I was so focused! Data is hard. #datascience #dataa...
3,030 views2022-06-07
I was so focused! Data is hard. #datascience #dataanalysis #statistics #codetok #mltok #machinelearning
In his talk, Denny Zhou outlined four strategies for ...
3,026 views2025-08-11
In his talk, Denny Zhou outlined four strategies for improving LLM reasoning without increasing model size: eliciting reasoning via intermed...
Google Colab, Kaggle, and LangChain are all great way...
3,017 views2023-01-21
Google Colab, Kaggle, and LangChain are all great ways to start learning this weekend! #datascience #machinelearning #kaggle #googlecolab #l...
Watch out for leakage, it happens even to the best. ...
2,983 views2022-07-10
Watch out for leakage, it happens even to the best. #datascience #statistics #dataleakage #targetleakage #machinelearning
Training an image classifier using 🤗 transformers #d...
2,969 views2022-08-04
Training an image classifier using 🤗 transformers #datascience #analytics #codetok #deeplearning #huggingface Longer video at other site us...
Good times, what was your first ML model? #titanic #...
2,958 views2022-06-15
Good times, what was your first ML model? #titanic #datascience #statistics #codetok #machinelearning #rstats #python
Rerankers improve search by examining whether documen...
2,956 views2025-03-12
Rerankers improve search by examining whether documents truly answer your specific question, unlike retrievers that only match similar words...
As language models expand into fuzzier domains like m...
2,944 views2025-08-03
As language models expand into fuzzier domains like medical advice and policy summarization, traditional training signals break down. This v...
I still havent tried copilot. Have you? #datascience...
2,940 views2022-07-27
I still havent tried copilot. Have you? #datascience #codetok #codex #copilot #python
It’s taken a while to accept this. #python #programmi...
2,930 views2022-03-28
It’s taken a while to accept this. #python #programming #datascience
Reply to @pal_protty negative reinforcement and #reg...
2,926 views2022-02-27
Reply to @pal_protty negative reinforcement and #regressiontothemean . Link to the Nylon calculus #basketball article in comments. #datasc...
This video explains temperature, one of the most cruc...
2,911 views2025-03-28
This video explains temperature, one of the most crucial settings when working with GPT models. Temperature controls the randomness of the m...
Ensembling is key method in machine learning. This vi...
2,905 views2023-03-15
Ensembling is key method in machine learning. This video introduces ensembling through majority voting. #datascience #machinelearning #ensem...
Most enterprise GenAI pilots fail to deliver measurab...
2,903 views2025-08-22
Most enterprise GenAI pilots fail to deliver measurable ROI because of structural and organizational gaps rather than model quality. The rep...
#datascience #analytics #statistics #wald #survivorsh...
2,897 views2022-03-10
#datascience #analytics #statistics #wald #survivorshipbias
Amazing how this stuff keeps getting better #datascie...
2,890 views2022-08-14
Amazing how this stuff keeps getting better #datascience #codetok #machinelearning #codex
95% of GenAI pilots fail, not because of the model bu...
2,874 views2025-11-16
95% of GenAI pilots fail, not because of the model but because of the approach. The simple playbook to actually scale is Usability, Utility,...
#duet with @the.rachel.woods #rachelwoods Go hustle b...
2,858 views2023-04-14
#duet with @the.rachel.woods #rachelwoods Go hustle but don’t take it personally when they dont respond. Instead wait your time. And then ...
LLMs can streamline existing AI/ML operations by repl...
2,852 views2025-02-22
LLMs can streamline existing AI/ML operations by replacing specialized models (e.g., BERT for classification, SpaCy for NER, T5 for summariz...
Text to video models including text2video. The models...
2,843 views2023-03-26
Text to video models including text2video. The models are grtting better and there is now a place over at the hugging face hub to find them....
Replying to @urdar635 watermarking output from AI mod...
2,840 views2022-12-09
Replying to @urdar635 watermarking output from AI models is something that is being considered. It’s done by adding some “signal” to the out...
Feature Selection Deep Dive, Notebook at: https://bit...
2,836 views2024-10-13
Feature Selection Deep Dive, Notebook at: https://bit.ly/raj_fs
Let’s talk about common challenges in human annotatio...
2,815 views2025-05-05
Let’s talk about common challenges in human annotation for AI training data, particularly around ambiguous label definitions and inconsisten...
Reply to @grahamkechnie #wastewater #cornovirus #anal...
2,803 views2022-02-03
Reply to @grahamkechnie #wastewater #cornovirus #analysis #cryptic
Big data bowl submissions are going in and lots of gr...
2,797 views2023-01-12
Big data bowl submissions are going in and lots of great sports analytic work. This one is on strain for evaluating pass rushers. #datascien...
How to Build a State-of-the-Art Retrieval System? Les...
2,791 views2025-01-04
How to Build a State-of-the-Art Retrieval System? Lessons from Kaggle's Top Solution 👀 Let's look at the winning solution from Raja Biswas ...
#onthisday a classic debate notebooks versus scripts
2,775 views2023-07-26
#onthisday a classic debate notebooks versus scripts
Exclusive interview with openAI asking all the questi...
2,773 views2023-05-19
Exclusive interview with openAI asking all the questions you wished the ask. Including: What's the deal with the name? How do you feel about...
This skit highlights the gap between traditional mode...
2,764 views2025-07-27
This skit highlights the gap between traditional model evaluation metrics (like precision) and real-world deployment concerns. The developer...
How are you using similarity search? Looking at Spoti...
2,748 views2024-06-26
How are you using similarity search? Looking at Spotify's annoy for nearest neighbor search for embeddings. #spotify #annoy #embeddings #ra...
NASA uses generative AI for manufacturing parts for s...
2,744 views2024-10-28
NASA uses generative AI for manufacturing parts for space. It’s a great use of generative technology and you can start seeing how it will ch...
This video illustrates the limitations of long-contex...
2,736 views2025-04-13
This video illustrates the limitations of long-context LLMs across real benchmarks. While models like GPT-4o perform well on retrieval tasks...
Reply to @garlic_gworl #fakeai #datascience #mechical...
2,672 views2022-01-23
Reply to @garlic_gworl #fakeai #datascience #mechicalturk #aiethics #labor #ai
Highlight great research from #anthropic studying the...
2,659 views2022-12-21
Highlight great research from #anthropic studying the behavior of large language models. #machinelearning #datascience #largelanguagemodels
urse of dimensionality reminds us to think carefully ...
2,657 views2023-02-11
urse of dimensionality reminds us to think carefully about feature selection. More isn’t always better. Use a feature selection curve. #da...
Compute keeps getting cheaper. GPUs keep getting fast...
2,653 views2025-12-17
Compute keeps getting cheaper. GPUs keep getting faster. So why do bigger models feel less efficient? This video breaks down a real technica...
In this comprehensive talk (adapted from my presentat...
2,649 views2025-11-02
In this comprehensive talk (adapted from my presentation at ODSC), I provide a practical, hands-on framework for evaluating your GenAI and L...
Recent work in Text-to-SQL shows that once you get pa...
2,647 views2024-08-31
Recent work in Text-to-SQL shows that once you get past demo datasets, the performance drops. By incorporating human expertise, you can buil...
Great tips to save money, processing time, and improv...
2,632 views2024-06-08
Great tips to save money, processing time, and improve speed in the FrugalGPT paper. The video covers three types of strategies to reduce th...
Get shiny to run on hugging face spaces (or even some...
2,625 views2023-01-15
Get shiny to run on hugging face spaces (or even some other web app) #huggingface #posit #rstudio #shiny #datascience
Yikes: Gen AI is not that easy. Going over some recen...
2,616 views2024-06-30
Yikes: Gen AI is not that easy. Going over some recent stories about difficulty getting Generative AI to production. On vacation this week,...
Models and datasets have specific definitions. Models...
2,613 views2024-05-15
Models and datasets have specific definitions. Models consist of at least two licenses nowadays, this has been an issue for LLaMA where the ...
Find all 5 metrics mentioned? Unit test approach is a...
2,612 views2024-06-25
Find all 5 metrics mentioned? Unit test approach is a great way of thinking about evaluations for generative AI. Check out my yt for a longe...
Many techniques for text similarity including lexical...
2,585 views2024-09-28
Many techniques for text similarity including lexical, semantic, and has strategies. Here is a short list of some popular methods: 1. FuzzyW...
Explanations for transformers gently #datascience #co...
2,543 views2022-08-18
Explanations for transformers gently #datascience #codetok #deeplearning
Isolation Forests are an anomaly detection algorithm ...
2,527 views2024-08-03
Isolation Forests are an anomaly detection algorithm that builds trees to partition data, isolating "lonely" points or outliers with fewer p...
The Keras versus Pytorch benchmarking drama. This isn...
2,527 views2024-04-06
The Keras versus Pytorch benchmarking drama. This isn't about picking sides. I want to point out how difficult it is to do these sorts of b...
Getting prediction intervals with conformal predictio...
2,526 views2022-09-21
Getting prediction intervals with conformal prediction. This is a very simple intro, it can do much more. #datascience #statistics #predicti...
Synthetic data improves model performance only when i...
2,520 views2025-10-08
Synthetic data improves model performance only when it expands coverage rather than replicating existing distributions. Using LLMs to genera...
AG (Retrieval Augmented Generation) addresses fundame...
2,514 views2025-01-26
AG (Retrieval Augmented Generation) addresses fundamental limitations of base LLMs like ChatGPT, which can generate incorrect technical info...
Intro for AI Literacy #datascience #machinelearning #...
2,496 views2022-01-14
Intro for AI Literacy #datascience #machinelearning #ai #programming #literacy #alleninstitute
5 things to look for when a new model is announced Li...
2,492 views2025-04-19
5 things to look for when a new model is announced License, Size of the Model, Benchmarks, Training Data/Details, Fine-Tuning & Tech Specs
Got some time this weekend? Go build a web demo. #dat...
2,491 views2022-11-25
Got some time this weekend? Go build a web demo. #datascience #statistics #shinyr #rstats #python #gradio #streamlit
Open source can be a lot of work #opensource #techtok...
2,477 views2022-04-15
Open source can be a lot of work #opensource #techtok #programming #python #github
It’s not easy to answer questions. Techniques like mu...
2,467 views2025-01-10
It’s not easy to answer questions. Techniques like multi retrieval and multi hop we use every day without thinking about it. However, with A...
Statistics leverages randomness across machine learni...
2,459 views2025-01-13
Statistics leverages randomness across machine learning applications, including random forests, dropout in neural networks, and hyperparamet...
Don't overspend getting into data science. This eposi...
2,450 views2024-08-20
Don't overspend getting into data science. This eposide is dedicated to the snap-on and ikon controversy. Reminders: Basic macbook, Google c...
AI agents used to shut down mid-task or hallucinate v...
2,446 views2025-07-06
AI agents used to shut down mid-task or hallucinate vending empires. Now? They're beating humans at long-horizon business simulations. From ...
This video explains why FP16 (16-bit floating point) ...
2,445 views2025-05-16
This video explains why FP16 (16-bit floating point) isn't always suitable for training neural networks due to instability caused by limited...
Flux unifies text-to-image and image editing in a sin...
2,431 views2025-09-29
Flux unifies text-to-image and image editing in a single model. By working in latent space, using flow matching, and applying adversarial di...
OpenFE is an automated Feature Engineering package. I...
2,419 views2024-09-08
OpenFE is an automated Feature Engineering package. I found out about this through Kaggle, check out the notebook for all the links: Noteboo...
LLM-as-a-judge isn’t broken. Our mental model is. Ins...
2,416 views2025-12-28
LLM-as-a-judge isn’t broken. Our mental model is. Instead of fixing the judge with prompts, this video shows how calibration can turn cheap,...
A debate whether AI evals are worth the effort. The H...
2,416 views2025-09-06
A debate whether AI evals are worth the effort. The Hacker says benchmarks don’t reflect reality, eval sets are brittle, and vibes or natura...
A simple explanation of what AI is. The video touches...
2,413 views2024-06-26
A simple explanation of what AI is. The video touches upon the impact of AI, how AI works with a practical example, and some of the reasons ...
Having some fun connecting a spreadsheet to a ML mode...
2,409 views2022-11-04
Having some fun connecting a spreadsheet to a ML model. It wasn’t too hard and it’s pretty cool to have it working this way. #datascience #...
Earthquake visualization from lazarusA #datascience #...
2,400 views2022-02-01
Earthquake visualization from lazarusA #datascience #datavisualization #visualization #julia #python #earthquakes
Check out my earlier videos on Block World. The lates...
2,387 views2024-09-24
Check out my earlier videos on Block World. The latest paper is: LLMs Still Can't Plan; Can LRMs? A Preliminary Evaluation of OpenAI's o1 on...
Fine Tuning Sentence Transformers MedEmbed: Fine-Tune...
2,379 views2024-10-25
Fine Tuning Sentence Transformers MedEmbed: Fine-Tuned Embedding Models for Medical / Clinical IR: https://huggingface.co/blog/abhinand/mede...
I have no attention span. How will I learn from these...
2,375 views2022-09-26
I have no attention span. How will I learn from these videos? #datascience #codetok #python
We keep building AI copilots that look great in demos...
2,366 views2025-12-26
We keep building AI copilots that look great in demos and fail in the real world. This skit shows a common mistake: designing for chat, pers...
Shallow learning with tensorflow playground #datascie...
2,364 views2022-02-17
Shallow learning with tensorflow playground #datascience #tensorflow #python #machinelearning #deeplearning
Google’s sparrow is the rumored competitor to OpenA...
2,361 views2023-01-21
Google’s sparrow is the rumored competitor to OpenAI ChatGPT. Check out the paper to see lots of examples of it chatting. It looks really ...
Go explore if you are new #datascience #techtok #anal...
2,359 views2022-04-14
Go explore if you are new #datascience #techtok #analytics
Any old school SAS users out there? #datascience #sta...
2,353 views2022-04-21
Any old school SAS users out there? #datascience #statistics #sas
Baseline models are important when comparing differen...
2,342 views2024-04-07
Baseline models are important when comparing different models. Benchmark datasets are handy to for seeing how a model does for a specific sc...
OpenAI’s GPT-5 Codex bakes in adaptive compute — trim...
2,326 views2025-09-16
OpenAI’s GPT-5 Codex bakes in adaptive compute — trimming steps for simple edits and expanding for complex coding, all inside one model. But...
When creating simulated data, you have complete contr...
2,324 views2025-01-20
When creating simulated data, you have complete control over which elements represent meaningful patterns and which represent random variati...
I like to stay practical and plenty to get excited ab...
2,321 views2022-08-10
I like to stay practical and plenty to get excited about and get worries about without AGI. AGI is artifical general intelligence and the id...
Claude is amazing, but still plenty of room for AI to...
2,320 views2024-07-04
Claude is amazing, but still plenty of room for AI to improve. Let's dig into two challenging benchmarks for LLMs, Connections and MUSR. The...
Applying a classic methodology of ablation when worki...
2,310 views2022-11-12
Applying a classic methodology of ablation when working with stable diffusion prompts. Ablation is very common in many techniques to underst...
In this satirical video, a customer requests a modifi...
2,288 views2025-04-10
In this satirical video, a customer requests a modified ChatGPT aligned with their political views, and the vendor explains various technica...
Applying a classic methodology of ablation when worki...
2,285 views2022-11-12
Applying a classic methodology of ablation when working with stable diffusion prompts. Ablation is very common in many techniques to underst...
Predict social outcomes is not doable by #ai #ethics ...
2,280 views2022-05-11
Predict social outcomes is not doable by #ai #ethics #bias #datascience #statistics #snakeoil
A couple of techniques we use to compress models. Thi...
2,270 views2025-02-01
A couple of techniques we use to compress models. This saves GPU memory and can reduce the amount of compute needed. Model distillation comp...
Evaluation is critical for LLMs and there is an entir...
2,251 views2024-11-06
Evaluation is critical for LLMs and there is an entirely new generation of evaluation applications coming. Here I show off Braintrust, which...
RAG doesn't work out of the box! There are many possi...
2,248 views2024-06-01
RAG doesn't work out of the box! There are many possible issues with the answers. In this paper, researching RAG in the legal context, the a...
The video pokes fun at the hype and fear surrounding ...
2,247 views2025-04-01
The video pokes fun at the hype and fear surrounding GPT-4, AI job loss, and tech sensationalism. It reminds data professionals that most re...
Model Risk Management (MRM), important but can be fru...
2,244 views2022-03-15
Model Risk Management (MRM), important but can be frustrating. #datascience #regulatedindustries #explainability #statistics
What happens when humans stop fearing AI—and start le...
2,237 views2025-06-10
What happens when humans stop fearing AI—and start learning from it? This video explores how superhuman AI didn’t just beat humans at Go or ...
Pandas versus Polars Check out: https://github.com/p...
2,236 views2024-11-28
Pandas versus Polars Check out: https://github.com/pola-rs/polars Polars vs. pandas: What’s the Difference? https://blog.jetbrains.com/pych...
Latency is a key factor but there are others when thi...
2,235 views2023-08-15
Latency is a key factor but there are others when thinking about deploying large language models. Let's discuss tradeoffs between latency th...
Fundamentals folks. A great example is the paper on p...
2,224 views2025-05-29
Fundamentals folks. A great example is the paper on police misconduct. It highlights a lot of great data science practices (more than I coul...
Profit Curve, See earlier parts on Classification Mar...
2,220 views2022-03-15
Profit Curve, See earlier parts on Classification Martrics here: @rajistics @rajistics #datascience #statistics #confusionmatrix
Quick intro, let me know if a deeper dive is useful. ...
2,186 views2022-07-19
Quick intro, let me know if a deeper dive is useful. #translation #meta #datascience #machinelearning #huggingface
YouChat. Looks impressive, I will try it out this wee...
2,168 views2022-12-24
YouChat. Looks impressive, I will try it out this weekend and let you know.
Anthropic is starting to preview their model and peop...
2,166 views2023-01-07
Anthropic is starting to preview their model and people are comparing it to ChatGPT. Thanks to Riley Goodside for sharing screenshots. It lo...
Data science is a pretty awesome job. Much better th...
2,159 views2022-10-07
Data science is a pretty awesome job. Much better than my past jobs of working thr IT helpdesk or painting rocks. #datascience #analytics #...
Automated Feature Engineering has lots of great tools...
2,151 views2024-09-02
Automated Feature Engineering has lots of great tools. But remember, automation isn't a full substitute for human expertise and subject matt...
Sql just doesn’t go away and is hipper than ever. #da...
2,151 views2022-10-18
Sql just doesn’t go away and is hipper than ever. #datascience #dataengineering
Hugging Face’s INTIMA benchmark tests how AI handles ...
2,142 views2025-08-26
Hugging Face’s INTIMA benchmark tests how AI handles emotional boundaries—and the results are worrying. Across 368 prompts, major models oft...
Active Learning prioritizes labeling the most informa...
2,130 views2025-05-18
Active Learning prioritizes labeling the most informative data points—typically those near the decision boundary—based on model uncertainty....
A couple of examples of what not to do and what you s...
2,129 views2022-12-23
A couple of examples of what not to do and what you should do when presenting your data science results to the business. #datascience #stati...
Some good lessons in Amazon's efforts to automate war...
2,119 views2025-05-15
Some good lessons in Amazon's efforts to automate warehouse item stowage. Despite sophisticated hardware, vision systems, and algorithms, th...
A fun breakdown of the three split methods in XGBoost...
2,114 views2024-10-27
A fun breakdown of the three split methods in XGBoost—Exact, Approx, and Histogram—and how each speeds up model training. See which methods ...
Human in the loop is important, but it's not a silver...
2,114 views2024-09-01
Human in the loop is important, but it's not a silver bullet. #aiethics #tesla #cigna #rajistics Cigna: https://www.healthcaredive.com/news...

AI is starting to outperform humans in surprising pla...
2,110 views2025-12-13
AI is starting to outperform humans in surprising places: ad creative, systems optimization, even algorithm design. But look closer and a pa...
Feature Selection Methods: A critical part of machine...
2,107 views2024-09-21
Feature Selection Methods: A critical part of machine learning is identifying the best set of features. Some popular techniques include: Bor...
VLLM is one of the most widely used serving platforms...
2,087 views2024-08-17
VLLM is one of the most widely used serving platforms for LLMs. It's also very easy to get started with. Check it out if you are hosting you...
Cryptic error messages. Cmon. Give it up for actionab...
2,086 views2022-03-12
Cryptic error messages. Cmon. Give it up for actionable error messages that make coding a downhill sport.
#opensource #explainability #datascience #statistics ...
2,080 views2022-05-15
#opensource #explainability #datascience #statistics #codetok #programming my intro video: @rajistics
Basic techniques for handling imbalanced datasets: NO...
2,077 views2024-10-17
Basic techniques for handling imbalanced datasets: NOT SMOTE Metrics sensitive to imbalance Algorithms robust Upsampling for large datasets ...
Robotics won’t scale like LLMs until perception, eval...
2,073 views2025-11-21
Robotics won’t scale like LLMs until perception, evaluation, and embodiment align. We still compress away crucial spatial information, measu...
Your regular reminder that you should translate the i...
2,073 views2022-06-04
Your regular reminder that you should translate the impact of your model into something your stakeholders care about. #datascience #statist...
Limits of AI around compute, memory, and interconnect...
2,068 views2024-09-06
Limits of AI around compute, memory, and interconnection bandwidth. AI and Memory Wall - https://arxiv.org/pdf/2403.14123 Fire-Flyer AI-HPC:...
The Physics of Language Models by Zeyuan Allen-Zhu Ch...
2,062 views2024-11-24
The Physics of Language Models by Zeyuan Allen-Zhu Check out: ICML 2024 Tutorial: Physics of Language Models - https://youtu.be/yBL7J0kgldU?...
Text to Chart. It’s easier than ever to build great...
2,059 views2023-02-15
Text to Chart. It’s easier than ever to build great charts using libraries like plotly or matplotlib. Are other people using ChatGPT for t...
Stackoverflow and Github Copilot
2,059 views2023-07-25
Stackoverflow and Github Copilot
Composer will be sharing their new generative AI mode...
2,058 views2023-02-26
Composer will be sharing their new generative AI models and they look amazing. They key is they decompose the image, which then provides a l...
Reply to @misho9000 anomaly detection is hard #datas...
2,055 views2022-05-01
Reply to @misho9000 anomaly detection is hard #datascience #statistics #techtok #anomalydetection #machinelearning
Microsoft’s chatbot meltdown showed what happens when...
2,048 views2025-09-24
Microsoft’s chatbot meltdown showed what happens when AI runs without oversight. In this skit, our “naïve vs. expert” duo break down why hum...
GPT-3 is powerful, but sometimes domain-specific mode...
2,036 views2023-01-26
GPT-3 is powerful, but sometimes domain-specific models will do better. Pick the right tool for the job. #datascience #machinelearning #hugg...
This was from last year, but still holds up.
2,035 views2024-10-19
This was from last year, but still holds up.
Beam search improves text generation by considering m...
2,028 views2025-03-25
Beam search improves text generation by considering multiple candidate sequences instead of just picking the highest probability token at ea...
Trees are so nice to work, but dont forget these step...
2,012 views2022-07-15
Trees are so nice to work, but dont forget these steps for other algorithms. #datascience #xgboost #randomforest #statistics #machinelearni...
Q 4,5, and 7 from the Allen Institute survey #datasci...
2,005 views2022-02-23
Q 4,5, and 7 from the Allen Institute survey #datascience #medialiteracy #ai @rajistics @rajistics @rajistics
Zero-shot object detection. #datascience #codetok #hu...
1,989 views2022-08-09
Zero-shot object detection. #datascience #codetok #huggingface #objectdetection #deeplearning #zeroshotclassification
Myth versus Reality. #sql #datascience #analytics
1,984 views2022-03-25
Myth versus Reality. #sql #datascience #analytics
Stable diffusion 2.0 just dropped and a lot of unhapp...
1,983 views2022-11-25
Stable diffusion 2.0 just dropped and a lot of unhappy people. Who knew giving away software could create so much angst. #datascience #stab...
DSBench: How Far are Data Science Agents Becoming Dat...
1,979 views2024-09-23
DSBench: How Far are Data Science Agents Becoming Data Science Experts? A challenging benchmark to evaluate LLM systems on real-world data s...
Reply to @mrjohnlueders #scrum #datascience #agile #s...
1,972 views2022-02-02
Reply to @mrjohnlueders #scrum #datascience #agile #softwareengineering #analytics
GDPval is OpenAI’s new benchmark that tests AI on rea...
1,967 views2025-09-26
GDPval is OpenAI’s new benchmark that tests AI on real professional tasks from industries that drive GDP. Models like GPT-5 are graded by hu...
This didn't happen to me recently :) To learn more: A...
1,965 views2024-12-12
This didn't happen to me recently :) To learn more: An Empirical Analysis of the Python Package Index (PyPI) - https://arxiv.org/pdf/1907.11...
Feature engineering and data preprocessing are an imp...
1,959 views2023-02-27
Feature engineering and data preprocessing are an important part of the machine learning process. #datascience #machinelearning #featureengi...
An older video, but still very useful. Take time to l...
1,951 views2024-06-19
An older video, but still very useful. Take time to look at the actual errors of your model. It’s seems obvious, but too often people just s...
You can’t make this stuff up. Can I just say modeling...
1,951 views2022-04-28
You can’t make this stuff up. Can I just say modeling? #datascience #analytics #statistics #scrum #techtok
Pricing optimization is a data science use case that ...
1,949 views2024-07-16
Pricing optimization is a data science use case that is growing. In some areas, like many states in the United States, it is not allowed for...
Sentence Transformers - https://sbert.net/ MTEB: Mass...
1,947 views2024-10-15
Sentence Transformers - https://sbert.net/ MTEB: Massive Text Embedding Benchmark - https://huggingface.co/blog/mteb Notebook - https://gith...
Fairness in models #datascience #analytics #fairnessm...
1,936 views2022-03-19
Fairness in models #datascience #analytics #fairnessml #bias #algorithms
Replying to @darianv19 semantic search versus lexicon...
1,932 views2022-07-05
Replying to @darianv19 semantic search versus lexicon search. Emeddings help power semantic search. #datascience #embeddings #python
Reply to @mat.cov05 annotator agreement puts a ceili...
1,917 views2022-06-19
Reply to @mat.cov05 annotator agreement puts a ceiling on your model performance #datascience #statistics #codetok
Population Stability Index is a popular way to measur...
1,910 views2024-05-30
Population Stability Index is a popular way to measure feature drift or data drift when monitoring machine learning models. I am doing a tal...
#onthisday reposting an older video from last year th...
1,892 views2023-08-22
#onthisday reposting an older video from last year that illustrates kmeans
Who enjoys explaining how ML models work? #machinelea...
1,879 views2022-07-04
Who enjoys explaining how ML models work? #machinelearning #datascience #statistics #codetok
Transformers aren’t new anymore #datascience #codetok...
1,869 views2022-06-10
Transformers aren’t new anymore #datascience #codetok #deeplearning #machinelearning #statistics
I need more time to code. #datascience #programming #...
1,862 views2022-04-15
I need more time to code. #datascience #programming #techtok #python
It has happened. #datascience #codetok #machinelearni...
1,846 views2022-05-25
It has happened. #datascience #codetok #machinelearning #analytics
Those GPUs. #datascience #codetok #python #analytics...
1,839 views2022-05-04
Those GPUs. #datascience #codetok #python #analytics #aws
Nvidia Prismer model for image captioning and zero sh...
1,835 views2023-03-15
Nvidia Prismer model for image captioning and zero shot visual question answering. It uses and ensemble or mixture of experts approach. #dat...
Repost, but still useful. Some tips for deploying lar...
1,810 views2024-07-30
Repost, but still useful. Some tips for deploying large language models like Llama. Start by building some benchmarks for your tasks to asse...
We like to say “data is data” and that scale fixes ev...
1,810 views2026-01-07
We like to say “data is data” and that scale fixes everything. This skit questions that assumption using recent work on Data Shapley. By mea...
Checking out Flan T5 large language models. Let me kn...
1,804 views2022-11-09
Checking out Flan T5 large language models. Let me know what wisdom you can find in this model. #machinelearning #datascience #largelanguage...
Aged well - With the growth of open-source LLMs, many...
1,784 views2024-06-10
Aged well - With the growth of open-source LLMs, many leaderboards to rank these models are emerging. Several different methodologies are us...
4 Data Science Fails.These are a handful of ways that...
1,784 views2024-06-06
4 Data Science Fails.These are a handful of ways that society pushes back on data science approaches. It's good to understand why these were...
I have done a lot of good work in untitled python not...
1,774 views2022-06-30
I have done a lot of good work in untitled python notebooks. #datascience #machinelearning #python #codetok #thosethatgetitgetit
GPT- 3 trivia and French pastries I enjoyed at the 🤗...
1,761 views2023-02-01
GPT- 3 trivia and French pastries I enjoyed at the 🤗 offsite. #datascience #machinelearning #gpt3 #openai #huggingface
Pandas 2.0 combing with arrow. A short recap on how i...
1,755 views2023-03-01
Pandas 2.0 combing with arrow. A short recap on how it fits in with polars, dplyr, and data.table. #datascience #machinelearning #rstats #py...
Image captioning models - GIT from Microsoft and BLIP...
1,743 views2023-01-05
Image captioning models - GIT from Microsoft and BLIP from salesforce #datascience #machinelearning #imagecaptioning
No click bait on this account. Feeling sick today (an...
1,733 views2022-07-06
No click bait on this account. Feeling sick today (and upset between Roe and Highland). Mailing it in today. #datascience #statistics #analy...
Distance Metrics in Data Science
1,731 views2024-10-24
Distance Metrics in Data Science
Lots of real world problems, it pays to know distribu...
1,725 views2022-07-08
Lots of real world problems, it pays to know distributions like tweedie. Still sick, so you get old tik tok from my drafts. #datascience #...
YouChat and retrieval augmented models. To play aroun...
1,701 views2022-12-26
YouChat and retrieval augmented models. To play around with this, check out haystack from deepset. #datascience #machinelearning #youchat #c...
credit to Gavin from work - #codetok #stackoverflow #...
1,697 views2022-05-05
credit to Gavin from work - #codetok #stackoverflow #programming #python
Replying to @Data Storyteller Here are two examples o...
1,690 views2022-07-22
Replying to @Data Storyteller Here are two examples of data or target leakage. I bet people have other fun examples. #datascience #targetlea...
Conformal prediction.
1,684 views2024-10-04
Conformal prediction.
Fundamentals folks. A great example is the paper on p...
1,674 views2024-05-27
Fundamentals folks. A great example is the paper on police misconduct. It highlights a lot of great data science practices (more than I coul...
Replying to @rajistics here are two themes I wanted t...
1,671 views2022-12-18
Replying to @rajistics here are two themes I wanted to highlight. The second candidate showed more analytic maturity.
Code not working? start with the documented examples ...
1,660 views2022-09-03
Code not working? start with the documented examples #datascience#rstats #machinelearning #codetok #python
New state of the art embedding model, Instructor, for...
1,650 views2023-01-22
New state of the art embedding model, Instructor, for text is available! It accounts for task and domain when creating an mending. #datascie...
ABCs of Generative AI: Anything But Chatbots -- There...
1,642 views2024-07-13
ABCs of Generative AI: Anything But Chatbots -- There is so much value with generative AI, don't get trapped into just building chatbots. No...
It works. #datascience #analytics #codetok #statistic...
1,625 views2022-05-13
It works. #datascience #analytics #codetok #statistics #dataanalyst
In this video, statistical modeling evolved from manu...
1,621 views2025-02-19
In this video, statistical modeling evolved from manual processes requiring explicit data preparation (scaling, transformations, imputation)...
Reply to @ereb0s_rl #datascience #analytics #techtok ...
1,594 views2022-04-24
Reply to @ereb0s_rl #datascience #analytics #techtok #rstats #kaggle #fastair #machinelearning
AI that makes you feel better. The paper is Inducing ...
1,593 views2022-10-14
AI that makes you feel better. The paper is Inducing Positive Perspectives with Text Reframing. You can find a demo over at 🤗 hugging face ...
Video coming on Text Generational in Colab
1,583 views2023-07-25
Video coming on Text Generational in Colab
Data data data.
1,573 views2024-08-01
Data data data.
Recent research on model compression with quanitizati...
1,572 views2024-10-21
Recent research on model compression with quanitization from Neural Magic, go check it out We Ran Over Half a Million Evaluations on Quantiz...
And even better when they submit an issue #datascienc...
1,572 views2022-08-17
And even better when they submit an issue #datascience #codetok #opensource
#duet with @Sylar2.5 #parodysong #datascience #codetok
1,561 views2022-08-30
#duet with @Sylar2.5 #parodysong #datascience #codetok
Staying busy and doing a public talk on Generative AI...
1,541 views2022-12-27
Staying busy and doing a public talk on Generative AI. It will be about 40 minutes so gives me chance to dive into more details and answer q...
Segment Anything (SAM) is a new segmentation model fr...
1,536 views2023-04-06
Segment Anything (SAM) is a new segmentation model from Meta. It's a huge improvement over the state of the art and is going to change compu...
Replying to @joshhenny it’s great time to learning ab...
1,532 views2022-11-14
Replying to @joshhenny it’s great time to learning about #largelanguagemodels or #stablediffusion #datascience #machinelearning
Regression to the mean with the Madden Curse and Spor...
1,527 views2022-01-20
Regression to the mean with the Madden Curse and Sports Illustrated Jinx #datascience #analytics #stats #maddencurse #sijinx #regression
Datasets have worldviews from Google PAIR, link in co...
1,519 views2022-02-09
Datasets have worldviews from Google PAIR, link in comments, #datascience #bias #machinelearning #ethics #pair-google #statistics
After simple baselines, Anomaly detection is hard.
1,517 views2024-09-26
After simple baselines, Anomaly detection is hard.
GDG DevFest Ukraine, sign up! #datascience #codetok ...
1,495 views2022-06-11
GDG DevFest Ukraine, sign up! #datascience #codetok #huggingface #dallemini #bigscience #devfestforukraine #standwithukraine
I much prefer working through code examples than deco...
1,476 views2022-09-20
I much prefer working through code examples than decoding equations. I can’t be the only one. #datascience #statistics
How else can you work? #datascience #stackoverflow #c...
1,471 views2022-07-29
How else can you work? #datascience #stackoverflow #codetok
Recipe for Word and Sentence Embeddings: Word2Vec and...
1,470 views2024-10-11
Recipe for Word and Sentence Embeddings: Word2Vec and now Static Embeddings for word embeddings, and to get more context, use the sentence t...
Facts. We need data. #datascience #statistics #analys...
1,462 views2022-05-01
Facts. We need data. #datascience #statistics #analysis #techtok
Replying to @minisdlatvia my big tip for learning dat...
1,458 views2022-07-16
Replying to @minisdlatvia my big tip for learning data science #datascience #machinelearning #analytics #codetok #webapps #gradio #streamlit...
Curriculum Learning is about ordering your training d...
1,447 views2024-07-27
Curriculum Learning is about ordering your training data. It's another useful technique that you should consider. Some background: Overview:...
Level set expectations early! People have unrealisti...
1,447 views2022-06-07
Level set expectations early! People have unrealistic views. #datascience #dataanalytics #statistics #codetok
#insurance #regulation #datascience #statistics #inte...
1,447 views2022-05-21
#insurance #regulation #datascience #statistics #interpretablemodels #codetok
TikTok video #7445674328907123999
1,446 views2024-12-07
TikTok video #7445674328907123999
Reply to @zythesciguy Reply to @zythesciguy #datascie...
1,445 views2022-05-29
Reply to @zythesciguy Reply to @zythesciguy #datascience #statistics #codetok
Shapley Values in Machine Learning
1,424 views2022-08-10
Shapley Values in Machine Learning
My blog post journey from Jekyll to Quarto. #bloggin...
1,419 views2024-06-04
My blog post journey from Jekyll to Quarto. #blogging #rajistics #jekyll #quarto #posit
Seen this being hashed out on Twitter and had to join...
1,417 views2022-08-26
Seen this being hashed out on Twitter and had to join #dataengineering #codetok #duckdb #spark #datascience
Mmm. Food classification.
1,409 views2024-08-04
Mmm. Food classification.
Its always longer than you want to get your data prep...
1,402 views2022-08-07
Its always longer than you want to get your data prepped. #datascience #dataengineering #analytics #codetok
Keep your data science projects page loaded by making...
1,400 views2022-06-03
Keep your data science projects page loaded by making open source versions of your work. #datascience #codetok #programming #python
Evaluation data is so so important when working on ma...
1,395 views2024-08-09
Evaluation data is so so important when working on machine learning or generative AI projects. Labeling data is an imporant task and you can...
What a data scientist does #datascience #analytics #c...
1,393 views2022-09-04
What a data scientist does #datascience #analytics #codetok #python
Scaling LLMs. Two years ago OpenAI had a big lead. Si...
1,378 views2025-01-07
Scaling LLMs. Two years ago OpenAI had a big lead. Since then, other companies have learned to effectively scale their model training.
Clustering with kmeans
1,363 views2024-08-22
Clustering with kmeans

Every once in a while, I go back and try to build som...
1,359 views2025-11-09
Every once in a while, I go back and try to build some AI from the ground up. Lately, its been "Mixture of Experts" (MoE) models, and I foun...
Generative AI is awesome. If you need more ideas chec...
1,356 views2024-07-25
Generative AI is awesome. If you need more ideas check out all 35 real world examples for using Generative AI / LLMs that Evidently has put ...
Great data scientists figure out the best questions c...
1,356 views2022-11-05
Great data scientists figure out the best questions come from talking to people. #datascience book is practical python and opencv by rosebr...
Agents in AI Some effective examples for using agents...
1,349 views2025-01-04
Agents in AI Some effective examples for using agents based on posts from Anthropic and Hugging Face. Hugging Face - SmolAgents: https://hug...
Machibr learning tradeoffs between explainabilty and ...
1,345 views2024-08-06
Machibr learning tradeoffs between explainabilty and accuracy.
Your data science 101 reminder when working with clas...
1,289 views2022-10-10
Your data science 101 reminder when working with classification models. #datascience #statistics #codetok
What tools are way too hard to use? #datascience #st...
1,285 views2022-03-05
What tools are way too hard to use? #datascience #statistics #analytics
This fits so well for agents. Timeless.
1,282 views2025-02-05
This fits so well for agents. Timeless.
Filling in those job duties 🚩🚩 #datascience #codetok
1,275 views2022-08-23
Filling in those job duties 🚩🚩 #datascience #codetok
ChatGPT has sucked up a lot of my attention. Will do ...
1,265 views2022-12-07
ChatGPT has sucked up a lot of my attention. Will do a post soon on how it works.
I was rocking with SPSS back in 2009. I didn’t start...
1,263 views2022-03-20
I was rocking with SPSS back in 2009. I didn’t start using R until a few years later. We had to pay $$$ for a basic regression. #datascien...
Technical AI systems are vulnerable to out-of-distrib...
1,260 views2025-02-20
Technical AI systems are vulnerable to out-of-distribution behaviors, as shown through evasion techniques like unusual movement patterns and...
Reminder to be smart about how you using your trainin...
1,254 views2022-12-13
Reminder to be smart about how you using your training data. #machinelearning #datacentricai #datascience #waymo #reinforcementlearning
Should you take the time to learn Kubernetes as a dat...
1,215 views2023-01-23
Should you take the time to learn Kubernetes as a data scientist? Or you already overloaded learning data science? #datascience #machinelear...
Offering ways to improve your machine learning models...
1,211 views2022-07-23
Offering ways to improve your machine learning models #huggingface #datascience #codetok #datacentricai #adversarial
Tracking Covid - #datascience #analytics #wastewater ...
1,206 views2022-01-27
Tracking Covid - #datascience #analytics #wastewater #covid19 #data #monitoring
Have some projects in your github #datascience #githu...
1,204 views2022-08-03
Have some projects in your github #datascience #github #codetok
Probe the data #dataanalysis #datascience #statistics...
1,164 views2022-08-01
Probe the data #dataanalysis #datascience #statistics #bias
Who would have known that my pronunciation of LaTeX w...
1,137 views2022-10-15
Who would have known that my pronunciation of LaTeX would be such a big deal and so divisive. It’s all good. Go listen for yourself. @rajist...
Having some fun connecting a spreadsheet to a ML mode...
1,136 views2022-11-04
Having some fun connecting a spreadsheet to a ML model. It wasn’t too hard and it’s pretty cool to have it working this way. #datascienc...
Replying to @chairstaple so many good distance metric...
1,130 views2022-10-02
Replying to @chairstaple so many good distance metrics - what’s yours? This video covers Hamming, Levenshtein, Euclidean, Manhattan, and Ma...
Evaluation of Large Language Models is a critical top...
1,124 views2024-08-29
Evaluation of Large Language Models is a critical topic. Leaderboards provide little guidance for evaluation but have many flaws. I posted t...
Catch me on the Practically intelligent podcase eposi...
1,115 views2024-05-29
Catch me on the Practically intelligent podcase eposide 13. I talk about open source, enterprise AI, and the crazinees of the LLM space.
Getting the best distance metric is crucial for solvi...
1,108 views2024-10-06
Getting the best distance metric is crucial for solving analytical problems. This video reviews Euclidean, Manhattan, Mahabolobis, Levenshte...
üöÄ Just get started on your journey to learn large ...
1,097 views2023-07-26
üöÄ Just get started on your journey to learn large language models! ü§î Is there a lot to learn? Yes! üòÖ ü§∑‚Äç‚ôÇÔ∏è But is it easy t...
Some hints on how to evaluate Github projects.
1,081 views2024-08-28
Some hints on how to evaluate Github projects.
New style of content, let me know if you want more li...
1,074 views2022-11-13
New style of content, let me know if you want more like this. Predict sentiment #machinelearning #datascience #transformers #huggingface
Long video in comments, #huggingface #datascience #re...
1,065 views2022-07-15
Long video in comments, #huggingface #datascience #reinforcementlearning #deeplearning #codetok #mltok Earlier weeks: @Rajiv Shah @Rajiv Sha...
My favorite was a training on how to use zoom #securi...
1,037 views2022-05-24
My favorite was a training on how to use zoom #securitytraining #codetok
Quick introduction to optimization and for advanced f...
1,031 views2022-12-24
Quick introduction to optimization and for advanced folks, go run a notebook from gurobi or do the Kaggle Santa challenge. #datascience #mac...
AI only knows what's it's trained on. So beat it by d...
1,025 views2023-02-21
AI only knows what's it's trained on. So beat it by doing something new. The video shows recent examples of marines beating a surveillance s...
Netflix $1 million dollar prize #datascience #ai #net...
1,024 views2022-01-09
Netflix $1 million dollar prize #datascience #ai #netflix #PepsiApplePieChallenge
Reasoning and planning are key weaknesses of LLMs. Th...
1,023 views2024-08-13
Reasoning and planning are key weaknesses of LLMs. This video was from last year, but the issue still remains. My guess is we will see addit...
Wow. Look at that subscription revenue.
1,017 views2024-07-12
Wow. Look at that subscription revenue.
Comparing algorithms spiral dataset, #datascience #ma...
1,014 views2022-01-24
Comparing algorithms spiral dataset, #datascience #machinelearning #algorithms #gbm #logisticregression
Beer and diapers story of association of products. #d...
1,012 views2022-02-18
Beer and diapers story of association of products. #datascience #recommendationsystems #marketing #analytics #correlation
Still making PowerPoint-2008-level charts? AI gives y...
1,010 views2025-12-01
Still making PowerPoint-2008-level charts? AI gives you a shortcut to stunning visuals and real design skills. Pick the level that fits wher...
CLIP Interrogator is available over at the hugging fa...
1,010 views2022-10-25
CLIP Interrogator is available over at the hugging face spaces. Have fun! #datascience #machinelearning #stablediffusion #huggingface
Looking forward to a lot more videos in 2023, let me ...
1,005 views2023-01-01
Looking forward to a lot more videos in 2023, let me know topics I should cover. For all my videos, I put them in an airtable spreadsheet av...
Predicting NCAA basketball #marchmadness #datascience...
1,004 views2022-03-25
Predicting NCAA basketball #marchmadness #datascience #sportsanalytics #illinois
It’s tough to be content #codetok #techtok #datascien...
987 views2022-05-10
It’s tough to be content #codetok #techtok #datascience #programming
What’s the deal with those competition rules #datasci...
968 views2022-08-28
What’s the deal with those competition rules #datascience #codetok #analytics #kaggle
Creating music videos with stable diffusion and whisp...
966 views2022-09-28
Creating music videos with stable diffusion and whisper. This colab notebook uses a dream studio backend for the images. Another great step ...
ChatGPT for Robotics is the latest hot paper. Large l...
961 views2023-02-22
ChatGPT for Robotics is the latest hot paper. Large language models are the future interface. #datascience #machinelearning #largelanguagemo...
News flash: Data scientists spend lots of time on dat...
947 views2022-07-24
News flash: Data scientists spend lots of time on data prep/exploration #datascience #dataengineering #analytics #codetok
Vicuna is awesome go check it out. Its the latest LLa...
928 views2023-03-30
Vicuna is awesome go check it out. Its the latest LLama model and very impressive. I ended up cutting out the details on vicuna since i feel...
Reply to @anthonycomputer Dive in and start! Lots of...
899 views2022-04-20
Reply to @anthonycomputer Dive in and start! Lots of great stuff out there. #datascience #techtok #analytics
Great way to get under the skin of your data scientis...
897 views2022-10-08
Great way to get under the skin of your data scientist. #datascience #analytics #codetok
I am awful about writing tests. This is why I don’t w...
896 views2022-04-01
I am awful about writing tests. This is why I don’t write production code. #datascience #cstok #programminghumor #codetok
Simple tip, never claim causation. Unless you have an...
886 views2022-11-05
Simple tip, never claim causation. Unless you have an experimental design, it’s hard to prove. #datascience #machinelearning #statistics
Temperature is an important parameter when working wi...
881 views2023-03-21
Temperature is an important parameter when working with many models including got-3. This video gives a short background on temperature and ...
From Spiegelhalter interview on Artists of Data Scien...
867 views2022-05-23
From Spiegelhalter interview on Artists of Data Science podcast #datascience #statistics #codetok #dataanalysis
Practical Lessons for building generative AI: I share...
863 views2024-10-02
Practical Lessons for building generative AI: I share the latest research and earned wisdom on building generative AI applications and touch...
I have lived this. #conwayslaw #softwaredevelopment #...
846 views2022-05-17
I have lived this. #conwayslaw #softwaredevelopment #codetok #programming
Rerunning your old code #datascience #techtok #progra...
843 views2022-04-12
Rerunning your old code #datascience #techtok #programming #analytics
Data science work is hard to schedule and plan. It co...
843 views2022-01-26
Data science work is hard to schedule and plan. It conflicts with agile methods. #datascience #machinelearning #dataanalytics #agile #scrumm...
I have no desire to build data infrastructure. I will...
802 views2022-12-02
I have no desire to build data infrastructure. I will leave that to my #dataengineer friends. #datascience
Climax, a new transformer based model for predicting ...
799 views2023-02-07
Climax, a new transformer based model for predicting weather and climate forecasting. Great example of the flexibility of transformers based...
Corporate research labs have changed academic work wi...
798 views2023-01-28
Corporate research labs have changed academic work with their reluctance to provide reproducible research and getting around blind peer revi...
#aifilter #aifilterchallenge had to try it out and go...
795 views2022-12-17
#aifilter #aifilterchallenge had to try it out and got a bit more buff
Dreaded git push error. Had a little help tonight. #...
780 views2022-07-23
Dreaded git push error. Had a little help tonight. #git #datascience #python
Context engineering works, until it doesn’t. Recursiv...
755 views2026-01-04
Context engineering works, until it doesn’t. Recursive Language Models ask a sharper question: why are humans managing memory, search, and p...
ChatDoctor is a great example of fine tuning a large ...
749 views2023-03-22
ChatDoctor is a great example of fine tuning a large language model to get more factually correct output. This is an approach i expect many ...
It’s exasperating. #techtok #datascience #programming
731 views2022-04-22
It’s exasperating. #techtok #datascience #programming
New to Unix or Bash? This is a fast, visual walkthrou...
721 views2026-01-04
New to Unix or Bash? This is a fast, visual walkthrough of the core terminal commands every beginner should know: where you are, how to move...
Being above average part II. Cite in comments. @raji...
710 views2022-02-13
Being above average part II. Cite in comments. @rajistics #statistics #regressiontothemean #aboveaverage
Tuskegee Airman by geo karamanis links to code in com...
674 views2022-02-14
Tuskegee Airman by geo karamanis links to code in comments #TidyTuesday #rstats #datascience #datavisualization
self driving cars and data quality - LOA - #datascien...
674 views2022-01-25
self driving cars and data quality - LOA - #datascience #machinelearning #selfdrivingcar #stanford #data #stats
Those pesky outliers.
666 views2022-10-20
Those pesky outliers.
My second try to explain in context learning or few s...
660 views2023-01-27
My second try to explain in context learning or few shot learning with large language models. It’s very cool and why these models are so e...
One of my favorites for #explainability #datascience ...
655 views2022-06-28
One of my favorites for #explainability #datascience #statistics #interpretability #codetok #python #machinelearning
Some things are bigger than data science, I have a pe...
637 views2022-02-25
Some things are bigger than data science, I have a personal connection here and have to express my support. #ukraine #priceoffreedom #datasc...
Reply to @noleli median versus mean
622 views2022-02-11
Reply to @noleli median versus mean
The damage I have done with root access. What have yo...
611 views2022-09-09
The damage I have done with root access. What have you done? #codetok #python
Pair programming is some of my favorite times as a da...
609 views2023-03-19
Pair programming is some of my favorite times as a data scientist. I am starting to use ChatGPT to fill that role lately. Its useful for me....
ChatGPT for Robotics is the latest hot paper. Large l...
587 views2023-02-22
ChatGPT for Robotics is the latest hot paper. Large language models are the future interface. #datascience #machinelearning #largelanguagemo...
Our fellow algorithms calling mom featuring our linea...
581 views2022-10-21
Our fellow algorithms calling mom featuring our linear model, XGBoost, and Neural Networks. I had fun making them.
False positive and false negative #datascience #stati...
575 views2022-02-06
False positive and false negative #datascience #statistics #decionmaking #classificationalgorithm #algorithm
Scaling laws help us figure out how manage the amount...
563 views2023-01-07
Scaling laws help us figure out how manage the amount of training data versus the model size. DeepMind showed with Chinchilla by using more ...
Deciding whether to use a Large Language Model or a s...
549 views2023-06-02
Deciding whether to use a Large Language Model or a smaller model? This video explores the tradeoffs between both approaches based on the la...
We all play roulette with stackoverflow. #programming...
538 views2022-03-24
We all play roulette with stackoverflow. #programming #datascience #python
It’s almost here. Full support for pandas in sklear...
534 views2022-10-18
It’s almost here. Full support for pandas in sklearn pipelines. #machinelearning #datascience #codetok #python #sklearn #sci-kit
Don’t feel bad if you havent put a machine learning m...
533 views2022-08-11
Don’t feel bad if you havent put a machine learning model into production. Lots of valuable data scientist haven’t done fhat.
Machine learning engineer growing career #machinelear...
520 views2022-01-29
Machine learning engineer growing career #machinelearning #datascience #dataengineering #programming #ai #career #programmingbootcamp #stati...
Anthropic is starting to preview their model and peop...
508 views2023-01-08
Anthropic is starting to preview their model and people are comparing it to ChatGPT. Thanks to Riley Goodside for sharing screenshots. It lo...
At least it will be faster to build the second time. ...
501 views2022-09-29
At least it will be faster to build the second time. Ugh. How often have you had to recode something?
So many hyperparameters - this is from pytorch foreca...
495 views2022-01-30
So many hyperparameters - this is from pytorch forecasting #datascience #machinelearning #hyperparameters #coding #algorithms #modeling
Getting explainability when working with transformer ...
490 views2022-10-19
Getting explainability when working with transformer based image or vision models. Uses Captum on the backend, but makes it easy to get imag...
Talk business not data science metrics to have a busi...
490 views2022-02-16
Talk business not data science metrics to have a business impact #datascience #machinelearning #statistics #analytics
Reinforcement learning with my Eat Melon! Demo based ...
476 views2022-04-05
Reinforcement learning with my Eat Melon! Demo based on Karpathy #datascience #reinforcementlearning #techtok #machinelearning
AI Literacy, Question 1, can AI think by itself? #ai ...
471 views2022-01-16
AI Literacy, Question 1, can AI think by itself? #ai #datascience #programming #counsciousness #alleninstitute #literacy #capcut
Replying to @rajistics as promised, the feature or va...
435 views2023-02-12
Replying to @rajistics as promised, the feature or variables in auto insurance models. Keep the feedback coming. #datascience #machinelearni...
How I added my TikTok, instagram, and YouTube videos ...
426 views2024-04-02
How I added my TikTok, instagram, and YouTube videos to my website. I used Buzzlytics to gather the information, python to munge all the da...
TikTok video #7141035352076094762
388 views2022-09-08
TikTok video #7141035352076094762
How companies use your data for training models will ...
356 views2023-01-19
How companies use your data for training models will be a big issue this year. GitHub is being sued for Copilot and Hugging Face has been bu...
ChatGPT price drop. Let’s break down how much the p...
342 views2023-03-02
ChatGPT price drop. Let’s break down how much the price dropped, how OpenAI could drop the price, the effects on performance, what is goin...
I hope this pain isn’t shared widely #techtok #powerp...
332 views2022-04-12
I hope this pain isn’t shared widely #techtok #powerpoint #datascientist
Big data bowl submissions are going in and lots of gr...
330 views2023-01-13
Big data bowl submissions are going in and lots of great sports analytic work. This one is on strain for evaluating pass rushers. #datascien...
Explanations in Machine Learning
314 views2022-08-10
Explanations in Machine Learning
Logical song full explanation here: @rajistics #sij...
308 views2022-01-21
Logical song full explanation here: @rajistics #sijinx #maddencurse #stats #analytics #regression
Recognizing Meta AI's contribution
307 views2023-05-14
Recognizing Meta AI's contribution
The pain. Data munging on poorly prepped data. #datas...
297 views2022-03-22
The pain. Data munging on poorly prepped data. #datascience #analytics #csv
Overdue for sports analytics #datascience #analytics ...
286 views2022-08-12
Overdue for sports analytics #datascience #analytics #codetok #sportsanalytics #machinelearning
Accuracy is not your friend for most problems #datasc...
280 views2022-01-13
Accuracy is not your friend for most problems #datascience #media #norythm
2022 plans #2022 #datascience #ai #machinelearning #s...
279 views2022-01-19
2022 plans #2022 #datascience #ai #machinelearning #skillup start with AI Literacy @rajistics
i love notebooks #notebooks #programming #rstats
274 views2022-01-12
i love notebooks #notebooks #programming #rstats
It’s important to make sure your model is well cali...
262 views2022-11-11
It’s important to make sure your model is well calibrated. This becomes especially important with imbalanced data. #machinelearning #datas...
Open Source with Stable Diffusion - #datascience #cod...
256 views2022-08-27
Open Source with Stable Diffusion - #datascience #codetok #machinelearning #stablediffusion #opensourcesoftware
Using agents in langchain with gpt-3. You can do this...
245 views2023-03-04
Using agents in langchain with gpt-3. You can do this! Go check it out. #datascience #machinelearning #openai #gpt3 #langchain
Contrastive learning is common for folks working in N...
243 views2022-11-03
Contrastive learning is common for folks working in NLP and images. This was new to me, so wanted to share the intuition a bit more widely. ...
Meta’s less than open source model and some bad tak...
224 views2023-03-05
Meta’s less than open source model and some bad takes from Twitter. #datascience #machinelearning #largelanguagemodels #opensource #meta
Reminder to visualize your data with one of my favori...
222 views2022-10-29
Reminder to visualize your data with one of my favorites #anscombesquartet #datavisualization #datascience #statistics
AI that makes you feel better. The paper is Inducing ...
208 views2022-10-14
AI that makes you feel better. The paper is Inducing Positive Perspectives with Text Reframing. You can find a demo over at ü§ó hugging fac...
Your weekly dose of LLM news. I liked this because it...
207 views2022-11-06
Your weekly dose of LLM news. I liked this because it had interesting results with a smart approach. #datascience #machinelearning #largelan...
Interpretable models are often overlooked, but a grea...
203 views2022-11-05
Interpretable models are often overlooked, but a great addition to your data science toolkit. Imodels is a great python package for getting ...
What did i do this time? I hope your IT experienced ...
199 views2022-07-13
What did i do this time? I hope your IT experienced go much better.
Interpretable models are often overlooked, but a grea...
182 views2022-11-05
Interpretable models are often overlooked, but a great addition to your data science toolkit. Imodels is a great python package for getting ...
Random forests and their ease of use are important in...
181 views2023-02-18
Random forests and their ease of use are important in understanding modern data science. #datascience #machinelearning #statistics #randomfo...
Using AI for Pose Detection, this is such a cool appl...
171 views2022-09-05
Using AI for Pose Detection, this is such a cool application. #datascience #deeplearning #codetok #posedetection #sportsanalytics
Dreams of a better GPU #gpu #nvidia #deeplearning #ga...
170 views2022-02-12
Dreams of a better GPU #gpu #nvidia #deeplearning #gaming #datascience
Pie chart fails #stats #datascience #datavisualizatio...
169 views2022-01-15
Pie chart fails #stats #datascience #datavisualization #piechart #analytics #fails
Explaining how Emily Ocasio won second place with her...
145 views2023-03-29
Explaining how Emily Ocasio won second place with her project analyzing media coverage. I like her approach and highlights a growing trend o...
Crowdsource labor for #ai #machinelearning - longer v...
129 views2022-01-22
Crowdsource labor for #ai #machinelearning - longer video explaining this coming out later today.
Lets talk about why enterprises are considering alter...
126 views2023-03-18
Lets talk about why enterprises are considering alternatives to chatGPT by looking to open source. An open source strategy can affect lots o...
Starting to see people productionizing GPT-3 workflow...
120 views2023-03-11
Starting to see people productionizing GPT-3 workflows. I am a bug fan of using large language midels. Here is how one data science dealt wi...
Dealing with over plotting, another visualization tip...
113 views2023-01-08
Dealing with over plotting, another visualization tips from data to viz #datascience #machinelearning #statistics #datavisualization
OpenAI plugins! Lets get everyones APIs working with ...
109 views2023-03-23
OpenAI plugins! Lets get everyones APIs working with LLMs! This isa good thing. #largelanguagemodels #langchain #openai #datascience #machin...
AI Literacy, Q2, Driverless cars #ai #datascience #dr...
109 views2022-01-18
AI Literacy, Q2, Driverless cars #ai #datascience #driverless #cars #tesla #fsd #alleninstitute
Try out these examples for yourself and lots more are...
101 views2023-01-31
Try out these examples for yourself and lots more are available. It’s scary cool how these models are working. #datascience #machinelearni...
Learning curves, it’s a technique I use all the tim...
95 views2022-11-16
Learning curves, it’s a technique I use all the time when training models. Thanks to Todd C for showing me the best way to explain this. #...
My data science setup for now #datascience #codetok #...
87 views2022-08-20
My data science setup for now #datascience #codetok #python #rstats #posit #vscode #googlecolab #digitalocean #conda
Data drift analysis is a must for production workload...
79 views2023-03-13
Data drift analysis is a must for production workloads. Here is Uber’s D3 system fie automated drift analysis. This video covers types of ...
Just how smart is ChatGPT and other #largelanguagemod...
79 views2023-01-04
Just how smart is ChatGPT and other #largelanguagemodels? Big Bench is a set of benchmark tests to asses the performance of the models. And ...
No big deal, use visualization #stats #datascience #d...
73 views2022-01-14
No big deal, use visualization #stats #datascience #datasaurus #datascience #analytics #anscombe #visualization
Replying to @anansaadi OpenAssistant is an open sourc...
63 views2023-02-19
Replying to @anansaadi OpenAssistant is an open source project that aims to provide a chat based assistant that connects to other sources of...
Applying a classic methodology of ablation when worki...
63 views2022-11-12
Applying a classic methodology of ablation when working with stable diffusion prompts. Ablation is very common in many techniques to underst...
Cleanlab is open source and will improve your data qu...
61 views2023-01-29
Cleanlab is open source and will improve your data quality. It’s so underrated. This was hard to record vertically, so go try it out. #dat...
Models that cheat, take shortcuts, and leak informati...
58 views2023-01-03
Models that cheat, take shortcuts, and leak information are all part of the data scientist life style. Ever my data scientist has a story li...
tiktok e2a990da04ca9451707be3db1b1bbef66c5254a5
2024-09-30
tiktok d7bef697298bb1d985ca3c6f7cbd11ed7ae7d689
2024-09-01
tiktok 417158c1e75afdec1eb5bc35f46b7b138c6056ca
2024-08-31
tiktok a6aa66a4f29979c305632aa3bf16159062777ba7
2024-08-30
tiktok 0e623c2cf183c1083703c98f65cd692b76e6aa5b
2024-08-29
tiktok b9a370cab2d3b0936212e9c2ce29d961e2ab3cd4
2024-08-22
tiktok 9df3656300e252c0d3de70d269aad4bc567806c0
2024-08-20
5 tiers of data centers
2024-07-18
5 tiers of data centers
tiktok 0b122ac2e9694c92b91164414905566fdd2fc14f
2024-07-12
tiktok 4cf7ae463f2bad712f25c9e362e25157c65521a6
2024-07-05
tiktok 8c066ba61667767f6b38fe18b5dd8e1e281f6683
2024-07-01
7 Baseline Models: Time Series: Previous Value Anomal...
2024-06-22
7 Baseline Models: Time Series: Previous Value Anomaly: p99 Search: BM25 Recommendation: Popularity Buy recommendations: last viewed Classif...
tiktok bb7abbecd2c5cd54f01726919f43c5d803879533
2024-06-06
tiktok 19e9c4dc3117a7c860ebb3df3e3f7ce69ef12488
2024-06-04
tiktok 954676c979b9f2656d789b3e0f97f299509c178d
2024-06-01
tiktok 775eb1bfdcf68644342be85b1c8c554005253e0c
2024-05-30
tiktok 7fd83346daf58d13afdb3984dcb33cc9195efd7a
2024-05-29
tiktok 00c676f59f8cf572ebe8336fe2df7c4b0bd5ca5e
2024-05-27
tiktok cf32a89cf851869db4c43616089a42d2cf26b89f
2024-05-21
The politics of ChatGPT, it’s no different than any...
2022-12-27
The politics of ChatGPT, it’s no different than any other technology and is not neutral. If you want a simple explanation of how ChatGTP w...
tiktok de97fd3138a8f8d4d080bb166276cd00c49af5ca
2025-01-02
tiktok 64b0a4d7321788a7ba7541b749e80ecaffb9cd6f
2024-12-23
My take on Objaverse Llama and Alpaca. Not a lot of r...
2023-03-25
My take on Objaverse Llama and Alpaca. Not a lot of respect for copyright or contract terms. #largelanguagemodels #datascience #machinelearn...
Ensembling is key method in machine learning. This vi...
2023-03-14
Ensembling is key method in machine learning. This video introduces ensembling through majority voting. #datascience #machinelearning #ensem...
Best machine learning tools for competitions. Lots of...
2023-03-09
Best machine learning tools for competitions. Lots of great stuff here. #datascience #machinelearning #python #codetok
tiktok 0d4cfd4336634201f295e09f99e92ef5fa6a54c7
2024-12-19
tiktok 1c7fa2839202a58d61ba4c7afa4a28eae107cef1
2025-03-21
tiktok ba59e0260fcb7eee8f65083fe8390511e75acf3d
2024-10-30
tiktok ab70212cbc5914f13b1643b96d76fc60de8dea1d
2024-10-29
tiktok 08ffeb08df0e47f4d3fbbc5c7120071a912b9276
2024-10-28
tiktok caa90da79fbc05977cd1c19f1a42e9568bfa8bc6
2024-10-13
tiktok 3f0c6902fa5a79371eb5d6cda9f068ad9f77c706
2024-10-11
tiktok 48d3b4d7d189f76a66980535eae81feeb890348e
2024-10-06
tiktok b030263c4a5b1a509660564c61b6c54d2b265b4d
2024-10-02
tiktok d3e1d6b2b51a4f6d63a13025820aa59e02e742b1
2024-09-20
tiktok ffcd51d466fbaf06c6b067719c672165f926cd17
2024-05-25
tiktok 844e96adce97f89ef0cc605924da96b094d0fe0f
2024-05-17
tiktok b9c0b9b5b5823a6d96689096093f16871b5f156e
2024-05-04
tiktok 325733951ba1b2b080519863609b58df0876fcc6
2024-05-02
tiktok a5562e05a4234cd0e1c6c939c12f26c0cc677602
2024-04-27
tiktok 54aba190d1b3c5efca9aef8da61c0123890bc641
2024-04-23
tiktok 35db4b850d175713598b9f83c91925299673d3a8
2024-04-18
tiktok b8f5c81c0a4f7eb4981a87d3af5d05a2b3ef5f5b
2024-04-14
tiktok 39fe20eb758016a4a43ecd1d3b552263c0a86de4
2024-04-12
tiktok 2de5791aaafd1430249a1f1b6a97b40454657c60
2024-04-10
Data Centric AI helps to remind us not to focus too m...
2023-02-23
Data Centric AI helps to remind us not to focus too much on the model or algorithms. In real data science, it’s more about understanding y...
tiktok c50c73941e834e5984e3e0c1e90117d5cbebf631
2023-02-19
OpenAI AI classifier is a great example to remind peo...
2023-02-04
OpenAI AI classifier is a great example to remind people of the limitations when detecting rare events. It’s not intuitive, so I showed th...
Picking the right GPU for deep learning based on Tim ...
2023-01-17
Picking the right GPU for deep learning based on Tim Dettmers blog post.
Clustering with k-means. This skit was inspired by th...
2022-12-31
Clustering with k-means. This skit was inspired by the examples in Schubert paper on stop using the elbow criterion for kmeans. Any other cl...
Models that cheat, take shortcuts, and leak informati...
2023-01-03
Models that cheat, take shortcuts, and leak information are all part of the data scientist life style. Ever my data scientist has a story li...
Metaâs Cicero AI that plays Diplomacy and knows how...
2022-11-24
Metaâs Cicero AI that plays Diplomacy and knows how to get its way with people. #datascience
DiffusionDet is bringing generative approaches to obj...
2022-11-21
DiffusionDet is bringing generative approaches to object detection #computervision
Regularization is something I need more in my everyda...
2022-11-19
Regularization is something I need more in my everyday life.
Automatic speech recognition using transformers. It i...
2022-11-17
Automatic speech recognition using transformers. It is that easy!
Editing facts in large language models. An exciting a...
2022-11-06
Editing facts in large language models. An exciting approach that is probing LLMs. #largelanguagemodels #datascience #machinelearning
Software exec at the end is the best. Your quick intr...
2022-11-01
Software exec at the end is the best. Your quick intro to patents, trademarks, and licenses. I see too many comments where people get confus...
Stable diffusion for markup. This is about better und...
2022-10-30
Stable diffusion for markup. This is about better understanding how to go from text to image, not a practical solution. #stablediffusion #da...
Mixing in some law with data science. #craiyon #dalle...
2022-10-30
Mixing in some law with data science. #craiyon #dallemini #stablediffusion #texttoimage #machinelearning #datascience
Reminder to visualize your data #datascience #datavis...
2022-10-30
Reminder to visualize your data #datascience #datavisualization #statistics
tiktok c117fdd786b5fa4b9ca9bf6e63bb5784bbaa2f2b
2023-04-05
tiktok 09d58e11fb8f71d12e407b242351d657cb0331c5
2023-04-02
tiktok 1bfd35d388a29397ed0c8b2413093d3e6846b824
2023-04-01
tiktok 91cfc737c7bfbab33af3e1938be71a135a91b710
2023-03-27
tiktok 0386ff01a07604d0d2939e3c1a395093b9918e48
2023-03-26
tiktok 23ef998b251599c7ad0769b331d3c7388ef46b0b
2023-03-25
tiktok 92b2c950113e3e344c676be2387696f7e2760e53
2023-03-22
tiktok 7600368163371f79348dbde7355ad7b2624f8a25
2023-03-22
tiktok b01c415cbe197395d672a1697f15f8bcce0f6c62
2023-03-19
tiktok 57f46e0b2a673ca85ce1c18b516dc3fb951315d4
2023-03-18
tiktok 9afa0af2e2d89cec29cc5ce519ad2fce9f569801
2023-03-17
tiktok df04d2a352228dd8fe797b2f60216402bea19596
2023-03-15
tiktok b6bc845ad22ac829802bc2b587b1104739cb7b37
2023-03-15
tiktok d4f56cd69671cddaf9659ed34261197c90be41f9
2023-03-13
tiktok 0e24ee241dd470ca7526ee5f0b53e99e277820d0
2023-03-11
tiktok 5cc3823f5e80faa4ba21bb0be4549d9f381c8381
2023-03-10
tiktok ce63b22e2a11d8eac544f2b46a8723093f0f35bd
2023-03-02
tiktok 7c4249bfb070958df2e8dfd21c1651cf7da25266
2023-02-27
Is explainability important for you? #datascience #ex...
2022-08-06
Is explainability important for you? #datascience #explainability #interpretability #statistics #codetalk #machinelearning
tiktok b16b91c28d975c34028f8e7f66812744ed670595
2022-07-27
Learn about foundational models, especially in #nlp #...
2022-04-23
Learn about foundational models, especially in #nlp #naturallanguageprocessing #datascience #deeplearning #analytics #techtok #openai
Loss Functions - simple example of MAE versus RSME #d...
2022-08-30
Loss Functions - simple example of MAE versus RSME #datascience #statistics #analytics #codetok #regression
Rust for machine learning. It’s useful in some case...
2022-09-25
Rust for machine learning. It’s useful in some cases for ML, but learn python first. #datascience #codetok #python #machinelearning #rust
Diffusion models for markup. #datascience #machinelea...
2022-10-13
Diffusion models for markup. #datascience #machinelearning #stablediffusion
AI that makes you feel better. The paper is Inducing ...
2022-10-14
AI that makes you feel better. The paper is Inducing Positive Perspectives with Text Reframing. You can find a demo over at ü§ó hugging fac...
TabPFN revolution in data science. Please don’t you...
2022-10-22
TabPFN revolution in data science. Please don’t your time on all this hype. Every week there is a revolution announced on Twitter. Ignore ...
Reminder to visualize your data with one of my favori...
2022-10-29
Reminder to visualize your data with one of my favorites #anscombesquartet #datavisualization #datascience #statistics
Software exec at the end is the best. Your quick intr...
2022-10-30
Software exec at the end is the best. Your quick intro to patents, trademarks, copyright, and licenses. I see too many comments where people...
Checking out Flan T5 large language models. Let me kn...
2022-11-09
Checking out Flan T5 large language models. Let me know what wisdom you can find in this model. #machinelearning #datascience #largelanguage...
New style of content, let me know if you want more li...
2022-11-13
New style of content, let me know if you want more like this. Predict sentiment #machinelearning #datascience #transformers #huggingface
Learning curves, it’s a technique I use all the tim...
2022-11-16
Learning curves, it’s a technique I use all the time when training models. Thanks to Todd C for showing me the best way to explain this. #...
Automatic Speech recognition in 3 lines of code using...
2022-11-17
Automatic Speech recognition in 3 lines of code using wav2vec2 in transformers #datascience #machinelearning #huggingface #automaticspeechre...
Galactica by meta. Cool model, poor form on sharing i...
2022-11-17
Galactica by meta. Cool model, poor form on sharing it out. #datascience #machinelearning I feel for students, it was going to write a lot o...
I need to focus on adding more Regularization to my l...
2022-11-19
I need to focus on adding more Regularization to my life. #datascience #statistics #regularization
Meta’s Cicero for playing Diplomacy is impressive a...
2022-11-23
Meta’s Cicero for playing Diplomacy is impressive and a bit scary. #statistics #datascience #machinelearning #diplomacy
A couple of examples of what not to do and what you s...
2022-12-23
A couple of examples of what not to do and what you should do when presenting your data science results to the business. #datascience #stati...
The politics of ChatGPT, it’s no different than any...
2022-12-27
The politics of ChatGPT, it’s no different than any other technology and is not neutral. If you want a simple explanation of how ChatGTP w...
Dtreeviz 2.0 - Visualizing Decision Trees
2022-12-28
Dtreeviz 2.0 - Visualizing Decision Trees
GPT3.5 takes the bar exam with very little tuning. It...
2022-12-30
GPT3.5 takes the bar exam with very little tuning. It does pretty well. #gpt #datascience #machinelearning #barexam #law
Clustering with k-means. This skit was inspired by th...
2022-12-31
Clustering with k-means. This skit was inspired by the examples in Schubert paper on stop using the elbow criterion for kmeans. Any other cl...
Image captioning models - GIT from Microsoft and BLIP...
2023-01-05
Image captioning models - GIT from Microsoft and BLIP from salesforce #datascience #machinelearning #imagecaptioning
Scaling laws help us figure out how manage the amount...
2023-01-07
Scaling laws help us figure out how manage the amount of training data versus the model size. DeepMind showed with Chinchilla by using more ...
Dealing with over plotting, another visualization tip...
2023-01-08
Dealing with over plotting, another visualization tips from data to viz #datascience #machinelearning #statistics #datavisualization
Using LangChain with GPT3. I am seeing lots of cool d...
2023-01-14
Using LangChain with GPT3. I am seeing lots of cool demos based on LangChain and needed to make I covered it. It’s an easy way to take adv...
Picking a GPU for deep learning based on Tim Dettmers...
2023-01-16
Picking a GPU for deep learning based on Tim Dettmers classic blog post. #datascience #machinelearning #deeplearning #gpu
How companies use your data for training models will ...
2023-01-19
How companies use your data for training models will be a big issue this year. GitHub is being sued for Copilot and Hugging Face has been bu...
Google’s sparrow is the rumored competitor to OpenA...
2023-01-21
Google’s sparrow is the rumored competitor to OpenAI ChatGPT. Check out the paper to see lots of examples of it chatting. It looks really ...
Should you take the time to learn Kubernetes as a dat...
2023-01-23
Should you take the time to learn Kubernetes as a data scientist? Or you already overloaded learning data science? #datascience #machinelear...
I can’t make this stuff up. OpenAI released their c...
2023-01-31
I can’t make this stuff up. OpenAI released their classifier and I saw all these messages about how ineffective it is. Wanted to get this ...
My second try to explain in context learning or few s...
2023-01-27
My second try to explain in context learning or few shot learning with large language models. It’s very cool and why these models are so e...
How enterprises are dealing with ChatGPT it’s a pre...
2023-02-05
How enterprises are dealing with ChatGPT it’s a pretty familiar cycle of grief. The good thing is it does open up lots of cool use cases. ...
Climax, a new transformer based model for predicting ...
2023-02-07
Climax, a new transformer based model for predicting weather and climate forecasting. Great example of the flexibility of transformers based...
Random forests and their ease of use are important in...
2023-02-18
Random forests and their ease of use are important in understanding modern data science. #datascience #machinelearning #statistics #randomfo...
Replying to @anansaadi OpenAssistant is an open sourc...
2023-02-19
Replying to @anansaadi OpenAssistant is an open source project that aims to provide a chat based assistant that connects to other sources of...
Feature engineering and data preprocessing are an imp...
2023-02-27
Feature engineering and data preprocessing are an important part of the machine learning process. #datascience #machinelearning #featureengi...
Pandas 2.0 combing with arrow. A short recap on how i...
2023-03-01
Pandas 2.0 combing with arrow. A short recap on how it fits in with polars, dplyr, and data.table. #datascience #machinelearning #rstats #py...
ChatGPT price drop. Let’s break down how much the p...
2023-03-02
ChatGPT price drop. Let’s break down how much the price dropped, how OpenAI could drop the price, the effects on performance, what is goin...
OpenAI plugins! Lets get everyones APIs working with ...
2023-03-23
OpenAI plugins! Lets get everyones APIs working with LLMs! This isa good thing. #largelanguagemodels #langchain #openai #datascience #machin...