Abhik Sarkar
Paper
Computer Vision
Deep Residual Learning for Image Recognition
You Only Look Once: Unified, Real-Time Object Detection
YOLO9000: Better, Faster, Stronger
YOLOv3: An Incremental Improvement
YOLOv4: Optimal Speed and Accuracy of Object Detection
Natural Language Processing
Attention Is All You Need
Visual Transformer Language Representation Models
CLIP: Connecting Text and Images
Blog
Natural Language Processing
Illustrated Transformer
The Illustrated GPT-2 (Visualizing Transformer Language Models)
Transformers from Scratch
Book
Machine Learning
Machine Learning Design Patterns: Solutions to Common Challenges in Data Preparation, Model Building, and MLOps
Designing Machine Learning Systems: An Iterative Process for Production-Ready Applications