- Introduction
- Tools for generating images using description
- Research paper collection
- Improving deep learning performance
- Multi-model deep learning
- AI-based ppt generation
- Visualize your dreams
- ChatGPT
- Multiple modalities of Data to enhance AI
- Computer Vision
-
- Additional models for classification, pose estimation, depth estimation, and ReID are just gotten from paperswithcode rankings
- YOLOv8 is very good for exploring different computer vision tasks since it has detection, segmentation, classification, pose, and tracking (ByteTrack and BoT-SORT without ReID for now) implemented
- Object detection
- CNN-based
- Requires relatively little data (especially if pretrained with datasets such as COCO) fast inference
- YOLOv8 (still no actual paper)
- YOLOv6.3
- Situationally better than v8, but it isn't as user-friendly and doesn't have additional functionality (segmentation, classification, pose)
- https://arxiv.org/abs/2301.05586
- Transformer-based detection
- Generally requires more data and has slower inference but can detect many more types of objects/classes
- DETR
- Grounding DINO: SOTA Zero-Shot Objection Detection
- Can detect new classes with no additional training, one of their examples is a dog's tail
- https://blog.roboflow.com/grounding-dino-zero-shot-object-detection/
- https://arxiv.org/abs/2303.05499
- DINO v2
- CNN-based
- Object segmentation
- YOLOv8 also has segmentation
- Classification
- YOLOv8 also has classification
- CoCa: Contrastive Captioners
- Pose estimation
- YOLOv8 also has pose
- ViTPose+
- Tracking
- BoT-SORT
- Deep OC-SORT
- Better than BoT-SORT in some metrics, and worse in others, but it also runs faster (good for edge devices where the lower fps of BoT-SORT can impact the motion model)
- https://arxiv.org/abs/2302.11813
- Person ReID
- Fast-ReID: A Pytorch Toolbox for General Instance Re-identification
- DenseIL – Dense Interaction Learning for Video-based Person Re-identification
- Depth estimation
-
- Speech
- Computer Vision
-
Novel Algorithms for enhancing AI
-
Generative AI and its implications
- Science in AI and AI in Science
- Healthcare application
- Protein folding
- Physics informed neural networks
- https://docs.nvidia.com/deeplearning/modulus/modulus-sym/user_guide/theory/architectures.html#deeponet
- https://arxiv.org/pdf/2304.00567.pdf
- https://arxiv.org/pdf/2207.05748.pdf
- https://arxiv.org/pdf/2111.13587.pdf
- https://arxiv.org/pdf/2304.13799.pdf
- https://developer.nvidia.com/modulus?ncid=so-link-410285-vt25#cid=hpc03_so-link_en-us
- Trends in AI
- https://www.capgemini.com/wp-content/uploads/2017/09/five_senses_pov.pdf
- https://aiindex.stanford.edu/wp-content/uploads/2021/11/2021-AI-Index-Report_Master.pdf
- https://hai.stanford.edu/sites/default/files/ai_index_2019_report.pdf
- https://aiindex.stanford.edu/wp-content/uploads/2023/04/HAI_AI-Index-Report_2023.pdf