Mamba: Linear-Time Sequence Modeling with Selective State Spaces
Summary: A summary of the Mamba architecture.
Topics: Mamba, Foundation models
Date: December, 2023
Summary: A summary of the Mamba architecture.
Topics: Mamba, Foundation models
Date: December, 2023
Summary: A summary of the Google DeepMind Gemini tech report.
Topics: Gemini, Foundation models
Date: December, 2023
Summary: A lecture covering the basics of self-supervision in Computer Vision.
Topics: Self-supervised learning
Date: November, 2023
[slides]
Summary: A lecture covering the basics of vision transformers.
Topics: Transformers
Date: November, 2023
[slides]
Summary: A short summary of recent AI-related news.
Topics: AI news
Date: June, 2023
[slides]
Summary: A short summary of recent AI-related news.
Topics: AI news
Date: June, 2023
[slides]
Summary: A short summary of recent AI-related news.
Topics: AI news
Date: May, 2023
[slides]
Summary: A short summary of recent AI-related news.
Topics: AI news
Date: May, 2023
[slides]
Summary: A short summary of recent AI-related news.
Topics: AI news
Date: May, 2023
[slides]
Summary: A short summary of recent AI-related news.
Topics: AI news
Date: May, 2023
[slides]
Summary: A short summary of recent AI-related news.
Topics: AI news
Date: May, 2023
[slides]
Summary: A short summary of recent AI-related news.
Topics: AI news
Date: April, 2023
[slides]
Summary: A very short introduction to the field of computer vision.
Topics: Computer Vision, introduction, tasks
Date: April, 2023
[slides]
Summary: A summary of the SIGBOVIK 2023 paper, Large Language Models are Few-Shot Publication Scoopers.
Topics: Scooping, Large Language Models, SIGBOVIK
Date: March, 2023
[slides]
Summary: A description of the GPT-4 technical report by OpenAI.
Topics: GPT-4, Large Language Model, Capabilities, Risks
Date: March, 2023
[slides]
Summary: Samuel's tutorial on NP-complete problems.
Topics: P versus NP, decision problems, verification algorithms, NP-completeness
Date: February, 2023
[slides]
Summary: A short description of What is the alignment problem? by Jan Leike.
Topics: AI alignment, capabilities, the hard problem of alignment
Date: March, 2023
[slides]
Summary: Samuel's tutorial on task-parallel computing.
Topics: Task parallelism, data parallelism, Moore's law, Dennard scaling, Amdahl's law, Gustafson's law
Date: February, 2023
[slides]
Summary: Samuel's tutorial on bucket sort.
Topics: bucket sort, sorting, algorithms, coding
Date: January, 2023
[slides]
Summary: A description of the the work 'BLOOM: A 176B-Parameter Open-Access Multilingual Language Model' by Le Scao et al. published on arxiv in November 2022 as part of the BigScience Workshop. This work provides an overview of the BLOOM model and the efforts involved in its creation.
Topics: foundation models, large language models, multilingual models
Date: January, 2023
[slides]
Summary: Samuel's tutorial on counting sort.
Topics: radix sort, sorting, algorithms, coding
Date: January, 2023
[slides]
Summary: Samuel's tutorial on counting sort.
Topics: sorting, algorithms, coding
Date: January, 2023
[slides]
Summary: Samuel's tutorial on lower bounds on the fastest possible comparison sorting algorithms.
Topics: lower bounds, sorting, algorithms, coding
Date: January, 2023
[slides]
Summary: Samuel's tutorial on the quicksort algorithm.
Topics: Quicksort, sorting, algorithms, coding
Date: January, 2023
[slides]
Summary: Samuel's tutorial on Heapsort and the binary heap data structure.
Topics: Heapsort, binary heap, priority queue, data structures, algorithms, coding
Date: December, 2022
[slides]
Summary: Samuel's tutorial on the B-tree data structure.
Topics: B-trees, data structures, algorithms, coding
Date: December, 2022
[slides]
Summary: Samuel's tutorial on the red-black tree data structure.
Topics: Red-black trees, data structures, algorithms, coding
Date: December, 2022
[slides]
Summary: A description of the the work 'Crosslingual Generalization through Multitask Finetuning' by Niklas Muennighoff et al. published on arxiv in November 2022 as part of the BigScience Workshop. This work introduced the BLOOMZ and mT0 models.
Topics: multitask finetuning, foundation models, large language models, multilingual models
Date: November, 2022
[slides]
Summary: Samuel's tutorial on the binary search tree data structure.
Topics: Binary search trees, data structures, algorithms, coding
Date: October, 2022
[slides]
Summary: A description of the the work 'Scaling Instruction-Finetuned Language Models' by Hyung Won Chung et al. published on arxiv in October 2022. This work introduced the Flan-PaLM 540B model.
Topics: instruction finetuning, foundation models, large language models
Date: October, 2022
[slides]
Summary: A short description of the ReCo framework introduced in the work 'ReCo: Retrieve and Co-segment for Zero-shot Transfer' by G. Shin, W. Xie and S. Albanie, published at NeurIPS in 2022.
Topics: semantic segmentation, foundation models, zero shot transfer
Date: October, 2022
[slides]
Summary: Samuel's tutorial on big O notation and other asymptotic notation (little o, big theta, big omega, little omega).
Topics: Complexity, data structures, algorithms, analysis
Date: October, 2022
[slides]
Summary: Samuel's tutorial on the hash table data structure.
Topics: Hash tables, data structures, algorithms, coding
Date: September, 2022
[slides]
Summary: A description of PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts by S. Bach and others under the BigScience project as a demo at ACL 2022.
Topics: Prompting, annotation tools, NLP
Date: September, 2022
[slides]
Summary: A description of the 🤗 Datasets library, as summarised by Q. Lhoest et al. in their EMNLP 2021 demo paper.
Topics: Hugging Face, machine learning, NLP
Date: September, 2022
[slides]
Summary: Samuel's tutorial on row-major and column-major ordering for multi-dimensional array storage.
Topics: computing, memory, arrays
Date: September, 2022
[slides]
Summary: Samuel's tutorial on Why numbering should start at zero by Edsger W. Dijkstra, published in 1982 as EWD 831.
Topics: Dijkstra, counting, programming conventions
Date: September, 2022
[slides]
Summary: A slow video description of the Codex research paper.
Topics: codex, language models, foundation models, coding
Date: July, 2022
[slides]
Summary: A slow video description of the GPT-3 research paper.
Topics: language models, foundation models, GPT-3, scaling
Date: July, 2022
[slides]
Summary: A slow video description of the introduction to the Foundation Models report from Stanford.
Topics: foundation models, emergence, homogenisation
Date: June, 2022
[slides]
Summary: A video digest of the QB-Norm research paper.
Topics: vision and language, hubness, cross modal retrieval
Date: June, 2022
[slides]
Summary: A video digest of the DINO research paper.
Topics: computer vision, few-shot learning, vision and language
Date: June, 2022
[slides]
Summary: A video digest of the Flamingo research paper.
Topics: computer vision, few-shot learning, vision and language
Date: May, 2022
[slides]
Summary: A video digest of research ideas related to self-supervised learning and the pseudo-labelling algorithm.
Topics: computer vision, self-supervised learning, pseudo-labelling
Date: April, 2022
[slides]
Summary: A video digest of the CLIP research paper.
Topics: computer vision, zero-shot learning, vision and language
Date: April, 2022
[slides]
Summary: A video digest of ideas related to neural network architectures, scaling laws and transformers.
Topics: neural network architectures, scaling laws, transformers
Date: April, 2022
[slides]
A video digest of research related to anonymisation in egocentric video.
Topics: computer vision, privacy
Date: April, 2022
[slides]
Note on usage/credits:
Please feel free to use the slides and videos if you find them useful.
If so, please maintain references to the original authors of content where they are noted.