GPT Reviews
Microsoft and Apple drop OpenAI Board plans due to increased regulatory scrutiny in the AI sector.
Research papers on enhancing mathematical reasoning capabilities of large language models and improving mathematical problem-solving capabilities in visual contexts using Multi-modal Large Language Models (MLLMs).
FlashAttention-3, an algorithm that speeds up attention mechanism in large language models by up to 2 times faster than previous versions, while maintaining accuracy with lower precision numbers.
Adaptive In-Context Learning, a technique that simplifies the overall machine learning pipeline, making it more accessible for more organizations.
Contact:ย ย sergi@earkind.com
Timestamps:
00:34 Introduction
02:01ย Microsoft, Apple Drop OpenAI Board Plans as Scrutiny Grows
03:41ย Reproducing GPT-2 in C and CUDA
04:48ย Adaptive In-Context Learning
06:25ย FlashAttention-3: Fast and Accurate Attention with Asynchrony and Low-precision
07:53 Fake sponsor
11:35ย MAVIS: Mathematical Visual Instruction Tuning
13:38 Outro