Jesus RodriguezinTowards AIInside GameNGen: Google DeepMind’s New Model that can Simulate Entire 1993’s DOOM Game in Real TimeGameNGen represents a major milestone in creating generative AI models that can interact with complex real world environments.6d ago16d ago1
Jesus RodriguezinTowards AIHow NVIDIA Pruned and Distilled Llama 3.1 to Create Minitron 4B and 8BThe new models are using state of the art pruning and distillation techniques.Aug 261Aug 261
Jesus RodriguezinTowards AIInside The AI Scientist: The AI Agent for Open-Ended Scientific DiscoveryThe framework combines different generative AI models to streamline scientific research from idea to paper.Aug 202Aug 202
Jesus RodriguezinTowards AIMeet PromptPoet: The New Prompt Engineering Framework that Everyone is Talking AboutOriginally created by Character.ai, PromptPoet abstracts some of the core building blocks of prompt engineering.Aug 121Aug 121
Jesus RodriguezinTowards AIMeet Gemma Scope and ShieldGemma: Google DeepMind’s New Releases for Interpretability and…The two frameworks are part of the Gemma 2 release.Aug 6Aug 6
Jesus RodriguezinTowards AIInside DeepMind’s AlphaProof and AlphaGeometry 2: Two Models that Achieved Silver Medal Status in…One model focuses on algebra and number theory, while the other mastered geometry.Jul 29Jul 29
Jesus RodriguezinTowards AIInside NuminaMath: The AI Model that Took The First Place In the AI Math OlympiadThe model used strong data curation, fine-tuning processes, and algorithmic improvements to reach the top of the AIMO leaderboard.Jul 22Jul 22
Jesus RodriguezUnderstanding FlashAttention-3: One of the Most Important Algortihms to Make Transformers FastThe new version takes full advatange of H100 capabilities to improve attention in transformer models.Jul 15Jul 15
Jesus RodriguezinTowards AIInside 4M-21: Apple Small Model that Works Across 21 ModalitiesThe new model could be the foundation of Apple’s on-device AI strategy.Jul 82Jul 82
Jesus RodriguezInside DSPy: A Framework for Algorithmic Prompt OptimizationLaunched a few months ago, the framework has rapidly become one of the most complete LMP stacks in the market.Jun 241Jun 241