Called meta-rewards learning, the new method is a very interesting development in reinforcement learning.

Source: https://builtin.com/machine-learning/reinforcement-learning

I recently started an AI-focused educational newsletter, that already has over 80,000 subscribers. TheSequence is a no-BS (meaning no hype, no news etc) ML-oriented newsletter that takes 5 minutes to read. The goal is to keep you up to date with machine learning projects, research papers and concepts. Please give it a try by subscribing below:

Reinforcement learning has been at the center of some of the biggest artificial intelligence(AI) breakthroughs of the last five years. In mastering games like Go, Quake III or StarCraft, reinforcement learning models demonstrated that they can surpass human performance and create unique long-term strategies…


Called cognitive shifted neurons, the new method brings adaptability capabilities to traditional meta-learning techniques.

Source: https://www.keystepmedia.com/adaptability-change/

I recently started an AI-focused educational newsletter, that already has over 80,000 subscribers. TheSequence is a no-BS (meaning no hype, no news etc) ML-oriented newsletter that takes 5 minutes to read. The goal is to keep you up to date with machine learning projects, research papers and concepts. Please give it a try by subscribing below:

Adaptability is one of the key cognitive abilities that defined us as humans. Even as babies, we can intuitively shift between similar tasks even if we don’t have prior training on them. This contrasts with the traditional train-and-test approach of most artificial intelligence(AI) systems…


Machine Learning

NetHack poses new challenges to RL algorithms.

Image Credit: Facebook Research

I recently started an AI-focused educational newsletter, that already has over 80,000 subscribers. TheSequence is a no-BS (meaning no hype, no news etc) ML-oriented newsletter that takes 5 minutes to read. The goal is to keep you up to date with machine learning projects, research papers and concepts. Please give it a try by subscribing below:

Reinforcement learning(RL) has been at the center of some of the most impressive achievements in artificial intelligence(AI) in the last decade. From DeepMind’s famous AlphaGo to milestones in games such as StarCraft II, Dota 2 or Minecraft, RL remains one of the fastest growing…


Weekly newsletter with over 80,000 subscribers that discusses impactful ML research papers, cool tech releases, the money in AI, and real-life implementations.

📝 Editorial: Why Mobile Deep Learning is Tougher Than You Think

Mobile devices represent a primary runtime for our daily interactions with machine learning models. However, the vast majority of machine learning experiences in mobile devices are delivered in a server-side architecture with the machine learning model executing in a cloud environment and exposing results to mobile apps via an API. From training, personalization to computational resource consumption, the mobile deep learning paradigm presents many inefficiencies for mobile architectures. The holy grail of mobile deep learning is to build models that can execute natively and efficiently in mobile devices. …


A very clever solution to one of the most difficult challenges in reinforcement learning.

Source: https://medium.com/syncedreview/a-look-at-the-case-for-bayesian-deep-learning-ffa38dfd7124

I recently started an AI-focused educational newsletter, that already has over 80,000 subscribers. TheSequence is a no-BS (meaning no hype, no news etc) ML-oriented newsletter that takes 5 minutes to read. The goal is to keep you up to date with machine learning projects, research papers and concepts. Please give it a try by subscribing below:

Artificial intelligence(AI) agents often operate in environments with partial or incomplete information. In those settings, agents are often forced to find a balance between exploring the environment or taking actions that yield an immediate reward. The exploration-exploitation dilemma is one of the fundamental frictions…


Artificial Intelligence

Glow is an iconic interesting research about deep neural networks that can generalize with small training sets.

Image Credit: OpenAI

I recently started an AI-focused educational newsletter, that already has over 80,000 subscribers. TheSequence is a no-BS (meaning no hype, no news etc) ML-oriented newsletter that takes 5 minutes to read. The goal is to keep you up to date with machine learning projects, research papers and concepts. Please give it a try by subscribing below:

Since the early days of machine learning, artificial intelligence scenarios have faced with two big challenges in order to experience mainstream adoption. First, we have the data efficiency problem that requires machine or deep learning models to be trained using large and accurate datasets…


From holding times to exchange activity, blockchain dataset reveal some fascinating insights about the behavior of investors during the last few weeks.

Source: https://viaqe.com/shock-prediction-from-analyst-knowing-every-step-of-bitcoin-50000-correction/

In recent weeks, Bitcoin has experienced one of the most aggressive corrections in its market history. A lot has been written about the market activity of Bitcoin in centralized exchanges during this period but there are some fascinating insights that can be derived from analyzing the activity in the Bitcoin blockchain. Today, I would like to highlight some blockchain indicators that reveal a unique perspective about the recent activity in the Bitcoin market.

1) Activity by Medium Term Holders Increase Drastically

IntoTheBlock’s UTXO Age indicator shows that activity in Bitcoin UTXOs that between 3-to-6 months increased drastically while the activity between 1-to-3 months holders had the sharpest decrease…


Deep Learning

These three basic ideas should be put in place in any machine learning modeling experiment.

Source: https://towardsdatascience.com/overfitting-vs-underfitting-ddc80c2fc00d

I recently started an AI-focused educational newsletter, that already has over 80,000 subscribers. TheSequence is a no-BS (meaning no hype, no news etc) ML-oriented newsletter that takes 5 minutes to read. The goal is to keep you up to date with machine learning projects, research papers and concepts. Please give it a try by subscribing below:

Overfitting is considered one of the biggest challenges in modern deep learning applications. Conceptually, overfitting occurs when a model generates a hypothesis that is too tailored to a specific dataset to the data making it impossible to adapt to new datasets. A useful analogy…


Weekly newsletter with over 80,000 subscribers that discusses impactful ML research papers, cool tech releases, the money in AI, and real-life implementations.

📝 Editorial: AI Incumbents and Their Favorite ML Frameworks

Traditionally, open-source innovation in technology markets is targeted to challenge the incumbents in the field and boost a new generation of startups that can capitalize on the innovations of contributors to open-source projects. From the OS wars to recent cloud or big data trends, technology markets are full of examples in which an open-source project challenges big technology incumbents, which then react by creating their own alternative. Machine learning (ML) has challenged those conventional dynamics. …


By training robots by playing games, DeepMind takes a unique angle at the exploration-exploitation dilemma.

Image Credit: DeepMind

I recently started an AI-focused educational newsletter, that already has over 80,000 subscribers. TheSequence is a no-BS (meaning no hype, no news etc) ML-oriented newsletter that takes 5 minutes to read. The goal is to keep you up to date with machine learning projects, research papers and concepts. Please give it a try by subscribing below:

Creating agents that can learn like children is one of the ultimate goals of artificial intelligence. Disciplines such as reinforcement learning(RL) are fully devoted to create self-learning models that can use a combination of punishment and reward feedback to master a new task. However…

Jesus Rodriguez

CEO of IntoTheBlock, Chief Scientist at Invector Labs, I write The Sequence Newsletter, Guest lecturer at Columbia University, Angel Investor, Author, Speaker.

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store