AI: What the Heck is Going On?

Matt Furnari
06/19/2024

We all grew up with movies of AI and it always seemed to be decades off. Then ChatGPT was announced and suddenly it's everywhere.

It's like the future arrived. What happened ? How did we get here?

Was it a breakthrough in Artificial Intelligence?

No. Not really. ChatGPT and it's kin use a Transformer Network, built on Neural Networks. Neural Nets were invented in 1943. The only recent innovation has been the use of Multi-Head Attention, thanks to a paper from Google in 2017 ("Attention is All You Need").

    Was it breakthrough in Learning?

    Again, no. Not really. Transformer networks are trained using data and Backpropogation. The modern version of backpropogation was invented in 1982 (Rumelhart). So why is this happening now? And why is NVidia now the largest company in the world?

    So why is this happening now? And why is NVidia now the largest company in the world?
    In 2012 researchers Alex Krizhevsky, in collaboration with Ilya Sutskever (the ex-chief Scientist of OpenAI) and Geoffrey Hinton (the ex-head of Google Brain) built a Neural Network for image recognition that was trained using GPUs -- NVidia GPUs. 

    Since then, the field has focused on putting more data, and more GPUs behind bigger models.

    Chinchillas and LLaMas

    Chinchilla is a family of Large Language Models (LLMs) developed by DeepMind and presented in March 2022. The goal was to explore the effects of more data, more compute, and bigger neural networks. What happens if we increase each? Is there some limit? Will we continue to get better performance? 

    The researchers found that increasing compute, data, and model size (in tandem) should pretty much always lead to better results

    What does that mean exactly?

    Bigger is Better

    That means that the industry doesn't really need to worry about fundamental breakthroughs in AI techniques, they just need to throw more data, more compute and more bigger models at the problem.
    "Microsoft, Meta, and Google’s parent company, Alphabet, disclosed this week that they had spent more than $32 billion combined on data centers and other capital expenses in just the first three months of the year. " - WSJ
    "Microsoft (MSFT.O), and OpenAI are working on plans for a data center project that could cost as much as $100 billion and include an artificial intelligence supercomputer called ‘Stargate’"  - Reuters

    Where does it end?

    No one knows if there is a limit to the Transformer architecture. Some people like LeCun of Meta argue that this architecture is fundamentally limited because they lack fundamental understanding and reasoning abilities. 

    Obviously, Google, Microsoft and others are willing to bet hundreds of billions of dollars to that we aren't close to the limits of the architecture.

    AGI / ASI

    No one knows exactly the capacities of LLMs and if they can lead to Artificial General Intelligence or Artificial Super Intelligence. Mostly because we don't understand how they work internally. We are still poking the LLMs with sticks, like confused chimpanzees, hoping to gain some understanding (Towards Monosemanticity: Decomposing Language Models With Dictionary Learning https://transformer-circuits.pub/2023/monosemantic-features). 

    I guess we will soon find out.

    Read More

    Is Your AI a Toy or a Tool? Here’s How to Tell (And Why It Matters)

    11/07/2024
    As artificial intelligence (AI) becomes a powerful part of our daily lives, it’s amazing to see how many directions the technology is taking. From creative tools to customer service automation...
    Read more

    Stop Going Solo: Why Tech Founders Need a Business-Savvy Co-Founder (And How to Find Yours)

    10/24/2024
    Hey everyone, Justin Brochetti here, Co-founder of Intelligence Factory. We're all about building cutting-edge AI solutions, but I'm not here to talk about that today. Instead, I want to share...
    Read more

    Why OGAR is the Future of AI-Driven Data Retrieval

    09/26/2024
    When it comes to data retrieval, most organizations today are exploring AI-driven solutions like Retrieval-Augmented Generation (RAG) paired with Large Language Models (LLM)...
    Read more

    The AI Mirage: How Broken Systems Are Undermining the Future of Business Innovation

    09/18/2024
    Artificial Intelligence. Just say the words, and you can almost hear the hum of futuristic possibilities—robots making decisions, algorithms mastering productivity, and businesses leaping toward unparalleled efficiency...
    Read more

    A Sales Manager’s Perspective on AI: Boosting Efficiency and Saving Time

    08/14/2024
    As a Sales Manager, my mission is to drive revenue, nurture customer relationships, and ensure my team reaches their goals. AI has emerged as a powerful ally in this mission...
    Read more

    Prioritizing Patients for Clinical Monitoring Through Exploration

    07/01/2024
    RPM (Remote Patient Monitoring) CPT codes are a way for healthcare providers to get reimbursed for monitoring patients' health remotely using digital devices...
    Read more

    10X Your Outbound Sales Productivity with Intelligence Factory's AI for Twilio: A VP of Sales Perspective

    06/28/2024
    As VP of Sales, I'm constantly on the lookout for ways to empower my team and maximize their productivity. In today's competitive B2B landscape, every interaction counts...
    Read more

    Practical Application of AI in Business

    06/24/2024
    In the rapidly evolving tech landscape, the excitement around AI is palpable. But beyond the hype, practical application is where true value lies...
    Read more

    AI: What the Heck is Going On?

    06/19/2024
    We all grew up with movies of AI and it always seemed to be decades off. Then ChatGPT was announced and suddenly it's everywhere...
    Read more

    Paper Review: Compression Represents Intelligence Linearly

    04/23/2024
    This is post is the latest in a series where we review a recent paper and try to pull out the salient points. I will attempt to explain the premise...
    Read more

    SQL for JSON

    04/22/2024
    Everything old is new again. A few years back, the world was on fire with key-value storage systems...
    Read more

    Telemedicine App Ends Gender Preference Issues with AWS Powered AI

    04/19/2024
    AWS machine learning enhances MEDEK telemedicine solution to ease gender bias for sensitive online doctor visits...
    Read more

    Read More

    Is Your AI a Toy or a Tool? Here’s How to Tell (And Why It Matters)

    11/07/2024
    As artificial intelligence (AI) becomes a powerful part of our daily lives, it’s amazing to see how many directions the technology is taking. From creative tools to customer service automation...
    Read more

    Stop Going Solo: Why Tech Founders Need a Business-Savvy Co-Founder (And How to Find Yours)

    10/24/2024
    Hey everyone, Justin Brochetti here, Co-founder of Intelligence Factory. We're all about building cutting-edge AI solutions, but I'm not here to talk about that today. Instead, I want to share...
    Read more

    Why OGAR is the Future of AI-Driven Data Retrieval

    09/26/2024
    When it comes to data retrieval, most organizations today are exploring AI-driven solutions like Retrieval-Augmented Generation (RAG) paired with Large Language Models (LLM)...
    Read more

    The AI Mirage: How Broken Systems Are Undermining the Future of Business Innovation

    09/18/2024
    Artificial Intelligence. Just say the words, and you can almost hear the hum of futuristic possibilities—robots making decisions, algorithms mastering productivity, and businesses leaping toward unparalleled efficiency...
    Read more

    A Sales Manager’s Perspective on AI: Boosting Efficiency and Saving Time

    08/14/2024
    As a Sales Manager, my mission is to drive revenue, nurture customer relationships, and ensure my team reaches their goals. AI has emerged as a powerful ally in this mission...
    Read more

    Prioritizing Patients for Clinical Monitoring Through Exploration

    07/01/2024
    RPM (Remote Patient Monitoring) CPT codes are a way for healthcare providers to get reimbursed for monitoring patients' health remotely using digital devices...
    Read more

    10X Your Outbound Sales Productivity with Intelligence Factory's AI for Twilio: A VP of Sales Perspective

    06/28/2024
    As VP of Sales, I'm constantly on the lookout for ways to empower my team and maximize their productivity. In today's competitive B2B landscape, every interaction counts...
    Read more

    Practical Application of AI in Business

    06/24/2024
    In the rapidly evolving tech landscape, the excitement around AI is palpable. But beyond the hype, practical application is where true value lies...
    Read more

    AI: What the Heck is Going On?

    06/19/2024
    We all grew up with movies of AI and it always seemed to be decades off. Then ChatGPT was announced and suddenly it's everywhere...
    Read more

    Paper Review: Compression Represents Intelligence Linearly

    04/23/2024
    This is post is the latest in a series where we review a recent paper and try to pull out the salient points. I will attempt to explain the premise...
    Read more

    SQL for JSON

    04/22/2024
    Everything old is new again. A few years back, the world was on fire with key-value storage systems...
    Read more

    Telemedicine App Ends Gender Preference Issues with AWS Powered AI

    04/19/2024
    AWS machine learning enhances MEDEK telemedicine solution to ease gender bias for sensitive online doctor visits...
    Read more