Intelligence Factory AI

AI: What the Heck is Going On?

•

6/19/2024

•

We all grew up with movies of AI and it always seemed to be decades off. Then ChatGPT was announced and suddenly it's everywhere.

It's like the future arrived. What happened ? How did we get here?

‍

Was it a breakthrough in Artificial Intelligence?

No. Not really. ChatGPT and it's kin use a Transformer Network, built on Neural Networks. Neural Nets were invented in 1943. The only recent innovation has been the use of Multi-Head Attention, thanks to a paper from Google in 2017 ("Attention is All You Need").

‍

Was it breakthrough in Learning?

Again, no. Not really. Transformer networks are trained using data and Backpropogation. The modern version of backpropogation was invented in 1982 (Rumelhart). So why is this happening now? And why is NVidia now the largest company in the world?So why is this happening now? And why is NVidia now the largest company in the world?

By Raysonho @ Open Grid Scheduler / Scalable Grid Engine - Own work, CC0

In 2012 researchers Alex Krizhevsky, in collaboration with Ilya Sutskever (the ex-chief Scientist of OpenAI) and Geoffrey Hinton (the ex-head of Google Brain) built a Neural Network for image recognition that was trained using GPUs -- NVidia GPUs. Since then, the field has focused on putting more data, and more GPUs behind bigger models.

‍

Chinchillas and LLaMas

Chinchilla is a family of Large Language Models (LLMs) developed by DeepMind and presented in March 2022. The goal was to explore the effects of more data, more compute, and bigger neural networks. What happens if we increase each? Is there some limit? Will we continue to get better performance? The researchers found that increasing compute, data, and model size (in tandem) should pretty much always lead to better results. What does that mean exactly?

‍

Bigger is Better

That means that the industry doesn't really need to worry about fundamental breakthroughs in AI techniques, they just need to throw more data, more compute and more bigger models at the problem.

"Microsoft, Meta, and Google’s parent company, Alphabet, disclosed this week that they had spent more than $32 billion combined on data centers and other capital expenses in just the first three months of the year. " - WSJ
‍"Microsoft (MSFT.O), and OpenAI are working on plans for a data center project that could cost as much as $100 billion and include an artificial intelligence supercomputer called ‘Stargate’" - Reuters

‍

Where does it end?

No one knows if there is a limit to the Transformer architecture. Some people like LeCun of Meta argue that this architecture is fundamentally limited because they lack fundamental understanding and reasoning abilities. Obviously, Google, Microsoft and others are willing to bet hundreds of billions of dollars to that we aren't close to the limits of the architecture.

‍

AGI / ASI

No one knows exactly the capacities of LLMs and if they can lead to Artificial General Intelligence or Artificial Super Intelligence. Mostly because we don't understand how they work internally. We are still poking the LLMs with sticks, like confused chimpanzees, hoping to gain some understanding (Towards Monosemanticity: Decomposing Language Models With Dictionary Learning https://transformer-circuits.pub/2023/monosemantic-features). I guess we will soon find out.

‍

Telemedicine App Ends Gender Preference Issues with AWS Powered AI

4/19/24

AWS machine learning enhances MEDEK telemedicine solution to ease gender bias for sensitive online doctor visits...

AI: What the Heck is Going On?

Was it a breakthrough in Artificial Intelligence?

Was it breakthrough in Learning?

Chinchillas and LLaMas

Bigger is Better

Where does it end?

AGI / ASI

Read More

The Cost of Shortcuts: Lessons From a $4.9 Million Mistake

One Biller, One Gap: How a Missing Piece Reshapes Everything

The System Is Rigged: How AI Helps Independent Docs Fight Back

Trust Is the Real Technology: A Lesson in Healthcare Partnerships

Million Dollar Surprise

Unlocking AI: A Practical Guide for IT Companies Ready to Make the Leap

Agentic RAG: Separating Hype from Reality

From Black Boxes to Clarity: Buffaly's Transparent AI Framework

Bridging the Gap Between Language and Action: How Buffaly is Revolutionizing AI

When Retrieval Augmented Generation (RAG) Fails

SemDB: Solving the Challenges of Graph RAG

Metagraphs and Hypergraphs with ProtoScript and Buffaly

Chunking Strategies for Retrieval-Augmented Generation (RAG): A Deep Dive into SemDB’s Approach

Is Your AI a Toy or a Tool? Here’s How to Tell (And Why It Matters)

Stop Going Solo: Why Tech Founders Need a Business-Savvy Co-Founder (And How to Find Yours)

Why OGAR is the Future of AI-Driven Data Retrieval

The AI Mirage: How Broken Systems Are Undermining the Future of Business Innovation

A Sales Manager’s Perspective on AI: Boosting Efficiency and Saving Time

Prioritizing Patients for Clinical Monitoring Through Exploration

10X Your Outbound Sales Productivity with Intelligence Factory's AI for Twilio: A VP of Sales Perspective

Practical Application of AI in Business

Paper Review: Compression Represents Intelligence Linearly

SQL for JSON

Telemedicine App Ends Gender Preference Issues with AWS Powered AI