Neural Networks 101 - Interactive Lesson

🔬 What is a Neuron?

The Basic Building Block

A neuron is the fundamental unit of a neural network. Just like neurons in your brain receive electrical signals from other neurons, artificial neurons (software modules called nodes) receive inputs, process them, and produce an output.

Brain Inspiration: In the human brain, neurons form a complex, highly interconnected network and send electrical signals to help us process information. Artificial neural networks mimic this by using software programs that solve mathematical calculations.

Interactive Neuron

2

Input 1

w₁ = 0.5

?

Neuron

Output

?

Result

Input Value: 2

Weight (w₁): 0.5

Bias (b): 0

Output = Input × Weight + Bias

? = 2 × 0.5 + 0 = 1.0

Key Concepts

Input: The data fed into the neuron (like signals in your brain)
Weight: How important this input is - positive weights excite the neuron, negative weights suppress it
Bias: A constant that shifts the output up or down
Output: The result after applying the formula (like a neuron firing)

💡 Real-World Analogy

Think of a neuron like a decision-maker: It receives multiple pieces of information (inputs), weighs how important each piece is (weights), adds a personal bias, and then decides whether to "fire" or pass the signal along. Just like how your brain neurons decide whether to send electrical signals based on the inputs they receive!

🔢 Multiple Inputs

Real Neurons Have Many Inputs

In practice, neurons receive multiple inputs at once. Each input has its own weight, and all contributions are summed together. This is how artificial neural networks mimic the brain - where each neuron receives signals from thousands of other neurons!

Multi-Input Neuron

Input 1: 1

Weight 1: 0.5

Input 2: 2

Weight 2: 0.3

Bias: 0

Output = (Input₁ × Weight₁) + (Input₂ × Weight₂) + Bias

Output = (1 × 0.5) + (2 × 0.3) + 0 = 1.1

Result: 1.1

⚡ Activation Functions

Adding Non-Linearity

Activation functions transform the neuron's output, allowing neural networks to learn complex patterns. Without them, neural networks would only be able to learn linear relationships!

Try Different Activation Functions

Input Value: 0

ReLU(0) = 0

ReLU

Rectified Linear Unit

max(0, x)

Returns 0 for negative values, otherwise returns the input unchanged. Most popular in modern networks!

Sigmoid

Squashing Function

1 / (1 + e⁻ˣ)

Squashes any input to a value between 0 and 1. Great for probabilities!

Tanh

Hyperbolic Tangent

(eˣ - e⁻ˣ) / (eˣ + e⁻ˣ)

Similar to sigmoid but outputs between -1 and 1. Centered around zero!

Why Use Them?

Key Benefits

✓ Enable learning complex patterns
✓ Introduce non-linearity
✓ Control output range
✓ Help with gradient flow

🏗️ Building a Network

Connecting Neurons Together

A neural network is created by organizing neurons into layers. Each layer processes information and passes it to the next layer, allowing the network to learn increasingly complex features.

Input Layer

Where data enters the network. Information from the outside world enters here. Input nodes process, analyze, or categorize the data and pass it to the next layer.

📥

Hidden Layers

Where the magic happens! Hidden layers take input from the input layer or other hidden layers. They analyze the output from the previous layer, process it further, and pass it to the next layer. Networks can have many hidden layers!

🔮

Output Layer

Gives the final result of all data processing. Can have single or multiple nodes. For binary (yes/no) problems, use 1 output node (gives 1 or 0). For multi-class problems, use multiple output nodes.

📤

Deep Learning

Networks with several hidden layers and millions of neurons are called "deep" neural networks - hence "deep learning"! They can map any input type to any output type.

🏔️

Understanding Weights in Networks

Connections between neurons are represented by numbers called weights:

Positive weight: One node excites (activates) another
Negative weight: One node suppresses another
Higher weight values: More influence on other nodes
In deep networks: Millions of these weighted connections work together!

Visual Network Architecture

x₁

x₂

x₃

Input Layer
(3 nodes)

→

h₁

h₂

h₃

h₄

Hidden Layer
(4 nodes)

→

y

Output Layer
(1 node)

Each node in one layer connects to every node in the next layer through weighted connections

🎓 How Networks Learn

The Learning Process

Neural networks learn by adjusting their weights and biases to minimize errors. This process is called training!

Training Requirements: Simple networks might need hundreds or thousands of training examples. Deep neural networks need much more training - often millions of examples - compared to other machine learning methods.

1. Forward Pass

Data flows through the network from input to output, producing a prediction.

→

2. Calculate Loss

Compare the prediction to the correct answer. The difference is the "loss" or error.

📊

3. Backpropagation

Calculate how much each weight contributed to the error. Work backwards through the network.

←

4. Update Weights

Adjust weights to reduce the error. Repeat this process thousands of times!

🔄

The Learning Formula

New Weight = Old Weight - (Learning Rate × Gradient)

Learning Rate: How big of a step to take (usually small, like 0.01)
Gradient: The direction and magnitude to adjust the weight
Goal: Minimize the loss function over many iterations

💡 Key Takeaways

What You've Learned

🧠 Neurons (nodes) are software modules that process inputs using weights and biases, inspired by brain neurons
🔢 Multiple inputs are combined using weighted sums - positive weights excite, negative weights suppress
⚡ Activation functions add non-linearity, enabling complex learning beyond simple linear relationships
🏗️ Three layer types: Input (receives data), Hidden (processes patterns), Output (produces results)
🏔️ Deep networks have several hidden layers with millions of neurons and can map any input to any output
🎓 Learning happens through forward passes, loss calculation, backpropagation, and weight updates
📊 Training data: Simple networks need hundreds/thousands of examples; deep networks need millions

🌟 The Big Picture

Artificial neural networks are software programs that mimic how the human brain processes information. By connecting millions of artificial neurons with weighted connections, these networks can learn to recognize patterns, make predictions, and solve complex problems - just like how your brain learns from experience!

📊 Simple Neural Networks

Few hidden layers (1-2)
Hundreds to thousands of neurons
Training data: Hundreds to thousands of examples
Good for: Basic pattern recognition, simple classifications
Faster to train

🏔️ Deep Neural Networks

Several hidden layers (3+)
Millions of neurons linked together
Training data: Millions of examples needed
Good for: Complex tasks like image recognition, language understanding
Can map any input type to any output type

🧠 Neural Networks Basics