Random variables, probability distributions, expected value, variance, binomial distribution, Gaussian distribution (normal distribution)

Random variables, probability distributions, expected value, variance, binomial distribution, Gaussian distribution (normal distribution) Relevant Rea...

Author: Shannon Small

3 downloads 1 Views 443KB Size

Report

Download PDF

Recommend Documents

Chapter 6: Random Variables and the Normal Distribution. 6.1 Discrete Random Variables. 6.2 Binomial Probability Distribution

The Gaussian distribution (normal distribution)

Chapter 2 Random Variables and Probability; Normal Distribution

Transforming and Combining Random Variables. Binomial and Geometric Random Variables. Random Variable and Probability Distribution

Continuous Random Variables and the Normal Distribution

Random Variables, Probability Distributions, and Expected Values

The Multivariate Gaussian Probability Distribution

Outline. Continuous probability distributions. Normal probability distribution graph

Chapter 5. Normal Probability Distribution

Outline. Chapter 3: Random Sampling, Probability, and the Binomial Distribution. Random Sampling Model. Random Variables. Types of Random Variables

Chapter 4: Discrete Random Variables and the Binomial Distribution

Gaussian Distribution

Binomial distribution

Discrete Probability Distributions. Lecture 7 - Continuous Distributions. Continuous Probability Distributions. Normal distribution

7.1 - Continuous Probability Distribution and The Normal Distribution

RANDOM VARIABLES AND PROBABILITY DISTRIBUTIONS

Random Variables and Probability Distributions

Probability Distribution

Announcements. Unit 2: Probability and Distributions Lecture 3: Normal distribution. Discrete probability distributions

Continuous Random Variables & Probability Distributions

Mean, Variance, & Standard Deviation Probability Distributions - Expected Value

Distributions Derived From the Normal Distribution

Random variables, probability distributions, expected value, variance, binomial distribution, Gaussian distribution (normal distribution) Relevant Readings: Table 5.2, Section 5.3 up through 5.3.4 in Mitchell

CS495 - Machine Learning, Fall 2009

Prob and stats

I

Probability, statistics, and sampling theory aren’t only just useful in ML. These are generally useful in computer science (such is in the study of randomized algorithms)

Prob and stats

I

Probability, statistics, and sampling theory aren’t only just useful in ML. These are generally useful in computer science (such is in the study of randomized algorithms)

Some basics I

A random variable is a numerical outcome of some random “experiment”

Some basics I

A random variable is a numerical outcome of some random “experiment” I

It can be thought of as an unknown value that changes every time it is observed

Some basics I

A random variable is a numerical outcome of some random “experiment” I

It can be thought of as an unknown value that changes every time it is observed I

Similar to Math.random()

Some basics I

A random variable is a numerical outcome of some random “experiment” I

It can be thought of as an unknown value that changes every time it is observed I

I

Similar to Math.random()

Example: The result of rolling a die

Some basics I

A random variable is a numerical outcome of some random “experiment” I

It can be thought of as an unknown value that changes every time it is observed I

I

I

Similar to Math.random()

Example: The result of rolling a die

A random variable’s behavior is governed its probability distribution

Some basics I

A random variable is a numerical outcome of some random “experiment” I

It can be thought of as an unknown value that changes every time it is observed I

I

I

Similar to Math.random()

Example: The result of rolling a die

A random variable’s behavior is governed its probability distribution I

The probability distribution is a specification for the likelihood of the random variable taking on different values

Some basics I

A random variable is a numerical outcome of some random “experiment” I

It can be thought of as an unknown value that changes every time it is observed I

I

I

Similar to Math.random()

Example: The result of rolling a die

A random variable’s behavior is governed its probability distribution I

I

The probability distribution is a specification for the likelihood of the random variable taking on different values Example: For rolling a die, the probability distribution is uniform, meaning that each possibility (1, 2, 3, 4, 5, 6) has equal probability of coming up (1/6).

Some basics I

A random variable is a numerical outcome of some random “experiment” I

It can be thought of as an unknown value that changes every time it is observed I

I

I

Similar to Math.random()

Example: The result of rolling a die

A random variable’s behavior is governed its probability distribution I

I

I

The probability distribution is a specification for the likelihood of the random variable taking on different values Example: For rolling a die, the probability distribution is uniform, meaning that each possibility (1, 2, 3, 4, 5, 6) has equal probability of coming up (1/6). Random variables can be discrete or continuous

Some basics I

A random variable is a numerical outcome of some random “experiment” I

It can be thought of as an unknown value that changes every time it is observed I

I

I

Similar to Math.random()

Example: The result of rolling a die

A random variable’s behavior is governed its probability distribution I

I

I I

The probability distribution is a specification for the likelihood of the random variable taking on different values Example: For rolling a die, the probability distribution is uniform, meaning that each possibility (1, 2, 3, 4, 5, 6) has equal probability of coming up (1/6). Random variables can be discrete or continuous The probability that random variable X takes on the value x is denoted Pr(X = x)

Some basics I

A random variable is a numerical outcome of some random “experiment” I

It can be thought of as an unknown value that changes every time it is observed I

I

I

Similar to Math.random()

Example: The result of rolling a die

A random variable’s behavior is governed its probability distribution I

I

I I

The probability distribution is a specification for the likelihood of the random variable taking on different values Example: For rolling a die, the probability distribution is uniform, meaning that each possibility (1, 2, 3, 4, 5, 6) has equal probability of coming up (1/6). Random variables can be discrete or continuous The probability that random variable X takes on the value x is denoted Pr(X = x)

Continuous probability distributions I

Consider a uniform continuous random variable X over the range [0,2]

Continuous probability distributions I

Consider a uniform continuous random variable X over the range [0,2] I

What is Pr(X = 1/2)?

Continuous probability distributions I

Consider a uniform continuous random variable X over the range [0,2] I

What is Pr(X = 1/2)? I

Zero.

Continuous probability distributions I

Consider a uniform continuous random variable X over the range [0,2] I

What is Pr(X = 1/2)? I

I

Zero.

What is Pr(X = 1/4)?

Continuous probability distributions I

Consider a uniform continuous random variable X over the range [0,2] I

What is Pr(X = 1/2)? I

I

Zero.

What is Pr(X = 1/4)? I

Zero.

Continuous probability distributions I

Consider a uniform continuous random variable X over the range [0,2] I

What is Pr(X = 1/2)? I

Zero.

I

What is Pr(X = 1/4)?

I

What is Pr(1/4 ≤ X ≤ 1/2)?

I

Zero.

Continuous probability distributions I

Consider a uniform continuous random variable X over the range [0,2] I

What is Pr(X = 1/2)? I

Zero.

I

What is Pr(X = 1/4)?

I

What is Pr(1/4 ≤ X ≤ 1/2)?

I

I

Zero. 1/8

Continuous probability distributions I

Consider a uniform continuous random variable X over the range [0,2] I

What is Pr(X = 1/2)? I

Zero.

I

What is Pr(X = 1/4)?

I

What is Pr(1/4 ≤ X ≤ 1/2)?

I

I I

Zero. 1/8

So then for continuous random variables, we work in terms of ranges of values, rather than the probability of equaling some value exactly

Continuous probability distributions I

Consider a uniform continuous random variable X over the range [0,2] I

What is Pr(X = 1/2)? I

Zero.

I

What is Pr(X = 1/4)?

I

What is Pr(1/4 ≤ X ≤ 1/2)?

I

I I

I

Zero. 1/8

So then for continuous random variables, we work in terms of ranges of values, rather than the probability of equaling some value exactly Think of it this way: what’s the probability of throwing a dart and hitting exactly the right spot on the wall?

Continuous probability distributions I

Consider a uniform continuous random variable X over the range [0,2] I

What is Pr(X = 1/2)? I

Zero.

I

What is Pr(X = 1/4)?

I

What is Pr(1/4 ≤ X ≤ 1/2)?

I

I I

I

Zero. 1/8

So then for continuous random variables, we work in terms of ranges of values, rather than the probability of equaling some value exactly Think of it this way: what’s the probability of throwing a dart and hitting exactly the right spot on the wall? I

Zero.

Continuous probability distributions I

Consider a uniform continuous random variable X over the range [0,2] I

What is Pr(X = 1/2)? I

Zero.

I

What is Pr(X = 1/4)?

I

What is Pr(1/4 ≤ X ≤ 1/2)?

I

I I

I

Zero. 1/8

So then for continuous random variables, we work in terms of ranges of values, rather than the probability of equaling some value exactly Think of it this way: what’s the probability of throwing a dart and hitting exactly the right spot on the wall? I I

Zero. But after it’s thrown, it actually hit some exact spot.

Continuous probability distributions I

Consider a uniform continuous random variable X over the range [0,2] I

What is Pr(X = 1/2)? I

Zero.

I

What is Pr(X = 1/4)?

I

What is Pr(1/4 ≤ X ≤ 1/2)?

I

I I

I

Zero. 1/8

So then for continuous random variables, we work in terms of ranges of values, rather than the probability of equaling some value exactly Think of it this way: what’s the probability of throwing a dart and hitting exactly the right spot on the wall? I I I

Zero. But after it’s thrown, it actually hit some exact spot. Therefore zero-probability doesn’t necessarily mean “impossible”

Continuous probability distributions I

Consider a uniform continuous random variable X over the range [0,2] I

What is Pr(X = 1/2)? I

Zero.

I

What is Pr(X = 1/4)?

I

What is Pr(1/4 ≤ X ≤ 1/2)?

I

I I

I

Zero. 1/8

So then for continuous random variables, we work in terms of ranges of values, rather than the probability of equaling some value exactly Think of it this way: what’s the probability of throwing a dart and hitting exactly the right spot on the wall? I I I

Zero. But after it’s thrown, it actually hit some exact spot. Therefore zero-probability doesn’t necessarily mean “impossible”

Some basic notions

I

Summation notation:

Pn

i=0 ai

means a0 + a1 + · · · + an

Some basic notions

I I

Pn Summation notation: i=0 ai means a0 + a1 + · · · + an The expected P value (aka. mean) of a random variable Y is E [Y ] = i yi · Pr(Y = yi ), where the sum is over all possible outcomes

Some basic notions

I I

Pn Summation notation: i=0 ai means a0 + a1 + · · · + an The expected P value (aka. mean) of a random variable Y is E [Y ] = i yi · Pr(Y = yi ), where the sum is over all possible outcomes I

Just a weighted average (the continuous case would be an integral).

Some basic notions

I I

Pn Summation notation: i=0 ai means a0 + a1 + · · · + an The expected P value (aka. mean) of a random variable Y is E [Y ] = i yi · Pr(Y = yi ), where the sum is over all possible outcomes I

I

Just a weighted average (the continuous case would be an integral).

The variance of a random variable Y is Var (Y ) = E [(Y − E [Y ])2 ]

Some basic notions

I I

Pn Summation notation: i=0 ai means a0 + a1 + · · · + an The expected P value (aka. mean) of a random variable Y is E [Y ] = i yi · Pr(Y = yi ), where the sum is over all possible outcomes I

I

Just a weighted average (the continuous case would be an integral).

The variance of a random variable Y is Var (Y ) = E [(Y − E [Y ])2 ] I

This is a measure of how much Y deviates from the mean

Some basic notions

I I

Pn Summation notation: i=0 ai means a0 + a1 + · · · + an The expected P value (aka. mean) of a random variable Y is E [Y ] = i yi · Pr(Y = yi ), where the sum is over all possible outcomes I

I

The variance of a random variable Y is Var (Y ) = E [(Y − E [Y ])2 ] I

I

Just a weighted average (the continuous case would be an integral).

This is a measure of how much Y deviates from the mean

The standard deviation of a random variable Y is simply p Var (Y )

Some basic notions

I I

Pn Summation notation: i=0 ai means a0 + a1 + · · · + an The expected P value (aka. mean) of a random variable Y is E [Y ] = i yi · Pr(Y = yi ), where the sum is over all possible outcomes I

I

The variance of a random variable Y is Var (Y ) = E [(Y − E [Y ])2 ] I

I

Just a weighted average (the continuous case would be an integral).

This is a measure of how much Y deviates from the mean

The standard deviation of a random variable Y is simply p Var (Y )

Discrete uniform distribution

I

Suppose X is a discrete random variable with uniform probability distribution over the set {1, 2, 3, . . . , m}

Discrete uniform distribution

I

I

Suppose X is a discrete random variable with uniform probability distribution over the set {1, 2, 3, . . . , m} What is E [X ]?

Discrete uniform distribution

I

I

Suppose X is a discrete random variable with uniform probability distribution over the set {1, 2, 3, . . . , m} What is E [X ]? I

(m + 1)/2

Discrete uniform distribution

I

I

Suppose X is a discrete random variable with uniform probability distribution over the set {1, 2, 3, . . . , m} What is E [X ]? I

I

(m + 1)/2

What is Var [X ]?

Discrete uniform distribution

I

I

Suppose X is a discrete random variable with uniform probability distribution over the set {1, 2, 3, . . . , m} What is E [X ]? I

I

(m + 1)/2

What is Var [X ]? I

(m2 − 1)/12

Discrete uniform distribution

I

I

Suppose X is a discrete random variable with uniform probability distribution over the set {1, 2, 3, . . . , m} What is E [X ]? I

I

(m + 1)/2

What is Var [X ]? I

(m2 − 1)/12

Continuous uniform distribution

I

Suppose X is a continuous random variable with uniform probability distribution over the interval [a, b]

Continuous uniform distribution

I

I

Suppose X is a continuous random variable with uniform probability distribution over the interval [a, b] What is E [X ]?

Continuous uniform distribution

I

I

Suppose X is a continuous random variable with uniform probability distribution over the interval [a, b] What is E [X ]? I

(a + b)/2

Continuous uniform distribution

I

I

Suppose X is a continuous random variable with uniform probability distribution over the interval [a, b] What is E [X ]? I

I

(a + b)/2

What is Var [X ]?

Continuous uniform distribution

I

I

Suppose X is a continuous random variable with uniform probability distribution over the interval [a, b] What is E [X ]? I

I

(a + b)/2

What is Var [X ]? I

(b − a)2 /12

Continuous uniform distribution

I

I

Suppose X is a continuous random variable with uniform probability distribution over the interval [a, b] What is E [X ]? I

I

(a + b)/2

What is Var [X ]? I

(b − a)2 /12

Binomial distribution I

Imagine an unfair coin n times that has probability p of landing on heads

Binomial distribution I

Imagine an unfair coin n times that has probability p of landing on heads

I

A binomial distribution is: “how many times did we get heads?”

Binomial distribution I

Imagine an unfair coin n times that has probability p of landing on heads

I

A binomial distribution is: “how many times did we get heads?” Is this a discrete or continuous distribution?

I

Binomial distribution I

Imagine an unfair coin n times that has probability p of landing on heads

I

A binomial distribution is: “how many times did we get heads?” Is this a discrete or continuous distribution?

I

I

Discrete

Binomial distribution I

Imagine an unfair coin n times that has probability p of landing on heads

I

A binomial distribution is: “how many times did we get heads?” Is this a discrete or continuous distribution?

I

I

I

Discrete

Suppose X is a random variable with binomial probability distribution

Binomial distribution I

Imagine an unfair coin n times that has probability p of landing on heads

I

A binomial distribution is: “how many times did we get heads?” Is this a discrete or continuous distribution?

I

I

I

I

Discrete

Suppose X is a random variable with binomial probability distribution What is E [X ]?

Binomial distribution I

Imagine an unfair coin n times that has probability p of landing on heads

I

A binomial distribution is: “how many times did we get heads?” Is this a discrete or continuous distribution?

I

I

I

I

Discrete

Suppose X is a random variable with binomial probability distribution What is E [X ]? I

np

Binomial distribution I

Imagine an unfair coin n times that has probability p of landing on heads

I

A binomial distribution is: “how many times did we get heads?” Is this a discrete or continuous distribution?

I

I

I

I

Suppose X is a random variable with binomial probability distribution What is E [X ]? I

I

Discrete

np

What is Var [X ]?

Binomial distribution I

Imagine an unfair coin n times that has probability p of landing on heads

I

A binomial distribution is: “how many times did we get heads?” Is this a discrete or continuous distribution?

I

I

I

I

Suppose X is a random variable with binomial probability distribution What is E [X ]? I

I

Discrete

np

What is Var [X ]? I

np(1 − p)

Binomial distribution I

Imagine an unfair coin n times that has probability p of landing on heads

I

A binomial distribution is: “how many times did we get heads?” Is this a discrete or continuous distribution?

I

I

I

I

Suppose X is a random variable with binomial probability distribution What is E [X ]? I

I

Discrete

np

What is Var [X ]? I

np(1 − p)

Gaussian (normal) distribution

I

If you add lots of random variables together, the sum is in itself a random variable whose probability distribution tends to look like a Gaussian.

Gaussian (normal) distribution

I

If you add lots of random variables together, the sum is in itself a random variable whose probability distribution tends to look like a Gaussian.

I

Thus, Gaussian distributions show up a lot in nature

Gaussian (normal) distribution

I

If you add lots of random variables together, the sum is in itself a random variable whose probability distribution tends to look like a Gaussian.

I

Thus, Gaussian distributions show up a lot in nature

I

It’s a “bell shaped curve”

Gaussian (normal) distribution

I

If you add lots of random variables together, the sum is in itself a random variable whose probability distribution tends to look like a Gaussian.

I

Thus, Gaussian distributions show up a lot in nature

I

It’s a “bell shaped curve” The probability distribution for a Gaussian with mean µ and variance σ 2 is defined as:

I

Gaussian (normal) distribution

I

If you add lots of random variables together, the sum is in itself a random variable whose probability distribution tends to look like a Gaussian.

I

Thus, Gaussian distributions show up a lot in nature

I

It’s a “bell shaped curve” The probability distribution for a Gaussian with mean µ and variance σ 2 is defined as:

I

2

I

e −(x−µ)

/(2σ 2 )

√ /(σ 2π)

Gaussian (normal) distribution

I

If you add lots of random variables together, the sum is in itself a random variable whose probability distribution tends to look like a Gaussian.

I

Thus, Gaussian distributions show up a lot in nature

I

It’s a “bell shaped curve” The probability distribution for a Gaussian with mean µ and variance σ 2 is defined as:

I

2

I

e −(x−µ)

/(2σ 2 )

√ /(σ 2π)

Application

I

Suppose we are programming Naive Bayes and want to handle a continuous variable

Application

I

Suppose we are programming Naive Bayes and want to handle a continuous variable

I

One way we could do this: measure the mean and variance of the relevant examples, model them with a probability distribution (such as a Gaussian), and use that distribution to help determine whether “yes” or “no” is more likely for the given instance

Application

I

Suppose we are programming Naive Bayes and want to handle a continuous variable

I

One way we could do this: measure the mean and variance of the relevant examples, model them with a probability distribution (such as a Gaussian), and use that distribution to help determine whether “yes” or “no” is more likely for the given instance