Generating Functions
From charlesreid1
For coverage of generating functions specific to Donald Knuth's Art of Computer Programming, see AOCP/Generating Functions.
Contents
- 1 Notes
- 1.1 Sedgewick Flajolet: Analysis of Algorithms
- 1.1.1 Ordinary Generating Functions
- 1.1.2 Addition
- 1.1.3 Differentiation
- 1.1.4 Integration
- 1.1.5 Partial Sum Of Prior Terms
- 1.1.6 Multiplication
- 1.1.7 Expanding Generating Function
- 1.1.8 Example: Two-Term Recurrence
- 1.1.9 Example: Three-Term Recurrence
- 1.1.10 Example: Imaginary Roots
- 1.1.11 Example: Quicksort Recurrence
- 1.2 Trotter Chapter 8: Generating Functions
- 1.3 Trotter: Chapter 8 Exercises
- 1.1 Sedgewick Flajolet: Analysis of Algorithms
- 2 Related Pages
- 3 Flags
Notes
Sedgewick Flajolet: Analysis of Algorithms
Ordinary Generating Functions
Definition: define an ordinary generating function using the relation:
Notation for denoting a particular coefficient:
To add coefficeints raised to the N power, add a coefficient to z:
For example, powers of 2 coefficients are given by the generating function:
Can also perform operations on generating function.
Addition
To add generating functions together:
then:
Differentiation
Can also take derivative of series to get:
Also note that the derivative of the basic generating function is given by:
This operation can be performed an arbitrary number of times:
Integration
Can also integrate generating functions. Suppose we have a generating function:
then
For example:
Partial Sum Of Prior Terms
A more exotic operation is a partial sum - this allows us to write a sequence that is the sum of each prior term in the sequence. For example, suppose we start with our usual generating function G(z):
Now we can multiply the generating function by itself to get:
This can be obtained via the following operations:
(note that the last step there is just switching m for m-k.) Continuing:
Some examples of this: start with the log generating function, as before,
Multiplication
Multiplying two generating functions is called the convolution of two OGFs. Given two generating functions:
Now the product of these two gives the series:
This gives us the series:
Example:
Expanding Generating Function
Expanding a generating function is the process of expressing unknown generating function as power series.
Typically using Taylor Theorem to expand a function in terms of its derivatives, or using a known generating function (OGF or EGF)
Example: Two-Term Recurrence
Suppose we have the recurrence:
then we can write this recurrence as:
Now multiply by , and sum over all n
Solving this for G,
Now using partial fractions gives
(where the partial fractions expansion results in +1 and -1 as the numerators.)
Expanding this gives:
Example: Three-Term Recurrence
Suppose we have the recurrence:
Expanding and obtaining an expression for the nth term:
Note that if we had a term in the denominator with a higher multiplicity, say a multiplicity of p, we would end up with
Example: Imaginary Roots
Suppose we have an irreducible quadratic in our simplified/combined generating function:
Recurrence:
Now multiply by :
Expanding/solving/simplifying:
Now we have an irreducible quadratic. Expanding this using partial fractions yields:
Now the nth term of the series can be written as:
Example: Quicksort Recurrence
Consider the recurrence relation for quicksort.
To solve this, we can turn each term in the recurrence into a generating function. Start by multiplying both sides by and then sum over all n:
now summing over n:
Looking at each term and turning it into a generating function, we can see the first term (left hand side) is the derivative of our usual generating function:
Note that this is a differential equation, which can be solved using an integrating factor:
This leads to a solution for the differential equation:
Now we have our solution:
Finally, we get a expression for the nth term in the series:
Trotter Chapter 8: Generating Functions
Alternative Generating Functions Introduction
To approach generating functions: consider the problem of distributing apples among children.
We can even boil it down to a super simple case: distributing n apples among n children.
In this case, if there are no apples, the coeff is 0. If there is 1 apple, 1 way to distribute. If 2 apples, 1 way to distribute. Etc.
This makes the generating function:
And this is, as we already know, equivalent to:
Now consider the case where there are two children. Now, each child has an independent scenario in which it gets a particular number of apples. To cover all of these combinations we can multiply two copies of the generating function together:
Now in this case, there is no leading term (no zero-apples case), but there is also no one-apples case, either, since there is no way to obtain an x term. This corresponds to the scenario we have set up, in which each child must have one apple.
That is, when we defined the generating function, we defined the generating function such that if there are no apples, there are no possible configurations. This means that any child getting zero apples is not a possible configuration.
If we went back and changed the generating function to
then we would end up with one possible configuration when a given child has zero apples. This would carry through, so that when there are two children, there is one possible configuration when there are zero apples (z = 0; the leading constant term is 1), and there are also two possible configurations (z + z = 2z) for the case when there is one apple (z = 1; the 2z term gives us 2 configurations).
So, as mentioned the series contains no terms for the case, because each child must have 1 apple.
If we extend this to n children, we will multiply n copies of the original series. Consider the 5-child case:
Here, the smallest term is which corresponds to a single possible configuration when there are five apples.
If we look at the term this represents the number of ways we can distribute n apples among a certain number of children (the number of children determines the final form of the function, i.e., the structure of the problem; n just determines which coefficient of the function we're asking for, i.e., the size of the problem).
Therefore, the coefficient of in our generating function represents the number of ways that we can combine the number of apples each child gets (indexed by k):
Suppose we have 6 apples. Then we are asking, how many ways can we distribute 6 apples among 5 children? This is a relatively easy question, one that can be answered with the binomial coefficient. What we are asking is, how many ways can we combine 5 different k values (one for each child, indicating how many apples they get, or the exponent on the z term) to get a total of 6, which is the total number of apples available.
For five children,
To combine 6 z terms together to get z^6, with the restriction that one z must come from each group, results in the following sets of choices:
which is 5 choose 1, or
We could also get this result through a little bit ore advanced use of calculus and derivatives: suppose that we replace the long expanded generating function with the shorter version, to get a generating function for the one-child case:
and for the five-child case:
Now we can observe that the denominator can be obtained via:
so that
now this can be simplified to
Almost finished. Now we can use
Just to take this example one step further, let's suppose we decide that no child can get more than 4 apples. Then the generating function for each child will terminate after 4 terms (any term beyond that represents the case where a child has more than 4 apples and therefore we do not have any possibilities) and will be:
While this latter calculus approach looks hairy, with practice it becomes much easier to spot how to combine and simplify series. (See exercises below.)
EXAMPLE Colored Marbles with Complicated Restrictions
Let's look at an example that uses less trivial restrictions.
Suppose we are preparing cans of marbles. Each can of marbles will have 20 red, blue, or green marbles. How many different outcomes are there if we require each jar have at least 1 red marble, no more than three blue marbles, any number of yellow marbles, and the number of green marbles must be a multiple of 4?
We begin this problem by starting with an easier problem: let's count the number of outcomes if we put n marbles into each can. We'll create a generating function whose coefficients give this number, and we can then find the 20th coefficient. As with the generating functions above, the term corresponds to six marbles in each can, and the coefficient indicates the number of different ways that our three k values can be combined to get 6.
Now, to write the generating functions for each marble, given our various restrictions:
Start with red, which is easiest - we require at least one red marble, so the case of must correspond to 0 outcomes for red.
so finally we get our generating function for red:
Next, for blue, we require that we have no more than three blue marbles, so any terms larger than will be 0:
Yellow marbles can be any number:
Green marbles are a bit more tricky - we require they only occur in multiples of four, which means the coefficients of the generating function are nonzero only when the degree is divisible by 4:
Notice that we can do a variable transformation , and this becomes the generating function:
Now, this becomes:
and undoing our substitution gives:
Now the generating function for the entire lot is:
Simplifying with Mathematica or Wolfram Alpha:
Analysis of Generating Function
One step, at this point, is to find the Taylor Series expansion of this expression, which would give us our coefficients and the counts of different configurations. We will do that in the next section. However, we can do further analysis to get expressions for the coefficients analytically.
First, we can associate the on the bottom to the sum via the derivative:
and recall that this gives us a link to an infinite series, because , as we have seen many times thus far.
So, we can replace the derivative term with an infinite sum:
and this gives us the generating function in terms of a sum:
Why do all of this? The next steps give us the payoff: we get a closed form for the coefficients.
Notice the triangle number , which we will see crop up in the next section. The triangle number in that form, , rather than its usual form (which gives the sum of the integers from 1 to n), indicates that this is a binomial coefficient. Recall from AOCP/Binomial Coefficients that:
Thus, if we bring the x inside the infinite sum and simplify the expression, we get:
Last, we do a variable shift with the infinite series, shift n up by one, which pops out a 0 term, and write:
or equivalently,
Now we can give the closed-form answer to the question: How many fruit baskets are there? (n+1) choose 2. For a can of 20 marbles, that's:
Indeed, this is a lot nicer than just using Mathematica, but Mathematica certainly scales better:
Taylor Series Analysis
Now, we can use this generating function to explore the possibilities in our system. Here's the Taylor Series expansion to the first 10 terms:
We can actually verify the first few terms.
The case of the z term (k=1 marbles). Let us first examine the z term (k=1 marble). When we have a single marble, there is only one configuration that works, and that is the case of a single red marble. No other marbles are required, strictly speaking. Thus the coefficient is 1.
The case of the z^2 term (k=2 marbles). The case of 2 marbles is more interesting: one slot has to be occupied by a red marble, so the remaining slot can be occupied by a red marble, a blue marble, a yellow marble, or a green marble. But green marbles can only occur in multiples of 4, so they're out. That leaves 3 possible configurations:
RR, RB, RY
The case of the z^3 term (k=3 marbles). If we have 3 marbles, we must have one slot occupied by a red marble. The remaining slots can be occupied by red, blue, or yellow marbles. Green marbles are still out. This gives 3 additional possibilities, added to the prior 3 possibilities, for 6 total:
RRR RRB RRY RBB RBY RYY
Notice the way that the configurations are arranged - in groups of size 3, 2, 1. The total possibilities are 3 + 2 + 1 = 6, which is a triangle number.
Another way to think about this is, we are selecting from a set of 3 colored marbles, which one to put in each spot. Each spot sees the same set of 3 colored marbles, so this is different from saying 3 marbles (RYB) are going into two slots. It is actually like saying, when we choose what color to put into the first slot we are choosing from three marbles (RYB), and when we choose what color to put into the second slot we are also choosing from three marbles (RYB).
The case of the z^4 term (k=4 marbles). This configuration has 10 possibilities. 10 is a triangle number - just like 6.
RRRR RRRB RRRY RRBB RRBY RRYY RBBB RBBY RBYY RYYY
Again, notice the configuration arrangements: (3 + 2 + 1) + (2 + 1) + (1) = 4 + 3 + 2 + 1
These numbers are coming from the term in the Taylor Series analysis we did above:
If we have a term like k-choose-2, that will result in the (k-1)th triangle number. In this case, we have a term (n-1)-choose-2, which will give us the nth triangle numbers. That's why, when we had 3 marbles, we got the 3rd triangle number, when we had 4 marbles, we had the 4th triangle number, etc.
Exponential Generating Functions
What we've seen so far is pretty cool. But the case of marbles in a can, or boxes to pack in a truck, is independent of order. What do we do if we have a problem in which order matters?
In these situations we want to use exponential generating functions. We define an analogous function to to generate the sequence 1,1,1,1,...: we use the series expansion
which gives a basic exponential generating function of:
As before, we will seek to put all functions/sequences of interest in terms of this base function.
EXAMPLE Counting Ternary Strings
Suppose we have ternary strings (numbers consisting of the digits 0, 1, and 2) of length n, and we wish to know how many of these strings have an even number of 0s.
To solve this problem, we apply a formulaic procedure:
First, we determine the generating function for each of the possible choices (0, 1, 2 in this case).
Then, we multiply the generating functions together to get the overall generating function.
For 1s and 2s, we have as many digits as we would like, so the generating functions are the normal 1, 1, 1, 1 series:
For the 0s, we can only have an even number of terms, so we need the series
Unlike with linear generating functions, where complex numbers eliminate particular terms, exponential functions make this a little easier to do.
Start by writing out the exponential sequence that we are using:
then substitute -x for x, to get a slightly different series:
If we add these two series together, the odd terms will cancel out, and we will have precisely the terms we want. This identity can be written:
Now we have the generating function for the 0 digit:
Now the overall generating function becomes:
Now the exponential functions can be converted to infinite series, and the series can be combined and simplified to yield the series form of the generating function.
What this means, in the end, is that the coefficient of the nth term of the infinite series expansion of our generating function is
and we can obtain a closed-form expression for the total number of ternary strings of length n having an even number of 0s. It is:
Trotter: Chapter 8 Exercises
Problem 3
In problem 3, we're given some generating functions, and asked to find a closed form for the nth term.
Part a
Find nth term of generating function:
Solving this one is as simple as applying the binomial theorem:
So the nth term will be:
Part b
Find nth term of generating function:
This one is easy to do, intuitively, since the fourth power means we have a normal generating function that generates every fourth term. It is relatively easy to do by hand, therefore, but more complicated to do formally. Here's the formal approach.
Start by factoring this as , then do a partial fractions decomposition:
Now the goal is to combine these terms into a single series by re-expressing each expression with x in it in terms of , using various tricks shown above and from calculus.
The first term is simple enough:
The second term can be written as a series in (-x):
The last term is a bit challenging. I used Wolfram Alpha to get a series expansion for this function. Here it is:
Converting this to a series expression requires some imaginary numbers to eliminate the odd terms of the series. We have:
where .
These three terms can now be combined:
which becomes
(This is technically the final answer.)
Let's interpret this result a bit more.
Notice that the imaginary terms ensure the series has no odd values of k. For even values of k, the term has an alternating value of +2 or -2. Specifically, if k is divisible by 4 (0 mod 4), we have a value of +2, and if k is 2 mod 4, we have a value of -2.
The term will be +2 when k is even and 0 otherwise, so this contributes no odd terms to the series.
Since there are no odd terms and the values of k that are 2 mod 4 cancel out, we are left with:
So the entire series looks like:
Part c
Again, we're given a generating function and asked to find the nth term.
In part c we have the complement function to part b, so hold on to your hats.
The first two terms of the partial fraction expansion are the same as in part b:
The first two terms become:
The last term is a series that, analogous to the one from part b, has only odd terms. This time we'll take a shortcut and write the series expansion as:
Still struggling to glue this all together, but basically the two terms and will cancel out with each other for odd k, and will cancel out with the imaginary term for even k, for a final series solution that skips every other odd term:
Related Pages
Flags
Combinatorics
Combinatorial Structures - Order Does Not Matter Ordinary generating functions
Labelled Structures - Order Matters Enumerating Permutations: String Permutations Generating Permutations: Cool · Algorithm M (add-one) · Algorithm G (Gray binary code)
Combinatorics Problems Longest Increasing Subsequence · Maximum Value Contiguous Subsequence · Racing Gems Cards (poker hands with a deck of 52 playing cards)
|
Algorithms Part of Computer Science Notes
Series on Algorithms
Algorithms/Sort · Algorithmic Analysis of Sort Functions · Divide and Conquer · Divide and Conquer/Master Theorem Three solid O(n log n) search algorithms: Merge Sort · Heap Sort · Quick Sort Algorithm Analysis/Merge Sort · Algorithm Analysis/Randomized Quick Sort
Algorithms/Search · Binary Search · Binary Search Modifications
Algorithms/Combinatorics · Algorithms/Combinatorics and Heuristics · Algorithms/Optimization · Divide and Conquer
Algorithms/Strings · Algorithm Analysis/Substring Pattern Matching
Algorithm complexity · Theta vs Big O Amortization · Amortization/Aggregate Method · Amortization/Accounting Method Algorithm Analysis/Matrix Multiplication
Estimation Estimation · Estimation/BitsAndBytes
Algorithm Practice and Writeups Project Euler · Five Letter Words · Letter Coverage
|