Paths that consist of up-steps of one unit and down-steps of units, being bounded below by a horizontal line , behave like ordered tuples of -Dyck paths, provided that . We describe the general case, allowing also to be larger. Arguments are bijective and/or analytic.
A Dyck path consists of up-steps and down-steps, one unit each,
starts at the origin and returns to the origin after steps, and never goes below the -axis. The enumeration involves the
ubiquitous Catalan numbers [1]. The family of -Dyck paths is defined similarly, but
the down-steps are now by units
in one step. Practically every book on combinatorics has something about
this; we only give two citations, [2, 3]. The generating function of these objects, according
to length (the number of steps) can be found by a first return to the
-axis decomposition, where the last represents the down-step that brings
the path back to the -axis for the
first time. The Figure 1 describes this readily; the term ‘1’
refers to the empty path (of length 0). (An equivalent concept is the
family of -ary trees; there
are bijections between paths and trees, and various parameters translate
accordingly.)
Figure 1: The Decomposition of Generalized Dyck Paths for k = 2. Reading from left to
right, the Decomposition Leads to
One can only return to the -axis after a multiple of steps, as each down-step requires
up-steps for compensation.
Consequently we may write . Furthermore we set , making the equation amenable to
the Lagrange inversion, and we can compute the
coefficients of , Now we compute we will use these coefficients of powers of in the next section. The contour
is a small circle in the -plane;
the substitution from to does not change the winding number.
Such computations are quite common in the context of lattice paths
and/or trees.
2. -Dyck
Paths
Selkirk [4]
introduced an extra parameter to
the family of -Dyck paths. The
paths might go below the -axis,
but never go below the horizontal line .
Figure 2: A -Dyck path: Down-steps of 3 Units and Bounded below by the Line −3
Theorem 1. For , the number of -Dyck paths of length is given by
Equivalently, the generating function of -Dyck paths by length is given by
.
That has indeed these
coefficients was discussed in the previous section. It also enumerates
ordered -tuples of -Dyck paths, and the bijection in [4] is between these
two families of objects. It is to be noted that [5] contains somewhat equivalent statements in the
language of -ary trees, and
instead of a boundary line, the nodes are coloured, and the colour of
the root plays a role similar to the in -Dyck paths. We report this
information from [4].
In the present note we want to look at this again, but we also want
to explain what happens if the condition is no longer satisfied, i.e.,
if is allowed. The
generating function is no longer , and has to be replaced by
something more complicated.
But let us start with some bijective arguments. While we consider
-Dyck paths in the following
figures, the illustrations are representative for other values of as well.
Figure 3: The Path from Figure 2, Lifted up 3 Units, with a Sequence of Up-steps in the
Beginning
We lift up a -Dyck paths by
units and add up-steps in the beginning, as can be
seen in Figure 3. The resulting path is a -Dyck path, but does not end on level 0,
but rather on level . It is
classical (see [3, page321]) that
these paths have generating function , thanks to a decomposition
that is sketched in Figure 4. The first part, according to this
decomposition, ends, where the -axis (=level 0) is visited for the last
time. After an up-step, the second part starts and ends when the level 1
is visited for the last time, and so on. All these parts are -Dyck paths themselves, and altogether
up-steps have been
identified.
Figure 4. The Path from Figure 3, Decomposed
Removing extra up-steps, we
are at the generating function again, and the decomposition
gives us (ordered) -Dyck paths. In the example, we get 4
paths, see Figure 5,
Figure 5: Decomposed into 4 Paths; The Second and Third Paths are the Empty Path ε
The paper [4]
provides the same decomposition into -Dyck paths. We hope that our
alternative description is natural and easy to understand.
Since in this running example, two copies of the empty path appear,
we provide an additional example:
Figure 6. A Path for k = 3 and its Decomposition
Observe that the operation “shifting up the path by units” makes the old
origin the first point where the level is reached. In the beginning,
there are the extra up steps; the
new origin is at the left end of these added steps. If this is indeed the only option to
reach this level for the first time without going below the -axis by using a down-step.
If is arbitrarily large, this
is no longer true: We can “live” in the strip with boundaries 0 and
, end at the highest level, and
make one up-step to reach the -path. Figure 7 is a
drawing explaining this.
Figure 7: The Path has a First Part F and a Second Part G. The Length of F will Later be
Called J and Analyzed for k = 1
The decomposition of -Dyck
paths ending on level into and (as shown in Figure 7) is
canonical: starts when the level
has been reached for the first
time, and what comes before is called ; this ends with an up-step, and the part
before “lives” in the strip .
So it is evident that for general the first part has more freedom. In the next section
we will use generating functions to understand the roles of and better. In particular, the length of
the may vary now.
3. Generating Functions
At the moment, we leave the bijective arguments and go back to the
original question about -Dyck
paths with the negative boundary . In order to obtain the relevant
generating functions, we also introduce (temporarily) an upper boundary
at . This means that we consider
paths, living in the strip .
This has the advantage that the generating functions that will appear
are rational. We consider generating functions , where marks the level of the endpoint,
The recursion ,
provided all indices are within the interval , is easy to understand; if one ends
on level , one must have been at
level or at level before the last step. This is best
written as a linear system
The entry 1 on the righthand side corresponds to the function . Let be the determinant of the matrix with
rows (and columns). We find by
expanding along the first column, say, the recursion The solution is This can be
proved by induction on or by
other methods (possibly by the use of Lambert’s and Lagrange’s trinomial
equation; see [6], but this
has not been investigated). For , these polynomials are sometimes
called Fibonacci polynomials and appear e. g. in [7]; this highly cited paper deals with the
height of planar trees, and this is equivalent to the height of Dyck
paths. For , they appear in
[8] when computing
generating functions related to -Dyck paths with a boundary. A general
reference about this method is [9].
We want to link the polynomials to the generating function : The characteristic equation for
the sequence is . The
equation satisfied by is, as
discussed in the introductory section, . Upon setting , we see that this is the same
equation. Consequently has an
explicit expression where the are the other roots of the
characteristic equation. If one now takes a limit for and fixed the contributions coming from the other
roots will disappear, with the result . We will use this now.
According to Cramer’s rule to solve the linear system of equations,
we find and Since we do not need the upper boundary at level
, we push it to infinity:
We are only interested in the instance , with the result . The quantity , with lower boundary
and upper boundary (=no upper restriction) is the
generating function of -Dyck
paths. So, we obtained for the enumeration of -Dyck paths,
(The summation is over all integers satisfying the inequalities .) In the last
expression, we explicitly wrote to emphasize the dependency of the
generating function on the parameter . This explains once again that for
we get the simple
result . In general, the
generating function is given by
The number of -Dyck paths of
length is then given by
The polynomial has alternating
coefficients, and it is quite likely that there is some sort of an
inclusion-exclusion principle underlying.
The polynomial does not have
a combinatorial meaning itself, but we may write
The generating function has a
combinatorial meaning: In the linear system, first replacing by 0 and then and by , leads to , and it counts the
-Dyck paths living in the strip
, ending at the highest level
. The function differs only by an extra
factor , representing an up-step,
touching the level for the first
time. This is exactly the decomposition as given in Figure 7.
Paths ending on their highest level appear in [10, 11].
It is interesting to compare and , by taking their
quotient. This is Taking the limit
of this for (only
Stirling’s formula for factorials is required), we get with This sum
is indeed for , but takes smaller values,
when gets larger in relation to
. This matches intuition that,
when is large, the first part of
the path that does not (yet) hit the level tends to be longer, and the
contribution of the rest, which is measured against , tends to be
smaller.
Figure 8:, and is growing
4. Asymptotics
We want to study the parameter as in the drawing of
Figure 7. It is given by , for some . Recall that each path is uniquely
decomposed by the -part, until the
level has been reached for the
time, and the remainder, the -part.
In order to be able to do explicit calculations, we restrict
ourselves to the classical case
of Dyck paths. (The case of general , which seems to be less combinatorial,
and more analytical, is left as a challenge.) Then the recursion of
second order
admits the solution
with the (classical) substitution , borrowed from [7].
The parameter has an
automatic contribution of , which
we can add later. In our setting, we are interested in , and we are
concentrating on . This means
that we consider the random variable and call it the
parameter of interest. The advantage of this procedure is that now we
only have generating functions in , for which we write .
The probability generating function of interest is
with the
generating function of the Catalan numbers, enumerating Dyck paths by
half-length: the quantity in the denominator is the total number of
objects (not counting the additional up steps in the beginning); the
numerator is the unique decomposition into the -part and the -part. An additional variable is used to count the length of the
-part.
The number of paths of length with parameter is then given as .
We still need to compute ,
the generating function of paths not going below the line and ending on the -axis again.
This can be computed by the linear system as a quotient of the usual
determinants. Consider paths bounded below by , and above by . As a check, we get the product of the two
components and
as it should.
We study now using a second
variable to count the parameter
of interest. We compute, and further We do not try to simplify this any further, but
expand it around , which
corresponds to the singularity , which is relevant for
Dyck-paths. We are in the regime called “sub-critical”, compare [3], where the singular expansion
of numerator and denominator is of the same type:
whence the asymptotic expansion of the parameter of interest is given by . One does not need to
switch back to the -world, since
this quotient can also be obtained via , in
The reader might recall that , and this is
how the singularity
translates into and vice versa.
A recent application of this (including a proof of a conjectured
limiting distribution) is in [12]. In our instance, we are led to
and the quotient of interest is
Multiplying this by 2, in order to switch from half-length to length,
and adding the fixed contribution
leads to the average value of :
In this subcritical regime it is also relatively easy to determine
the discrete limiting distribution. We start from
the quotient does
not depend on the parameter, and can thus be replaced by its limit at
the singularity , or,
easier at : (This simplification
stems from the fact that the limit of a product is the product of
limits, and, more generally, this holds for the local expansions as
well.)
Hence, the discrete limiting distribution is given by the (probability)
generating function
One can even read off coefficients explicitly: (The contour is again a
small circle in the -plane, and
since for small , the curve also winds around the origin
exactly once in the -plane.) This quantity can be interpreted as the
probability that in an (almost) infinitely long path the parameter has value .
Figure 9: Limiting Distribution: t = 7, t = 10, t = 14
Acknowledgment
I thank Nancy Gu for valuable discussions. Most of this work was
developed while I visited the Technical University in Graz, Austria. I
thank my host Peter Grabner for his hospitality. Furthermore, I thank
two referees who pointed out several formulations and statements that
lacked in clarity. In this revised version I provided much more detailed
explanations and fixed some inaccuracies.
Conflict of
interest
The author declares no conflict of interest.
References:
Stanley, R.P., 2015. Catalan Numbers. Cambridge University Press, New York, 2015.
Banderier, C. and Flajolet, P., 2002. Basic analytic combinatorics of directed lattice paths. Theoretical Computer Science, 281(1-2), pp.37-80.
Flajolet, P., and Sedgewick, R., 2009. Analytic Combinatorics. Cambridge University Press, Cambridge.
Selkirk, S.J., 2019. MSc-thesis: On a generalisation of k-Dyck paths. Stellenbosch University.
Gu, N.S., Prodinger, H. and Wagner, S., 2010. Bijections for a class of labeled plane trees. European Journal of Combinatorics, 31(3), pp.720-732.
de Bruijn, N.G., Knuth, D.E. and Rice, S.O., 1972. The average height of planted plane trees. In Graph Theory and Computing (pp. 15-22). Academic press.
Prodinger, H., 2019. On some problems about ternary paths: a linear algebra approach. Rocky Mountain Journal of Mathematics, 51, pp. 709–720.
Prodinger, H., 2015. Analytic methods. Handbook of enumerative combinatorics, Discrete Mathematics and Applications (Boca Raton), pp.173-252.
Bousquet-Méloû, M. and Ponty, Y., 2007. Culminating paths. Discrete Mathematics & Theoretical Computer Science, 10, pp.125–152.
Hackl, B., Heuberger, C., Prodinger, H. and Wagner, S., 2016. Analysis of bidirectional ballot sequences and random walks ending in their maximum. Annals of Combinatorics, 20(4), pp.775-797.
Prodinger, H., 2016. Returns, Hills, and t-ary Trees. Journal of Integer Sequences, 19, Article 16.7.2.