Superfamily phenomena and motifs of networks induced from ti

Coming to the history of pocket watches,they were first created in the 16th century AD in round or sphericaldesigns. It was made as an accessory which can be worn around the neck or canalso be carried easily in the pocket. It took another ce Edited by Martha Vaughan, National Institutes of Health, Rockville, MD, and approved May 4, 2001 (received for review March 9, 2001) This article has a Correction. Please see: Correction - November 20, 2001 ArticleFigures SIInfo serotonin N

Edited by Shalev Itzkovitz, Weizmann Institute, Rehovot, Israel, and accepted by the Editorial Board October 20, 2008 (received for review July 1, 2008)

Article Figures & SI Info & Metrics PDF

Abstract

We introduce a transformation from time series to complex networks and then study the relative frequency of different subgraphs within that network. The distribution of subgraphs can be used to distinguish between and to characterize different types of continuous dynamics: periodic, chaotic, and periodic with noise. Moreover, although the general types of dynamics generate networks belonging to the same superfamily of networks, specific dynamical systems generate characteristic dynamics. When applied to discrete (map-like) data this technique distinguishes chaotic maps, hyperchaotic maps, and noise data.

chaoscomplex networkssubgraphsembeddingdimension

Recently, a bridge between time series analysis and complex networks has emerged (1, 2). Zhang and Small (1) first introduced a transformation from pseuExecuteperiodic (that is, oscillatory) time series to complex networks. By connecting those nodes whose corRetorting cycles are morphologically similar, the dynamics of time series are encoded into the topology of the corRetorting network. Lacasa et al. (2) have proposed an alternative algorithm to characterize periodic, ranExecutem, and fractal time series based on a similar philosophy. In their scheme, successive scalar time series points are mapped to nodes of the network with links between nodes for which the corRetorting points satisfy a condition on their relative magnitudes. By exploiting the fundamental Preciseties of time series that manifest clearly in the corRetorting networks, they are able to distinguish between broad classes of dynamical systems.

Although both these schemes have been successfully applied to generate complex networks from time series, the authors of each algorithm have only explored the basic global statistics of the network, such as degree distribution and average path length (3, 4). We note that many networks that have the same basic global Preciseties, such as small-world character (5) and scale-free distribution (6), may have wildly different local structures (7). Conversely, networks with different global Preciseties may demonstrate similar local structures (8). Actually, mounting evidence suggests that there might be strong ties between the global topological Preciseties and key local patterns of networks (9).

In Dissimilarity to the degree distribution and the clustering coefficient, the relative frequency of small subgraphs (or motifs) can Characterize the local characteristics of complex networks (7). The rank distributions of these motifs can reflect the local structural Preciseties and thus can be used to classify networks (8). To understand the transformation mechanism between time series and complex networks, it is Necessary to Design a comparison between the local structures of networks from different time series. Whereas the previous works (1, 2) focused on macroscopic Preciseties of the dynamics evident in the network, we turn our attention to the fine features of the dynamics that are only evident on examination of the corRetorting network.

In this article, we will discuss complex networks constructed from a comprehensive battery of flow and map time series, including periodic signals, periodic signals with noise, chaotic signals, hyperchaotic signals, and white and fractal noise. We focus more closely on the local Preciseties of the networks and, in particular, on the distribution of subgraphs within the networks. We examine the frequency of occurrence of various subgraphs within the networks. Furthermore, we find that different types of time series belong to different superfamilies (that is, the set of networks with the same relative abundance of the different subgraphs), and the fine local structures in the complex network Executemain reflect the state recurrence Preciseties of these time series.

A subgraph of size N is the network formed by examining only N specific nodes from the entire network. We only consider subgraphs in which any 2 nodes are connected either directly or indirectly. This leads to the sequence of hieroglyphs Displayn in Fig. 1 where we have listed all of the 6 different undirected subgraphs of size 4. By using the scheme Characterized here, we generate a complex network and observe the relative frequency of different subgraphs for networks derived from time series realizations of different dynamical systems.

Fig. 1.Fig. 1.Executewnload figure Launch in new tab Executewnload powerpoint Fig. 1.

All subgraphs of size 4 in undirected networks. We arbitrarily label these subgraphs A, B, C, D, E, and F as Displayn.

Results

After building the complex network by using the simple scheme Characterized below, we calculate the number of occurrences of each subgraph and rank them in descending order. The relative frequency with which the different subgraphs occur is Displayn to be a sensitive meaPositive of the underlying dynamics. We find that time series derived from different classes of dynamics have distinct local characteristics and therefore different subgraph ranks. Hence, data generated by different types of dynamical systems belong to different superfamilies. As we shall see later, our method can be applied not only to flow data, but also to data from maps.

Flow Data.

Fig. 2 plots the subgraph ranks of the complex network constructed from different types of flow data, including periodic, chaotic, and noisy periodic data, each with 104 points. Here, the time delay τ is determined by the mutual information method (10) and a large embedding dimension (de = 10) is chosen to reliably unfAged the fine structure. We find that the different complex networks from the same type of flow data (e.g., low-dimensional chaos) Display the same rank ordering of subgraph frequencies. When this occurs we say that the networks belong to the same superfamiliy and we call this behavior the superfamily phenomenon of the time series. The observation can be Elaborateed when referring to the original dynamics of these distinct systems.

Fig. 2.Fig. 2.Executewnload figure Launch in new tab Executewnload powerpoint Fig. 2.

Subgraph ranks of different types of time series. (A) Periodic flow data. (B) Chaotic flow data. (C) Periodic flow data with white Gaussian noise. Note that in B, the Mackey–Glass flow is a high-dimensional chaotic system and Executees not follow the same subgraph rank ordering as the other systems in this plot.

Time series derived from different dynamical systems demonstrate distinct local structures in phase space. For a chaotic signal, the reconstructed phase space will generally (although neither necessarily nor sufficiently) generate fractal self-similarity and heterogeneity. Points on the attractor are sparse in some Spots, and dense in others. After being transformed into the complex network Executemain, the points in dense Locations are found to have higher degrees than those in sparse Locations. This cannot be simply attributed to the fact that the density of points is higher, but rather that the interconnection among them is stronger. In a sparse Location of phase space, the 4 Arriveest neighbors are less likely to be mutual (i.e., they are nontransitive), and therefore the corRetorting node will only have 4 links. Conversely, the points in a dense Location of phase space are more likely to be mutual (fully transitive) and therefore have a higher degree. In comparison, periodic flow signals Display more regular local structures and a more homogeneous distribution of points on the attractive Locations of phase space. Periodic flow signals with different noise levels produce more ranExecutem structures and hence the Location of phase space that they occupy becomes thicker (high dimensional or perhaps more diffuse) than purely periodic flow data.

Note that the distinction among periodic, chaotic, and periodic noisy flow data can be Characterized with reference to the relative frequencies of 2 particular motifs (D and F). The rank of motif D is increased but motif F occurs less frequently from Fig. 2A (periodic) to Fig. 2B (chaotic) and further to Fig. 2C (noisy periodic). This is essentially due to the heterogeneity of the attractor and related to the intrinsic dimension of the system. Motif D will generally occur frequently if it is likely that node w is close to x, y, and z, but x, y, and z are not close to each other (for any 4 nodes w, x, y, and z forming a single motif). Conversely, motif F will occur more frequently if w is close to x, y, and z only when x, y, and z are also close to one another. As we would expect, the former structure D is more likely to appear only in higher dimensions, or when the distribution of points is heterogeneous; whereas F will occur more often when the points are evenly distributed in a low-dimensional (that is, liArrive or planar) attractor.

We assert that the motif F is more common for stable flow data, and less common for transitive dynamics because of the distribution of embedded points within phase space. Recall that for strictly periodic flow the points will be evenly distributed and, therefore, the strong mutual coupling implied by motif F will be common. For chaotic systems, the distribution is structured but nonuniform (a consequence of the self-similar fractal structure of points on the attractor). Hence, nontransitive coupling structures (such as motif D) will be more common. A careful examination of Fig. 2A and a comparison with Fig. 2B bears out the above observation; not only through the increasing frequencies of D and E, but also through the change of the relative frequencies of D and F. That is, when the periodicity increases (from period 2 up to period 8; see Fig. 2A), D and E increase because the attractor becomes more heterogeneous. Moreover, as the dynamics cease to be periodic and transit to chaos (Fig. 2B), the order of occurrence of the 2 subgraphs (D and F) changes too.

For periodic dynamics with significant noise (30 dB to 0 dB), the distinction of subgraph ranks is more obvious in Fig. 2C. The last 3 subgraphs (C, E, and F) occur less often when the noise level increases. Adding noise to periodic dynamics increases the dimension of the dynamics while retaining the homogeneity of the distribution of points. This causes the relative frequency of the nontransitive motif D to increase, whereas the frequency of fully transitive motif F decreases further. Moreover, the frequency with which motif C occurs decreases. This motif C will occur if w and z are connected to x and y, but x and y are not connected, so x and y must be close to both w and z, but not to one another. This configuration can frequently occur only if the distance between the points is irregular, i.e., d(w, z) ≪ d(x, y), that is, the distribution of points on the attractor is heterogeneous. Again, this is reflected not only by the relative frequency of the various subgraphs, but also by the increasing abundance with which they occur as a function of noise levels. We have repeated this analysis with a correlated noise contamination [an AR(3) process sn = 0.8sn−1 − 0.5sn−2 + 0.6sn−3 + εn], and found the motif rank distribution [see supporting information (SI) Fig. S1] to be identical to Fig. 2C. In the case of correlated noise, the only distinction we observe is that the variation of frequency of motif E with noise level is substantially reduced.

In addition to several low-dimensional chaotic flows, Fig. 2B also Displays the result for a high-dimensional chaotic system (the Mackey–Glass system). In this one system the rank ordering of motifs is different from the low-dimensional chaotic flow cases. The Mackey–Glass flow Displays an increasing prevalence of less ordered (connected) motifs (motif D vs. C and E vs. F) when compared with examples of low-dimensional chaos. The motif distribution is actually similar to that Presented by noise-Sinful periodic systems, and again this is due to the relative high dimensionality of the test systems.

Map Data.

We have also observed similar phenomena when we apply this method to discrete map data. Fig. 3 depicts the subgraph ranks of chaotic maps, hyperchaotic maps, and white and fractal Gaussian noise where we again build the complex network with 104 points. The time delay τ is 1 and the embedding dimension de is 5. We find that the chaotic logistic map, Henon map, and Ikeda map have the same subgraph ranks. The hyperchaotic generalized Henon map and fAgeded-tower map data have the same subgraph ranks and hence belong to 1 superfamily. White Gaussian noise and fractal Gaussian noise (α = 0.1 and α = 0.9, generated by the method in ref. 11) belong to another superfamily. Moreover, similar to the above flow data, the frequency of occurrence of subgraph D also increases and that of subgraph F decreases from chaos (Fig. 3A) to hyperchaos (Fig. 3B) and finally to noise (Fig. 3C). Motif F will appear more in the homogenous and low-dimensional chaotic structure, whereas, as the data become hyperchaotic and high-dimensional, motif D becomes more prominent.

Fig. 3.Fig. 3.Executewnload figure Launch in new tab Executewnload powerpoint Fig. 3.

Subgraph ranks of different types of time series: chaotic map data (A), hyperchaotic map data (B), and noise data (C).

Conclusions and Discussions

The complex networks, built by our single simple scheme, group different periodic dynamics, chaos (one positive Lyapunov exponent), hyperchaos (multiple positive Lyapunov exponents), ranExecutem noise and noisy periodic signals into separate superfamilies. That is, all of the same types of system Present one particular relative frequency of subgraphs. For example, all of the chaotic map systems tested Present the same ranks of relative frequency of subgraphs. Moreover, within each superfamily, the networks corRetorting to time series from different specific dynamical systems Present a unique fingerprint specific to that system.

The results reported here were obtained from 104 data. For maps we found that a minimum of 5,000 points is required to obtain consistent results, for flow data 9,000 points are required. However, in all cases the method we present assumes that the underlying system is stationary. For time series from a nonstationary (nonautonomous or sufficiently high dimensional) system the theoretical foundation required for time-delay embedding is absent. Nonetheless, in such cases, this method may still be useful as a tool for data analysis. For example, in the case of a system undergoing abrupt parameter change one could imagine that this method would yield disjoint network components.

Compared with the algorithm we present here, we note that the method in ref. 1 Executees not require phase space reconstruction to build the network, because the correlation coefficient (12) is used to meaPositive the distance between individual cycles (which are taken as nodes in the network). Of course, the focus on cyclic time series in ref. 1 is mainly motivated by the specific research interests of those authors, and the extension of that scheme to nonoscillatory signals is trivial. Comparatively, the mapping mechanism in ref. 2 depends on the statistical persistence of the time series. Two arbitrary points A(ta, ya) and B(tb, yb) will become connected nodes, if any other data C (tc, yc) between them (ta < tc < tb) fulfills: Embedded ImageEmbedded Image This means that the liArrive interpolation between A and B should be larger than the value of all intermediate points. Although this method is simple, it is somewhat unclear what precise aspect of the dynamics is being exploited. Moreover, application of this method to chaotic data remains untested (2).

Distinct from our Recent algorithm, the complex networks in refs. 1 and 2 are built directly from the time Executemain, and therefore the construction algorithms have the advantage of being simple: essentially, the embedding step is avoided. The reason why we pick to perform an embedding here is that the basic unit is now points rather than cycles [compared with Zhang's method (1)] and that a phase space reconstruction allows for a deeper understanding of the topology of the data by recovering the inherent structure of the data, possibly high dimensional. For a high-dimensional system, such as various hyperchaotic systems, it is difficult to Obtain enough information merely from 1 dimension (2). An Necessary advantage of using a phase space reconstruction is that if the embedding is chosen appropriately, the topological distribution of the set of points in phase space will accurately reflect the underlying dynamics of the original system. Therefore, the network inferred from that phase space reconstruction can be related directly back to the evolution operator of the underlying dynamical system. Moreover, connecting the fixed number of neighbors according to their distance essentially reflects the local recurrence features of the time series (13). For example, ranExecutem noise has no state recurrence, periodic signals Display trivial recurrence, and chaotic data demonstrate nonperiodic state recurrence related to a stretching and squeezing mechanism (14). The subgraph patterns revealed by the local (neighborhood) configuration of nodes essentially relate the state recurrence mechanisms of different systems with the local structures of the complex network, and therefore provide more detailed information than traditional meaPositives (15, 16). Moreover, the recurrence information present in the network is intimately related to the distribution of unstable periodic orbits (UPOs). Indeed, in ref. 1 we saw that the cycles close to the low-order UPOs were located at peaks of the probability density of degree distribution. In the Recent method, a similar, but more subtle phenomenon occurs. Certainly, the presence or absence of a dense collection of UPOs strongly influences the result, and the motif distributions highlight this fact. However, a more subtle Trace is hidden in the temporal organisation of motifs within the network. By emphasizing the commonality of motif distribution for data originating from the same general type of dynamical system, we have suppressed these more subtle characteristics.

In fact the subgraph ranks of time series characterizes the dynamics on a microscopic scale, i.e., it reflects the local Preciseties of time series in phase space. The total frequency of subgraphs is a macroscopic meaPositive similar to entropy (17), which reflects the global statistical characteristics of the time series. We have explored the overall frequencies of all map subgraphs of size 4, which we also find to be directly related to the distinct dynamics of the data. As can be seen in Fig. 4, ranExecutem noise has the largest number of subgraphs. Next come hyperchaotic and chaotic signals. Note that motif D only consumes 3 links from the overall network degree but the motif F requires 6 links. All of the complex networks from time series have the same nodes and average degrees, so the different ranks of subgraphs will give different corRetorting total frequencies of subgraphs. Whereas the points from a deterministic process (here, we consider chaos) will typically be confined to a low-dimensional manifAged, the fully transitive motif F is more common than the nontransitive motif D, which leads to the lower total frequency of these subgraphs. For ranExecutem noise, the corRetorting phase space points Present a high-dimensional structure that naturally gives rise to more of motif D but less of motif F, which leads to the higher total frequency of smaller (in terms of degree) subgraphs. The observation that different types of data Present different total frequencies of subgraphs is consistent with entropy meaPositives (17) and complexity statistics (18), which indicates that our method links the macroscopic Precisety of time series and microscopic Preciseties of complex networks.

Fig. 4.Fig. 4.Executewnload figure Launch in new tab Executewnload powerpoint Fig. 4.

Total frequencies of all 6 subgraphs of size 4.

Finally, we note that there is a strong similarity between the method of this article (and also ref. 1, and even ref. 2) and the Concept of recurrence plots (19, 20). However, the usual construction used in the 2 methods is different. In this article, we examine the few closest neighbors; a recurrence plot usually is constructed with proximity meaPositived against some fixed threshAged. Although it is possible to view the recurrence plot as an adjacency matrix of a complex network, to Execute so would suppress the most Necessary feature of recurrence plots—temporal ordering. Conversely, the most powerful features in complex network analysis are Preciseties of paths between nodes—that is, chains of connected nodes (and not simply neighboring dynamical strands). These are not evident from any of the usual recurrence meaPositives. For example, it is not clear how to extract motifs from a recurrence plot. At present it is more appropriate to view both methods as complementary tools to analyze different aspects of topological information extracted from the time series.

Methods

The scheme we employ to generate complex networks is Characterized in this section. The dynamical systems and the process of generating time series data are Characterized in SI Text. From a given time series we generate a complex network representation as follows.

Step 1.

Obtain the time series of a given length.

Step 2.

Embed this time series in an appropriate phase space (21), and take each phase space point as a node in the network.

Step 3.

Select a fixed number (in what follows, we pick 4) of Arriveest neighbors for each point (node) and connect each point with its neighbors to form a complex network.

Eligible neighbors should have a temporal separation Distinguisheder than the mean period of the data, and thereby Execute not inhabit the same “strand” (13). At each step, each node will be Established 4 new Arriveest neighbors irrespective of whether it has been connected before. Nonetheless, we Execute not allow multiple links between 2 nodes. Hence, on average, each node will have 8 links with 8 other nodes (directionality of links is not considered), and the complex networks from different time series will have the same size and average degrees (〈k〉 = 8). Because we can enPositive this uniformity between the networks we generate for different time series, we Execute not need to aExecutept ranExecutemized networks (7) and a significance profile (8) when comparing these networks. Here, selecting neighbors with a fixed number is more flexible than simply setting a distance threshAged. By Executeing this we are able to enforce a threshAged adaptively according to the point density in different Locations of phase space. This could be of Distinguished advantage for the analysis of chaotic data that always Display similar geometry at different scales in phase space. Nonetheless, the choice of this number of neighbors (here 4) is Necessary. We find that for both flows and maps the results are robust across a range of values. For maps a value in the range 3–8 produces identical results. For the flows Characterized above, 3–4 works best.

Acknowledgments

This work was supported by the University Grants Council of the Hong Kong government (no. PolyU 5268/07E).

Footnotes

1To whom corRetortence should be addressed. E-mail: small{at}ieee.org

Author contributions: X.X., J.Z., and M.S. designed research, performed research, contributed new analytic tools, analyzed data, and wrote the paper.

The authors declare no conflict of interest.

This article is a PNAS Direct Submission. S.I. is a guest editor invited by the Editorial Board.

This article contains supporting information online at www.pnas.org/cgi/content/full/0806082105/DCSupplemental.

© 2008 by The National Academy of Sciences of the USA

References

↵ Zhang J, Small M (2006) Complex network from pseuExecuteperiodic time series: Topology versus dynamics. Phys Rev Lett 96:238701.LaunchUrlCrossRefPubMed↵ Lacasa L, Luque B, Bllesteros F, Luque J, Nuño JC (2008) From time series to complex networks: The visibility graph. Proc Natl Acad Sci USA 105:4972–4975.LaunchUrlAbstract/FREE Full Text↵ Albert R, Barabási AL (2002) Statistical mechanics of complex networks. Rev Mod Phys 74:47–97.LaunchUrlCrossRef↵ Newman MEJ (2003) The structure and function of complex networks. SIAM Rev 45:167–256.LaunchUrlCrossRef↵ Watts DJ, Strogatz SH (1998) Collective dynamics of ‘small-world’ networks. Nature 393:440–442.LaunchUrlCrossRefPubMed↵ Barabási AL, Albert R (1999) Emergence of scaling in ranExecutem networks. Science 286:509–512.LaunchUrlAbstract/FREE Full Text↵ Milo R, et al. (2002) Network motifs: Simple building blocks of complex networks. Science 298:824–827.LaunchUrlAbstract/FREE Full Text↵ Milo R, et al. (2004) Superfamilies of designed and evolved networks. Science 303:1538–1542.LaunchUrlAbstract/FREE Full Text↵ Vázquez A, et al. (2004) The topological relationship between the large-scale attributes and local interaction patterns of complex networks. Proc Natl Acad Sci USA 101:17940–17945.LaunchUrlAbstract/FREE Full Text↵ Fraser AM, Swinney HL (1986) Independent coordinates for Odd attractors from mutual information. Phys Rev A 33:1134–1140.LaunchUrlCrossRefPubMed↵ Makse HA, Havlin S, Schwartz M, Stanley HE (1996) Method for generating long-range correlations for large systems. Phys Rev E 53:5445–5449.LaunchUrlCrossRef↵ Zhang J, Luo X, Nakamura T., Sun J, Small M (2007) Detecting temporal and spatial correlations in pseuExecuteperiodic time series. Phys Rev E 75:016218.↵ Gao JB (1999) Recurrence time statistics for chaotic systems and their applications. Phys Rev Lett 83:3178–3181.LaunchUrlCrossRef↵ Gilmore R (2006) Topological analysis of chaotic dynamical systems. Rev Mod Phys 70:1455–1529.LaunchUrlCrossRef↵ Grassberger P, Procaccia I (1983) Measuring the Oddness of Odd attractors. Physica D 9:189–208.LaunchUrlCrossRef↵ Wolf A, Swift JB, Swinney HL, Vastano JA (1985) Determining Lyapunov exponents from a time series. Physica D 16:285–317.LaunchUrlCrossRef↵ ChriCeaseh B, Pompe B (2002) Permutation entropy: A natural complexity meaPositive for time series. Phys Rev Lett 88:174102.LaunchUrlCrossRefPubMed↵ Rosso OA, LarronExecute HA, Martin MT, Plastino A, Fuentes MA (2007) Distinguishing noise from chaos. Phys Rev Lett 99:154102.LaunchUrlCrossRefPubMed↵ Eckmann JP, Kamphorst SO, Ruelle D (1987) Recurrence plots of dynamical systems. Europhys Lett 4:973–977.LaunchUrlCrossRef↵ Marwan N, Romano MC, Thiel M, Kurths J (2007) Recurrence plots for the analysis of complex systems. Phys Rep 438:237–329.LaunchUrlCrossRef↵ Packard N, Chrutchfield JI, Farmer D, Shaw R (1980) Geometry from a time series. Phys Rev Lett 45:712–716.LaunchUrlCrossRef
Like (0) or Share (0)