\documentclass[a4paper,twoside,11pt]{article}
\usepackage{latexsym,amsmath,amssymb,amsthm}
\usepackage{appendix}
\usepackage{setspace}
\usepackage{makeidx}         % allows index generation
\usepackage[bottom]{footmisc}% places footnotes at page bottom

%\doublespacing
\onehalfspacing

\newcommand{\dt}{Dennett}
\newcommand{\bb}{Barab\'{a}si}
\newcommand{\bbb}{Bassett and Bullmore}
\newcommand{\cn}{connection}
\newcommand{\swn}{small-world network}
\newcommand{\Swn}{Small-world network}
\newcommand{\sw}{small-world}
\newcommand{\er}{Erd\H{o}s \& R\'{e}nyi}
\newcommand{\ws}{Watts' and Strogatz'}
\newcommand{\nw}{network}
\newcommand{\fl}{functional}
\newcommand{\ml}{mathematical}
\newcommand{\scl}{synchronisation likelihood}
\newcommand{\cy}{connectivity}


\begin{document}

\title{The Feasibility of using Small-World Networks as models in Informatics, and in particular, the brain.}
\author{Samantha Lam \\
        s$0790004$ \\
        IRR Literature Review \\
        Due: Friday, 30th November 2007}
\date{}
\maketitle

\begin{center}\textbf{Abstract}
\end{center}

This review investigates the concept of a small-world network, its feasibility and applications to the area of informatics, in particular the modelling of the brain.
Approximately, a small-world network is a graph in which most nodes are not neighbours of one another, but where most nodes are a small number of hops or steps away
from all others.
%\newpage
\tableofcontents
\newpage
\section{Introduction}
%The first major introduction of small-world networks arose from Watts and Strogatz's %paper `Collective dynamics of `small-world' networks' in 1998 which garnered much %interest as it showed how its characteristics (high clustering, short characteristic %path length) proved to be applicable in areas such as dynamical systems, spatial games %and many other self-organising systems.
\subsection{Motivation}
A \swn\;can be seen as one which combines the properties of a regular and random network.  To analyse these \swn, they are usually represented by concepts derived from graph theory (See appendix for formal definitions).  A node is used to represent a certain property and edges connect these concepts appropriately.  The structural properties of these graphs are characterised by their path length, $L(p)$, and clustering coefficient, $C(p)$.  As suggested by their names, the path length measures the typical separation between two edges in the graph and the clustering coefficient the `cliquishness' of a typical neighbourhood.  Thus, a \sw\;network is a graph which has short path length but a high clustering coefficient.

It is due to the strong mathematical foundations of graph theory and the intuitive analogy of the way one thinks in categories linking to other categories like a \sw\;network that motivated the investigation into these \swn.  We look at how accurately these \swn\;model certain networks in areas in Informatics such as the World-Wide Web (WWW) and in particular, the brain.

\subsection{History}
The work of Paul Erd\H{o}s \& Alfr\'{e}d R\'{e}nyi on random graphs in the 1960's provided the basis model of a network which tends to be used for comparison methods.  In many studies it is shown that the \sw\;model characterises phenomenon better than random network-based ones.  \Swn s were popularised by Stanley Milgram's `small-world experiment' in 1967 which implied the notion of the `six degrees of separation' phenomenon.  However, it was not until 1998 with \ws\;esteemed paper entitled `Collective dynamics of `small-world' networks' which garnered much interest and research into the area.  In this paper, they introduced \swn\;as a realisable alternative to previous categorisation of a network either being regular or random, with many studies which seemed to confirm this `goodness of fit'.  Malaquais et al.\cite{Mala} provided a good theoretical argument towards why a \sw\;architecture would work as a model of the human mind, in particular appealing to the way people categorise and group concepts together, e.g. types of food.  More concrete support for this is found in research by Postma et al.\cite{Postma} which will be discussed later.

Another highly influential paper was published in 1999 by \bb\;et al.\cite{Bara} which criticises both the random and \ws\;model as incomplete - they lacked the incorporation of two major aspects of `real' world networks.  First, both models assume a network starts and ends with a fixed number of nodes.  In reality, many real world networks are open and form by the continuous addition of new nodes throughout the lifetime of the network, e.g. the WWW grows with the addition of new webpages.  Second, both models are wired and rewired randomly whereas many real networks exhibit preferential connectivity, e.g. a new webpage is more likely to include links to well-known pages with high connectivity.  More importantly, \bb\;et al. discovered that the inclusion of these two factors led to a power law distribution, $P(k) \approx k^{-\gamma}$.  This finding is of particular interest because recent studies in networks in Informatics have shown to exhibit a power law distribution, e.g. for a \fl\;\nw\;of activated voxels\cite{Eguilez}.

However, caution and interest also emerged through the \ml\;analysis of this power law distribution as dealt with by Amaral et al.\cite{Amaral}, more specifically the robustness of a power law distributed network against random error.  One of the main features of these networks is the high occurrence of `hubs', i.e. nodes with degrees that are significantly higher than the average.  Since failures occur at random and the vast majority of nodes are those with small degree, the likelihood that a hub would be affected is almost negligible.\footnote{http://en.wikipedia.org/wiki/Scale-free\_network}  The caution is naturally in the event of removing one or more of these `hubs'; the result would be a \nw\;consisting of a sparse graph which would give a less \ml ly rich structure to the \nw\;and hence, less information.  In short, scale-free networks are highly robust against errors, but potentially vulnerable to planned attack\cite{Amaral}.

Thus, the history of \swn\;arose from a sound and continually developing \ml\;theory with practical applications to \nw s and can be seen to relate to some \cn ist models of the brain.  This already demonstrates a rich prospect of research in the theory of \swn\;alone, without even knowing its applications.  However, this is just one of the aspects of \swn\;which generates interest.  Our aim is to see how much potential there is for the brain to be modelled as a \swn\;and vice-versa.

\section{Arguments For and Against}
\subsection{Examples of Effectiveness}
We begin by looking at the initial effectiveness of \swn s for modelling specific types of systems related to the brain.  In \ws\cite{Watts} seminal paper they give the influential example of the completely mapped neural \nw\;of the (nematode) worm, \emph{C. Elegans}\cite{Achacoso} where an edge joins two neurons if they are connected by either a synapse or a gap junction. This \nw\;exhibited \sw\;properties with $L>L_{rand}$ and $C \gg C_{rand}$.  A property of path length of a random \nw, $L_{rand}$, is that it is short, which \swn s are also characterised by.  Here, although it is seen that $L>L_{rand}$, the actual figures are $2.65 > 2.25$ which is only $~ 15\%$ difference and for practical reasons, it is fairly justified to take $L \approx L_{rand}$ (as we shall see later).  Thus, we take the notion of $C \gg C_{rand}$ (which is certainly justified since $0.28 \gg 0.05$ here) along with $L \approx L_{rand}$ as the properties of \swn s.

Furthermore, one of the proposed future work areas from the paper was the plausibility of the brain having a \swn\;structure.  A study that showed \swn s of coupled phase oscillators synchronising as readily as a standard model presented the possible relevance of it to the observed synchronisation of widely separated neurons in the visual cortex\cite{Gray}.

In 2006, Bassett and Bullmore\cite{Bassett} published a very promising paper entitled ``Small-Brain Networks'' which provided further support for using \sw\;models for understanding the structure and function of the brain.  They presented results from studies of the visual cortex of the macaque monkey and the cat from several researchers and deduced \swn\;structures from them.  To quote: \textit{``The first graphical analyses of mammalian cortical \nw, in the early 1990s, preceded the mathematical development of the \sw\;model but identified many features of anatomical connectivity that would later be recognized as compatible with it"}.

In 1991, Felleman and Van Essen\cite{Felleman} compiled an anatomical connectivity matrix that summarised 305 axonal \cn s between 32 areas of the visual cortex in the macaque monkey.  They found that on producing several levels of cortical processing by differentiating connections with regard to the (laminar) structure of the cortex, results showed the existence of multiple separated parallel processing streams with fewer \cn s between the streams.

In particular, the segregation of the ventral and dorsal streams was confirmed by Young et al.\cite{Young1} who had began considering anatomical connectivity matrices from tract-tracing studies of the macaque monkey and cat also.  Their connectivity matrix was drawn from 301 \cn s and 30 areas in the visual cortex of the macaque monkey with \cn s being coded as 0 (weak/absent) to 2(strong \& reciprocal).  Similarly, Scannell et al.\cite{Scannell1}\cite{Scannell2} analysed results from tract-tracing of the cat with a connectivity matrix derived from 1139 \cn s between 65 cortical areas and again, similarly coded with 1 for weak/sparse up to 3 for strong/dense.

Both analyses of matrices were done using nonmetric multidimensional scaling (NMDS) which plots strongly connected areas in closely while maximising the graphical distance between non-connected areas.  The resulting graphs showed strong interconnectivity between the dorsal and ventral areas confirming Felleman and Van Essen in Young et al.'s studies and for Scannell et al., four distinct clusters of strong local interconnectivity were seen - designated visual, auditory, somatosensorimotor and frontolimbic - with relatively sparse connectivity between the clusters themselves.  Furthermore, \bbb\;notes that in a study by Hilgetag et al.\cite{Hiltegag}, \textit{the larger scale macaque and cat cortical connectivity matrices described above were formally shown to have small-world properties, that is, relatively high clustering ($\gamma \gg 1$) and short path lengths ($\lambda \sim 1$) compared to random networks}.

A less biological data driven approach was taken by Postma et al.\cite{Postma} where they examined the semantic associative network for a set of words and measured the characteristic path length and clustering coefficient at various stages of development.  They artificially created a semantic net in two ways by using data taken from the MRC psycholinguistic database containing word-association and age-of-acquisition data published by Michael Wilson\footnote{http://ling.ed.ac.uk/help/mrc} online.  The \cn s were defined by a study by Kiss et al.\cite{Kiss} where people were asked to associate freely on target words.  The association data were used for two models - first, in combination with age-of-acquisition data where they were translated into age-dependent semantic \nw s and second, they were trained without the age-of-acquisition data as a hetero-associative memory (HAM) model of semantic development.

For the first model the presence of an associative relation between any two concepts is represented by the $N \times N$ matrix $\textbf{A}$.  Each element $A(t, i, j) = A(t, j, i)$ of the matrix $\textbf{A(t)}$ ($t$ represents age in years $\times 100$) specifies the presence ($A(t, i, j) = 1$) or absence ($A(t, i, j) = 0$) of an association between the concepts $i$ and $j$ at time $t$.  The total number of concepts $N$ is acquired at age $t = 700$ and for any concept $h$ that is not yet acquired, it is represented by $A(t, h, j) = A(t, j, h) = 0, 1 \leq j \leq N$.  Also, for each target word $w$, a response word $r$ was generated by a proportion $\lambda(w,r)$ of the subjects.  Thus, $A(700,w,r) = s[\lambda(w,r)],$ with $s[x] =$ the step function,\footnote{i.e. $s[x] = 1, x>0$ and $s [x] = 0$ otherwise.}  and for $t<700$, $A(t,w,r) = s[aoa(w) - t] \times s[aoa(r) - t] \times A(800,w,r)$, with $aoa(w)$ being the age of acquisition of word $w$.

For the second model Postma et al. presented the concepts acquired to a HAM model consisting of $M$ in and output nodes and $M^2$ adaptive weights.  Associations among concepts were represented by means of clipped Hebbian learning, i.e at each presentation of a pattern-pair, weights are set to $w(i,j) = min[1, w(i,j)' + \lambda p(i)q(j)].$  (Aside: Note that the usage of Hebbian learning is subject to criticism which we shall deal with later.)  Nevertheless, Postma et al. sets $\lambda = 0.2$, trains the network seven times to simulate the developmental stages and at each stage pairs of concepts are randomly selected with a probability proportional to their association strength.  We also note that they do not give any justification for the choice of these numbers, one can only presume that they should be arbitrary for independence of results.  They incorporate stochastic synapses to accommodate the association of a single input vector to multiple output vectors which is a modification to traditional HAM modules which associate single input vectors to single output ones.  For $p(j)$ an input vector, elements of output vector $q(i)$ is defined as:
$$q(i) = \Sigma_{j=1}^{M} f(w_{i,j}) p(j),$$
with $f:\mathbb{R} \rightarrow \{0,1\}, f(x) = 1,$ for $rand<x, f(x) = 0$ otherwise, and $rand$ is a random value taken from the uniform distribution on the unit interval.

Thus, the resulting tables are as follows:
\begin{center}
\begin{tabular}{| c | c | c | c | c | c | c | c | c |}
\hline			
  \textbf{$t$} &  \textbf{100} & \textbf{200} & \textbf{300} & \textbf{400} & \textbf{500} & \textbf{600} & \textbf{700} & \textbf{adult} \\
  \hline
  $N_{1}$ &  - & 30 & 322 & 728 & 1066 & 1185 & 1205 & 8210 \\
  \hline
  $N_{2}$ &  20 & 35 & 266 & 677 & 999 & 1011 & 1205 & - \\
  \hline
  $k_{1}$ &  - & 3.2 & 10.5 & 14.7 & 17.0 & 17.7 & 17.1 & 58.7 \\
  \hline
  $k_{2}$ &  1.2 & 3.0 & 9.4 & 12.1 & 16.4 & 17.0 & 18.0 & - \\
\hline
\end{tabular}
\end{center}
where the `adult' column specifies $N_1$ and $k_1$ for the entire set of concepts for which association data are available for the first model and $N_2$ and $k_2$ are associated with the HAM model.

The analysis of the results were then done by calculating the clustering coefficients and path lengths.  Postma et al. then compared these values with ones obtained from a random network (denoted by letting $p = 1$) so that they got $L \approx L(1)$ and $C \gg C(1)$ which is akin to \ws\;criteria from earlier.  Overall, the main findings of Postma et al.'s paper are that the characteristic path length remains almost constant despite expansion of the network ($C \approx 0.13, L \approx 2.7$) and that these values indicate small-worldliness.  It is again noteworthy that the binary association network produced here are rather crude approximations of the human associative strengths and that the almost stagnant path length didn't form until the later stages of development ( $\approx \frac{400-500}{100}$ years).  However, the conclusion which mentions the apparent tendency of the human semantic \nw\;to optimise small-worldliness by maintaining a constant path length and low clustering coefficient despite increasing number of concepts is also discussed in \bbb's paper from earlier.

\bbb\;suggest that the \sw\;brain \nw\;topology may have been selected to optimise the problem of cost-efficient information processing.  They gave several citations which provide supports for this claim that the anatomical structure of the brain has evolved with a \sw\;structure for information optimisation such as the segregation of visual cortexes of the cat and macaque monkey as seen earlier.  Furthermore, they give recent studies based on electroencephalogram (EEG), magnetoencephalogram (MEG) and \fl\; magnetic resonance imaging (fMRI).  To quantify these measures, e.g. using EEG or fMRI, several methods such as recording time series for several regions of the brain and then estimating the \fl\;connectivity between a given pair of regional time series in terms of their mutual information, correlation or partial correlation.\cite{Bassett}  Finer details can be found in \cite{Bassett} but for the purpose of this review we don't need a deep understanding of these procedures, we need just be aware that the relative merits of these various technical options remains to be fully evaluated, as also noted by \bbb.

According to \bbb, the first graph theoretical analysis of MEG data was reported by Stam\cite{Stam} who measured the activity from 126 MEG sensors in five people who were in a resting state (eyes closed).  Activity between sensors was estimated using \scl\;after the raw data was filtered into EEG bands of $\delta, \theta, \alpha, \beta$ and $\gamma$ whose magnitudes are approximately $<$ 4 Hz, 4-8 Hz, 8-12 Hz, 13-30 Hz and $>$30 Hz, respectively\footnote{See http://www.crossroadsinstitute.org/eeg.html for further explanation of these bands in physical terms.}  The resulting \fl\;\cy\;matrices were thresholded to create a set of graphs depicting brain \fl\;\nw\;corresponding to each of the frequency bands.  Their results showed that the $\alpha$ and $\beta$ bands had a regular topology whereas the more extreme $\delta, \theta$ and $\gamma$ bands had \sw\;properties.  A similar study was conducted by Micheloyannis et al.\cite{Mich} in 2006 with 28 sensors and 14 individuals.  The analysis was similarly done (using \scl), however, they found evidence for \sw\;topology on \textbf{all} frequency bands.  \bbb's conclusion regarding these conflicting results is that \textit{`the currently available data are inconclusive at this point'.}

However, it is noteworthy to us that in the second study Micheloyannis et al. had significantly more participants (14 versus 5) and that the task involved was quite different - a memory test versus Stam et al.'s resting state.

In relation to fMRI, despite the narrow bandwidths suggesting initial limitations for study, appropriate \fl\;\cy\;analysis such as wavelet-based introduced by Achard et al.\cite{Achard} who reported \sw\; properties of fMRI networks at all frequencies in the range 0.007-0.45 Hz.

In addition, as mentioned in the introduction, the degree distribution of a \nw\;can provide information of the processes determining its formation and growth.  \bbb\;also report an fMRI study by Egu\'{i}lez et al.\cite{Eguilez} who found a power law degree distribution for a \fl\;\nw\;of activated voxels.  However, the degree distribution of whole-brain fMRI networks of cortical regions has also been described as an exponentially truncated power law\cite{Achard}\cite{Bassett}.  This means that the probability of highly connected hubs existing in the brain is somewhat less than in a \swn\;but is still more likely than a random one.  Moreover, \bbb\;remind us that many complex systems that are physically embedded or constrained also exhibit this truncated power law.  Thus the existence of such a distribution in the brain could reflect the constraint of aging of brain regions\cite{Doro}.  Several other examples were also given where the node can `age' i.e. not form new connections, or becomes isolated, i.e. old \cn s removed.  These aspects coincide with \bb's incorporation of growth and preferential attachment which gives an indication of a possible fruitful area of research.

\subsection{Criticisms}
This section takes several arguments from Asim Roy's `Artificial neural networks: a science in trouble' paper\cite{Roy}.  Though the nature of this review is not specifically dealing with neural \nw s, the form of a \swn\;is of a connectionist kind and in particular with regard to our second paper where Postma et al. modelled word association with a HAM model which is a form of artificial neural \nw.  The first major misconception mentioned by Roy is the (mis)assumption that `no synaptic change signals are allowed to the cells from other sources within the brain.'  This can be specifically applied critically to Postma et al.'s use of the HAM model which is based on Hebbian learning.  Hebbian learning simply assumes that `neurons that fire together, wire together', thereby not having any other further influences.

Likewise, Roy's explanation of the misconception that the brain learns instantly from each and every learning example presented to it is another flaw in Postma et al.'s HAM model.  From everyday experience, it is not difficult to see that human learning is not always instantaneous, but may occur later based on information it has already collected and stored in the brain.  In Postma et al.'s HAM model the network was `trained' 7 times to produce the results observed as `learning'.  According to Roy, this is insufficient to be taken as `learning' as it is `memoryless' (as described in his second misconception). Inputs, weights and functions are updated simultaneously and no information (analogous to memory) is retained which is counterintuitive to the way humans (or their brains) learn.  Thus one must take into account these factors which may hinder the plausibility of the \swn\;observation constructed from Postma et al.'s HAM model.

Moreover, though Roy's arguments do not apply directly to \bbb's paper, one must be aware of the incompleteness of the \sw\;phenomenon results of the visual cortex of the cat and macaque monkey and the various EEG and fMRI studies.  In particular, because there are different ways of forming and analysing a \cy\;matrix, more specifically the way one encodes the relative strengths of connections can be done less discretely (e.g. instead of 0,1,2, one can have a larger, more continuous range).  One cannot disregard the possibility that a different method of analysis may reveal a differently network topology, however unlikely.

\section{Summary/Conclusion}
Overall, as seen throughout this review, there are many reports, in particular from biological data, that indicate the existence of small-worldliness in the structure of the brain.  Conversely, we have Postma et al.'s paper where the modelling of human memory association was made and a \sw\;structure also observed.  However, as highlighted in our `Criticisms' section, caution must be taken due to the use of artificial neural networks to simulate Postma et al.'s results which led to the \sw\;observation.

Thus, in conclusion, it \textit{does} appear that there is much potential for the brain to be modelled as a \swn, however, more so `bottom-up' rather than `top-down'.  That is, it would be more beneficial to research with more concrete, biological data to confirm the \sw\;phenomena.  We say so because, thus far, we have seen that it appears to hold true for the visual cortexes of the cat and the macaque monkey but little, other than somewhat conflicting results with EEG and fMRI scans have been done with regards to the human brain.  Furthermore, another avenue of interest may be to invest research into the mathematical foundations as done by \bb, in particular the possibility of incorporating the notion of `dying' nodes, which \bb\;surprisingly hasn't mentioned.  (He only mentions growth, but e.g. in the WWW, it is common for webpages to become inaccessible.)

\newpage
\section{Appendix}
\subsection{Mathematical concepts (as defined by \bbb)}
From graph theory, $n =$ number of nodes, $k =$ number of edges, degree of node = number of edges on each node.

The \textbf{degree distance} of a graph is the probability distribution of $k$ ($=$ degree).

\textbf{Random graphs} have exponential degree distribution: $P(k) \sim \exp^{-\alpha k}$.

Some \textbf{complex systems} have power law distribution: $P(k) \sim k^{-\alpha}$, e.g. world-wide web.

Physically embedded networks have exponentially truncated power law distribution of the form: $P(k) \sim k^{\alpha-1} \exp^{\frac{k}{kc}}$.

\textbf{Path Length} between two nodes $=$ the minimal number of edges that must be traversed to form a direct connection between the two nodes of interest.

\textbf{Clustering coefficient} of a node is the measure of the number of edges that exist between its nearest neighbours.

\textbf{C has maximum normalised value} if all the nearest neighbours of an index node are also nearest neighbours of each other.

\textbf{Minimum path length and minimum clustering coefficient of a graph} is estimated by averaging $L_i$ and $C_i$ for each of the $i = 1,\ldots,n$ nodes.

In a \emph{random graph}, the average minimum path length is typically \emph{short} and clustering coefficient, \emph{small}:
$$L_{rand} \sim \dfrac{\ln N}{\ln (\frac{k}{N} - 1)}$$
$$C_{rand} \sim \dfrac{\frac{K}{N}}{N}.$$

For a \swn: $\lambda = \dfrac{L}{L_{rand}} \sim 1$ and $\gamma = \dfrac{C}{C_{rand}} > 1$.
\newpage
\section{Bibliography}
\begin{thebibliography}{30}

\bibitem{Achacoso}
Achacoso TB, Yamamoto WS,
\newblock ``AY's Neuroanatomy of C. Elegans for Computation'',
\newblock CRC Press, Boca Raton, FL, 1992.

\bibitem{Achard}
Achard S, Salvador R, Whitcher B, Suckling J, Bullmore E,
\newblock ``A resilient, low-frequency, small-world human brain functional network with highly connected association cortical hubs'',
\newblock {\sl J Neurosci}, vol. 26, p. 63-72, 2006.

\bibitem{Amaral}
Amaral LAN, Scala A, Barth\'{e}l\'{e}my M, Stanley HE,
\newblock ``Classes of small-world networks'',
\newblock {\sl Proceedings of the National Academy of Sciences of the USA}, vol. 97, no. 21, p. 11149-11152, 2000.

\bibitem{Bara}
Barab\'{a}si A-L, Albert R,
\newblock ``Emergence of Scaling in Random Networks'',
\newblock {\sl Science}, vol. 286, no. 5439, p. 509-512, 1999.

\bibitem{Bassett}
Bassett DS, and Bullmore E,
\newblock ``Small-Brain Networks'',
\newblock {\sl The Neuroscientist}, vol. 12, no. 6, p. 512-523, 2006.

\bibitem{Doro}
Dorogovtsev SN, Mendes JFF,
\newblock ``Evolution of networks with aging of sites'',
\newblock {\sl Physical Review E}, vol. 62, p. 1842, 2000.

\bibitem{Eguilez}
Egu\'{i}lez VM, Chialvo DR, Cecchi GA, Baliki M, Apkarian AV,
\newblock ``Scale-free brain functional networks'',
\newblock {\sl Physical Review Letters}, 94, 018102, 2005.

\bibitem{Felleman}
Felleman DJ, Van Essen DC,
\newblock ``Distributed hierarchical processing in the primate cerebral cortex'',
\newblock {\sl Cerb Cortex}, vol. 1, p. 1-47, 1991.

\bibitem{Gray}
Gray CM, K\''{o}nig P, Engel AK, Singer W,
\newblock ``Oscillatory responses in cat visual cortex exhibit intercolumnar synchronisation which reflects global stimulus properties'',
\newblock {\sl Nature}, vol. 338, p. 334-337, 1989.

\bibitem{Hiltegag}
Hiltegag CC, Burns GAPC, O'Neill MA, Scannell JW,
\newblock ``Anatomical connectivity defines the organization of clusters of cortical areas in the macaque and cat'',
\newblock {\sl Philos Trans R Soc Lond B Biol Sci}, vol. 355, p. 91, 2000.

\bibitem{Kiss}
Kiss GR, Armstrong C, Milroy R, Piper J,
\newblock ``An associative thesaurus of English and its computer analysis'',
\newblock {\sl in The Computer and Literary Studies},
\newblock Aitken AJ, Bailey RW, Hamilton-Smith N (Eds.), University Press, Edinburgh, 1973.

\bibitem{Mala}
Malaquias JL, da Rosa AC, Correia CM,
\newblock ``A Small-World Model of the Human Mind'',
\newblock {\sl SAC '06}, Dijon, France, April 2006.

\bibitem{Mich}
Micheloyannis S, Pachou E, Stam CJ, Vourkas M, Erimaki S, Tsirka V,
\newblock ``Using graph theoretical analysis of multi channel EEG to evaluate the neural efficiency hypothesis'',
\newblock {\sl Neuroscience Letters}, 402, p. 273-277, 2006.

\bibitem{Morel}
Morelli LG, Abramson G, Kuperman MN,
\newblock ``Associative memory on a small-world neural network'',
\newblock {\sl The European Physical Journal B-Condensed Matter}, vol. 38, p. 495-500, 2004.

\bibitem{Postma}
Postma E, Roebroeck E, LaCroix J,
\newblock ``The Development of Small-world Semantic Networks'',
\newblock {\sl in Connectionst Models of Learning, Development and Evolution: The 6th Neural Computation and Psychology Workshop, Li\`{e}ge, Belgium, 16-18 September 2000}, Section 4, p. 293-302,
\newblock Robert M. French and Jacques P. Sougn\'{e} (Eds.), Springer-Verlag, London Limitied, 2001.

\bibitem{Roy}
Roy A,
\newblock ``Artificial neural networks: a science in trouble'',
\newblock {\sl ACM SIGKDD Explorations Newsletter}, vol. 1, no. 2, p. 33-38,
2000.

\bibitem{Scannell1}
Scannell JW, Blakemore C, Young MP,
\newblock ``Analysis of connectivity in the cat cerebral cortex'',
\newblock {\sl J Neurosci}, vol. 15, p. 1463-1483, 1995.

\bibitem{Scannell2}
Scannell JW, Burns GAPC, Hilgetag CC, O'Neill MA, Young MP,
\newblock ``The connectional organization of the cortico-thalamic system of the cat'',
\newblock {\sl Cereb Cortex}, vol. 9, p. 277-299, 1999.

\bibitem{Stam}
Stam CJ,
\newblock ``Functional connectivity patterns of human magnetoencephalographic recordings: a ``small-world" network?'',
\newblock {\sl Neuroscience Letters}, 355, p. 25-28, 2004.

\bibitem{Vernon}
Vernon M,
\newblock ``Phd Literature Review'',
\newblock {\sl www.cus.cam.ac.uk/~mcv21/litreview.pdf}, 2003.

\bibitem{Watts}
Watts DJ, Strogatz SH,
\newblock ``Collective dynamics of `small-world' networks'',
\newblock {\sl Nature}, vol. 393, no. 6684, p. 440-442, 1998.

\bibitem{wiki}
http://en.wikipedia.org/wiki/Scale-free\_network

\bibitem{Young1}
Young MP, Scannell JW, O'Neill MA, Hilgetag CC, Burns GAPC, Blakemore C,
\newblock ``Non-metric multidimensional scaling in the analysis of neuroanatomical connection data and the organization of the primate cortical visual system'',
\newblock {\sl Philos Trans R Soc Lond B Biol Sci}, vol. 348, p. 281-308, 1995.

\bibitem{Young2}
Young MP,
\newblock ``Objective analysis of the toplogical organization of the primate cortical visual system'',
\newblock {\sl Nature}, vol. 358, p. 152-155, 1992.

%\bibitem{PR:1}
%Peter~J. Rizzo, Ed.,
%\newblock {\sl Title of Conference Proceedings}, Location of Symposium, June
%  1987. Sponsoring Organization, Name of publisher.

%\bibitem{RL:1}
%R.~E. Larson,
%\newblock ``Title of dissertation,
%\newblock Ph.D. dissertation, Stanford University, Stanford, California, 1964.

%\bibitem{EG:1}
%Elaine~G. Haschke,
%\newblock ``Title of thesis,''
%\newblock M.S. thesis, Drofnats University, West Yellowstone, Montana, May
%  1993.

%\bibitem{JS:2}
%J.~D. Simon,
%\newblock ``Lemma on theorem,'' private communications, November 1963.

%\bibitem{HC:1}
%Han Chen,
%\newblock ``Title of booklet,'' Texas A\&M University, College Station, Texas,
%  July 1989.

\end{thebibliography}

\end{document} 