(Click on tutorial title to access the abstract and the author's vita. |
||||||
|
Dario Floreano
Paul J. Werbos
Frank Lewis
Bernard Widrow |
E. Tzanakou
Harold Szu
X. Liu, A. Srivastava &
W. Mio |
Pierre Baldi
Leonid Perlovsky
Vladimir Cherkassky
Johan A. K. Suykens |
Nik Kasabov
A. Staiano & R. Tagliaferri
Joao Luis Garcia Rosa |
|||
| Abstracts and Vitas | (Back to Top of Authors) | |||||
|
Bioinformatics and Machine Learning: the Prediction of Protein Structures on a Genomic Scale Pierre Baldi This tutorial focuses on the application of machine learning methods, and in particular of neural networks, to problems in protein structure prediction. The tutorial first addresses the general problem of how to apply neural networks techniques to variable-size structured data, such a sequences, trees, and graphs in bioinformatics and other applications. The tutorial covers the design of feedforward and recursive neural network architectures for the prediction of protein structural features, such as secondary structures, relative solvent accessibility, disulphide bridges, and contact maps. Simulations and state-of-the-art results are presented and discussed together with more general issues, such as tradeoffs and integration with other methods, and applications to other related problems. Keywords: neural networks, recurssive neural networks, bioinformatics, sequence analysis, protein, protein structure prediction, structural proteomic Vita: Pierre Baldi is a Professor in the School of Information and Computer Sciences and the Department of Biological Chemistry at the University of California, Irvine and the Director of the Institute for Genomics and Bioinformatics at UCI. Born and raised in Europe, he received his PhD from the California Institute of Technology in 1986. From 1986 to 1988 he was a postdoctoral fellow at the University of California, San Diego. From 1988 to 1995 he held faculty and member of the technical staff positions at the California Institute of Technology and at the Jet Propulsion Laboratory. He was CEO of a startup company from 1995 to 1999 and joined UCI in 1999. He is the recipient of a 1993 Lew Allen Award at JPL and a Laurel Wilkening Faculty Innovation Award at UCI. Dr. Baldi has written over 100 research articles and four books: • Modeling the Internet
and the Web--Probabilistic Methods and Algorithms, Wiley, (2003); His research focuses in
AI, machine learning, computational biology, and biological and chemical informatics. New Formulations for Predictive Learning Vladimir Cherkassky Most existing learning algorithms have been developed under standard formulations of the learning problem, such as classification or regression. However, many real-life applications motivate approaches that go beyond traditional learning methods. These new directions include new types of inference different from standard inductive inference (i.e., transduction), as well as new formulations of the learning problem. This tutorial will review standard learning formulations using the framework of Vapnik-Chervonenkis (VC) learning theory, and then discuss several possible ‘non-standard’ formulations. The tutorial consists of four parts: (1) Motivation for new learning formulations. We provide historical, philosophical and application-driven motivations for developing new learning formulations and new types of inference (as opposed to research on new learning algorithms) under general conceptual framework of Vapnik’s Statistical Learning Theory. Then we present standard inductive learning formulations, and critically review the assumptions underlying such formulations. Relaxing some of these assumptions leads to several ‘non-standard’ formulations as discussed below. (2) Non-inductive types of inference. These include non-inductive types of inference such as transduction and selection learning formulations. We describe these formulations, and several recent applications studies comparing inductive vs. transductive learning. (3) Application-driven formulations. We outline general framework for mapping application requirements onto a learning problem formulation, and then provide examples of such new application-driven formulations. Such examples range from financial engineering (trading) to identity fraud detection. (4) Multiple model estimation. All standard inductive learning formulations assume that all available (or training) data can be described by a single statistical model. For example, in classification setting the goal is to estimate a (single) decision boundary. Likewise, under regression formulation, the goal is to estimate a single target function from finite and noisy data samples. However, there are many applications which naturally lead to learning formulations where the goal is to estimate several models from available (finite) data. For example, in computer vision, a sequence of video frames may contain several moving objects (with overlapping trajectories), so the problem of multiple motion estimation can be viewed under multiple model estimation framework. I will describe a new learning formulation for such settings, and then introduce SVM-based learning algorithms for this new formulation. Finally, I will discuss methodological issues arising in using learning methods for various real-life applications, such as: - clear separation between the learning problem formulation and the solution approach (e.g, learning algorithm); - importance of mapping application requirements onto an appropriate problem formulation, rather than inventing/ reinventing new learning algorithms; - empirical comparisons of learning algorithms. INTENDED AUDIENCE: researchers and practitioners interested in understanding new learning formulations motivated by practical application requirements, rather than standard formulations motivated by mathematical modeling considerations. OTHER CONSIDERATIONS: the proposed tutorial is based on two plenary talks (on the same subject of New Learning Problem Formulations) given by the instructor at ICANN-01 in Vienna, Austria and ANNIE-04 in St Louis, Missouri. Keywords: Inductive principle, inductive inference, neuroimaging, new learning formulations, Popper's falsifiability, regularization, statistical learning, support vector machines, transduction, Vapnik-Chervonenkis theory Vita:
Vladimir Cherkassky is Professor of
Electrical and Computer Engineering at the University of Minnesota. He
received Ph.D. in Electrical Engineering from University of Texas at Austin
in 1985. His current research is on methods for predictive learning from
data, and he has written a monograph Learning From Data published by Wiley
in 1998. Prof. Cherkassky has served on the Governing Board of INNS. He has
served on editorial boards of IEEE Transactions on Neural Networks, the
Neural Networks Journal, the Natural Computing Journal and the Neural
Processing Letters. He served on the program committee of major
international conferences on Artificial Neural Networks. He was Director of
NATO Advanced Study Institute (ASI) From Statistics to Neural Networks:
Theory and Pattern Recognition Applications held in France, in 1993. He has
presented numerous tutorials on neural network and statistical methods for
learning from data. Evolutionary Robotics Dario Floreano Evolutionary Robotics is
the artificial evolution of control systems for autonomous robots. Most of
the past and ongoing research in Evolutionary Robotics is concerned with
evolution of neural controllers for which available learning algorithms are
not suitable. The tutorial will put special emphasis on how Evolutionary
Robotics can be used to study open questions in evolutionary biology,
neurophysiology, and cognitive science. A short introduction to artificial
evolution will allow participants to fully understand and replicate most of
the experiments by using freely available software. Keywords: mobile robots, evolutionary systems, autonomous systems, time-dependent networks, control systems Vita: DARIO FLOREANO is Professor of Intelligent Systems at the Swiss Federal Institute of Technology in Lausanne (EPFL) where he is director of the Institute of Systems Engineering. His research activities include embodied and embedded neural networks, artificial evolution, robotics, bio-mimetic electronics, self-organizing systems, and artificial life. He held senior research positions at the National Research Council in Roma, at the University of Stirling, at the Swiss Federal Institute of Technology in Lausanne, and at Sony Computer Science Laboratory in Tokyo. Dario published more than 100 peer-reviewed papers, authored 2 books, and edited 3 other books. His book with S. Nolfi, Evolutionary Robotics, was reprinted by MIT Press three times over the last four years. He co-organized 6 international conferences and joined the program committee of more than 50 other conferences. He is on the editorial board of the journals Neural Networks, Genetic Programming and Evolvable Machines, Adaptive Behavior, Artificial Life, Connection Science, IEEE Transactions on Evolutionary Computation, and Autonomous Robots. He is co-founder and member of the Board of Directors of the International Society for Artificial Life, Inc. and member of the Board of Governors of the International Society for Neural Networks.(Back to Top of Abstracts) (Back to Top of Authors) Evolving Connectionist Systems: Biological Principles, Models and Applications Prof. Nik Kasabov (nkasabov@aut.ac.nz) Evolving connectionist systems (ECOS) is
a general paradigm of neural systems that evolve their structure, their
functionality and their internal knowledge representation through continuous
learning from data and interaction with the environment. The learning
process can be: on-line, off-line, incremental, supervised, unsupervised,
active, sleep/dream, etc. ECOS are inspired by the evolving processes in the
brain. These processes are self-organized and are governed by both external
stimuli (data) and internal parameters - genes. The tutorial consists of
three parts, each presented in 35 minutes with a break of 15 min. after the
first two parts. Part one presents biological principles of evolving brains,
that include: synaptic changes through learning, neuron creation, brain
development, the role of genes for brain functions and malfunctions, gene
interaction networks, the role of evolution. Keywords: Computational Intelligence, Neuroinformatics, Bioinformatics, Knowledge-based neural networks, Evolving connectionist systems, Gene regulatory networks, Computational neurogenetic modeling. Vita:
Professor Nik Kasabov is the Founding Director and the Chief
Scientist of the Knowledge Engineering and Discovery Research Institute
KEDRI, Auckland (http://www.kedri.info/).
He obtained a MSc and a PhD from the Technical University of Sofia. At
present he holds a Chair of Knowledge Engineering at the School of Computer
and Information Sciences at Auckland University of Technology. He is a
Fellow of the Royal Society of New Zealand, Fellow of the New Zealand
Computer Society, and a Senior Member of IEEE. Kasabov is the chair of the
Adaptive Systems Task Force of the Neural Network Technical Committee of the
IEEE Computational Intelligence Society. He is a member of the Board of
Governors of the INNS and a board member of the Asia Pacific Neural Network
Assembly (APNNA). His main research interests are in the areas of
intelligent information systems, soft computing, neuro-computing,
bioinformatics, brain study, speech and image processing, data mining and
knowledge discovery, where he has published 10 books, 380 refereed papers,
25 patents and other work. He has an extensive academic experience, holding
positions at the Technical University of Sofia- Bulgaria, University of
Essex - UK, University of Otago -New Zealand. Kasabov is on the editorial
boards of 8 international journals and has been on the PC of 50
international conferences in the last 5 years. He chaired the series of
ANNES conferences (1993-2001). More information of Prof. Kasabov can be
found on the Web site:
http://www.kedri.info. Neural Networks for Dynamic Systems Feedback Control F. L. Lewis Feedback control systems occur
throughout nature. Included is the biological cell, which uses feedback
mechanisms to pump ions across its membrane to sustain its equilibrium, or
homeostasis. Volterra showed that the population balance of predator and
prey species of fish in a closed pond is due to a feedback mechanism. Darwin
showed that feedback over long time periods is responsible for natural
selection. Man-made feedback control systems make possible the reliable
performance of all modern complex systems including aircraft, spacecraft,
vehicle motor systems, submarine stabilization, and robotic industrial
machines. Though Thomas Newcomen invented the steam engine in 1712, the
Industrial Revolution started in earnest only after J. Watt invented the
feedback control systems for the steam engine in 1786. Keywords: Feedback control using neural networks, Dynamical neural network control, Nonlinear adaptive control systems Vita:
Dr. F.L. Lewis is a University Distinguished Scholar Professor and the
Moncrief-O'Donnell Endowed Chair at The University of Texas at Arlington.
His current interests include intelligent control, neural and fuzzy systems,
MEMS control, wireless sensor networks, nonlinear systems, robotics,
condition-based maintenance, and manufacturing process control. He is the
author/co-author of 4 U.S. patents, 150 journal papers, 240 refereed
conference papers, nine books including Optimal Control, Optimal Estimation,
Applied Optimal Control and Estimation, Aircraft Control and Simulation,
Control of Robot Manipulators, Neural Network Control, High-Level Feedback
Control with Neural Networks and the IEEE reprint volume Robot Control. He
is a Fellow of the IEEE, a Fulbright Awardee, a member of the New York
Academy of Sciences, and a registered Professional Engineer in the State of
Texas. He has served as Visiting Professor at Democritus University in
Greece, Hong Kong University of Science and Technology, Chinese University
of Hong Kong, National University of Singapore. He is an elected Guest
Consulting Professor at both Shanghai Jiao Tong University and South China
University of Technology. He is the recipient of an NSF Research Initiation
Grant and the American Society of Engineering Education F.E. Terman Award,
and was selected as Engineer of the Year in 1994 by the Ft. Worth IEEE
Section. He was appointed to the NAE Committee on Space Station in 1995 and
to the IEEE Control Systems Society Board of Governors in 1996. In 1998 he
was selected as an IEEE Control Systems Society Distinguished Lecturer. He
is a Founding Member of the Board of Governors of the Mediterranean Control
Association. .Nonlinear Manifolds in Patten Recognition and Image Analysis
Xiuwen Liu,
Anuj Srivastava and
Washington Mio Web site: http://www.cavis.fsu.edu/manifold-tutorials/ (A preliminary version of the presentation is available at http://cavis.fsu.edu/manifold-tutorials/manifolds-ijcnn2005.pdf). Many of the problems in learning can be posed as an optimization one on underlying manifolds. While Euclidean space is commonly used due to its simplicity, intrinsic constraints of problems often lead to manifolds that are nonlinear in nature. For example, common to component analysis techniques (such as PCA, ICA, FDA, and so on), one often requires the components to be orthonormal and this requirement leads to Stiefel manifold. By exploiting the geometric structure of the manifolds, one can often design and derive efficient and effective algorithms. The main goal of this tutorial is to give a systematic coverage of nonlinear manifolds in pattern recognition and image analysis areas. A novel aspect of this tutorial is that we will introduce the tools from differential geometry that are necessary to understand existing techniques and more importantly to build new ones. Another important feature of this tutorial is that we will use live demonstrations to show the effectiveness of such techniques for pattern recognition. For example, by using optimal component analysis, we can often improve recognition performance significantly. It is widely recognized that neural network performance often depends critically on the features that are used. Compared to other areas, systematic study on feature extraction and selection is very limited. These nonlinear manifold techniques provide solutions to some of these problems by optimization in the feature space, which is often nonlinear in nature. We believe this tutorial on nonlinear manifolds is very timely and should be very useful for reseachers in pattern recognition, computer vision, machine learning, and other related areas. This tutorial will consist of four main parts (see http://www.cavis.fsu.edu/manifold-tutorials/). In the first part, we will define nonlinear manifolds, and give examples to motivate the importance of nonlinear manifolds especially in the pattern recognition and image analysis areas. In the second part, we will cover the mathematical tools from differential geometry for optimization on nonlinear manifolds, including differentiable manifolds, examples of manifolds, tangent vectors, tangent spaces, examples of tangent vectors and tangent spaces, differential geometry of manifolds, geodesics, numerical techniques for calculating geodesics, and optimization on nonlinear manifolds. In the third part, we will extend usual techniques from statistics onto nonlinear manifolds, including probability distributions on manifolds, intrinsic means and covariances, sampling and estimation on manifolds, and learning and hypothesis testing on manifolds. In the last part, we will give two concrete examples that are important for many pattern recognition and image analysis applications: optimal component analysis, and shape inference and clustering on shape manifolds. We will give live demonstrations using techniques presented in this tutorial. Keywords: Nonlinear manifolds, optimal component analysis, pattern recognition, feature extraction, statistics on nonlinear manifolds, image analysis, Grassman manifold, and Stiefel manifold. Vita: Dr. Xiuwen Liu is a computer scientist with research interests in computer vision, biologically plausible principles for visual modeling, and image analysis. He has developed and implemented distinctive computational texture models that provide quantitative explanations for texture perception as well as successful applications in texture synthesis, image segmentation, object recognition, and terrain classification. His work with Dr. Srivastava is the first to computationally derive sparse filters that are optimal for recognition. His expertise in efficient computing has helped translate many theoretical ideas into efficient practical algorithms. He is a co-director of the Center for Applied Vision and Imaging Sciences (CAVIS) at the Florida State University. Dr. Anuj Srivastava has expertise in the areas of statistical image analysis, signal processing, and statistical inferences on nonlinear manifolds. He has worked on applications of automated target recognition (ATR), derived performance bounds for statistical ATR, derived new probability models for natural images, derived nonlinear filtering algorithms on Lie groups and their quotient spaces, and is currently working on algorithms for statistical shape analysis. He directs the Center for Applied Vision and Imaging Sciences (CAVIS) at the Florida State University.
Dr. Washington Mio
is a geometric topologist with expertise in the areas of topology and
geometry of manifolds, who is currently working on problems in computer
vision and pattern recognition. Dr. Mio’s present investigations include the
algorithmic study of the differential geometry of various spaces of curves,
energy functionals on these spaces, and applications to shape and image
analysis. His previous research includes the study of the geometric topology
of manifolds and generalized manifolds, group actions on these spaces, knot
theory, and submanifold and map transversality. Dr. Mio is a co-director of
CAVIS. Integrating Language and Cognition: New Results in Computational Intelligence Leonid Perlovsky,
Ph.D. During the last decades computational intelligence methods have being developed taking advantage of the known mechanisms of the mind and brain. Much is known; still the knowledge is far from complete. Among the big unknown are relationships between cognition and language: do we think with words? Or are words just labels used at the end of the thinking process? The talk reviews the known mechanisms of cognition and suggests a computational mechanism that the mind might use for integrating cognition and language. Mathematics is related to results of other sciences studying the mind and brain. Mathematical descriptions of conscious and unconscious, imagination, symbols, concepts and emotions, including aesthetic emotion will be briefly discussed. The talk will review recent results in origin and evolution of language. Applications to recognition, fusion, and Internet search engines based on language understanding will be discussed. The course focuses on the current understanding of the fundamental principles of working of the mind in the areas of cognition and language, their computational implementations, and practical applications. The course discusses the mind mechanisms, including concepts, emotions, instincts, behavior, language, cognition, understanding, thinking, intuitions, conscious and unconscious, abilities for formation of symbols and aesthetic feelings. Computational techniques are given for these mechanisms and abilities. The goals of the course are: First, to provide a basic mathematical understanding of the working of the mind. Second, to demonstrate practical applications of these mechanisms for pattern recognition, tracking, fusion, search engines, and for integrated systems combining sensor signals and communication data. Third, to outline future research directions. Historical and current difficulties in developing intelligent systems (IS) and applications will be discussed along with how the mind and new computational techniques overcome these difficulties. By the end of the course, students will be familiar with several general applications addressed by IS, especially, smart search engines, computational difficulties encountered over fifty years, and basic novel approaches to overcoming these difficulties. Targeted for: Individuals interested in the development and application of intelligent systems and intelligent signal processing, especially in the area of smart search engines based on language understanding. This course assumes a basic understanding of theory of probability. Keywords: mind, brain, concepts, emotions, instincts, cognition, language, recognition, tracking, fusion, search engines Vita:
Dr. Leonid Perlovsky is Principal Research
Physicist and Technical Advisor at the Air Force Research Laboratory.
Previously, from 1985 to 1999, he served as Chief Scientist at Nichols
Research, a $0.5 B high-tech organization, leading the corporate research in
information science, intelligent systems, neural networks, optimization,
sensor fusion, and algorithm development. In the past he served as professor
at Novosibirsk University and New York University. He participated as a
principal in commercial startups developing tools for text understanding,
search engines, biotechnology, and financial predictions. He published about
50 papers in refereed scientific journals and about 180 papers in
conferences, delivered invited keynote plenary talks and authored a book
"Neural Networks and Intellect: model-based concepts", Oxford University
Press, 2001. Dr. Perlovsky serves on IEEE Computational Intelligence
Technical Committee, Computational Intelligence Society Multimedia Tutorial
Committee, Chair IEEE Boston Computational Intelligence Chapter, IEEE World
Congress on Computation Intelligence Operational Committee, Program Chair
for IEEE International Conference on Computational Intelligence Measurement,
General Chair for IEEE KIMAS conference, and as Editor-in-Chief for an
Elsevier journal “Physics of Life Reviews”. Biologically Plausible Neural Networks Joao Luıs Garcia Rosa The objectives of the proposed tutorial are to provide artificial neural networks notions and to show how this intelligent computational tool is related to neuroscience. Why current artificial neural network models are so distant from biological ones? Existing connectionist mathematical models are compared with biological neurons and networks and the main differences are stressed. Desired characteristics in biologically plausible artificial neural networks and their corresponding elements in cerebral cortex are discussed. Neuroscience basic concepts about neuron physiology, its connections and the physical chemistry of central nervous system are approached. Several biological features are being taken into account in order to achieve new models that restore the artificial neural systems first concerns. Connectionist models based on neuroscience are about to be considered the next generation of artificial neural networks, inasmuch as nowadays models are far from biology, mainly for mathematical simplicity reasons. The tutorial encompasses the biological neuron, chemical and electrical synapses, the roles of transmitters and receptors, and the neuromodulators of the synaptic function. How the brain became a model to such machine learning technique? Conventional and biologically plausible connectionist algorithms are presented. Why backpropagation, the most used supervised connectionist algorithm, is considered biologically implausible? Some connectionist architectures are taken into account: feedforward, recurrent, and bi-directional. The latter structure is considered more physiologically realistic. A number of applications are presented. They show that the alleged biologically plausible artificial neural networks are even more computationally efficient than conventional ones. Keywords: Artificial neural networks, Biologically plausible connectionist architectures, Biologically plausible connectionist algorithms, Computational neuroscience. Vita: João Luís Garcia Rosa is
Titular Professor of Pontifical Catholic University of Campinas (PUC-Campinas),
São Paulo, Brazil. He got his doctorate in Computational Linguistics from
Linguistics Department, Language Studies Institute, State University of
Campinas (Unicamp), Sao Paulo, Brazil, in 1999, and his master's degree in
Electrical Engineering (Artificial Intelligence), from Department of
Computer Engineering and Industrial Automation, School of Electrical and
Computer Engineering, State University of Campinas in 1993. He was graduated
in Electrical Engineering (Automation and Electronics) from the latter
department in 1983. His research interests are: Artificial Neural Networks,
Natural Language Processing, Computational Psycholinguistics, and
Biologically Plausible Connectionist Models. His academic experience: taught
postgraduate level courses for masters level on computing systems:
Intelligent Systems, Artificial Neural Networks, and Natural Language
Processing disciplines, and taught graduate level courses for Computer
Engineering and System Analysis: Programming Languages, Artificial
Intelligence, and Formal Languages and Automata disciplines, at Pontifical
Catholic University of Campinas (PUC-Campinas), Campinas, Sao Paulo, Brazil.
Since 1998, he contributes to the field of Biologically Plausible Artificial
Neural Networks with published papers and supervision of undergraduate and
graduate students. Data Visualization of High Dimensional Scientific Data Antonino Staiano and
Roberto Tagliaferri The recent technological advances are producing huge data sets in almost all fields of scientific research, from astronomy to genetics. Although each research field often requires ad-hoc, fine tuned, procedures to properly exploit all the available information inherently present in the data, there is an urgent need for a new generation of general computational theories and tools capable to boost most human activities of data analysis. Traditional data analysis methods, in fact, are inadequate to cope with such exponential growth in the data volume and especially in the data complexity (ten or hundreds of dimensions of the parameter space). Among the data mining methodologies, visualization plays a key role in developing good models for data especially when the quantity of data is large. For a scientist, i.e. the expert in a specific domain, is essential the need for a visual environment that facilitates exploring high-dimensional data dependent on many parameters. Data visualization is an important means of extracting useful information from large quantities of raw data. The human eye and brain together make a formidable pattern detection tool, but for them to work the data must be represented in a low-dimensional space, usually two or three dimensions. Even quite simple relationship can seem very obscure when the data is presented in tabular form, but are often very easy to see by visual inspection. Many algorithms for data visualization have been proposed by both neural computing and statistics communities, most of which are based on a projection of the data onto a two or three dimensional visualization space. This tutorial embraces a number of these visualization techniques both linear and nonlinear: • Principal Component Analysis (PCA), a classical linear projection method for mapping the data to a lower dimensional space. PCA does this with a linear transformation that preserves as much data variance as possible. • Probabilistic PCA. The disadvantage of PCA is that it does not define a generative model and indeed there is no principled interpretation of its error function. However classical PCA can be made into a density model by using a latent variable approach, derived from factor analysis, i.e. Probabilistic Principal Component Analysis (PPCA). PPCA is a Gaussian mixture model in which each Gaussian has a covariance matrix which is diagonal and has a single variance parameter to describe all the variance in each component. • Mixture of PPCA. Because PCA only defines a linear projection of data, it is a rather limited technique. One way around this is to use a collection of local linear models. The attraction of this is that each model is simpler to understand and usually easier to fit. PCA, PPCA and mixture of PPCA are appropriate when the data is linear or approximately piece-wise linear. An alternative approach is to use global nonlinear methods: • Self Organizing Maps (SOM), a neural network algorithm based on a competitive learning which summarizes a set of data vectors in a high-dimensional space by a set of reference vectors organized on a lower dimensional sheet (usually two dimensional). SOM has been used for a wide variety of applications thanks to its simplicity and for its several plotting options. Although SOM provides easy of computation and powerful visualizations it, indeed, does not define any density model and suffers of other drawbacks which can be overcame employing nonlinear latent variable models; • Generative Topographic Mapping (GTM), a nonlinear latent variable model in which the density is a Gaussian mixture model. The centre of each Gaussian is constrained to lie on a smooth manifold of low dimension (usually 1 or 2) but all Gaussians have a common spherical covariance; • Probabilistic Principal Surfaces (PPS), a generalization of GTM with oriented covariance. PPS allows a spherical manifold in 3D space which is better suitable to characterize high-dimensional data which usually is sparse and located at the periphery (curse of dimensionality). While such algorithms can usefully display the structure of simple and more complex data sets, they often prove inadequate in the face of data sets which are too complex. A single two or three dimensional projection, even if it is nonlinear, may be insufficient to capture all the interesting aspects of the data set. For example, the projection which best separates two clusters may not be the best for revealing internal structure within one of the clusters. This motivates the consideration of a hierarchical model involving multiple two or three dimensional visualization spaces. The goal is that the top-level projection should display the entire data set, perhaps revealing the presence of clusters, while lower-level projections display internal structure within individual clusters, such as the presence of subclusters, which might not be apparent in higher-level projections. Therefore, the tutorial reviewes hierarchical linear (based on mixture of PPCA) and nonlinear (based on GTM) latent variable models and concludes by illustrating a new proposed hierarchical model based on Probabilistic Principal Surfaces. Keywords: Informatics, Data mining Vita:
Antonino Staiano is a post-doctoral research fellow at the Department of
Mathematics and Computer Science, University of Salerno, Italy. He got his
PhD in Computer Science at University of Salerno in 2003, discussing a
thesis on unsupervised neural network models for the extraction of
scientific information from astronomical databases. Since October 2000 he
has been working within the AstroNeural collaboration project (a joint
project together with the Department of Physical Science, University
Federico II of Naples) to the development of neural -based tools for data
visualization and clustering for astronomical and genetic data mining. He
has worked at European Southern Observatory (ESO) - Space Telescope European
Coordinating Facility and is currently involved in the European Virtual
Observatory. Fields of interest and research: Data Mining, Data
Visualization, Statistical Pattern Recognition, Bioinformatics, Machine
Learning, Ensemble of learning systems, Evolutionary Algorithms. Support Vector Machines and Kernel Based Learning Johan Suykens The use of kernel based learning techniques has become increasingly popular, largely stimulated by the work on support vector machines as originally introduced within the context of statistical learning theory. Standard support vector machines for classification and regression lead to solving convex optimization problems. The problem formulation and solution is characterized by a primal and dual problem where multilayer neural network interpretations can be given in both worlds. In contrast with many classical models, support vector machines are able to learn and generalize in huge dimensional input spaces. The method makes use of a high dimensional feature map (which should not be explicitly known) in relation to a Mercer kernel (the so-called kernel trick). Notions as large margin classification and regularization techniques play an important role at this point. The method can also be viewed as a functional optimization problem in reproducing kernel Hilbert spaces (RKHS). The kernel trick has been further applied to a wider variety of problems such in kernel Fisher discriminant analysis (KFDA), kernel principal component analysis (KPCA), kernel canonical correlation analysis (KCCA), kernel partial least squares (KPLS), kernel clustering and others. Optimization formulations with primal and dual problem has been given with least-squares support vector machines (LS-SVM), thereby extending support vector machine methodology to a wider range of problems for regression, classification, supervised and unsupervised learning, recurrent networks and control. One often has the choice to either solve the primal or the dual problem depending on the dimensionality of the feature map (or its approximation) and select the most suitable representation for the given problem at hand (e.g. high dimensional input space or large data sets). Kernels have also been customized towards specific application areas such as in textmining, bioinformatics or in relation to graphical models. Novel techniques of hierarchical kernel machines even allow to find the model together with the tuning parameters by solving a convex problem. At the same time one can achieve in this way e.g. sparse representations, stability of learning machines and input selection. In this tutorial we outline the main concepts of support vector machines and kernel based learning, show interesting research directions and illustrate the methods with many successful real-life examples in different areas such as e.g. time-series prediction, nonlinear modelling and microarray data analysis. Keywords: Support vector machines, kernel based learning, convex optimization, primal and dual problem, supervised and unsupervised learning
Links:
Vita:
Johan A.K. Suykens is an associate professor
with K. U. Leuven Belgium. His research interests are mainly in the areas of
the theory and application of neural networks and nonlinear systems. He is
author of the books "Artificial Neural Networks for Modelling and Control of
Non-linear Systems" (Kluwer Academic Publishers) and "Least Squares Support
Vector Machines" (World Scientific) and editor of the books "Nonlinear
Modeling: Advanced Black-Box Techniques" (Kluwer Academic Publishers) and
"Advances in Learning Theory: Methods, Models and Applications" (IOS Press).
In 1998 he organized an International Workshop on Nonlinear Modelling with
Time-series Prediction Competition. He is a Senior IEEE member and has
served as associate editor for the IEEE Transactions on Circuits and Systems
- Part I (1997-1999) and Part II (since 2004) and since 1998 he is serving
as associate editor for the IEEE Transactions on Neural Networks. He
received an IEEE Signal Processing Society 1999 Best Paper Senior Award and
several Best Paper Awards at International Conferences. He is a recipient of
the International Neural Networks Society INNS 2000 Young Investigator
Award for significant contributions in the field of neural networks. He has
served as Director and Organizer of a NATO Advanced Study Institute on
Learning Theory and Practice (Leuven 2002) and as a program co-chair for the
International Joint Conference on Neural Networks IJCNN 2004. Unsupervised Learning Real World Applications Prof. Harold Szu The tutorial presents unsupervised learning artificial neural networks (ANN), which yields unbiased computational intelligence (CI), information acquisition, feature extraction and knowledge discovery. Their applications in bioinformatics, brain study, unmanned robot control, finance and business are briefly presented. The necessary and sufficient conditions are that (1) all CPU Brain is always on all the time and is kept at an isothermal equilibrium temperature 37oC and (2) each CPU Brain is equipped with power of 5 sensors pairs e.g. two eyes, two ears, etc. altogether 10 dimensional vector time series for input data space. The overwhelming incoming excitations must not be replenished, a natural choice is letting fluctuating disagreements so-called noise decay away, and only reinforce coincident signals so-called features. This biomimetics is our minimum free energy principle of unsupervised feature extraction & fusion methodology. The tutorial proves these conditions (1) (2) constituted the basis of unsupervised learning ANN and real world applications in remote sensing and early tumor detection are given for computer scientists, neuroscientists, biologist, engineers and graduate students. Keywords: Unsupervised Learning, Computational Intelligence, Knowledge-based neural networks, Computational Neurogenetic modeling. Vita: Harold Szu received in 1971 Ph. D. in Physics/Statistical Mechanics from Prof. George Uhlenbeck at the Rockefeller University, New York, N.Y. and worked at Naval Research Lab Wash DC in three Divisions, and became the Information Science Group Leader of Naval Surface Warfare Center at White Oak (1990-1996) and at Dahlgren VA(1997-now). He has been a Fellow of
SPIE (since 1995) for neural nets, Fellow of OSA (1996) for adaptive
wavelets, Fellow of IEEE (1997) for bi-sensor fusion, Fellow of Am. Inst.
Med. & Bio. Eng. 2004 for medical image diagnoses. Dr. Szu is a Champaign
of brain-style computing for decades, a founder, former president, and a
current governor of International Neural Network Society (INNS), He has
contributed to the establishment of 30 SIG/Chapter worldwide. Notably, he
has scientifically contributed to the unsupervised redundancy reduction of
sensory pair fusion by postulating the thermodynamic free energy for
learning. He received INNS D. Gabor Award, in 1997, “for outstanding
contribution to neural network applications in information sciences and
pioneer implementations of fast simulated annealing search," and Italy
Academy the Eduardo R. Caianiello Award, 1999, for “elucidating and
implementing a chaotic neural net as a dynamic realization for fuzzy logic
membership function." Dr. Szu is a foreign academician of Russian Academy
of Nonlinear Sciences in 1999 for the homeostasis unsupervised learning
based on constant Cybernetic Temperature. Feature Extraction in Computational Intelligence Evangelia
Micheli-Tzanakou, PhD One of the major problems a researcher faces is what is learned from data obtained by various methods and different techniques. This tutorial will discuss and compare topics such as: Statistical Advances and Challenges, as well as Feature Extraction in Computational Intelligence methodologies. Often a simple model describes the data well, simply because the S/N ratio is too small for detection of more complex structures-which for example is the case with medical data involving human subjects. One has a lot of variability both in intra- and inter-sets of data. Some important Simple Tools that have been used for a long time are: Linear Regression, Discriminant analysis, Principal Component Analysis etc. In all of these, the size of the data set matters. Huge data sets create memory problems. The question is how do we handle different data types and how do we handle them? What if the data are correlated? What if we have complex data structures? In this tutorial you will learn more about Computational Intelligence, how to get to know your data and how to do Feature Extraction. Some examples of "features" will be given and different feature extraction methods will be discussed. Keywords: Vita: Dr. Tzanakou is Professor II and director of the Computational Intelligence Laboratories (2000-present); Chair of Biomedical Engineering Dept at Rutgers University (1990-2000);established the BME Undergraduate curriculum in 1999. Has an active research group and has published over 250 scientific papers. Author/co-author of 2 books. Has graduated 37 MS and PhDs. Fellowships: IEEE Fellow (1992);Founding Fellow, American Institute for Medical & Biological Eng. (AIMBE), 1993; Member of Sigma Xi and Eta Kappa Nu; Listed in American Men and Women in Science; Honorary Member of the British Brain Research Association; Honorary Member of the European Brain and Behavior Society. Awards: NJ Women of Achievement Award, 1995; Featured in the “Notable Twentieth Century Scientists, 1994; Achievement Award Recipient, Society of Women Engineers, 1992; Outstanding IEEE Branch Advisor/Counselor Award, 1985 ; cited as a Pioneer in the IEEE Web page (http://www.ieee.org/innovators). Editorships: Editorial
Board, IEEE Trans. On Nano-BioScience (2002-present); Editorial Board
Biomedical Engineering On Line (2001-present ); Associate Editor, IEEE
Transactions on Neural Networks (2000-present); Book Series Editor in
Biomedical Engineering, Plenum/Klewer (1999-present); Book Series Editor,
Biomedical Systems, IES Book series, CRC Press (1997-present); Editorial
Board, IEEE Trans. Biomedical Information Tech.(1997-2001); Editorial Board,
International J. On Advanced Computational Intelligence (1997-1999);
Associate Editor, IEEE Transactions on Neural Networks (1989-1992). Neural Networks That Actually Work In Prediction and Decision/Control: Common Misconceptions Versus Real-World Success Paul J. Werbos Neural nets are used in a growing variety of real-world applications, in a growing number of areas (including the modeling of natural intelligence). But this has been a mixed blessing, as various subcultures have not kept up with each other. For example, at http://www.eas.asu.edu/~nsfadp/ applications of adaptive dynamic programming (aka advanced reinforcement learning) are reported which beat the previous best performance in electric power grid control, aircraft control, in large-scale logistics, etc. -- but best performance requires integrating concepts from many subcultures. Yet in some subcultures, people are still using "mappings from sensory to motor coordinates" (direct adaptive control), whose limits were clear years ago. Some are using indirect methods inspired by control theory with well-known stability theorems -- theorems whose conditions are rarely satisfied, and methods which are easily improved upon. Similarly, in prediction, some subcultures falsely believe that recurrent networks are hard to train and of marginal benefit, even as people in industry use them routinely, and report performance better than extended Kalman filters and equal (at lower cost) to elaborate particle filter methods. The foundations of learning also bear on the practical choices. This tutorial will try to offer a kind of practical roadmap of these issues, and suggest how it points towards future functionality in tasks as large-scale as what brains can handle. Keywords: adaptive dynamic programming (advanced reinforcement learning) Vita: Paul J. Werbos is best known as the original inventor of backpropagation, as part of his Harvard PhD thesis, which was reprinted in full in his book the Roots of Backpropagation, Wiley 1994, along with his classic 1990 tutorial on backpropagation through time for Proc IEEE. Also, his 1967 article in Cybernetica first proposed the idea of approximating dynamic programming as a way to improve reinforcement learning, elaborated in many later papers. He was one of the three original two-year presidents of the International Neural Network Society, and winner of the IEEE Pioneer Award. He is Program Director for Control, Networks and Computational Intelligence at NSF, which actively seeks more proposals in this area. He has also been active in many cross-cutting funding initiatives; for example, he serves on the Working Group for energy storage and distribution of the interagency Climate Change Technology Program, and coordinated the NASA-NSF-EPRI solicitation on space solar power (NSF 02-098). He is also on the Planning Committee of the Millennium Project of the United Nations University (http://millennium-project.org), and has published a few papers on quantum foundations and technology (see arXiv.org, physics and nonlinear systems). He also has two degrees in economics from Harvard and the London School of Economics. He is on the governing
boards of INNS and of the IEEE Industrial Electronics Society. Cognitive Memory Bernard Widrow
Regarding the workings of the human mind, memory and
pattern recognition seem to be intertwined. You generally do not have one
without the other. Taking inspiration from life experience, a new form of
computer memory has been devised. It has been used successfully in diverse
applications such as visual aircraft identification, aircraft navigation,
and human facial recognition. Other uses are being explored. The basic
idea will have many new areas of application.
Keywords: cognitive memory, pattern recognition, applications Vita: Bernard Widrow received the S.B., S.M., and Sc.D. degrees in Electrical Engineering from the Massachusetts Institute of Technology in 1951, 1953, and 1956, respectively. He joined the MIT faculty and taught there from 1956 to 1959. In 1959, he joined the faculty of Stanford University, where he is currently Professor of Electrical Engineering. He began research on adaptive filters, learning processes, and artificial neural models in 1957. Together with M.E. Hoff, Jr., his first doctoral student at Stanford, he invented with LMS algorithm on in the Autumn of 1959. Today, this is the world's most widely used learning algorithm. He has continued working on adaptive signal processing, adaptive controls, and neural networks since that time. Dr. Widrow is a Life Fellow of the IEEE and a Fellow of AAAS. He received the IEEE Centennial Medal in 1984, the IEEE Alexander Graham Bell Medal in 1986, the IEEE Signal Processing Society Medal in 1986, the IEEE Neural Networks Pioneer Medal in 1991, the IEEE Millennium Medal in 2000, and the Benjamin Franklin Medal for Engineering from the Franklin Institute of Philadelphia in 2001. He was inducted into the National Academy of Engineering in 1995 and into the Silicon Valley Engineering Council Hall of Fame in 1999. Dr. Widrow is a past
president and currently a member of the Governing Board of the International
Neural Network Society. He is a member of the AdCom of the IEEE
Computational Intelligence Society. He is associate editor of several
journals and is the author of over 100 technical papers and 18 patents. He
is co-author of "Adaptive Signal Processing" and "Adaptive Inverse Control,"
both Prentice-Hall books. A new book, "Quantization Noise," is in
preparation. |
||||||
|
|
||||||
|
This page (http://www.mlpadgett.org/IJCNN05tutorials) was last modified on June 15, 2005. Contact: m.padgett(at)ieee.org |
||||||