Average customer rating:
|
Bioinformatics and Computational Biology Solutions Using R and Bioconductor (Statistics for Biology and Health)
Manufacturer: Springer ProductGroup: Book Binding: Hardcover Similar Items:
Accessories:
ASIN: 0387251464 |
Book Description
Bioconductor is a widely used open source and open development software project for the analysis and comprehension of data arising from high-throughput experimentation in genomics and molecular biology. Bioconductor is rooted in the open source statistical computing environment R.
This volume's coverage is broad and ranges across most of the key capabilities of the Bioconductor project, including importation and preprocessing of high-throughput data from microarray, proteomic, and flow cytometry platforms:
Curation and delivery of biological metadata for use in statistical modeling and interpretation
Statistical analysis of high-throughput data, including machine learning and visualization
Modeling and visualization of graphs and networks
The developers of the software, who are in many cases leading academic researchers, jointly authored chapters. All methods are illustrated with publicly available data, and a major section of the book is devoted to exposition of fully worked case studies.
This book is more than a static collection of descriptive text, figures, and code examples that were run by the authors to produce the text; it is a dynamic document. Code underlying all of the computations that are shown is made available on a companion website, and readers can reproduce every number, figure, and table on their own computers.
Customer Reviews:
technically accurate but pedagogically flawed.......2007-02-09
Book contains many chapters to help get you started.......2006-06-30
Average customer rating:
|
Bioinformatics for Geneticists: A Bioinformatics Primer for the Analysis of Genetic Data
Michael R., Ed. Barnes Manufacturer: John Wiley & Sons ProductGroup: Book Binding: Paperback Similar Items:
ASIN: 0470026200 |
Book Description
A fully revised version of the successful First Edition, this one-stop reference book enables all geneticists to improve the efficiency of their research.The study of human genetics is moving into a challenging new era. New technologies and data resources such as the HapMap are enabling genome-wide studies, which could potentially identify most common genetic determinants of human health, disease and drug response. With these tremendous new data resources at hand, more than ever care is required in their use. Faced with the sheer volume of genetics and genomic data, bioinformatics is essential to avoid drowning true signal in noise. Considering these challenges, Bioinformatics for Geneticists, Second Edition works at multiple levels: firstly, for the occasional user who simply wants to extract or analyse specific data; secondly, at the level of the advanced user providing explanations of how and why a tool works and how it can be used to greatest effect. Finally experts from fields allied to genetics give insight into the best genomics tools and data to enhance a genetic experiment.
Hallmark Features of the Second Edition:
Praise for the First Edition:
”…a very valuable and important resource for bringing bioinformatics into the work practice of geneticists… we strongly recommend this book…” CLINICAL CHEMISTRY, 2004
”…a useful addition to the library of a seasoned scientist… a useful resource in itself, cataloguing the ‘how’ and ‘why’…” BRIEFINGS IN BIOINFORMATICS, June 2004
Bioinformatics for Geneticists, Second Edition describes the key bioinformatics and genetic analysis processes that are needed to identify human genetic determinants. The book is based upon the combined practical experience of domain experts from academic and industrial research environments and is of interest to a broad audience, including students, researchers and clinicians working in the human genetics domain.
Customer Reviews:
A Timely Primer for Researchers about the Analysis of Genetic Data.......2007-08-23
Average customer rating: |
Analysis of Phylogenetics and Evolution with R (Use R)
Emmanuel Paradis Manufacturer: Springer ProductGroup: Book Binding: Paperback Similar Items:
Accessories:
ASIN: 0387329145 |
Book Description
The increasing availability of molecular and genetic databases coupled with the growing power of computers gives biologists opportunities to address new issues, such as the patterns of molecular evolution, and re-assess old ones, such as the role of adaptation in species diversification.
This book integrates a wide variety of data analysis methods into a single and flexible interface: the R language. This open source language is available for a wide range of computer systems and has been adopted as a computational environment by many authors of statistical software. Adopting R as a main tool for phylogenetic analyses will ease the workflow in biologists' data analyses, ensure greater scientific repeatability, and enhance the exchange of ideas and methodological developments.
Graduate students and researchers in evolutionary biology can use this book as a reference for data analyses, whereas researchers in bioinformatics interested in evolutionary analyses will learn how to implement these methods in R. The book starts with a presentation of different R packages and gives a short introduction to R for phylogeneticists unfamiliar with this language. The basic phylogenetic topics are covered: manipulation of phylogenetic data, phylogeny estimation, tree drawing, phylogenetic comparative methods, and estimation of ancestral characters. The chapter on tree drawing uses R's powerful graphical environment. A section deals with the analysis of diversification with phylogenies, one of the author's favorite research topics. The last chapter is devoted to the development of phylogenetic methods with R and interfaces with other languages (C and C++). Some exercises conclude these chapters.
Average customer rating:
|
Statistical Methods in Bioinformatics
Warren J. Ewens , and Gregory R. Grant Manufacturer: Springer ProductGroup: Book Binding: Hardcover Similar Items: Accessories:
ASIN: 0387952292 |
Book Description
Advances in computers and biotechnology have had an immense impact on the biomedical fields, with broad consequences for humanity. Correspondingly, new areas of probability and statistics are being developed specifically to meet the needs of this area. There is now a necessity for a text that introduces probability and statistics in the bioinformatics context. This book also describes some of the main statistical applications in the field, including BLAST, gene finding, and evolutionary inference, much of which has not yet been summarized in an introductory textbook format. This book grew out of a need to teach bioinformatics to graduate students at the University of Pennsylvania. At the same time however, it is organized to appeal to a wider audience. In particular it should appeal to any biologist or computer scientist who wants to know more about the statistical methods of the field, as well as to a trained statistician who wishes to become involved in bioinformatics. The earlier chapters introduce the concepts of probability and statistics at an elementary level, and will be accessible to students who have only had introductory calculus and linear algebra. Later chapters are immediately accessible to the trained statistician. Only a basic understanding of biological concepts is assumed, and all concepts are explained when used or can be understood from the context. Several chapters contain material independent of that in other chapters, so that the reader interested in certain areas can proceed directly to those areas.Warren Ewens is Professor of Biology at the University of Pennsylvania. He is the author of two books, Population Genetics and Mathematical Population Genetics, and has served on the editorial boards of Theoretical Population Biology, GENETICS, Proceeding of the Royal Society B and SIAM Journal in Mathematical Biology. He was recently awarded the Gold Medal of the Australian Statistical Society and elected as Fellow of the Royal Society. His research interests are in evolutionary population genetics, linkage analysis for human diseases, and bioinformatics.
Gregory Grant is a bioinformatics researcher at the University of Pennsylvania in the Computational Biology and Informatics Laboratory (CBIL), where he has been since 1998. In 1995 he received a Ph.D. in Mathematics from the University of Maryland and in 1999 a Masters in Computer Science from the University of Pennsylvania. His research interests are in bioinformatics in general and in particular in the statistical analysis of gene expression data and significance testing methods for IBD-mapping.
Customer Reviews:
Misleading title!.......2004-12-12
Great all-around review of probability .......2004-08-17
Disappointing overview.......2003-11-12
A topic such as the two-sample t-statistic is scattered throughout the book, with the main part not even cited in the index!
Unfortunately there are not a lot of books in the field of Statistics in Bioinformatics. However, I would recommend "The Elements of Statistical Learning" (Hastie et al.) for classifiers etc (Duda and Hart's classic is also good). I would recommend "Biostatistical Analysis" by Zar for a general coverage, and Terry Speed's "stat Labs: Mathematical Statistics ..." which is not comprehensive but has good lab examples with associated statistical analysis.
Pretty good overview.......2002-09-19
Chapter one begins, appropriately, with an introduction to probability theory, with a consideration of discrete probability distributions of one variable beginning the chapter. The Bernoulli, binomial, uniform, geometric, generalized geometric, and Poisson distributions are discussed. The authors point out the use of geometric-like distributions in the BLAST application. The also caution the reader as to the difference between the mean and the average of a random variable. They then move on to consider continuous distributions, discussing briefly the uniform, Normal, exponential, gamma, and beta distributions. Moment-generating functions are also introduced, and they prove a "convexity" theorem for these functions that is important in the BLAST application. The authors also introduce the relative entropy and generalized support statistics, the later also being used in BLAST.
The next chapter is an overview of probability theory in many random variables. The results in chapter one are discussed in this context, and the authors give an interesting application to the sequencing of EST libraries. The authors also point out that the variance of the maximum of a collection random variables is finite as the number of variables increases, a fact that is used quite often in bioinformatics. Transformations of random variables are also discussed, with the goal of showing how these can be used to find the density function of a single random variable, this also being important in BLAST.
The most important subject of the book begins in chapter 3, wherein the authors introduce statistical inference. They begin with a very brief discussion of the differences between the frequentist and Bayesian approaches to statistical inference and then move on to classical hypothesis testing and nonparametric tests. This chapter is of great value to those readers, for example biologists/would-be bioinformaticists who are approaching statistics for the first time.
Chapter 4 introduces concepts that are of upmost importance in probabilistic computational biology, namely Markov chains. The discussion in this chapter sets up the strategies used in the next chapter on analyzing a single DNA sequence and a latter chapter on hidden Markov models. Shotgun sequencing is discussed as a tool to determine the an actual DNA sequence, and the authors discuss the probabilistic issues that arise in the reconstruction of long DNA sequences from shorter sequences. Missing in this chapter is a mathematical analysis of the advantages/disadvantages between shotgun and whole genome sequencing strategies.
Chapter 6 then generalizes the analysis of chapter 5 to multiple DNA and protein sequences. It is here that one begins to talk about alignments between sequences, which bring about some very subtle mathematical problems in computational biology. The computational complexity of the (global) alignment problem entails the use of softer techniques, such as dynamic programming, which is discussed in this chapter. The (local) alignment problem is also discussed in some detail, using the linear gap model. The alignment problem and the issues with scoring for protein sequences are also discussed in detail. The reader first encounters the famous PAM and BLOSUM matrices in this chapter. The authors do not discuss any connections with the protein folding problem, unfortunately.
The next chapter introduces the basic probability theory behind the BLAST algorithm, namely random walks. They do so with emphasis on moment generating functions, which might be a little abstract for the biologist reader.
The authors return to tatistical estimation and hypothesis testing in chapter 8, with maximum liklihood and fixed sample size tests discussed in some detail. Again connecting with the BLAST algorithm, the sequential probability ratio test is treated.
The authors finally get down to the BLAST algorithm in chapter 9, using an older version of the software (1.4). The connection of the algorithm with random walks and how to assign scores is immediately apparent, as is the ability of BLAST to do database queries against a chosen sequence. The algorithm is compared with the sequential analysis discussed in the last chapter.
The authors return to Markov chains in chapter 10, and give some numerical examples. In addition, they treat the important topic of Markov chain Monte Carlo via the Hastings-Metropolis algorithm, Gibbs sampling, and simulated annealing. An application of simulated annealing to the double digest problem is described. The authors also spend a litte time discussing continuous-time Markov chains.
Hidden Markov models are finally discussed in chapter 11. These have been the most effective tools in sequence analysis and the authors give a nice overview of their construction and properties in this chapter. The Pfam package is discussed as a software implementation of HMMs for determining protein domains. Unfortunately, they do not discuss the excellent package HMMER for implementing HMMs in sequence analysis.
Chapter 12 discusses computationally intensive methods in classical inference. One of these methods, the bootstrap procedure, which is used for large sample sizes, is described. Used to estimate confidence intervals in situations where there is not enough information to employ classical methods, the authors detail a method using quantiles to estimate the confidence interval for the standard deviation of the expression intensity of a gene. This is followed by a return to the multiple testing problem of chapter 3 in the context of the data analysis of expression arrays.
I did not read the last two chapters on evolutionary models and phylogenetic tree estimation so I will omit their review.
guide into the right direction.......2001-09-06
This book is the first exception I know of. It builds, and rests on, solid foundations of genetic stochastic processes and still goes all the way to real-life problems. Let me illustrate this by means of an example, rather than enumerating all the topics in the book.
Chap. 14, entitled `phylogenetic tree estimation' (as opposed to the more common term `phylogenetic tree reconstruction' - not without reason, I presume) builds on, and is firmly interlaced with, Chap. 13 about `evolutionary models', which systematizes the zoo (if not jungle) of substitution models in both discrete and continuous time. On this basis, the overview of tree-building methods makes a lot of sense. Even better, it does not stop here, but presents an application (to real sequence data), followed by a careful analysis of where the various methods agree, and where - and maybe why - they disagree. This way, it clears away some common misconceptions; in particular, it presents a careful analysis of what bootstrap does and what it does not in this context. The chapter closes with a discussion of unresolved problems (like inhomogeneity of substitution rates), and methods and possible pitfalls related to testing of nested and non-nested hypotheses in tree estimation.
The book is written in an informal style without being imprecise, which makes it pleasant reading. It is particularly suitable for teaching at a high level. This is enhanced by realistic (and even real-life) examples that furnish the text, as well as carefully chosen exercises at the end of each chapter.
Certainly, this first edition of `Statistical Methods in Bioinformatics' cannot be the last word in this fast-moving field. But it is an excellent guide into the `right' direction.
Average customer rating:
|
Biometric Technologies and Verification Systems
John R. Vacca Manufacturer: Butterworth-Heinemann ProductGroup: Book Binding: Paperback Similar Items:
ASIN: 0750679670 |
Book Description
Biometric Technologies and Verification Systems is organized into nine parts composed of 30 chapters, including an extensive glossary of biometric terms and acronyms. It discusses the current state-of-the-art in biometric verification/authentication, identification and system design principles. It also provides a step-by-step discussion of how biometrics works; how biometric data in human beings can be collected and analyzed in a number of ways; how biometrics are currently being used as a method of personal identification in which people are recognized by their own unique corporal or behavioral characteristics; and how to create detailed menus for designing a biometric verification system.Customer Reviews:
Security with a 21st century flair.......2007-10-09
Great source of security information.......2007-10-08
fascinating topic.......2007-10-05
James Bond technology is now common place........2007-09-25
The best book yet on Biometric Technology!.......2007-09-15
Average customer rating:
|
Instant Notes in Bioinformatics (Instant Notes)
D. R. Westhead Manufacturer: BIOS Scientific Publishers ProductGroup: Book Binding: Paperback Similar Items:
ASIN: 1859962726 |
Book Description
Instant Notes titles focus on core information and are designed to help undergraduate students come to grips with a subject quickly and easily.
Instant Notes in Bioinformatics describes what is possible, and the strengths, limitations and potential pitfalls of methods and analyses. The book begins by describing data generation and databases. It also discusses the newer bioinformatics problems associated with structures, expression, proteomics, interactions and pathways. The important areas in bioinformatics are covered to encourage easy learning and brushing-up.
Customer Reviews:
Concise primer. Not bad........2003-12-09
Compared to other primers such as "Developing Bioinformatics Computer Skills", this book contains less unnecessasary figures (e.g., central dogma, etc.), covers wider range of topics, tries to be less verbose.
A drawback is that there is little description at an algorithmic level (e.g., dynamic programming). However, the book does a pretty good job in conveying the main ideas about what such algorithms do and why they are needed. I like this book's concise and accurate presentation style much better than lengthy and confusing style found in many other books (e.g., Bioinformatics - David Mount). Another drawback is that font is small.
Overall, this book is not bad. I think this book's preface tells you what you can expect from this book, so below I excerpted a paragraph.
"We will tell you how to do things, but this is not a software manual for commonly used packages. They have their own manuals that are (mostly) much better than anything we could provide. Many of the methods we describe rely on quite complex mathematical, statistical or computational techniques. Often we choose not to describe these at all, but where we do we have aimed for a simple conceptual understanding."
Average customer rating: |
Evolutionary Bioinformatics
Donald R. Forsdyke Manufacturer: Springer, Inc. ProductGroup: Book Binding: Hardcover ASIN: 0387334181 |
Book Description
Books on bioinformatics which began appearing in the mid 80s primarily served gene-hunters, and biologists who wished to construct family trees showing tidy lines of descent. Given the great pharmaceutical industry interest in genes, this trend has continued in most subsequent texts. These deal extensively with the exciting topic of gene discovery and searching databases, but hardly consider genomes as information channels through which multiple forms and levels of information, including genic information, have passed through the generations. This book identifies the types of information that genomes transmit, shows how competition between different types is resolved in the genomes of different organisms, and identifies the evolutionary forces involved. The early chapters relate the form of information with which we are most familiar, namely written texts, to the DNA text that is our genome. This lends itself well to introducing historical aspects dating back to the nineteenth century.
Average customer rating: |
Advanced Data Mining and Applications: Third International Conference, ADMA 2007, Harbin, China, August 6-8, 2007 Proceedings (Lecture Notes in Computer Science)
Manufacturer: Springer ProductGroup: Book Binding: Paperback ASIN: 3540738703 |
Book Description
This book constitutes the refereed proceedings of the Third International Conference on Advanced Data Mining and Applications, ADMA 2007, held in Harbin, China in August 2007.
The 44 revised full papers and 15 revised short papers presented together with the abstract of 1 invited lecture were carefully reviewed and selected from about 200 submissions. The papers focus on advancements in data mining and peculiarities and challenges of real world applications using data mining. The major theme of the conference encompasses the innovative applications of data mining approaches to real-world problems that involve large data sets, incomplete and noisy data, or demand optimal solutions.
Average customer rating: |
Advances in Intelligent Data Analysis V: 5th International Symposium on Intelligent Data Analysis, IDA 2003, Berlin, Germany, August 28-30, 2003, Proceedings (Lecture Notes in Computer Science)
Manufacturer: Springer ProductGroup: Book Binding: Paperback ASIN: 3540408134 |
Book Description
This book constitutes the refereed proceedings of the 5th International Conference on Intelligent Data Analysis, IDA 2003, held in Berlin, Germany in August 2003.
The 56 revised papers presented were carefully reviewed and selected from 180 submissions. The papers are organized in topical sections on machine learning, probability and topology, classification and pattern recognition, clustering, applications, modeling, and data processing.
Average customer rating: |
Advances in Intelligent Data Analysis VII: 7th International Symposium on Intelligent Data Analysis, IDA 2007, Ljubljana, Slovenia, September 6-8, 2007, Proceedings (Lecture Notes in Computer Science)
Manufacturer: Springer ProductGroup: Book Binding: Paperback ASIN: 3540748245 |
Book Description
This book constitutes the refereed proceedings of the 7th International Conference on Intelligent Data Analysis, IDA 2007, held in Ljubljana, Slovenia, September 6-8, 2007.
The 33 revised papers presented were carefully reviewed and selected from almost 100 submissions. All current aspects of this interdisciplinary field are addressed; the areas covered include statistics, machine learning, data mining, classification and pattern recognition, clustering, applications, modeling, and interactive dynamic data visualization.
Books:
Recommended Books