The rapidly increasing wealth of genomic data has driven the development of tools to assist in the task of representing and processing information about genes, their products and their functions. One of the most important of these tools is the GeneOntology (GO), which is being developed in tandem with work on a variety of bioinformatics databases. An examination of the structure of GO, however, reveals a number of problems, which we believe can be resolved by taking account (...) of certain organizing principles drawn from philosophical ontology. We shall explore the results of applying such principles to GO with a view to improving GO’s consistency and coherence and thus its future applicability in the automated processing of biological data. (shrink)
The GeneOntology is an important tool for the representation and processing of information about gene products and functions. It provides controlled vocabularies for the designations of cellular components, molecular functions, and biological processes used in the annotation of genes and gene products. These constitute three separate ontologies, of cellular components), molecular functions and biological processes, respectively. The question we address here is: how are the terms in these three separate ontologies related to each other? We (...) use statistical methods and formal ontological principles as a first step towards finding answers to this question. (shrink)
Formal principles governing best practices in classification and definition have for too long been neglected in the construction of biomedical ontologies, in ways which have important negative consequences for data integration and ontology alignment. We argue that the use of such principles in ontology construction can serve as a valuable tool in error-detection and also in supporting reliable manual curation. We argue also that such principles are a prerequisite for the successful application of advanced data integration techniques such (...) as ontology-based multi-database querying, automated ontology alignment and ontology-based text-mining. These theses are illustrated by means of a case study of the GeneOntology, a project of increasing importance within the field of biomedical data integration. (shrink)
The automatic integration of information resources in the life sciences is one of the most challenging goals facing biomedical informatics today. Controlled vocabularies have played an important role in realizing this goal, by making it possible to draw together information from heterogeneous sources secure in the knowledge that the same terms will also represent the same entities on all occasions of use. One of the most impressive achievements in this regard is the GeneOntology (GO), which is rapidly (...) acquiring the status of a de facto standard in the field of gene and gene product annotations, and whose methodology has been much intimated in attempts to develop controlled vocabularies for shared use in different domains of biology. The GO Consortium has recognized, however, that its controlled vocabulary as currently constituted is marked by several problematic features - features which are characteristic of much recent work in bioinformatics and which are destined to raise increasingly serious obstacles to the automatic integration of biomedical information in the future. Here, we survey some of these problematic features, focusing especially on issues of compositionality and syntactic regimentation. (shrink)
The Unified Medical Language System and the GeneOntology are among the most widely used terminology resources in the biomedical domain. However, when we evaluate them in the light of simple principles for wellconstructed ontologies we find a number of characteristic inadequacies. Employing the theory of granular partitions, a new approach to the understanding of ontologies and of the relationships ontologies bear to instances in reality, we provide an application of this theory in relation to an example drawn (...) from the context of the pathophysiology of hypertension. This exercise is designed to demonstrate how, by taking ontological principles into account we can create more realistic biomedical ontologies which will also bring advantages in terms of efficiency and robustness of associated software applications. (shrink)
Increasingly, in data-intensive areas of the life sciences, experimental results are being described in algorithmically useful ways with the help of ontologies. Such ontologies are authored and maintained by scientists to support the retrieval, integration and analysis of their data. The proposition to be defended here is that ontologies of this type – the GeneOntology (GO) being the most conspicuous example – are a part of science. Initial evidence for the truth of this proposition (which some will (...) find self-evident) is the increasing recognition of the importance of empirically-based methods of evaluation to the ontology development work being undertaken in support of scientific research. The ontologies created by scientists must, of course, be associated with implementations satisfying the requirements of software engineering. But these ontologies are not themselves engineering artifacts, and to conceive them as such brings grievous consequences. Rather, we shall argue, ontologies such as the GO are comparable to scientific theories, to scientific databases, or to scientific journal publications. Such a view implies a radically new conception of what is involved in the authoring, maintenance and application of ontologies in scientific contexts, and therewith also a radically new approach to the evaluation of ontologies and to the training of ontologists. (shrink)
We begin at the beginning, with an outline of Aristotle’s views on ontology and with a discussion of the influence of these views on Linnaeus. We move from there to consider the data standardization initiatives launched in the 19th century, and then turn to investigate how the idea of computational ontologies developed in the AI and knowledge representation communities in the closing decades of the 20th century. We show how aspects of this idea, particularly those relating to the use (...) of the term 'concept' in ontology development, influenced SNOMED CT and other medical terminologies. Against this background we then show how the Foundational Model of Anatomy, the GeneOntology, Basic Formal Ontology and other OBO Foundry ontologies came into existence and discuss their role in the development of contemporary biomedical informatics. (shrink)
This paper defends a view of the GeneOntology (GO) and of Basic Formal Ontology (BFO) as examples of what the manufacturing industry calls product-service systems. This means that they are products (the ontologies) bundled with a range of ontology services such as updates, training, help desk, and permanent identifiers. The paper argues that GO and BFO are contrasted in this respect with DOLCE, which approximates more closely to a scientific theory or a scientific publication. The (...) paper provides a detailed overview of ontology services and concludes with a discussion of some implications of the product-service system approach for the understanding of the nature of applied ontology. Ontology developer communities are compared in this respect with developers of scientific theories and of standards (such as W3C). For each of these we can ask: what kinds of products do they develop and what kinds of services do they provide for the users of these products? (shrink)
In recent years, sequencing technologies have enabled the identification of a wide range of non-coding RNAs (ncRNAs). Unfortunately, annotation and integration of ncRNA data has lagged behind their identification. Given the large quantity of information being obtained in this area, there emerges an urgent need to integrate what is being discovered by a broad range of relevant communities. To this end, the Non-Coding RNA Ontology (NCRO) is being developed to provide a systematically structured and precisely defined controlled vocabulary for (...) the domain of ncRNAs, thereby facilitating the discovery, curation, analysis, exchange, and reasoning of data about structures of ncRNAs, their molecular and cellular functions, and their impacts upon phenotypes. The goal of NCRO is to serve as a common resource for annotations of diverse research in a way that will significantly enhance integrative and comparative analysis of the myriad resources currently housed in disparate sources. It is our belief that the NCRO ontology can perform an important role in the comprehensive unification of ncRNA biology and, indeed, fill a critical gap in both the Open Biological and Biomedical Ontologies (OBO) Library and the National Center for Biomedical Ontology (NCBO) BioPortal. Our initial focus is on the ontological representation of small regulatory ncRNAs, which we see as the first step in providing a resource for the annotation of data about all forms of ncRNAs. (shrink)
Two senses of ‘ontology’ can be distinguished in the current literature. First is the sense favored by information scientists, who view ontologies as software implementations designed to capture in some formal way the consensus conceptualization shared by those working on information systems or databases in a given domain. [Gruber 1993] Second is the sense favored by philosophers, who regard ontologies as theories of different types of entities (objects, processes, relations, functions) [Smith 2003]. Where information systems ontologists seek to maximize (...) reasoning efficiency even at the price of simplifications on the side of representation, philosophical ontologists argue that representational adequacy can bring benefits for the stability and resistance to error of an ontological framework and also for its extendibility in the future. In bioinformatics, however, a third sense of ‘ontology’ has established itself, above all as a result of the successes of the GeneOntology (hereafter: GO), which is a tool for the representation and processing of information about gene products and their biological functions [GeneOntology Consortium 2000]. We show how Basic Formal Ontology (BFO) has established itself as an overarching ontology drawing on all three of the strands distinguished above, and describe applications of BFO especially in the treatment of biological granularity. (shrink)
We have begun work on two separate but related ontologies for the study of neurological diseases. The first, the Neurological Disease Ontology (ND), is intended to provide a set of controlled, logically connected classes to describe the range of neurological diseases and their associated signs and symptoms, assessments, diagnoses, and interventions that are encountered in the course of clinical practice. ND is built as an extension of the Ontology for General Medical Sciences — a high-level candidate OBO Foundry (...)ontology that provides a set of general classes that can be used to describe general aspects of medical science. ND is being built with classes utilizing both textual and axiomatized definitions that describe and formalize the relations between instances of other classes within the ontology itself as well as to external ontologies such as the GeneOntology, Cell Ontology, Protein Ontology, and Chemical Entities of Biological Interest. In addition, references to similar or associated terms in external ontologies, vocabularies and terminologies are included when possible. Initial work on ND is focused on the areas of Alzheimer’s and other diseases associated with dementia, multiple sclerosis, and stroke and cerebrovascular disease. Extensions to additional groups of neurological diseases are planned. The second ontology, the Neuro-Psychological Testing Ontology (NPT), is intended to provide a set of classes for the annotation of neuropsychological testing data. The intention of this ontology is to allow for the integration of results from a variety of neuropsychological tests that assay similar measures of cognitive functioning. Neuro-psychological testing is an important component in developing the clinical picture used in the diagnosis of patients with a range of neurological diseases, such as Alzheimer’s disease and multiple sclerosis, and following stroke or traumatic brain injury. NPT is being developed as an extension to the Ontology for Biomedical Investigations. (shrink)
The Protein Ontology (PRO; http://proconsortium.org) formally defines protein entities and explicitly represents their major forms and interrelations. Protein entities represented in PRO corresponding to single amino acid chains are categorized by level of specificity into family, gene, sequence and modification metaclasses, and there is a separate metaclass for protein complexes. All metaclasses also have organism-specific derivatives. PRO complements established sequence databases such as UniProtKB, and interoperates with other biomedical and biological ontologies such as the GeneOntology (...) (GO). PRO relates to UniProtKB in that PRO’s organism-specific classes of proteins encoded by a specific gene correspond to entities documented in UniProtKB entries. PRO relates to the GO in that PRO’s representations of organism-specific protein complexes are subclasses of the organism-agnostic protein complex terms in the GO Cellular Component Ontology. The past few years have seen growth and changes to the PRO, as well as new points of access to the data and new applications of PRO in immunology and proteomics. Here we describe some of these developments. (shrink)
Where humans can manipulate and integrate the information they receive in subtle and ever-changing ways from context to context, computers need structured and context-free background information of a sort which ontologies can help to provide. A domain ontology captures the stable, highly general and commonly accepted core knowledge for an application domain. The domain at issue here is that of the life sciences, in particular molecular biology and bioinformatics. Contemporary life science research includes components drawn from physics, chemistry, mathematics, (...) medicine and many other areas, and all of these dimensions, as well as fundamental philosophical issues, must be taken into account in the construction of a domain ontology. Here we describe the basic features of domain ontologies in the life sciences and show how they can be used. (shrink)
This paper is trying to put together two different researches, from theology and from genetics, about a general and undetermined topic, death. It is undetermined because no one can say something demonstrable and unequivocal about it, since no person alive can cross over the edge of life and come back from the domain of death with information about it. But we can discuss nevertheless things that are obvious and possible to be reasonably inferred about death even by livings. In this (...) regard Theology will provide the mainline of what is to be known as death for religion in general, while Genetics will try to come with its research to sustain or contradict the general premise: death is not an ontological behavior of living matter, but an imposed attribute after the sin occurred into the world. (shrink)
The usual interpretation of Republic 10 takes it as Socrates’ multilevel philosophical demonstration of the untruth and dangerousness of mimesis and its required excision from a well ordered polity. Such readings miss the play of the Platonic mimesis which has within it precisely ordered antistrophes which turn its oft remarked strophes perfectly around. First, this argument, famously concluding to the unreliability of image-makers for producing knowledge begins with two images—the mirror (596e) and the painter. I will show both undercut the (...) argument they introduce. Secondly, Socrates repeats the “three removes” argument three times. Each has its own object and philosophical axis. The “bed” argument (596a-598d) concerns the ontological status of images vis-à-vis human makers and the divine idea. The bit and bridle,” 601b-601d) emphasizes the epistemological status of image-making as beneath the human maker of bridles (having correct opinion) and the human user, who knows. This second takes away the ontological distinction which was the point of the first. Thus, any human being could be in any of the three positions—user, maker, imitator. This dance might bring one to doubt the conclusion that the imitator has “neither knowledge nor right opinion” (602a). Plato leads into his concluding psychological and moral argument (602c-605c) after a third variation of the 3 removes. This last gives us the image of the flute player (601d-602) as the one who knows and so can order the flute maker. We will conclude by considering what it is this flutist could know, and given that his art is itself a mimesis, who the imitator of this imitative artist could be. Thus attention to the differences among these examples opens a defense of the arts, which reverses many of the claims made against the arts within the ostensible “Platonic” argument. (shrink)
The Plant Ontology (PO) (http://www.plantontology.org) (Jaiswal et al., 2005; Avraham et al., 2008) was designed to facilitate cross-database querying and to foster consistent use of plant-specific terminology in annotation. As new data are generated from the ever-expanding list of plant genome projects, the need for a consistent, cross-taxon vocabulary has grown. To meet this need, the PO is being expanded to represent all plants. This is the first ontology designed to encompass anatomical structures as well as growth and (...) developmental stages across such a broad taxonomic range. While other ontologies such as the GeneOntology (GO) (The GeneOntology Consortium, 2010) or Cell Type Ontology (CL) (Bard et al., 2005) cover all living organisms, they are confined to structures at the cellular level and below. The diversity of growth forms and life histories within plants presents a challenge, but also provides unique opportunities to study developmental and evolutionary homology across organisms. (shrink)
Biologists explain organisms’ behavior not only as having been programmed by genes and shaped by natural selection, but also as the result of an organism’s agency: the capacity to react to environmental changes in goal-driven ways. The use of such ‘agential explanations’ reopens old questions about how justified it is to ascribe agency to entities like bacteria or plants that obviously lack rationality and even a nervous system. Is organismic agency genuinely ‘real’ or is it just a useful fiction? In (...) this paper we focus on two questions: whether agential explanations are to be interpreted ontically, and whether they can be reduced to non-agential explanations (thereby dispensing with agency). The Kantian approach we identify interprets agential explanations non-ontically, yet holds agency to be indispensable. Attributing agency to organisms is not to be taken literally in the way we attribute physical properties such as mass or acceleration, but nor is it a mere heuristic or predictive tool. Rather, it is an inevitable consequence of our own rational capacity: as long as we are rational agents ourselves, we cannot avoid seeing agency in organisms. (shrink)
The Plant Ontology (PO; http://www.plantontology.org/) is a publicly-available, collaborative effort to develop and maintain a controlled, structured vocabulary (“ontology”) of terms to describe plant anatomy, morphology and the stages of plant development. The goals of the PO are to link (annotate) gene expression and phenotype data to plant structures and stages of plant development, using the data model adopted by the GeneOntology. From its original design covering only rice, maize and Arabidopsis, the scope of (...) the PO has been expanded to include all green plants. The PO was the first multi-species anatomy ontology developed for the annotation of genes and phenotypes. Also, to our knowledge, it was one of the first biological ontologies that provides translations (via synonyms) in non-English languages such as Japanese and Spanish. There are about 2.2 million annotations linking PO terms to over 110,000 unique data objects representing genes or gene models, proteins, RNAs, germplasm and Quantitative Traits Loci (QTLs) from 22 plant species. In this paper, we focus on the plant anatomical entity branch of the PO, describing the organizing principles, resources available to users, and examples of how the PO is integrated into other plant genomics databases and web portals. We also provide two examples of comparative analyses, demonstrating how the ontology structure and PO-annotated data can be used to discover the patterns of expression of the LEAFY (LFY) and terpene synthase (TPS) gene homologs. (shrink)
Biomedical ontologies are emerging as critical tools in genomic and proteomic research where complex data in disparate resources need to be integrated. A number of ontologies exist that describe the properties that can be attributed to proteins; for example, protein functions are described by GeneOntology, while human diseases are described by Disease Ontology. There is, however, a gap in the current set of ontologies—one that describes the protein entities themselves and their relationships. We have designed a (...) PRotein Ontology (PRO) to facilitate protein annotation and to guide new experiments. The components of PRO extend from the classification of proteins on the basis of evolutionary relationships to the representation of the multiple protein forms of a gene (products generated by genetic variation, alternative splicing, proteolytic cleavage, and other post-translational modification). PRO will allow the specification of relationships between PRO, GO and other OBO Foundry ontologies. Here we describe the initial development of PRO, illustrated using human proteins from the TGF-beta signaling pathway. (shrink)
The Protein Ontology (PRO; http://purl.obolibrary.org/obo/pr) formally defines and describes taxon-specific and taxon-neutral protein-related entities in three major areas: proteins related by evolution; proteins produced from a given gene; and protein-containing complexes. PRO thus serves as a tool for referencing protein entities at any level of specificity. To enhance this ability, and to facilitate the comparison of such entities described in different resources, we developed a standardized representation of proteoforms using UniProtKB as a sequence reference and PSI-MOD as a (...) post-translational modification reference. We illustrate its use in facilitating an alignment between PRO and Reactome protein entities. We also address issues of scalability, describing our first steps into the use of text mining to identify protein-related entities, the large-scale import of proteoform information from expert curated resources, and our ability to dynamically generate PRO terms. Web views for individual terms are now more informative about closely-related terms, including for example an interactive multiple sequence alignment. Finally, we describe recent improvement in semantic utility, with PRO now represented in OWL and as a SPARQL endpoint. These developments will further support the anticipated growth of PRO and facilitate discoverability of and allow aggregation of data relating to protein entities. (shrink)
Successful biomedical data mining and information extraction require a complete picture of biological phenomena such as genes, biological processes, and diseases; as these exist on different levels of granularity. To realize this goal, several freely available heterogeneous databases as well as proprietary structured datasets have to be integrated into a single global customizable scheme. We will present a tool to integrate different biological data sources by mapping them to a proprietary biomedical ontology that has been developed for the purposes (...) of making computers understand medical natural language. (shrink)
The Plant Ontology (PO) is a community resource consisting of standardized terms, definitions, and logical relations describing plant structures and development stages, augmented by a large database of annotations from genomic and phenomic studies. This paper describes the structure of the ontology and the design principles we used in constructing PO terms for plant development stages. It also provides details of the methodology and rationale behind our revision and expansion of the PO to cover development stages for all (...) plants, particularly the land plants (bryophytes through angiosperms). As a case study to illustrate the general approach, we examine variation in gene expression across embryo development stages in Arabidopsis and maize, demonstrating how the PO can be used to compare patterns of expression across stages and in developmentally different species. Although many genes appear to be active throughout embryo development, we identified a small set of uniquely expressed genes for each stage of embryo development and also between the two species. Evaluating the different sets of genes expressed during embryo development in Arabidopsis or maize may inform future studies of the divergent developmental pathways observed in monocotyledonous versus dicotyledonous species. The PO and its annotation databasemake plant data for any species more discoverable and accessible through common formats, thus providing support for applications in plant pathology, image analysis, and comparative development and evolution. (shrink)
The automatic integration of rapidly expanding information resources in the life sciences is one of the most challenging goals facing biomedical research today. Controlled vocabularies, terminologies, and coding systems play an important role in realizing this goal, by making it possible to draw together information from heterogeneous sources – for example pertaining to genes and proteins, drugs and diseases – secure in the knowledge that the same terms will also represent the same entities on all occasions of use. In the (...) naming of genes, proteins, and other molecular structures, considerable efforts are under way to reduce the effects of the different naming conventions which have been spawned by different groups of researchers. Electronic patient records, too, increasingly involve the use of standardized terminologies, and tremendous efforts are currently being devoted to the creation of terminology resources that can meet the needs of a future era of personalized medicine, in which genomic and clinical data can be aligned in such a way that the corresponding information systems become interoperable. (shrink)
The study of biodiversity spans many disciplines and includes data pertaining to species distributions and abundances, genetic sequences, trait measurements, and ecological niches, complemented by information on collection and measurement protocols. A review of the current landscape of metadata standards and ontologies in biodiversity science suggests that existing standards such as the Darwin Core terminology are inadequate for describing biodiversity data in a semantically meaningful and computationally useful way. Existing ontologies, such as the GeneOntology and others in (...) the Open Biological and Biomedical Ontologies (OBO) Foundry library, provide a semantic structure but lack many of the necessary terms to describe biodiversity data in all its dimensions. In this paper, we describe the motivation for and ongoing development of a new Biological Collections Ontology, the Environment Ontology, and the Population and Community Ontology. These ontologies share the aim of improving data aggregation and integration across the biodiversity domain and can be used to describe physical samples and sampling processes (for example, collection, extraction, and preservation techniques), as well as biodiversity observations that involve no physical sampling. Together they encompass studies of: 1) individual organisms, including voucher specimens from ecological studies and museum specimens, 2) bulk or environmental samples (e.g., gut contents, soil, water) that include DNA, other molecules, and potentially many organisms, especially microbes, and 3) survey-based ecological observations. We discuss how these ontologies can be applied to biodiversity use cases that span genetic, organismal, and ecosystem levels of organization. We argue that if adopted as a standard and rigorously applied and enriched by the biodiversity community, these ontologies would significantly reduce barriers to data discovery, integration, and exchange among biodiversity resources and researchers. (shrink)
The Planteome project provides a suite of reference and species-specific ontologies for plants and annotations to genes and phenotypes. Ontologies serve as common standards for semantic integration of a large and growing corpus of plant genomics, phenomics and genetics data. The reference ontologies include the Plant Ontology, Plant Trait Ontology, and the Plant Experimental Conditions Ontology developed by the Planteome project, along with the GeneOntology, Chemical Entities of Biological Interest, Phenotype and Attribute Ontology, (...) and others. The project also provides access to species-specific Crop Ontologies developed by various plant breeding and research communities from around the world. We provide integrated data on plant traits, phenotypes, and gene function and expression from 95 plant taxa, annotated with reference ontology terms. (shrink)
In previous work on biomedical ontologies we showed how the provision of formal definitions for relations such as is_a and part_of can support new types of auto-mated reasoning about biomedical phenomena. We here extend this approach to the transformation_of characteristic of pathologies.
The Common Anatomy Reference Ontology (CARO) is being developed to facilitate interoperability between existing anatomy ontologies for different species, and will provide a template for building new anatomy ontologies. CARO has a structural axis of classification based on the top-level nodes of the Foundational Model of Anatomy. CARO will complement the developmental process sub-ontology of the GO Biological Process ontology, using it to ensure the coherent treatment of developmental stages, and to provide a common framework for the (...) model organism communities to classify developmental structures. Definitions for the types and relationships are being generated by a consortium of investigators from diverse backgrounds to ensure applicability to all organisms. CARO will support the coordination of cross-species ontologies at all levels of anatomical granularity by cross-referencing types within the cell type ontology (CL) and the GeneOntology (GO) Cellular Component ontology. A complete cross-species CARO could be utilized in other ontologies for cross-product generation. (shrink)
Background: Ontologies and taxonomies are among the most important computational resources for molecular biology and bioinformatics. A series of recent papers has shown that the GeneOntology (GO), the most prominent taxonomic resource in these fields, is marked by flaws of certain characteristic types, which flow from a failure to address basic ontological principles. As yet, no methods have been proposed which would allow ontology curators to pinpoint flawed terms or definitions in ontologies in a systematic way. (...) Results: We present computational methods that automatically identify terms and definitions which are defined in a circular or unintelligible way. We further demonstrate the potential of these methods by applying them to isolate a subset of 6001 problematic GO terms. By automatically aligning GO with other ontologies and taxonomies we were able to propose alternative synonyms and definitions for some of these problematic terms. This allows us to demonstrate that these other resources do not contain definitions superior to those supplied by GO. Conclusion: Our methods provide reliable indications of the quality of terms and definitions in ontologies and taxonomies. Further, they are well suited to assist ontology curators in drawing their attention to those terms that are ill-defined. We have further shown the limitations of ontology mapping and alignment in assisting ontology curators in rectifying problems, thus pointing to the need for manual curation. (shrink)
Representing species-specific proteins and protein complexes in ontologies that are both human and machine-readable facilitates the retrieval, analysis, and interpretation of genome-scale data sets. Although existing protin-centric informatics resources provide the biomedical research community with well-curated compendia of protein sequence and structure, these resources lack formal ontological representations of the relationships among the proteins themselves. The Protein Ontology (PRO) Consortium is filling this informatics resource gap by developing ontological representations and relationships among proteins and their variants and modified forms. (...) Because proteins are often functional only as members of stable protein complexes, the PRO Consortium, in collaboration with existing protein and pathway databases, has launched a new initiative to implement logical and consistent representation of protein complexes. We describe here how the PRO Consortium is meeting the challenge of representing species-specific protein complexes, how protein complex representation in PRO supports annotation of protein complexes and comparative biology, and how PRO is being integrated into existing community bioinformatics resources. The PRO resource is accessible at http://pir.georgetown.edu/pro/. (shrink)
The organism is neither a discovery like the circulation of the blood or the glycogenic function of the liver, nor a particular biological theory like epigenesis or preformationism. It is rather a concept which plays a series of roles – sometimes overt, sometimes masked – throughout the history of biology, and frequently in very normative ways, also shifting between the biological and the social. Indeed, it has often been presented as a key-concept in life science and the ‘theorization’ of Life, (...) but conversely has also been the target of influential rejections: as just an instrument of transmission for the selfish gene, but also, historiographically, as part of an outdated ‘vitalism’. Indeed, the organism, perhaps because it is experientially closer to the ‘body’ than to the ‘molecule’, is often the object of quasi-affective theoretical investments presenting it as essential, sometimes even as the pivot of a science or a particular approach to nature, while other approaches reject or attack it with equal force, assimilating it to a mysterious ‘vitalist’ ontology of extra-causal forces, or other pseudo-scientific doctrines. This paper does not seek to adjudicate between these debates, either in terms of scientific validity or historical coherence; nor does it return to the well-studied issue of the organism-mechanism tension in biology. Recent scholarship has begun to focus on the emergence and transformation of the concept of organism, but has not emphasized so much the way in which organism is a shifting, ‘go-between’ concept – invoked as ‘natural’ by some thinkers to justify their metaphysics, but then presented as value-laden by others, over and against the natural world. The organism as go-between concept is also a hybrid, a boundary concept or an epistemic limit case, all of which partly overlap with the idea of ‘nomadic concepts’. Thereby the concept of organism continues to function in different contexts – as a heuristic, an explanatory challenge, a model of order, of regulation, etc. – despite having frequently been pronounced irrelevant and reduced to molecules or genes. Yet this perpetuation is far removed from any ‘metaphysics of organism’, or organismic biology. (shrink)
Ontologies are being developed throughout the biomedical sciences to address standardization, integration, classification and reasoning needs against the background of an increasingly data-driven research paradigm. In particular, ontologies facilitate the translation of basic research into benefits for the patient by making research results more discoverable and by facilitating knowledge transfer across disciplinary boundaries. Addressing and adequately treating mental illness is one of our most pressing public health challenges. Primary research across multiple disciplines such as psychology, psychiatry, biology, neuroscience and pharmacology (...) needs to be integrated in order to promote a more comprehensive understanding of underlying processes and mechanisms, and this need for integration only becomes more pressing with our increase in understanding of differences among individuals and populations at the molecular level concerning susceptibility to specific illnesses. Substance addiction is a particularly relevant public health challenge in the developed world, affecting a substantial percentage of the population, often co-morbid with other illnesses such as mood disorders. Currently, however, there is no straightforward automated method to combine data of relevance to the study of substance addiction across multiple disciplines and populations. In this contribution, we describe a framework of interlinked, interoperable bio-ontologies for the annotation of primary research data relating to substance addiction, and discuss how this framework enables easy integration of results across disciplinary boundaries. We describe entities and relationships relevant for the description of addiction within the Mental Functioning Ontology, Chemical Entities of Biological Interest Ontology, Protein Ontology, GeneOntology and the Neuroscience Information Framework ontologies. (shrink)
The collaboration of Language and Computing nv (L&C) and the Institute for Formal Ontology and Medical Information Science (IFOMIS) is guided by the hypothesis that quality constraints on ontologies for software ap-plication purposes closely parallel the constraints salient to the design of sound philosophical theories. The extent of this parallel has been poorly appreciated in the informatics community, and it turns out that importing the benefits of phi-losophical insight and methodology into application domains yields a variety of improvements. L&C’s (...) LinKBase® is one of the world’s largest medical domain ontologies. Its current primary use pertains to natural language processing ap-plications, but it also supports intelligent navigation through a range of struc-tured medical and bioinformatics information resources, such as SNOMED-CT, Swiss-Prot, and the GeneOntology (GO). In this report we discuss how and why philosophical methods improve both the internal coherence of LinKBase®, and its capacity to serve as a translation hub, improving the interoperability of the ontologies through which it navigates. (shrink)
Research has indicated that microRNAs (miRNAs), a special class of non-coding RNAs (ncRNAs), can perform important roles in different biological and pathological processes. miRNAs’ functions are realized by regulating their respective target genes (targets). It is thus critical to identify and analyze miRNA-target interactions for a better understanding and delineation of miRNAs’ functions. However, conventional knowledge discovery and acquisition methods have many limitations. Fortunately, semantic technologies that are based on domain ontologies can render great assistance in this regard. In our (...) previous investigations, we developed a miRNA domain-specific application ontology, Ontology for MIcroRNA Target (OMIT), to provide the community with common data elements and data exchange standards in the miRNA research. This paper describes (1) our continuing efforts in the OMIT ontology development and (2) the application of the OMIT to enable a semantic approach for knowledge capture of miRNA-target interactions. (shrink)
Vaccine research, as well as the development, testing, clinical trials, and commercial uses of vaccines involve complex processes with various biological data that include gene and protein expression, analysis of molecular and cellular interactions, study of tissue and whole body responses, and extensive epidemiological modeling. Although many data resources are available to meet different aspects of vaccine needs, it remains a challenge how we are to standardize vaccine annotation, integrate data about varied vaccine types and resources, and support advanced (...) vaccine data analysis and inference. To address these problems, the community-based Vaccine Ontology (VO) has been developed through collaboration with vaccine researchers and many national and international centers and programs, including the National Center for Biomedical Ontology (NCBO), the Infectious Disease Ontology (IDO) Initiative, and the Ontology for Biomedical Investigations (OBI). VO utilizes the Basic Formal Ontology (BFO) as the top ontology and the Relation Ontology (RO) for definition of term relationships. VO is represented in the Web Ontology Language (OWL) and edited using the Protégé-OWL. Currently VO contains more than 2000 terms and relationships. VO emphasizes on classification of vaccines and vaccine components, vaccine quality and phenotypes, and host immune response to vaccines. These reflect different aspects of vaccine composition and biology and can thus be used to model individual vaccines. More than 200 licensed vaccines and many vaccine candidates in research or clinical trials have been modeled in VO. VO is being used for vaccine literature mining through collaboration with the National Center for Integrative Biomedical Informatics (NCIBI). Multiple VO applications will be presented. (shrink)
The Emotion Ontology is an ontology covering all aspects of emotional and affective mental functioning. It is being developed following the principles of the OBO Foundry and Ontological Realism. This means that in compiling the ontology, we emphasize the importance of the nature of the entities in reality that the ontology is describing. One of the ways in which realism-based ontologies are being successfully used within biomedical science is in the annotation of scientific research results in (...) publicly available databases. Such annotation enables several objectives, including searching, browsing and cross-database data integration. A key benefit conferred by realismbased ontology is that suitably annotated research results are able to be aggregated and compared in a fashion that is based on the underlying reality that the science is studying. This has the potential of increasing the power of statistical analysis and meta-analysis in data-driven science. This aspect has been fruitfully exploited in the investigation of the functions of genes in molecular biology. Cognitive neuroscience uses functional neuroimaging to investigate the brain correlates of areas of mental functioning such as memory, planning and emotion. The use of functional neuroimaging to study affective phenomena such as the emotions is called ‘affective neuroscience’. BrainMap is the largest curated database of coordinates and metadata for studies in cognitive neuroscience, including affective neuroscience (Laird et al., 2005). BrainMap data is already classified and indexed using a terminology for classification, called the ‘Cognitive Paradigm Ontology’ (CogPO), that has been developed to facilitate searching and browsing. However, CogPO has been developed specifically for the BrainMap database, and the data are thus far not annotated to a realism-based ontology which would allow the discovery of interrelationships between research results across different databases on the basis of what the research is about. In this contribution, we describe ongoing work that aims to annotate affective neuroscience data, starting with the BrainMap database, using the Emotion Ontology. We describe our objectives and technical approach to the annotation, and mention some of the challenges. (shrink)
The Protein Ontology (PRO) is designed as a formal and principled Open Biomedical Ontologies (OBO) Foundry ontology for proteins. The components of PRO extend from a classification of proteins on the basis of evolutionary relationships at the homeomorphic level to the representation of the multiple protein forms of a gene, including those resulting from alternative splicing, cleavage and/or posttranslational modifications. Focusing specifically on the TGF-beta signaling proteins, we describe the building, curation, usage and dissemination of PRO. PRO (...) provides a framework for the formal representation of protein classes and protein forms in the OBO Foundry. It is designed to enable data retrieval and integration and machine reasoning at the molecular level of proteins, thereby facilitating cross-species comparisons, pathway analysis, disease modeling and the generation of new hypotheses. (shrink)
Darwin’s theory of evolution argued that the human race evolved from the same original cell as all other animals. Biological principles such as randomness, adaption and natural selection led to the evolution of different species including the human species. Based on this evolutionary sameness, Donald R. Griffin (1915-2003) challenged the behaviourist claim that animal communication is characterized as merely groans of pain. This paper argues that (1) all animals are embedded in a social system. (2) However, that does not mean (...) that all animals are social animals. (3) That the human social ontology remains to be unique due to a gene-cultural co-evolution. -/- . (shrink)
Recent work on the quality assurance of the GeneOntology (GO, GeneOntology Consortium 2004) from the perspective of both linguistic and ontological organization has made it clear that GO lacks the kind of formalism needed to support logic-based reasoning. At the same time it is no less clear that GO has proven itself to be an excellent terminological resource that can serve to combine together a variety of biomedical database and information systems. Given the strengths (...) of GO, it is worth investigating whether, by overcoming some of its weaknesses from the point of view of formal-ontological principles, we might not be able to enhance a version of GO which can come even closer to serving the needs of the various communities of biomedical researchers and practitioners. It is accepted that clinical and bioinformatics need to find common ground if the results of data-intensive biomedical research are to be harvested to the full. It is also widely accepted that no single method will be sufficient to create the needed common framework. We believe that the principles-based approach to life-science data integration and knowledge representation must be one of the methods applied. Indeed in dealing with the ontological representation of carcinomas, and specifically of colon carcinomas, we have established that, had GO (and related biomedical ontologies) followed some of the basic formal-ontological principles we have identified (Smith et al. 2004, Ceusters et al. 2004), then the effort required to navigate successfully between clinical and bioinformatics systems would have been reduced. We point here to the sources of ontologically-related errors in GO, and also provide arguments as to why and how such errors need to be resolved. (shrink)
While Darwin is commonly supposed to have demonstrated the inapplicability of the Aristotelian ontology of species to biological science, recent developments, especially in the wake of the Human Genome Project, have given rise to a new golden age of classification in which ontological ideas -- as for example in the GeneOntology, the Cell Ontology, the Protein Ontology, and so forth -- are once again playing an important role. In regard to species, on the other (...) hand, matters are more complex. We provide a brief overview of recent proposals concerning the ontology of species, dealing with species as sets, classes and as collections, the views of Ernst Meyr, and mereological views. (shrink)
This essay is a response to Luis M. Augusto’s intriguing paper on the rift between mainstream and formal ontology. I will show that there are in fact two questions at issue here: 1. concerning the links between mainstream and formal approaches within philosophy, and 2. concerning the application of philosophy (and especially philosophical ontology) in support of information-driven research for example in the life sciences.
We discuss the role of perceptron (or threshold) connectives in the context of Description Logic, and in particular their possible use as a bridge between statistical learning of models from data and logical reasoning over knowledge bases. We prove that such connectives can be added to the language of most forms of Description Logic without increasing the complexity of the corresponding inference problem. We show, with a practical example over the GeneOntology, how even simple instances of perceptron (...) connectives are expressive enough to represent learned, complex concepts derived from real use cases. This opens up the possibility to import concepts learnt from data into existing ontologies. (shrink)
that can serve as a foundation for more refined ontologies in the field of proteomics. Standard data sources classify proteins in terms of just one or two specific aspects. Thus SCOP (Structural Classification of Proteins) is described as classifying proteins on the basis of structural features; SWISSPROT annotates proteins on the basis of their structure and of parameters like post-translational modifications. Such data sources are connected to each other by pairwise term-to-term mappings. However, there are obstacles which stand in the (...) way of combining them together to form a robust meta-classification of the needed sort. We discuss some formal ontological principles which should be taken into account within the existing datasources in order to make such a metaclassification possible, taking into account also the GeneOntology (GO) and its application to the annotation of proteins. (shrink)
Throughout the biological and biomedical sciences there is a growing need for, prescriptive ‘minimum information’ (MI) checklists specifying the key information to include when reporting experimental results are beginning to find favor with experimentalists, analysts, publishers and funders alike. Such checklists aim to ensure that methods, data, analyses and results are described to a level sufficient to support the unambiguous interpretation, sophisticated search, reanalysis and experimental corroboration and reuse of data sets, facilitating the extraction of maximum value from data sets (...) them. However, such ‘minimum information’ MI checklists are usually developed independently by groups working within representatives of particular biologically- or technologically-delineated domains. Consequently, an overview of the full range of checklists can be difficult to establish without intensive searching, and even tracking thetheir individual evolution of single checklists may be a non-trivial exercise. Checklists are also inevitably partially redundant when measured one against another, and where they overlap is far from straightforward. Furthermore, conflicts in scope and arbitrary decisions on wording and sub-structuring make integration difficult. This presents inhibit their use in combination. Overall, these issues present significant difficulties for the users of checklists, especially those in areas such as systems biology, who routinely combine information from multiple biological domains and technology platforms. To address all of the above, we present MIBBI (Minimum Information for Biological and Biomedical Investigations); a web-based communal resource for such checklists, designed to act as a ‘one-stop shop’ for those exploring the range of extant checklist projects, and to foster collaborative, integrative development and ultimately promote gradual integration of checklists. (shrink)
A theoretical framework is provided to explore teleonomy as a problem of self-causation, distinct from upward, downward and reticulate causation. Causality theories in biology are often formulated within hierarchy theories, where causation is conceptualized as running up or down the rungs of a ladder-like hierarchy or, more recently, as moving between multiple hierarchies. Research on the genealogy of cosmologies demonstrates that in addition to hierarchy theories, causality theories also depend upon ideas of time. This paper explores the roots and impact (...) of both time and hierarchy thinking on causal reasoning in the evolutionary sciences. Within evolutionary biology, the Neodarwinian synthesis adheres to a linear notion of time associated with linear hierarchies that portray upward causation. Eco-evo-devo schools recognize the importance of downward causation and consequently receive resistance from the standard view because downward causation is sometimes understood as backward causation, considered impossible by adherents of a linear time model. In contrast, downward causation works with a spatial or presential time notion. Hybridization, lateral gene transfer, infective heredity, symbiosis and symbiogenesis require recognition of reticulate causation occurring in both space and time, or spacetime, between distinct and interacting ontological hierarchies. Teleonomy is distinct from these types of causation because it invokes the problem of self-causation. By asking how the focal level in a hierarchy can persist through time, self-causation raises philosophical concerns on the nature of duration, identity and individuality. (shrink)
Ever since Darwin a great deal of the conceptual history of biology may be read as a struggle between two philosophical positions: reductionism and holism. On the one hand, we have the reductionist claim that evolution has to be understood in terms of changes at the fundamental causal level of the gene. As Richard Dawkins famously put it, organisms are just ‘lumbering robots’ in the service of their genetic masters. On the other hand, there is a long holistic tradition (...) that focuses on the complexity of developmental systems, on the non-linearity of gene– environment interactions, and on multi-level selective processes to argue that the full story of biology is a bit more complicated than that. Reductionism can marshal on its behalf the spectacular successes of genetics and molecular biology throughout the 20th and 21st centuries. Holism has built on the development of entirely new disciplines and conceptual frameworks over the past few decades, including evo-devo and phenotypic plasticity. Yet, a number of biologists are still actively looking for a way out of the reductionism–holism counterposition, often mentioning the word ‘emergence’ as a way to deal with the conundrum. This paper briefly examines the philosophical history of the concept of emergence, distinguishes between epistemic and ontological accounts of it, and comments on conceptions of emergence that can actually be useful for practising evolutionary biologists. (shrink)
Vitalism was long viewed as the most grotesque view in biological theory: appeals to a mysterious life-force, Romantic insistence on the autonomy of life, or worse, a metaphysics of an entirely living universe. In the early twentieth century, attempts were made to present a revised, lighter version that was not weighted down by revisionary metaphysics: “organicism”. And mainstream philosophers of science criticized Driesch and Bergson’s “neovitalism” as a too-strong ontological commitment to the existence of certain entities or “forces”, over and (...) above the system of causal relations studied by mechanistic science, rejecting the weaker form, organicism, as well. But there has been some significant scholarly “push-back” against this orthodox attitude, notably pointing to the 18th-century Montpellier vitalists to show that there are different historical forms of vitalism, including how they relate to mainstream scientific practice. Additionally, some trends in recent biology that run counter to genetic reductionism and the informational model of the gene present themselves as organicist. Here, we examine some cases of vitalism in the twentieth century and today, not just as a historical form but as a significant metaphysical and scientific model. We argue for vitalism’s conceptual originality without either reducing it to mainstream models of science or presenting it as an alternate model of science, by focusing on historical forms of vitalism, logical empiricist critiques thereof and the impact of synthetic biology on current theorizing of vitalism. (shrink)
I start with some famous comments by the philosopher (psychologist) Ludwig Wittgenstein because Pinker shares with most people (due to the default settings of our evolved innate psychology) certain prejudices about the functioning of the mind and because Wittgenstein offers unique and profound insights into the workings of language, thought and reality (which he viewed as more or less coextensive) not found anywhere else. The last quote is the only reference Pinker makes to Wittgenstein in this volume, which is most (...) unfortunate considering that he was one of the most brilliant and original analysts of language. -/- In the last chapter, using the famous metaphor of Plato’s cave, he beautifully summarizes the book with an overview of how the mind (language, thought, intentional psychology) –a product of blind selfishness, moderated only slightly by automated altruism for close relatives carrying copies of our genes--works automatically, but tries to end on an upbeat note by giving us hope that we can nevertheless employ its vast capabilities to cooperate and make the world a decent place to live. -/- Pinker is certainly aware of but says little about the fact that far more about our psychology is left out than included. Among windows into human nature that are left out or given minimal attention are math and geometry, music and sounds, images, events and causality, ontology (classes of things), dispositions (believing, thinking, judging, intending etc) and the rest of intentional psychology of action, neurotransmitters and entheogens, spiritual states (e.g, satori and enlightenment, brain stimulation and recording, brain damage and behavioral deficits and disorders, games and sports, decision theory (incl. game theory and behavioral economics), animal behavior (very little language but a billion years of shared genetics). Many books have been written about each of these areas of intentional psychology. The data in this book are descriptions, not explanations that show why our brains do it this way or how it is done. How do we know to use the sentences in their various way (i.e., know all their meanings)? This is evolutionary psychology that operates at a more basic level –the level where Wittgenstein is most active. And there is scant attention to context. -/- Nevertheless this is a classic work and with these cautions is still well worth reading. -/- Those wishing a comprehensive up to date framework for human behavior from the modern two systems view may consult my book ‘The Logical Structure of Philosophy, Psychology, Mind and Language in Ludwig Wittgenstein and John Searle’ 2nd ed (2019). Those interested in more of my writings may see ‘Talking Monkeys--Philosophy, Psychology, Science, Religion and Politics on a Doomed Planet--Articles and Reviews 2006-2019 3rd ed (2019), The Logical Structure of Human Behavior (2019), and Suicidal Utopian Delusions in the 21st Century 4th ed (2019). (shrink)
Memetics is a research approach which applies evolutionary ideas and terminology to cultural phenomena. The core idea of memetics is the existence of the units of cultural evolution which are attributed autonomous replicating goals. Of course, such a controversial concept has gained many devoted adherents as well as its determined opponents. The paper discusses the theoretical difficulties of memetics. The first part discusses the analogy of genes and memes. The theme of the second part is the ontology of a (...) cultural replicator. Finally, the third part presents the main issues related to manners of transmission and the adaptation of memes. In short, the aim of the paper is an evaluation of the explanatory potential of memetics. The explanatory potential of memetics will be evaluated in terms of its intrinsic consistency, the degree of its confirmation, the falsifiability of the theses which were elaborated on its basis and the heuristic value of this approach. (shrink)
As the international genomic research community moves from the tool-making efforts of the Human Genome Project into biomedical applications of those tools, new metaphors are being suggested as useful to understanding how our genes work – and for understanding who we are as biological organisms. In this essay we focus on the Human Microbiome Project as one such translational initiative. The HMP is a new ‘metagenomic’ research effort to sequence the genomes of human microbiological flora, in order to pursue the (...) interesting hypothesis that our ‘microbiome’ plays a vital and interactive role with our human genome in normal human physiology. Rather than describing the human genome as the ‘blueprint’ for human nature, the promoters of the HMP stress the ways in which our primate lineage DNA is interdependent with the genomes of our microbiological flora. They argue that the human body should be understood as an ecosystem with multiple ecological niches and habitats in which a variety of cellular species collaborate and compete, and that human beings should be understood as ‘superorganisms’ that incorporate multiple symbiotic cell species into a single individual with very blurry boundaries. These metaphors carry interesting philosophical messages, but their inspiration is not entirely ideological. Instead, part of their cachet within genome science stems from the ways in which they are rooted in genomic research techniques, in what philosophers of science have called a ‘tools-to-theory’ heuristic. Their emergence within genome science illustrates the complexity of conceptual change in translational research, by showing how it reflects both aspirational and methodological influences. (shrink)
The theory of evolution of complex and comprising of human systems and algorithm for its constructing are the synthesis of evolutionary epistemology, philosophical anthropology and concrete scientific empirical basis in modern (transdisciplinary) science. «Trans-disciplinary» in the context is interpreted as a completely new epistemological situation, which is fraught with the initiation of a civilizational crisis. Philosophy and ideology of technogenic civilization is based on the possibility of unambiguous demarcation of public value and descriptive scientific discourses (1), and the object and (...) subject of the cognitive process (2). Both of these attributes are no longer valid. For mass, everyday consciousness and institutional philosophical tradition it is intuitively obvious that having the ability to control the evolutionary process, Homo sapiens came close to the borders of their own biological and cultural identity. The spontaneous coevolutionary process of interaction between the «subject» (rational living organisms) and the «object» (material world), is the teleological trend of the movement towards the complete rationalization of the World as It Is, its merger with the World of Due. The stratification of the global evolutionary process into selective and semantic (teleological) coevolutionary and therefore ontologically inseparable components follows. With the entry of anthropogenic civilization into the stage of the information society, firsty, the post-academic phase of the historical evolution of scientific rationality began, the attributes of which are the specific methodology of scientific knowledge, scientific ethos and ontology. Bioethics as a phenomenon of intellectual culture represents a natural philosophical core of modern post- academic (human-dimensional) science, in which the ethical neutrality of scientific theory principle is inapplicable, and elements of public-axiological and scientific-descriptive discourses are integrated into a single logic construction. As result, hermeneutics precedes epistemology not only methodologically, but also meaningfully, and natural philosophy is regaining the status of the backbone of the theory of evolution – in an explicit form. (shrink)
Create an account to enable off-campus access through your institution's proxy server.
Monitor this page
Be alerted of all new items appearing on this page. Choose how you want to monitor it:
Email
RSS feed
About us
Lorem ipsum dolor sit amet, consectetur adipisicing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur. Excepteur sint occaecat cupidatat non proident, sunt in culpa qui officia deserunt mollit anim id est laborum.