Bioinformatic Harvester
You don't need to be Editor-In-Chief to add or edit content to WikiDoc. You can begin to add to or edit text on this WikiDoc page by clicking on the edit button at the top of this page. Next enter or edit the information that you would like to appear here. Once you are done editing, scroll down and click the Save page button at the bottom of the page.
The Bioinformatic-Harvester is a bioinformatic meta search engine at KIT Karlsruhe Institute of Technology for genes and protein-associated information. Harvester currently works for human, mouse, rat, zebrafish, drosophila and arabidopsis thaliana based information. Harvester cross-links >28 popular bioinformatic resources and allows cross searches. A ranking system similar to Google pagerank sorts the search results and displays the more relevant information. Harvester serves 10.000s of pages every day to scientists and physicians.
Contents |
How Harvester works
Harvester collects information from protein and gene databases along with information from so called "prediction servers." Prediction server e.g provide online sequence analysis for a single protein. Harvesters search index is based on the IPI and UniProt protein information collection. The collections consists of:
- ~68.000 human, ~53.000 mouse, ~42.000 rat, ~51.000 zebrafish, ~35.000 arabidopsis and ~33.000 drosophila protein pages, which are curated and updated on a regular basis.
Harvester collects several types of information
Text based information
...from the following databases:
- UniProt, world largest protein database
- SOURCE, convenient gene information overview
- Simple Modular Architecture Research Tool (SMART),
- SOSUI, predicts transmembrane domains
- PSORT, predicts protein localisation
- Homologene, compares proteins from different species
- gfp-cdna, protein localisation with fluorescence microscopy
- International Protein Index (IPI).
Databases rich in graphical elements
...are not collected, but crosslinked via iframes. Iframes are transparent windows within a HTML pages. The iframe windows allows up-to-date viewing of the "iframed," linked databases. Several such iframes are combined on a Harvester protein page. This method allows convenient comparison of information from several databases.
- NCBI-BLAST, an algorithm for comparing biological sequences NCBI.
- Ensembl, automatic gene annotation. EMBL-EBI and Sanger-Institute
- FlyBase is a database of model organism Drosophila melanogaster.
- GoPubMed is a knowledge-based search engine for biomedical texts.
- iHOP, information hyperlinked over proteins via gene/protein synonyms
- Mendelian_Inheritance_in_Man project catalogues all the known diseases.
- RZPD, German resources Center for genome research in Berlin/Heidelberg.
- STRING, Search Tool for the Retrieval of Interacting Genes/Proteins EMBL.
- Zebrafish_Information_Network.
- LOCATE subcellular localization database (mouse).
"linkouts"
- Genome_browser, working draft assemblies for genomes UCSC
- Google_Scholar
- Mitocheck
- PolyMeta, meta search engine for Google, Yahoo, MSN, Ask, Exalead, AllTheWeb, GigaBlast
What one can find
Harvester allows a combination of different search terms and single words.
Search Examples:
- Gene-name: "golga3"
- Gene-alias: "ADAP-S ADAS ADHAPS ADPS" (one gene name is sufficient)
- Gene-Ontologies: "Enzyme linked receptor protein signaling pathway"
- Unigene-Cluster: "Hs.449360"
- Go-annotation: "intra-Golgi transport"
- Molecular function: "protein kinase binding"
- Protein: "Q9NPD3"
- Protein domain: "SH2 sar"
- Protein Localisation: "endoplasmic reticulum"
- Chromosome: "2q31"
- Disease relevant: use the word "diseaselink"
- Combinations: "golgi diseaselink" (finds all golgi proteins associated with a disease)
- mRNA: "AL136897"
- Word: "Cancer"
- Comment: "highly expressed in heart"
- Author: "Merkel, Schmidt"
- Publication or project: "cDNA sequencing project"
| Databases supported by Bioinformatic Harvester |
| NCBI-BLAST | CDD | Ensembl | Entrez | Flybase | Flymine | GFP-cDNA | Genome_browser | GeneCard | Google_Scholar | GoPubMed | HomoloGene | iHOP | IPI | OMIM | Mitocheck | PSORT | PolyMeta | UniProt | SOURCE | SOSUI | RZPD | Sciencenet | STRING | SMART | ZFIN | |
Acknowledgement and Attribution Regarding Sources of Content
Some of the initial content on this page may be incorporated in part from copyleft sources in the public domain including wikis such as Wikipedia and AskDrWiki. Drug information for patients came from the The National Library of Medicine. Infectious disease information may have come from the Centers for Disease Control (CDC). Differential Diagnoses are drawn from clinicians as well as an amalgamation of 3 sources: 1.The Disease Database; 2. Kahan, Scott, Smith, Ellen G. In A Page: Signs and Symptoms. Malden, Massachusetts: Blackwell Publishing, 2004:3; 3. Sailer, Christian, Wasner, Susanne. Differential Diagnosis Pocket. Hermosa Beach, CA: Borm Bruckmeir Publishing LLC, 2002:7 .

