Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_0464 |
Symbol | |
ID | 7979441 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 514847 |
End bp | 517810 |
Gene Length | 2964 bp |
Protein Length | 987 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 644797441 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_002948641 |
Protein GI | 239826017 |
COG category | [R] General function prediction only |
COG ID | [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGAAC AATTGATAAC GATTACAATA AACAGAAGCG ATTACAGCGC AAAACAAGGA ACGACGATTT TGGAAGTCAT TAATGAAAAC AACATCCCAC ATCCGCAAGT TTGCTATACT CCTGAGCTTG GAGCCATTCA AACGTGCGAC ACATGCATTG TTGAAGTGGA TGGGAAACTG ATGCGCGCCT GTTCTACTCC CGTTAAGGAC GGCATGAACA TCGAGCTTTC TTCCGAACGG GCGAAAACGG CGCAAAAAGA AGCGATGGAC CGCATCCTAG AAAACCACCT ATTATATTGT ACTGTCTGTG ATAACAATAA CGGAAACTGT AAATTGCACA ATACGGTGGA ACTGATGGGA ATTGAGCATC AATCGTATCC ATACCGCCCG AAAGTAGATC CATCGGAAGT CGACATGTCC CATCCATTCT ACCGCTATGA TCCAAATCAA TGCATCGCCT GTGGGCAATG CGTAGAAGTA TGTCAAAATT TACAAGTAAA CGAAACATTA TCTATTGACT GGGAAGCGGA ACGCCCTCGT GTCATTTGGG ATAACGGTGT TCCGATTAAC GAATCTTCCT GTGTCAGTTG CGGACAATGC GTTACCGTTT GCCCATGTAA TGCATTAATG GAAAAATCGA TGTTAGGCGA GGCCGGCTTT ATGACCGGTT TAGATAAAGA AATTTTAAAT CCAATGATCG ATTTTGTAAA GGAAGTCGAA CCTGGCTACA GCAGTATTAT TGCCATTTCG GAAATCGAAG CGGCGATGCG CAAACAGCGC ATCAAAAAGA CGAAAACGGT TTGTACGTTC TGCGGCGTCG GCTGTTCGTT TGAAGTATGG ACAAAAGGAC GTAAAATTTT AAAAATCCAG CCTGTCTCTG AAGCACCAGT CAACGCCATT TCCACATGTG TAAAAGGAAA ATTCGGTTGG GATTTCGTCA ACAGCGAAGA ACGCCTGACC AAACCGCTTA TCCGCAAAGG TGACGTGTTT GTCGAATCCA CATGGGAAGA AGCGCTTTCG CTTGTTGCAG AAAAACTTGG CGAAATCAAA CGGAAATACG GCGGCAACGC CATCGGCTTT ATTTCATCGT CGAAAACATC CAATGAAGAA AATTATTTAA TGCAAAAACT TGCGCGGCAA GTGTTTGAAA CGAACAACGT CGACAACTGT TCGCGCTACT GCCAGTCGCC GGCAACAGAC GGATTGTTCC GCACAGTAGG AATGGGCGGC GATTCCGGAA CGATTCATGA TATCGCATCC GCGGGATTAG TGATTATTAT CGGCGCCAAT CCGGCGGAAG GACATCCGGT TTTGGCGACT CGCGTCAAAC GTGCCCATAA ACTATTTGGT CAAAAATTAA TTGTTGCTGA CTTGCGCAAA AATGAAATGG CGGAACGCGC CGATTTATTT ATCCGACCAA AACAGGGAAC GGATCAAGTA TGGTTGATGG CCGTCACGAA ATATATCATT GACCAAGGCT GGCATGACGA AGCGTTCATT CGCGAACGCG TTCATTTCTT CGATGAATTC AAAGAAGTGC TCGAAAAATA TACGCTCGAT TACGCGGAAG AAGTAACAGG CATTGCGAAA GCAGACTTAA TCCGCATCGC TGAAATGATC CACAAAGCAG ATGGAACATG TATCCTCTGG GGAATGGGCG TTACGCAAAA CACGGGAGGA AGCGATACAT CAGCGGCCAT CTCGAACTTA TTGCTCGCAA CAGGCAACTA TGGACGCCCA GGCGCTGGAG CGTTCCCGCT CCGCGGCCAT AACAACGTGC AAGGTGCTTG TGATATGGGG TCCCTTCCTT CCTGGCTTCC AGGATACCAG CATATTACGG ATGATATCGC GCGGGCAAAA TTTGAAAAAG CGTACGGTGT CCGCATCGAT GGAAAACCTG GTCTTGATAA CATCCAAATG ATCGAAGCAG CCGAACAAGG AAAATTAAAG GCAATGTATA TCGTCGGCGA AGATATGGCA CTTGTGGACT GCAACGCCAA TCATGTCCAA AAAGTGCTAT CAGAGTTAGA CTTCTTGGTA GTACAGGACA TTTTCTTATC GAAAACAGCG CAATTTGCTG ACGTTGTTTT GCCAGCAGCT CCAAGCTTAG AAAAAGAAGG AACGTTTACC AACACAGAGC GGCGCATTCA ACGTTTTTAC CAAGCGCTTG AACCGCTCGG CGATTCAAAA CCGGACTGGT GGATCATTCA AGAAATCGCC AAACGGATGG GTGCTGATTG GAACTACGCC GGACCGAAAG AAATTATGGA CGAAATCGCA AGTCTTGCTC CGCTTTTTTC ACAAGCGCAT TACGAAAACT TGGAAGGCTG GAAAAGCCTT TGCTGGGGCA GCTACGACGG TGCAGATACG CCGATTCTTT ATAAAGAACG CTTCAACTTC CCGGATGGAA AAGCGCGCTT TGCACTCGCC GACTGGGTAC AGCCGGCAGA ATATCCGGAA GAATACGATT TGCTTGTGAA CAACGGACGA TTGCTCGAAC ATTTCCATGA AGGAAACTTA ACATACAAAT CAGAAGGCAT TCAAAGAAAA TTCCCTGAAA TATTCGTTGA AGTATCTCCG GAATTAGCGA AAGAACGCGG CATCAAAGAC GGCTCGCTCG TTCGTTTAGA ATCACCATTC GGCAGAGTAA AAGTGCGCGT ACTCGTCACT GACCGTGTCA AAGGAAAAGA ATTATTCTTG CCAATGCACT CGGCAACAAA CGAAAGCGCC ATTAACATAC TTACCGGTCC TGCCACTGAC CATCGCACGA ATACACCGGC GTTTAAACAA GCAAGAGTGC GCATGCAAGT GTTGGAAGTC GACGGAGAAT CTCCGCTCCC GCGCACCAAT CCGCGATTTA AAAAGCGGAA TCCAAAAAGA GGCGTCGAAG TCGATCGAAA ATGGAAGCGT CCGGATTACG TTCCTTTAAC GGATGAATGG AAGGAGGCAC AGCCGCGTGG CTAA
|
Protein sequence | MSEQLITITI NRSDYSAKQG TTILEVINEN NIPHPQVCYT PELGAIQTCD TCIVEVDGKL MRACSTPVKD GMNIELSSER AKTAQKEAMD RILENHLLYC TVCDNNNGNC KLHNTVELMG IEHQSYPYRP KVDPSEVDMS HPFYRYDPNQ CIACGQCVEV CQNLQVNETL SIDWEAERPR VIWDNGVPIN ESSCVSCGQC VTVCPCNALM EKSMLGEAGF MTGLDKEILN PMIDFVKEVE PGYSSIIAIS EIEAAMRKQR IKKTKTVCTF CGVGCSFEVW TKGRKILKIQ PVSEAPVNAI STCVKGKFGW DFVNSEERLT KPLIRKGDVF VESTWEEALS LVAEKLGEIK RKYGGNAIGF ISSSKTSNEE NYLMQKLARQ VFETNNVDNC SRYCQSPATD GLFRTVGMGG DSGTIHDIAS AGLVIIIGAN PAEGHPVLAT RVKRAHKLFG QKLIVADLRK NEMAERADLF IRPKQGTDQV WLMAVTKYII DQGWHDEAFI RERVHFFDEF KEVLEKYTLD YAEEVTGIAK ADLIRIAEMI HKADGTCILW GMGVTQNTGG SDTSAAISNL LLATGNYGRP GAGAFPLRGH NNVQGACDMG SLPSWLPGYQ HITDDIARAK FEKAYGVRID GKPGLDNIQM IEAAEQGKLK AMYIVGEDMA LVDCNANHVQ KVLSELDFLV VQDIFLSKTA QFADVVLPAA PSLEKEGTFT NTERRIQRFY QALEPLGDSK PDWWIIQEIA KRMGADWNYA GPKEIMDEIA SLAPLFSQAH YENLEGWKSL CWGSYDGADT PILYKERFNF PDGKARFALA DWVQPAEYPE EYDLLVNNGR LLEHFHEGNL TYKSEGIQRK FPEIFVEVSP ELAKERGIKD GSLVRLESPF GRVKVRVLVT DRVKGKELFL PMHSATNESA INILTGPATD HRTNTPAFKQ ARVRMQVLEV DGESPLPRTN PRFKKRNPKR GVEVDRKWKR PDYVPLTDEW KEAQPRG
|
| |