Gene GWCH70_0464 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_0464 
Symbol 
ID7979441 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp514847 
End bp517810 
Gene Length2964 bp 
Protein Length987 aa 
Translation table11 
GC content47% 
IMG OID644797441 
Productformate dehydrogenase, alpha subunit 
Protein accessionYP_002948641 
Protein GI239826017 
COG category[R] General function prediction only 
COG ID[COG3383] Uncharacterized anaerobic dehydrogenase 
TIGRFAM ID[TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGAAC AATTGATAAC GATTACAATA AACAGAAGCG ATTACAGCGC AAAACAAGGA 
ACGACGATTT TGGAAGTCAT TAATGAAAAC AACATCCCAC ATCCGCAAGT TTGCTATACT
CCTGAGCTTG GAGCCATTCA AACGTGCGAC ACATGCATTG TTGAAGTGGA TGGGAAACTG
ATGCGCGCCT GTTCTACTCC CGTTAAGGAC GGCATGAACA TCGAGCTTTC TTCCGAACGG
GCGAAAACGG CGCAAAAAGA AGCGATGGAC CGCATCCTAG AAAACCACCT ATTATATTGT
ACTGTCTGTG ATAACAATAA CGGAAACTGT AAATTGCACA ATACGGTGGA ACTGATGGGA
ATTGAGCATC AATCGTATCC ATACCGCCCG AAAGTAGATC CATCGGAAGT CGACATGTCC
CATCCATTCT ACCGCTATGA TCCAAATCAA TGCATCGCCT GTGGGCAATG CGTAGAAGTA
TGTCAAAATT TACAAGTAAA CGAAACATTA TCTATTGACT GGGAAGCGGA ACGCCCTCGT
GTCATTTGGG ATAACGGTGT TCCGATTAAC GAATCTTCCT GTGTCAGTTG CGGACAATGC
GTTACCGTTT GCCCATGTAA TGCATTAATG GAAAAATCGA TGTTAGGCGA GGCCGGCTTT
ATGACCGGTT TAGATAAAGA AATTTTAAAT CCAATGATCG ATTTTGTAAA GGAAGTCGAA
CCTGGCTACA GCAGTATTAT TGCCATTTCG GAAATCGAAG CGGCGATGCG CAAACAGCGC
ATCAAAAAGA CGAAAACGGT TTGTACGTTC TGCGGCGTCG GCTGTTCGTT TGAAGTATGG
ACAAAAGGAC GTAAAATTTT AAAAATCCAG CCTGTCTCTG AAGCACCAGT CAACGCCATT
TCCACATGTG TAAAAGGAAA ATTCGGTTGG GATTTCGTCA ACAGCGAAGA ACGCCTGACC
AAACCGCTTA TCCGCAAAGG TGACGTGTTT GTCGAATCCA CATGGGAAGA AGCGCTTTCG
CTTGTTGCAG AAAAACTTGG CGAAATCAAA CGGAAATACG GCGGCAACGC CATCGGCTTT
ATTTCATCGT CGAAAACATC CAATGAAGAA AATTATTTAA TGCAAAAACT TGCGCGGCAA
GTGTTTGAAA CGAACAACGT CGACAACTGT TCGCGCTACT GCCAGTCGCC GGCAACAGAC
GGATTGTTCC GCACAGTAGG AATGGGCGGC GATTCCGGAA CGATTCATGA TATCGCATCC
GCGGGATTAG TGATTATTAT CGGCGCCAAT CCGGCGGAAG GACATCCGGT TTTGGCGACT
CGCGTCAAAC GTGCCCATAA ACTATTTGGT CAAAAATTAA TTGTTGCTGA CTTGCGCAAA
AATGAAATGG CGGAACGCGC CGATTTATTT ATCCGACCAA AACAGGGAAC GGATCAAGTA
TGGTTGATGG CCGTCACGAA ATATATCATT GACCAAGGCT GGCATGACGA AGCGTTCATT
CGCGAACGCG TTCATTTCTT CGATGAATTC AAAGAAGTGC TCGAAAAATA TACGCTCGAT
TACGCGGAAG AAGTAACAGG CATTGCGAAA GCAGACTTAA TCCGCATCGC TGAAATGATC
CACAAAGCAG ATGGAACATG TATCCTCTGG GGAATGGGCG TTACGCAAAA CACGGGAGGA
AGCGATACAT CAGCGGCCAT CTCGAACTTA TTGCTCGCAA CAGGCAACTA TGGACGCCCA
GGCGCTGGAG CGTTCCCGCT CCGCGGCCAT AACAACGTGC AAGGTGCTTG TGATATGGGG
TCCCTTCCTT CCTGGCTTCC AGGATACCAG CATATTACGG ATGATATCGC GCGGGCAAAA
TTTGAAAAAG CGTACGGTGT CCGCATCGAT GGAAAACCTG GTCTTGATAA CATCCAAATG
ATCGAAGCAG CCGAACAAGG AAAATTAAAG GCAATGTATA TCGTCGGCGA AGATATGGCA
CTTGTGGACT GCAACGCCAA TCATGTCCAA AAAGTGCTAT CAGAGTTAGA CTTCTTGGTA
GTACAGGACA TTTTCTTATC GAAAACAGCG CAATTTGCTG ACGTTGTTTT GCCAGCAGCT
CCAAGCTTAG AAAAAGAAGG AACGTTTACC AACACAGAGC GGCGCATTCA ACGTTTTTAC
CAAGCGCTTG AACCGCTCGG CGATTCAAAA CCGGACTGGT GGATCATTCA AGAAATCGCC
AAACGGATGG GTGCTGATTG GAACTACGCC GGACCGAAAG AAATTATGGA CGAAATCGCA
AGTCTTGCTC CGCTTTTTTC ACAAGCGCAT TACGAAAACT TGGAAGGCTG GAAAAGCCTT
TGCTGGGGCA GCTACGACGG TGCAGATACG CCGATTCTTT ATAAAGAACG CTTCAACTTC
CCGGATGGAA AAGCGCGCTT TGCACTCGCC GACTGGGTAC AGCCGGCAGA ATATCCGGAA
GAATACGATT TGCTTGTGAA CAACGGACGA TTGCTCGAAC ATTTCCATGA AGGAAACTTA
ACATACAAAT CAGAAGGCAT TCAAAGAAAA TTCCCTGAAA TATTCGTTGA AGTATCTCCG
GAATTAGCGA AAGAACGCGG CATCAAAGAC GGCTCGCTCG TTCGTTTAGA ATCACCATTC
GGCAGAGTAA AAGTGCGCGT ACTCGTCACT GACCGTGTCA AAGGAAAAGA ATTATTCTTG
CCAATGCACT CGGCAACAAA CGAAAGCGCC ATTAACATAC TTACCGGTCC TGCCACTGAC
CATCGCACGA ATACACCGGC GTTTAAACAA GCAAGAGTGC GCATGCAAGT GTTGGAAGTC
GACGGAGAAT CTCCGCTCCC GCGCACCAAT CCGCGATTTA AAAAGCGGAA TCCAAAAAGA
GGCGTCGAAG TCGATCGAAA ATGGAAGCGT CCGGATTACG TTCCTTTAAC GGATGAATGG
AAGGAGGCAC AGCCGCGTGG CTAA
 
Protein sequence
MSEQLITITI NRSDYSAKQG TTILEVINEN NIPHPQVCYT PELGAIQTCD TCIVEVDGKL 
MRACSTPVKD GMNIELSSER AKTAQKEAMD RILENHLLYC TVCDNNNGNC KLHNTVELMG
IEHQSYPYRP KVDPSEVDMS HPFYRYDPNQ CIACGQCVEV CQNLQVNETL SIDWEAERPR
VIWDNGVPIN ESSCVSCGQC VTVCPCNALM EKSMLGEAGF MTGLDKEILN PMIDFVKEVE
PGYSSIIAIS EIEAAMRKQR IKKTKTVCTF CGVGCSFEVW TKGRKILKIQ PVSEAPVNAI
STCVKGKFGW DFVNSEERLT KPLIRKGDVF VESTWEEALS LVAEKLGEIK RKYGGNAIGF
ISSSKTSNEE NYLMQKLARQ VFETNNVDNC SRYCQSPATD GLFRTVGMGG DSGTIHDIAS
AGLVIIIGAN PAEGHPVLAT RVKRAHKLFG QKLIVADLRK NEMAERADLF IRPKQGTDQV
WLMAVTKYII DQGWHDEAFI RERVHFFDEF KEVLEKYTLD YAEEVTGIAK ADLIRIAEMI
HKADGTCILW GMGVTQNTGG SDTSAAISNL LLATGNYGRP GAGAFPLRGH NNVQGACDMG
SLPSWLPGYQ HITDDIARAK FEKAYGVRID GKPGLDNIQM IEAAEQGKLK AMYIVGEDMA
LVDCNANHVQ KVLSELDFLV VQDIFLSKTA QFADVVLPAA PSLEKEGTFT NTERRIQRFY
QALEPLGDSK PDWWIIQEIA KRMGADWNYA GPKEIMDEIA SLAPLFSQAH YENLEGWKSL
CWGSYDGADT PILYKERFNF PDGKARFALA DWVQPAEYPE EYDLLVNNGR LLEHFHEGNL
TYKSEGIQRK FPEIFVEVSP ELAKERGIKD GSLVRLESPF GRVKVRVLVT DRVKGKELFL
PMHSATNESA INILTGPATD HRTNTPAFKQ ARVRMQVLEV DGESPLPRTN PRFKKRNPKR
GVEVDRKWKR PDYVPLTDEW KEAQPRG