Gene Teth514_2138 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTeth514_2138 
Symbol 
ID5876212 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoanaerobacter sp. X514 
KingdomBacteria 
Replicon accessionNC_010320 
Strand
Start bp2141808 
End bp2143553 
Gene Length1746 bp 
Protein Length581 aa 
Translation table11 
GC content37% 
IMG OID641542492 
Producthydrogenase, Fe-only 
Protein accessionYP_001663746 
Protein GI167040761 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG1034] NADH dehydrogenase/NADH:ubiquinone oxidoreductase 75 kD subunit (chain G)
[COG4624] Iron only hydrogenase large subunit, C-terminal domain 
TIGRFAM ID[TIGR02512] hydrogenases, Fe-only 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000865974 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACAAAG TTCGTGTTAT GATAGACAGC ATCACTGTAG AAGTTCCCTC CCATTACACG 
GTATTGGAAG CAGCAAAAGA AGCAGGCATA GATATTCCTA CACTGTGTTA CCTCAAGGAA
ATTAATCAAA TTGGCGCTTG TCGTATATGT GTAGTTGAAA TAGAAGGAGT TAGAAATTTA
CAAACCTCCT GCACTTATCC GGTATTTGAT GGTATGAAAG TGTATACAAA TACACCTAAA
GTTAGAGAAG CGCGGAAATT AAATCTTGAG CTTATACTTT CAAATCATGA TAGAAGCTGT
TTGACTTGTA TTAGAAATAC TAACTGTGAG CTTCAATCAT TGTCTAAAAA ATTAGGAGTA
GATGAAATAA GGTTTGAAGG TGAAAATATA AAGTATTCTA TAGATAATGC TTCTCCTTCT
ATTGTAAGGG ACCCAAATAA GTGTGTGCTT TGTAGAAGAT GTGTATCAGT ATGTTCAGAA
GTGCAAAACG TATTTGCTAT TGGAATGGTA AATAGAGGAT TTAATACAAT GGTAGCACCT
TCTTTTGGCA GAAGTTTAAA AGATTCTCCT TGTATAAGCT GTGGACAGTG TATTGAGGTA
TGTCCTGTTG GAGCTATATA TGAAAAAGAC CACACAAAGA GAGTTTATGA GGCTTTAGCT
GATGAGAAAA AATATGTAGT AGCTCAAACA GCTCCAGCTG TAAGAGTAGC ATTAGGTGAA
GAGTTTGGCA TGCCAATAGG AAGTATTGTT ACAGGAAAAA TGGTAGCAGC TTTGAGAAGA
ATGGGTTTTG ATGCGATATT TGATACGAAT TTTGCAGCAG ACTTGACTAT AATGGAGGAA
GGTTCGGAAC TTCTTGAAAG GCTTAAAAAT GGCGGAAAAC TTCCTATGAT AACTTCCTGT
AGCCCTGGTT GGATAGCTTT CTGTGAAAAA TATTATCCAG AATTTATAGA TAATCTTTCA
ACTTGTAAGT CTCCTCACAT GATGATGGGG GCATTAGTAA AGAGTTATTA TGCAGAAAAG
AAAGGGCTTA ATCCTGAGGA TATATATATA GTATCTATTA TGCCATGTAC TGCTAAAAAA
CTAGAGATTG AAAGGCCAGA AATGCAGCAT AATGGAATAA AAGATGTAGA TGCTGTCCTT
ACCACAAGAG AATTGGCGAG AATGATAAAA GAAATGGGCA TTGACTTTGT AAATCTTCCT
GATGAAGAAT ATGACGAACC TCTTGGAATG TCCACGGGTG CGGCAGTAAT ATTTGGAGCT
ACAGGTGGAG TTATGGAGGC AGCTTTAAGG ACAGTTGCGG ATATTGTAGA AGGGAAAGAT
TTAGACAAAT TTGATTATGA AGAAGTGAGA GGGTTAGAAG GTGTTAGAGA GGCTACTATA
AGAATAGATG GTAAGGACAT AAAAGTTGCC ATAGCAAATG GAACAGGAAA TGCGAAGAAA
CTTCTTGACA AAGTAAAAGC TGGCGAGGTA GAGTATCACT TCATAGAGGT AATGGGCTGC
CCAGGAGGAT GTATAATGGG AGGCGGTCAG CCGATTCACA ATCCAAATGA AATGGAAAAA
GTAAAAGAAT TAAGAGCGAA GGCTATTTAC GAAGCAGATA AAAATTTGCC TATCAGAAAA
TCTCACAAAA ATCCTGCAAT ACAAAAACTC TATGAAGAAT TTTTGGGCAG TCCTTTAAGC
GAAAAGTCTC ATCACTTGCT CCACACTCAT TATTCCAAGA AGGAGTTGTA TCCTCTGGTA
AAATAA
 
Protein sequence
MDKVRVMIDS ITVEVPSHYT VLEAAKEAGI DIPTLCYLKE INQIGACRIC VVEIEGVRNL 
QTSCTYPVFD GMKVYTNTPK VREARKLNLE LILSNHDRSC LTCIRNTNCE LQSLSKKLGV
DEIRFEGENI KYSIDNASPS IVRDPNKCVL CRRCVSVCSE VQNVFAIGMV NRGFNTMVAP
SFGRSLKDSP CISCGQCIEV CPVGAIYEKD HTKRVYEALA DEKKYVVAQT APAVRVALGE
EFGMPIGSIV TGKMVAALRR MGFDAIFDTN FAADLTIMEE GSELLERLKN GGKLPMITSC
SPGWIAFCEK YYPEFIDNLS TCKSPHMMMG ALVKSYYAEK KGLNPEDIYI VSIMPCTAKK
LEIERPEMQH NGIKDVDAVL TTRELARMIK EMGIDFVNLP DEEYDEPLGM STGAAVIFGA
TGGVMEAALR TVADIVEGKD LDKFDYEEVR GLEGVREATI RIDGKDIKVA IANGTGNAKK
LLDKVKAGEV EYHFIEVMGC PGGCIMGGGQ PIHNPNEMEK VKELRAKAIY EADKNLPIRK
SHKNPAIQKL YEEFLGSPLS EKSHHLLHTH YSKKELYPLV K