Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Teth514_2138 |
Symbol | |
ID | 5876212 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoanaerobacter sp. X514 |
Kingdom | Bacteria |
Replicon accession | NC_010320 |
Strand | - |
Start bp | 2141808 |
End bp | 2143553 |
Gene Length | 1746 bp |
Protein Length | 581 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 641542492 |
Product | hydrogenase, Fe-only |
Protein accession | YP_001663746 |
Protein GI | 167040761 |
COG category | [C] Energy production and conversion [R] General function prediction only |
COG ID | [COG1034] NADH dehydrogenase/NADH:ubiquinone oxidoreductase 75 kD subunit (chain G) [COG4624] Iron only hydrogenase large subunit, C-terminal domain |
TIGRFAM ID | [TIGR02512] hydrogenases, Fe-only |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000865974 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACAAAG TTCGTGTTAT GATAGACAGC ATCACTGTAG AAGTTCCCTC CCATTACACG GTATTGGAAG CAGCAAAAGA AGCAGGCATA GATATTCCTA CACTGTGTTA CCTCAAGGAA ATTAATCAAA TTGGCGCTTG TCGTATATGT GTAGTTGAAA TAGAAGGAGT TAGAAATTTA CAAACCTCCT GCACTTATCC GGTATTTGAT GGTATGAAAG TGTATACAAA TACACCTAAA GTTAGAGAAG CGCGGAAATT AAATCTTGAG CTTATACTTT CAAATCATGA TAGAAGCTGT TTGACTTGTA TTAGAAATAC TAACTGTGAG CTTCAATCAT TGTCTAAAAA ATTAGGAGTA GATGAAATAA GGTTTGAAGG TGAAAATATA AAGTATTCTA TAGATAATGC TTCTCCTTCT ATTGTAAGGG ACCCAAATAA GTGTGTGCTT TGTAGAAGAT GTGTATCAGT ATGTTCAGAA GTGCAAAACG TATTTGCTAT TGGAATGGTA AATAGAGGAT TTAATACAAT GGTAGCACCT TCTTTTGGCA GAAGTTTAAA AGATTCTCCT TGTATAAGCT GTGGACAGTG TATTGAGGTA TGTCCTGTTG GAGCTATATA TGAAAAAGAC CACACAAAGA GAGTTTATGA GGCTTTAGCT GATGAGAAAA AATATGTAGT AGCTCAAACA GCTCCAGCTG TAAGAGTAGC ATTAGGTGAA GAGTTTGGCA TGCCAATAGG AAGTATTGTT ACAGGAAAAA TGGTAGCAGC TTTGAGAAGA ATGGGTTTTG ATGCGATATT TGATACGAAT TTTGCAGCAG ACTTGACTAT AATGGAGGAA GGTTCGGAAC TTCTTGAAAG GCTTAAAAAT GGCGGAAAAC TTCCTATGAT AACTTCCTGT AGCCCTGGTT GGATAGCTTT CTGTGAAAAA TATTATCCAG AATTTATAGA TAATCTTTCA ACTTGTAAGT CTCCTCACAT GATGATGGGG GCATTAGTAA AGAGTTATTA TGCAGAAAAG AAAGGGCTTA ATCCTGAGGA TATATATATA GTATCTATTA TGCCATGTAC TGCTAAAAAA CTAGAGATTG AAAGGCCAGA AATGCAGCAT AATGGAATAA AAGATGTAGA TGCTGTCCTT ACCACAAGAG AATTGGCGAG AATGATAAAA GAAATGGGCA TTGACTTTGT AAATCTTCCT GATGAAGAAT ATGACGAACC TCTTGGAATG TCCACGGGTG CGGCAGTAAT ATTTGGAGCT ACAGGTGGAG TTATGGAGGC AGCTTTAAGG ACAGTTGCGG ATATTGTAGA AGGGAAAGAT TTAGACAAAT TTGATTATGA AGAAGTGAGA GGGTTAGAAG GTGTTAGAGA GGCTACTATA AGAATAGATG GTAAGGACAT AAAAGTTGCC ATAGCAAATG GAACAGGAAA TGCGAAGAAA CTTCTTGACA AAGTAAAAGC TGGCGAGGTA GAGTATCACT TCATAGAGGT AATGGGCTGC CCAGGAGGAT GTATAATGGG AGGCGGTCAG CCGATTCACA ATCCAAATGA AATGGAAAAA GTAAAAGAAT TAAGAGCGAA GGCTATTTAC GAAGCAGATA AAAATTTGCC TATCAGAAAA TCTCACAAAA ATCCTGCAAT ACAAAAACTC TATGAAGAAT TTTTGGGCAG TCCTTTAAGC GAAAAGTCTC ATCACTTGCT CCACACTCAT TATTCCAAGA AGGAGTTGTA TCCTCTGGTA AAATAA
|
Protein sequence | MDKVRVMIDS ITVEVPSHYT VLEAAKEAGI DIPTLCYLKE INQIGACRIC VVEIEGVRNL QTSCTYPVFD GMKVYTNTPK VREARKLNLE LILSNHDRSC LTCIRNTNCE LQSLSKKLGV DEIRFEGENI KYSIDNASPS IVRDPNKCVL CRRCVSVCSE VQNVFAIGMV NRGFNTMVAP SFGRSLKDSP CISCGQCIEV CPVGAIYEKD HTKRVYEALA DEKKYVVAQT APAVRVALGE EFGMPIGSIV TGKMVAALRR MGFDAIFDTN FAADLTIMEE GSELLERLKN GGKLPMITSC SPGWIAFCEK YYPEFIDNLS TCKSPHMMMG ALVKSYYAEK KGLNPEDIYI VSIMPCTAKK LEIERPEMQH NGIKDVDAVL TTRELARMIK EMGIDFVNLP DEEYDEPLGM STGAAVIFGA TGGVMEAALR TVADIVEGKD LDKFDYEEVR GLEGVREATI RIDGKDIKVA IANGTGNAKK LLDKVKAGEV EYHFIEVMGC PGGCIMGGGQ PIHNPNEMEK VKELRAKAIY EADKNLPIRK SHKNPAIQKL YEEFLGSPLS EKSHHLLHTH YSKKELYPLV K
|
| |