Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0342 |
Symbol | |
ID | 4808491 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 430223 |
End bp | 431971 |
Gene Length | 1749 bp |
Protein Length | 582 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640105756 |
Product | hydrogenase, Fe-only |
Protein accession | YP_001036773 |
Protein GI | 125972863 |
COG category | [R] General function prediction only |
COG ID | [COG4624] Iron only hydrogenase large subunit, C-terminal domain |
TIGRFAM ID | [TIGR02512] hydrogenases, Fe-only |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAATGG TAAATGTTAC TATAGATAAT TGCAAGATAC AGGTACCTGC CAATTATACC GTGTTGGAAG CTGCAAAACA GGCTAACATA GACATACCTA CTCTTTGCTT CCTCAAGGAT ATAAATGAAG TAGGTGCCTG CCGTATGTGC GTTGTTGAGG TAAAAGGTGC CAGAAGCTTA CAGGCGGCCT GTGTATATCC GGTGTCCGAA GGTCTTGAGG TGTACACTCA GACACCGGCG GTAAGGGAAG CCAGGAAAGT GACTTTGGAA CTTATACTGT CAAACCATGA AAAGAAATGT TTGACCTGTG TAAGAAGTGA AAACTGCGAA TTGCAAAGAC TGGCAAAAGA TCTGAATGTA AAAGATATCA GATTTGAAGG TGAAATGAGC AATTTGCCGA TAGATGATCT TTCGCCTTCT GTTGTAAGGG ATCCCAACAA GTGTGTTTTG TGCAGACGCT GTGTCAGCAT GTGCAAGAAT GTTCAGACCG TTGGAGCCAT TGATGTTACT GAAAGAGGAT TCCGTACCAC CGTATCAACG GCCTTTAACA AACCTCTCAG TGAAGTACCC TGCGTAAACT GCGGACAGTG TATCAATGTA TGTCCTGTGG GAGCATTGAG AGAAAAGGAC GATATTGACA AGGTTTGGGA AGCTCTTGCA AATCCTGAGC TTCATGTAGT CGTTCAGACG GCTCCTGCAG TCAGGGTTGC ATTGGGAGAA GAGTTTGGAA TGCCTATCGG CTCAAGAGTG ACCGGTAAAA TGGTGGCAGC ATTGAGTCGA CTGGGCTTTA AAAAGGTATT TGATACAGAT ACGGCTGCCG ACCTTACAAT AATGGAGGAA GGTACTGAGC TTATAAACAG GATTAAAAAC GGCGGCAAGC TTCCTTTGAT AACTTCCTGC AGCCCGGGAT GGATAAAGTT CTGCGAACAC AACTATCCTG AGTTTTTAGA CAATCTGTCC AGCTGCAAAT CGCCTCACGA AATGTTTGGT GCGGTTTTGA AATCCTACTA TGCACAGAAA AACGGAATTG ATCCTTCAAA AGTATTTGTT GTATCAATAA TGCCATGTAC GGCAAAGAAG TTTGAGGCTC AAAGGCCGGA GCTTTCTTCA ACGGGTTATC CTGATGTGGA TGTTGTTCTT ACCACAAGAG AGCTTGCAAG AATGATAAAA GAAACGGGTA TTGATTTTAA TTCCCTTCCG GATAAACAGT TTGATGATCC TATGGGTGAG GCATCCGGAG CAGGTGTTAT TTTTGGTGCC ACCGGAGGAG TTATGGAGGC TGCCATCAGG ACCGTCGGTG AATTATTGAG CGGCAAACCT GCAGACAAGA TTGAATATAC TGAGGTAAGA GGTCTTGACG GTATAAAAGA GGCTTCCATA GAACTTGACG GTTTTACTCT GAAGGCTGCT GTTGCCCATG GTCTTGGCAA CGCAAGAAAG CTTCTTGACA AAATAAAAGC CGGAGAGGCG GATTATCATT TCATTGAAAT AATGGCCTGT CCCGGTGGTT GTATAAACGG TGGAGGACAG CCCATACAGC CGTCATCTGT GAGAAACTGG AAAGATATAA GATGCGAGAG GGCGAAAGCT ATTTACGAAG AGGATGAGTC CTTGCCTATA AGAAAATCTC ATGAAAATCC AAAGATAAAG ATGCTGTATG AAGAATTCTT TGGTGAACCG GGCAGTCATA AAGCTCACGA GCTTTTGCAC ACTCATTATG AGAAGAGGGA AAACTACCCT GTTAAATGA
|
Protein sequence | MQMVNVTIDN CKIQVPANYT VLEAAKQANI DIPTLCFLKD INEVGACRMC VVEVKGARSL QAACVYPVSE GLEVYTQTPA VREARKVTLE LILSNHEKKC LTCVRSENCE LQRLAKDLNV KDIRFEGEMS NLPIDDLSPS VVRDPNKCVL CRRCVSMCKN VQTVGAIDVT ERGFRTTVST AFNKPLSEVP CVNCGQCINV CPVGALREKD DIDKVWEALA NPELHVVVQT APAVRVALGE EFGMPIGSRV TGKMVAALSR LGFKKVFDTD TAADLTIMEE GTELINRIKN GGKLPLITSC SPGWIKFCEH NYPEFLDNLS SCKSPHEMFG AVLKSYYAQK NGIDPSKVFV VSIMPCTAKK FEAQRPELSS TGYPDVDVVL TTRELARMIK ETGIDFNSLP DKQFDDPMGE ASGAGVIFGA TGGVMEAAIR TVGELLSGKP ADKIEYTEVR GLDGIKEASI ELDGFTLKAA VAHGLGNARK LLDKIKAGEA DYHFIEIMAC PGGCINGGGQ PIQPSSVRNW KDIRCERAKA IYEEDESLPI RKSHENPKIK MLYEEFFGEP GSHKAHELLH THYEKRENYP VK
|
| |