Gene Cthe_0342 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0342 
Symbol 
ID4808491 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp430223 
End bp431971 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content44% 
IMG OID640105756 
Producthydrogenase, Fe-only 
Protein accessionYP_001036773 
Protein GI125972863 
COG category[R] General function prediction only 
COG ID[COG4624] Iron only hydrogenase large subunit, C-terminal domain 
TIGRFAM ID[TIGR02512] hydrogenases, Fe-only 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAATGG TAAATGTTAC TATAGATAAT TGCAAGATAC AGGTACCTGC CAATTATACC 
GTGTTGGAAG CTGCAAAACA GGCTAACATA GACATACCTA CTCTTTGCTT CCTCAAGGAT
ATAAATGAAG TAGGTGCCTG CCGTATGTGC GTTGTTGAGG TAAAAGGTGC CAGAAGCTTA
CAGGCGGCCT GTGTATATCC GGTGTCCGAA GGTCTTGAGG TGTACACTCA GACACCGGCG
GTAAGGGAAG CCAGGAAAGT GACTTTGGAA CTTATACTGT CAAACCATGA AAAGAAATGT
TTGACCTGTG TAAGAAGTGA AAACTGCGAA TTGCAAAGAC TGGCAAAAGA TCTGAATGTA
AAAGATATCA GATTTGAAGG TGAAATGAGC AATTTGCCGA TAGATGATCT TTCGCCTTCT
GTTGTAAGGG ATCCCAACAA GTGTGTTTTG TGCAGACGCT GTGTCAGCAT GTGCAAGAAT
GTTCAGACCG TTGGAGCCAT TGATGTTACT GAAAGAGGAT TCCGTACCAC CGTATCAACG
GCCTTTAACA AACCTCTCAG TGAAGTACCC TGCGTAAACT GCGGACAGTG TATCAATGTA
TGTCCTGTGG GAGCATTGAG AGAAAAGGAC GATATTGACA AGGTTTGGGA AGCTCTTGCA
AATCCTGAGC TTCATGTAGT CGTTCAGACG GCTCCTGCAG TCAGGGTTGC ATTGGGAGAA
GAGTTTGGAA TGCCTATCGG CTCAAGAGTG ACCGGTAAAA TGGTGGCAGC ATTGAGTCGA
CTGGGCTTTA AAAAGGTATT TGATACAGAT ACGGCTGCCG ACCTTACAAT AATGGAGGAA
GGTACTGAGC TTATAAACAG GATTAAAAAC GGCGGCAAGC TTCCTTTGAT AACTTCCTGC
AGCCCGGGAT GGATAAAGTT CTGCGAACAC AACTATCCTG AGTTTTTAGA CAATCTGTCC
AGCTGCAAAT CGCCTCACGA AATGTTTGGT GCGGTTTTGA AATCCTACTA TGCACAGAAA
AACGGAATTG ATCCTTCAAA AGTATTTGTT GTATCAATAA TGCCATGTAC GGCAAAGAAG
TTTGAGGCTC AAAGGCCGGA GCTTTCTTCA ACGGGTTATC CTGATGTGGA TGTTGTTCTT
ACCACAAGAG AGCTTGCAAG AATGATAAAA GAAACGGGTA TTGATTTTAA TTCCCTTCCG
GATAAACAGT TTGATGATCC TATGGGTGAG GCATCCGGAG CAGGTGTTAT TTTTGGTGCC
ACCGGAGGAG TTATGGAGGC TGCCATCAGG ACCGTCGGTG AATTATTGAG CGGCAAACCT
GCAGACAAGA TTGAATATAC TGAGGTAAGA GGTCTTGACG GTATAAAAGA GGCTTCCATA
GAACTTGACG GTTTTACTCT GAAGGCTGCT GTTGCCCATG GTCTTGGCAA CGCAAGAAAG
CTTCTTGACA AAATAAAAGC CGGAGAGGCG GATTATCATT TCATTGAAAT AATGGCCTGT
CCCGGTGGTT GTATAAACGG TGGAGGACAG CCCATACAGC CGTCATCTGT GAGAAACTGG
AAAGATATAA GATGCGAGAG GGCGAAAGCT ATTTACGAAG AGGATGAGTC CTTGCCTATA
AGAAAATCTC ATGAAAATCC AAAGATAAAG ATGCTGTATG AAGAATTCTT TGGTGAACCG
GGCAGTCATA AAGCTCACGA GCTTTTGCAC ACTCATTATG AGAAGAGGGA AAACTACCCT
GTTAAATGA
 
Protein sequence
MQMVNVTIDN CKIQVPANYT VLEAAKQANI DIPTLCFLKD INEVGACRMC VVEVKGARSL 
QAACVYPVSE GLEVYTQTPA VREARKVTLE LILSNHEKKC LTCVRSENCE LQRLAKDLNV
KDIRFEGEMS NLPIDDLSPS VVRDPNKCVL CRRCVSMCKN VQTVGAIDVT ERGFRTTVST
AFNKPLSEVP CVNCGQCINV CPVGALREKD DIDKVWEALA NPELHVVVQT APAVRVALGE
EFGMPIGSRV TGKMVAALSR LGFKKVFDTD TAADLTIMEE GTELINRIKN GGKLPLITSC
SPGWIKFCEH NYPEFLDNLS SCKSPHEMFG AVLKSYYAQK NGIDPSKVFV VSIMPCTAKK
FEAQRPELSS TGYPDVDVVL TTRELARMIK ETGIDFNSLP DKQFDDPMGE ASGAGVIFGA
TGGVMEAAIR TVGELLSGKP ADKIEYTEVR GLDGIKEASI ELDGFTLKAA VAHGLGNARK
LLDKIKAGEA DYHFIEIMAC PGGCINGGGQ PIQPSSVRNW KDIRCERAKA IYEEDESLPI
RKSHENPKIK MLYEEFFGEP GSHKAHELLH THYEKRENYP VK