Gene Cthe_0335 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0335 
Symbol 
ID4808484 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp424321 
End bp425667 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content44% 
IMG OID640105749 
Producthydrogenase large subunit-like protein 
Protein accessionYP_001036766 
Protein GI125972856 
COG category[R] General function prediction only 
COG ID[COG4624] Iron only hydrogenase large subunit, C-terminal domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCCGT ACTTCCACTC GGTGACTCTG GACGAGGTCA AATGTAAAGG TTGTACCAAT 
TGCATAAAAA GATGCCCCAC GGAAGCAATC AGAGTAAGAA AGAGCAAAGC CAGAATCATA
AATGAAAGAT GCATAGATTG CGGAGAGTGT ATAAGGGTTT GTCCTTATCA TGCCAAAAAG
GCGATAACCG ACCCTATTGG CCTGATAAAT GACTATAAAT ACAAAGTTGC GATACCTGCC
CCTTCCCTGT ATGGACAGCT AAAGTCAGCG CCTTCAAGGG ATCATATCCT CACTGCTCTT
AAAATGTGCG GATTTGACGA TGTTTTTGAG GTTGCAAGGG CAGCCGAGAT TATTACCAGT
GAGACTAAAA AAATACTTGC ACAGCAAAGA TTTGAAAAGC CCCTTATATC TTCAGCATGC
CCTGCTGTAG TAAGGCTCAT ACAAGTGAGG TTTCCAAATC TTATCGATAA TATATTAAAG
CTAAAGTCCC CCATGGAGGT TGCGGCCAAA ATCGCAAAAG ATGAAATATC ACGAAACAGG
AAAATTCGCC AGGAGGACAT CGGCGTATTT TTTATTACTC CCTGTGCCGC AAAAGTGACC
AGCATAAAAG CTCCTTTGGA CAAGGAGAAG TCCTTTGTGG ACGGAGCAAT TTCAATTTCG
GAAATATATT TAAAAATACT TTCTGCCCTG GGGAAAATTG ATAAACCTGA AAAGCTTTCA
AGAGCCGGGT TTGTCGGAGT ACGCTGGGCA AATTCCGGAG GAGAAAGCAT CGCCCTTGGA
ACGGACAAAT TTCTTGCGGT TGACGGTATT CACAATGTAA TTGCGATTCT TGAGGAAATT
GAGGACGAAA AGCTGGAGGA TATAGATTTT GTTGAAGCTC TGGCATGTCA GGGAGGCTGT
CTTGGCGGTC CACTTAACGT GGAAAATATG TATGTTGCCA GAAAAACAAT AAAGAAATTT
ATTGATGATG CAAAAGAAAA AGGGTTGGAA ACGGCGGCAG ACGAAAATTG CAATTATGAT
ATTGCCTGGA CGGGAAAGGT TGAATACAAA CCTGTCATGA AGCTGGACAA GGATTTTGAC
GTGGCAATGA AGAAGCTGGA GACTCTTAAT GTCATAAACA GCGGACTTCC CGGACTGGAT
TGCGGTGCGT GCGGTGCACC GAGCTGCAGA GCCCTTGCGG AAGACATAGT AAGGGGGAAT
GCCAACGAGA CGGATTGCAT ATTTAAACTG AGAGAAAAAG TAAGAGACCT TGCATTTCAG
ATGATGGAGC TGGAAGCAAA AATGCCTCCT GTGCTGGACA AGGATGAAAC TTCGGAAAAC
AAATCAAAGA AAAGGGGAGA GTTTTGA
 
Protein sequence
MSPYFHSVTL DEVKCKGCTN CIKRCPTEAI RVRKSKARII NERCIDCGEC IRVCPYHAKK 
AITDPIGLIN DYKYKVAIPA PSLYGQLKSA PSRDHILTAL KMCGFDDVFE VARAAEIITS
ETKKILAQQR FEKPLISSAC PAVVRLIQVR FPNLIDNILK LKSPMEVAAK IAKDEISRNR
KIRQEDIGVF FITPCAAKVT SIKAPLDKEK SFVDGAISIS EIYLKILSAL GKIDKPEKLS
RAGFVGVRWA NSGGESIALG TDKFLAVDGI HNVIAILEEI EDEKLEDIDF VEALACQGGC
LGGPLNVENM YVARKTIKKF IDDAKEKGLE TAADENCNYD IAWTGKVEYK PVMKLDKDFD
VAMKKLETLN VINSGLPGLD CGACGAPSCR ALAEDIVRGN ANETDCIFKL REKVRDLAFQ
MMELEAKMPP VLDKDETSEN KSKKRGEF