Gene Cthe_0609 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0609 
Symbol 
ID4808211 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp746423 
End bp747400 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content44% 
IMG OID640106023 
Productpeptidase M42 
Protein accessionYP_001037037 
Protein GI125973127 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.489423 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTGACT TGCTCAAAAA GTTTACAGGA ATCGTTGGAG TATCCGGAAA CGAAGAAGAA 
ATAAGGGAAG CTATAATTGA GGAAATTAAA GAATGTGTTG ACGAAATAAA AGTTGATACT
TTGGGAAACC TTATTGCCGT CAAAAAAGGC AAGGGCAAAA AAATCATGGT GGCGGCTCAT
ATGGATGAGA TTGGCGTAAT GGTTACATAT ATAGACGACA AGGGCTTTTT AAGGTTTTCT
GCCGTCGGAG GGGTCAGCCG CTATGACTGT ATAGGCCAGA GGGTGAAGTT TAAAAACGGA
GTTGTCGGAG CTGTTTATTA CGAGGAAAAA CTTGAGGATA TGAAGAATCT CCAGCTTTCC
AAAATGTATA TAGATATTGG AGCAAGAAGC AGAGAGGAAG CCCTGAAGAT GGTAAATATC
GGAGATGTCG CCTGCTTTGT CGGAGATGCG GTGCTTCAGG GGGATACCGT GATATCGAAG
GCATTGGACA ACAGAAGCGG CTGTGCGGTG GTTGTAAAGG CGATAAAAGA GTTGAAAAAG
ACGGATAATG AAATATATTT TGTGTTTACG GTTCAGGAAG AGGTCGGTTT GAGAGGAGCA
AAAACCGCGG CTTTCAGTAT AAAGCCTGAT ATAGCCATAG CTGTGGATGT TACAATGACG
GGAGACACAC CGGAATCGCA TCCTATGGAG GTTAAGTGCG GCGGCGGGCC TGCAATTAAA
GTAAAGGATC GTTCCGTCAT TTGTCATCCG GAGGTAAGAA AACTTTTGGA AGAGTCAGCA
AAAAGGAATA ATATTCCTTA TCAGTTGGAA ATACTTGAAG CAGGAGGGAG CGACCCGGGC
TCAATACATT TGACGGCGGG AGGAATACCT TCGGGTGCGA TATCCATACC GGTGAGGTAT
GTTCACAGTC CGGTAGAGAC CGCCAGCATG TCGGATATTA ATAATGCCGT AAAATTGTTG
GTTGAAGCCA TTTGCTGA
 
Protein sequence
MFDLLKKFTG IVGVSGNEEE IREAIIEEIK ECVDEIKVDT LGNLIAVKKG KGKKIMVAAH 
MDEIGVMVTY IDDKGFLRFS AVGGVSRYDC IGQRVKFKNG VVGAVYYEEK LEDMKNLQLS
KMYIDIGARS REEALKMVNI GDVACFVGDA VLQGDTVISK ALDNRSGCAV VVKAIKELKK
TDNEIYFVFT VQEEVGLRGA KTAAFSIKPD IAIAVDVTMT GDTPESHPME VKCGGGPAIK
VKDRSVICHP EVRKLLEESA KRNNIPYQLE ILEAGGSDPG SIHLTAGGIP SGAISIPVRY
VHSPVETASM SDINNAVKLL VEAIC