Gene Cthe_0608 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0608 
Symbol 
ID4808210 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp745327 
End bp746376 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content43% 
IMG OID640106022 
Productpeptidase M42 
Protein accessionYP_001037036 
Protein GI125973126 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0364864 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTGATAA AAGAATTGAC GGAGTTAAAC GGGGTATCCG GAAATGAGGA TGAGGTAAGA 
AAATTTATAA AAGAAGAGGC GCAAAAGTAT GCAGACAGCA TAACCGAGGA CTCAATGGGA
AATTTGATTT GCTATAAAAA AGGCGGATCC TCAAAATACC GCGTAATGTT GTCCGCCCAC
ATGGACGAGG TCGGATTTAT GGTAACGGGG TATGATGACG GTCTGATCAA ATTTGCAAGT
ATTGGTGGAA TAGATGAGAG AATACTTCCG GGAAAGAGGG TTTTGGTCGG GGAAAAGCGG
ATTCCCGGTG TTATAGGTTC AAAGCCCATT CATCTGCAGG AAAAGGCTGA AAGGGGAAAT
AATATCAAGC TGAAAAACAT GTATATAGAC ATAGGTGCCG AAAAAAAGGA AGAAGCCGAA
AAACTTGCTC CTTTAGGTGA ATACATAGCC TTTTACAGCA TGTATACTGA GTTTGGTGAC
GGCTGTATAA AGGCAAAGGC TTTGGATGAC CGTGTCGGTT GTGCCATACT TCTTGAAATA
TTGAAAGAGA GGTATGGATT TGATTTGTAT GTATGCTTTA CCGTTCAGGA GGAGATAGGA
TTAAGAGGCG CAGGCGTTGC TGCGTTCAGG GTAAATCCTG ATATTGCAAT TGTTGTTGAA
GGAACCACCT GCTCTGATGT GCCGGGAGCC CGCGAACATG AGTATTCAAC GGTAATGGGA
AATGGAGCGG CACTAACTAT AATGGACAGA ACTTCCTATT CCAATAAAAA GCTGGTTGAC
TTTATGTACA AGACGGCGAA AGATAAAAAC ATACCGGTCC AGTACAAGCA AACCGCAACC
GGCGGAAATG ATGCCGGAAA GATACAGTTA ACCCGTGAGG GAGTTGTGGT GGCATCGGTA
TCTGTTCCCT GCAGGTATAT ACATTCCCCT GTGTCGGTAA TGAACCGAAG AGACTATGAA
AGTTGCTTGA ATCTCGTAAA AGCTGTGCTG GAGGAGTTTG ACAACAATGA GAGCTTAATT
GAAAGTTTTA AATTGCACAA TGTGAAGTAA
 
Protein sequence
MLIKELTELN GVSGNEDEVR KFIKEEAQKY ADSITEDSMG NLICYKKGGS SKYRVMLSAH 
MDEVGFMVTG YDDGLIKFAS IGGIDERILP GKRVLVGEKR IPGVIGSKPI HLQEKAERGN
NIKLKNMYID IGAEKKEEAE KLAPLGEYIA FYSMYTEFGD GCIKAKALDD RVGCAILLEI
LKERYGFDLY VCFTVQEEIG LRGAGVAAFR VNPDIAIVVE GTTCSDVPGA REHEYSTVMG
NGAALTIMDR TSYSNKKLVD FMYKTAKDKN IPVQYKQTAT GGNDAGKIQL TREGVVVASV
SVPCRYIHSP VSVMNRRDYE SCLNLVKAVL EEFDNNESLI ESFKLHNVK