Gene Cthe_1775 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1775 
Symbol 
ID4810020 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2097361 
End bp2098071 
Gene Length711 bp 
Protein Length236 aa 
Translation table11 
GC content46% 
IMG OID640107189 
Productpeptidase M22, glycoprotease 
Protein accessionYP_001038189 
Protein GI125974279 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1214] Inactive homolog of metal-dependent proteases, putative molecular chaperone 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0438332 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATAC TGGCTTTGGA TACATCGGCG CTGGTTGCTG CCGTTGCGGT GATGGAGGAT 
GACAGGCTTC TTGCGGAATA TATGCTAAAC CACAGAAAAA CCCATTCCCA GCAGCTTGTA
GCGATGATCA GGGAGGTTCT TGCCTCATTG GAACTGGCGC CGAAAGACAT AGATGTTTTT
GCGGCCTCCA CAGGTCCGGG CTCTTTTACG GGACTGAGAA TCGGGGTTAC CACCGTAAAG
GCCATGGCCT ATGCGACGGG AAAACCTGTC GTAAGTGTGC CGACATTGGA TGCAATAGCA
TATAATATTC CGATGAACAG TTTTACAATA TGCCCGGTCA TGGATGCAAG AAACAACCAG
GTGTATACCG CACTTTACGA TTGGGATGAG AACGGGCAGA AGAGGATTAC GGATTATATG
GGGATACCGG TGTCCGAATT GGTACAGCTT ATAAAGGACA TGGGCAAAAA AGTCATTTTT
GCCGGGGATG CTGCAAAAAT GCATGAAGAA TATTTTACAC AGGAGCTTGG AGACGACTGT
AAAATTGCTC CGGGAAACCT TCTTCTTCAG AGGGCTTCAT CGGTTGCCCG TCTTGCTTAT
TTAAAAGCAA TGAACAATGA ACTGGAAAGC TGTTTTGACA TGGTGCCTTT CTATCTTAGA
AAGTCCCAGG CTGAAAGAGA ATATGAGAAA AAGCTTTGCA AGGACTGTTA A
 
Protein sequence
MKILALDTSA LVAAVAVMED DRLLAEYMLN HRKTHSQQLV AMIREVLASL ELAPKDIDVF 
AASTGPGSFT GLRIGVTTVK AMAYATGKPV VSVPTLDAIA YNIPMNSFTI CPVMDARNNQ
VYTALYDWDE NGQKRITDYM GIPVSELVQL IKDMGKKVIF AGDAAKMHEE YFTQELGDDC
KIAPGNLLLQ RASSVARLAY LKAMNNELES CFDMVPFYLR KSQAEREYEK KLCKDC