Gene Cthe_1858 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1858 
Symbol 
ID4809409 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2203047 
End bp2204174 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content42% 
IMG OID640107277 
Productpeptidase M23B 
Protein accessionYP_001038272 
Protein GI125974362 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000219165 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGAAGG CAATATTGGT GGTTGTAGCG TTTGCGTTGA TTTTGTCGTC ATTGATGATA 
CCTGTATTTG CCAAAACCAT TTCCGATGTA CAAAAGGAGA AAAATACCGT TGACAGCAAA
TTAAACAGCA TTACGAAACA AAAGAAAGAG GAAAAGCAAA AACTCAGTAA TATTGAGAGT
GAAAAGAAAA AAATAGAGTC ACAGCAGGCG GAAAAGACCA GGGAATATAA TTCACTGAAT
CAGCAGGTTG AAGAACTGAA CAAACATATA GAGGAAATAG ATGCCGCGAT AAAGGAAAGC
GAAGACAGAT ACAACAAGCA GTTGGAACTG TTGAAAGTAA GAATCAATGT AATGTATCAG
AATTCCGGAG CTACATATAT TCAAACACTG GCAGAATCGA AAAACTTTAT TGATTTTCTG
AACAAACTTG AACTTGTTGC AGCCATAAGC AAAAGGGACA AAGAGATAAT TGAGGACCTC
AAACAGGCTA AAGCGGATGT GGAGTTTAAA AAGAAACTGG CCGTTGAGAA GCGGGATACT
GTTAAAGAGA AAGCGGAACA ATCGTTGAAG GCGTTAAATG AGCTCAGTGT CGCAAGATCA
AAGCTTGACA GCCAAATAAA CAGTATAAAT GCCCAACTTA AGAAACTTGA ACAACAGGAA
AATGAGTTGA TAAAACAGTC AAATGAACTT GCCGGTCAGA TAAGAAAACT TCAGCAAAGC
GGAAGCTACG CCGGTGGAAC CATGCGCTGG CCTTTGCCTG GCAGTACCAA AATTTCATCT
TACTTTGGCA ACAGGCTTCA TCCCATACTC AAAGTGTATA AGATGCATAC AGGTATAGAT
ATTTCGGCAG CCACCGGAAC ATCGATAGTA GCCGCCAACA AAGGCGTGGT CATAATGTCA
GGGTGGCAGA ACGGATACGG CTATACAGTG GTTGTGGACC ATGGAGGCGG AATTTCCACA
TTGTATGCCC ATTGCAGCAA ACTGCTTGTC AAAGTGGGCG ATTCGGTTAA TGCCGGGGAT
ACGATTGCAA AAGTAGGAAG CACCGGACTT GCCACCGGGC CTCACTTGCA CTTTGAGGTA
AGAAAGAACG GCACTCCGGT AAATCCGCTC GACTATGTAA AGCCGTAA
 
Protein sequence
MKKAILVVVA FALILSSLMI PVFAKTISDV QKEKNTVDSK LNSITKQKKE EKQKLSNIES 
EKKKIESQQA EKTREYNSLN QQVEELNKHI EEIDAAIKES EDRYNKQLEL LKVRINVMYQ
NSGATYIQTL AESKNFIDFL NKLELVAAIS KRDKEIIEDL KQAKADVEFK KKLAVEKRDT
VKEKAEQSLK ALNELSVARS KLDSQINSIN AQLKKLEQQE NELIKQSNEL AGQIRKLQQS
GSYAGGTMRW PLPGSTKISS YFGNRLHPIL KVYKMHTGID ISAATGTSIV AANKGVVIMS
GWQNGYGYTV VVDHGGGIST LYAHCSKLLV KVGDSVNAGD TIAKVGSTGL ATGPHLHFEV
RKNGTPVNPL DYVKP