Gene Cthe_0734 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0734 
Symbol 
ID4810352 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp891758 
End bp892879 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content37% 
IMG OID640106151 
Productpeptidase M23B 
Protein accessionYP_001037162 
Protein GI125973252 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0522246 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAAGGC TTTTTATTAT TGTACTGATA GCGTCTTTGA CGGCGGCTTT GATTTTTCCT 
GTTATGGCTG ACAATAATTC TTTAAAAGAT AAAATGAGTG AAATAGACGA TAAATTGAAT
GATATTAGCA AGCAAAAAGT AGAGATAGAT AAAGAAAAAA AGGAACTTGA ACGTGAGAAA
AAAGAATTAA TAAATGCCGA AAATGAAGCA AACTTGGAGT ACCAGAATTT GGTTTCTGAA
CTGGAGGCAT TGGACAGTCA AATAGAAAGC TATGAGACTT CGTTAAAAGA TACGGAAGAA
AGATACGCCA AAACTTTAAA AGCCTTTGAA GAACGTCTTG TAACGATGTA CAGAAATTCA
TATGTTTCAT ACCTTAATAT ATTAGCCGAT TCGGAGAATC TGATTAATTT TTTTGAAAGA
CTTGAGTTGA TTTCATCAAT TGCGAAAAAA GATAAGGAAA TTGTAAAGAA AGTTCAGGAT
ATAAAAAAGG ATCTTACATA TAAAAAGCAA TTGGTTCAAT ATATAAAAGG TGCAAAGCAA
CTTGAATTAA TCAGAAAGAA AAACAATATT GATTCATTGG TTGCATCCAG GAGTGGGCTT
GAAAACAAAA TCAAAGAAAG AGAAGAAGAA ATCAGACGTT TGGAAGAGCA GGAAGACAAG
CTGATTCAGC AATCTTACGA AATAGCAAAT CAAATAAGGA GAAGTACCGG AAGTATCAAA
AATTATGCCG GCGGTACAAT GGTGTGGCCG GTACCAAGTT CAAGAAAGAT AGATTCTCGA
TTTGGCACGA GACTTCATCC TATATTCAAA AAGTATAAAA TGCATACCGG TGTTGACATT
GATGCGGCTT ACGGAGCTTC AATAGTTGCT GCCAATAATG GAATTGTGAT TTTTTCGGGC
TGGGAGGATG GATACGGTTA TACGGTTATT ATCGACCATG GCGGCGGAAT AACCACGCTG
TATGCTCATT GCAGCAAGCT TCTTGTTAAC AAAGGTGACA AGGTTCGAAA GGGTCAAACC
ATAGCCCAGG CCGGCAGTAC CGGAACGGCT ACAGGATCGC ATCTACATTT TGAAGTTCGA
ATAGACGGAA ATGTAACCAA TCCTTTGGAT TATATTAAAT GA
 
Protein sequence
MKRLFIIVLI ASLTAALIFP VMADNNSLKD KMSEIDDKLN DISKQKVEID KEKKELEREK 
KELINAENEA NLEYQNLVSE LEALDSQIES YETSLKDTEE RYAKTLKAFE ERLVTMYRNS
YVSYLNILAD SENLINFFER LELISSIAKK DKEIVKKVQD IKKDLTYKKQ LVQYIKGAKQ
LELIRKKNNI DSLVASRSGL ENKIKEREEE IRRLEEQEDK LIQQSYEIAN QIRRSTGSIK
NYAGGTMVWP VPSSRKIDSR FGTRLHPIFK KYKMHTGVDI DAAYGASIVA ANNGIVIFSG
WEDGYGYTVI IDHGGGITTL YAHCSKLLVN KGDKVRKGQT IAQAGSTGTA TGSHLHFEVR
IDGNVTNPLD YIK