Gene Cthe_1231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1231 
Symbol 
ID4809923 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1473277 
End bp1474575 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content40% 
IMG OID640106654 
ProductSerine-type D-Ala-D-Ala carboxypeptidase 
Protein accessionYP_001037656 
Protein GI125973746 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1686] D-alanyl-D-alanine carboxypeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGGC TATTGTTACT TTTAATCATG ACAATCATTC TGTTTTCAAA TATTGTGACT 
TTAAAAGCAC AACCATTGGA TATTAATGCA CAAGCATACA TTTTGATAGA TTCCAAAACT
GGTCAGGTTC TTGCTGAACA CAATCCGGAT CTTAGAACTT ATCCTGCCAG CACCACTAAA
ATAATGACAG CAATACTGGC ACTCGAACTT GGAGATCTCA ATCAAATAAT GACTGTCAGC
CAGTCTGCCA TAGATGACAT AGGTCCTGGC GGCATGCATA TCGGTTTGCT GCCGGGCGAA
CAACTGGAGC TCAGATACTT ACTGGATGCT CTCCTGGTGA GATCAGCCAA TGAGACTGCT
TATGTTATTG CCGAAAACCT CTGCTCCTCC CGCGAGGAAT TTTACAGACT TATGAACGAA
AAGGCAAGGG AGCTTGGGGC TACCAATACA AATTTTGTAA ATCCCTGCGG TATTGACAAT
GGAGAAAAGG GAAAAAATCA TCTTACCACG GCAAGAGATC TCGCCAAAAT AGCGCAGTAT
GCAATGACGA TACCGGAATT TAGGGAAATC GTTCAAAAAA CTATTATCAA AATACCTCCT
ACAAACAAGC ATGCTGAAGA GGTTATTGTC GGTACTACCA ATAAATTGCT GCTCTACAGC
AACTCAAAAT ACAAATCGGA ACACTATACA AAAATAACCG GTATAAAAAC GGGTTATACC
GACAGGGCCC TTAACAACTT GGTTTCTTCC GCCGTCAACG ATGAAGGAAC GGAATTGATT
GCTGTGGTTC TCGGCGTTGA GAATTATGAC ATGGTGTTCG AATATTCCAA AATGCTGTTG
GAATACGGTT TCAAAAACTA CTCCGTTCAG CCTGTTATTG CACCGAACTC GTATATAACC
TCTGTACCCG TTTTAAAAGC AGCGGGAAAT CACAACTTGG ATATTCTGGC ATCGCCGGAG
GGACTCAAAT GCCTGCTGCC CAACAATTCA ACTAAAAATG ATTATGAAAT TGAACAACAT
ATTTTAGAAA ACATAGAAGC TCCGGTAAAA AAAGGGGATG TTCTCGGATA CATCGAGGTT
AAAAAAGACG GTGTCACCAT CGGAAAAATA GATGCAGTCG CTTCAAGGGA TGTTGAAAAA
CTTGAGCCGC CGGTTGAACC TCAAAACATA ATTATTAAAA CAGCAAACGA TCCGATTTTG
AAAAAAGTTA CAACAGGAGC ATTGATCTTC CTGTTAATGT TCCTTATGTT AAGATTTACT
TTGCGCAGAA TTTCACGAAG CCTTCATTCA AAAAGATAA
 
Protein sequence
MKRLLLLLIM TIILFSNIVT LKAQPLDINA QAYILIDSKT GQVLAEHNPD LRTYPASTTK 
IMTAILALEL GDLNQIMTVS QSAIDDIGPG GMHIGLLPGE QLELRYLLDA LLVRSANETA
YVIAENLCSS REEFYRLMNE KARELGATNT NFVNPCGIDN GEKGKNHLTT ARDLAKIAQY
AMTIPEFREI VQKTIIKIPP TNKHAEEVIV GTTNKLLLYS NSKYKSEHYT KITGIKTGYT
DRALNNLVSS AVNDEGTELI AVVLGVENYD MVFEYSKMLL EYGFKNYSVQ PVIAPNSYIT
SVPVLKAAGN HNLDILASPE GLKCLLPNNS TKNDYEIEQH ILENIEAPVK KGDVLGYIEV
KKDGVTIGKI DAVASRDVEK LEPPVEPQNI IIKTANDPIL KKVTTGALIF LLMFLMLRFT
LRRISRSLHS KR