Gene Cthe_1171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1171 
Symbol 
ID4810123 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1397667 
End bp1398914 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content40% 
IMG OID640106593 
ProductSerine-type D-Ala-D-Ala carboxypeptidase 
Protein accessionYP_001037596 
Protein GI125973686 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1686] D-alanyl-D-alanine carboxypeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.987269 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATAGAA GAGTCTTGAT ACAAATTCAG TGCTTTACAG TAGCAATGAT GATTTTGTTT 
TTCTCTCAAA GTCCGGTTTT CGCAGTTGCG GAACCTCCGG AAATCAAGGC ACCTTCCGCT
ATTTTGATGG AAGTGCAGAG GGGACAGATA CTCTATCAAA AGAATCCAAA ATTAAAACTT
CATGTTTCGT GTGCAAATAA AATTATGACC GGACTCATTG CTTTGGAAAA AATGCAGAAT
CAACTGAACA CCAATATCAC TGTCAGCAAG AAGGCGGTTT CTGTTGAAGG AGCTGTGTTA
AATCTCGAGG TCGGCGGAAA ATACCCGGTT GAAGATTTGA TATATTCGGT TTTGTTAGGA
TCCGCCAATG ACAGTGCCAA TGTTCTGGCT GAGTATATAG GTGGAGACGA GAAGGGTTTT
GTTGAGCTTA TGAATAAAAA GGCCCAGGAA CTTGAGATGA AGGATACTTA TTTTACAAAT
CCCACGGGTC TTTATGATGA AAAACAATAT ACAACGGCGT ATGACCTGGC CGTTTTAATA
AGATATGCTC TGACAAAATC CAGCACTTTC AATGAGATGT TTTCGGCTAA GGCCAGACCA
TGGGTTGACG GAACGCAGAT TTTAATAAAC AGCAATGAGT TGTTCTGGAG CTATGACGGC
GTTGACGGTG GAAAGACCGG ATATAACGAA ATAGACCGTC AAACGGCAAT TACCACTGCC
ACAAGAAACG GGCAAAGGTT GATATGCATA GTTCTTGATT CACCGGAAGA AAGCATGTAT
GACGATTCGG TAAAGCTTCT GGACTATGGT TTTTTAAATT TCAGGACAGG CATTCTGGTA
TCAATGGGAC AACCTTTGAA GAAAGTTACC GTCGGCGATA AAGTTATAGA TTTGGTTAGC
ATAGGTGACT ATTACTATAC TTACCCTGCC GGGGAAAATT ATATAAAGAA TATTGAATTT
AAAGTTCCTG AAAAGTTTGA TCCTCCTGTA CTGAAAAGTG ATGTTTTAGG CATTGCAAAG
TATACTTTGG AGGATGGAAC GGTTATTGAA GTAAGTCTGC ATCCGGCGGT TGATGTTTAC
TCTTCGATGG GCTTGTTTGA GTCGTTGATA AATCAAGTGA AGGAATACAG GGATATAGTA
ATATTGCTGT GTATTCTTTT GGTAATAGAA TTATTTATTG CGGTTTATCA TATAGTGAGG
CTGATAAAGC GGCTGTTTCT AAAGCTTGTT TACAAGCCTG GGAAATAA
 
Protein sequence
MYRRVLIQIQ CFTVAMMILF FSQSPVFAVA EPPEIKAPSA ILMEVQRGQI LYQKNPKLKL 
HVSCANKIMT GLIALEKMQN QLNTNITVSK KAVSVEGAVL NLEVGGKYPV EDLIYSVLLG
SANDSANVLA EYIGGDEKGF VELMNKKAQE LEMKDTYFTN PTGLYDEKQY TTAYDLAVLI
RYALTKSSTF NEMFSAKARP WVDGTQILIN SNELFWSYDG VDGGKTGYNE IDRQTAITTA
TRNGQRLICI VLDSPEESMY DDSVKLLDYG FLNFRTGILV SMGQPLKKVT VGDKVIDLVS
IGDYYYTYPA GENYIKNIEF KVPEKFDPPV LKSDVLGIAK YTLEDGTVIE VSLHPAVDVY
SSMGLFESLI NQVKEYRDIV ILLCILLVIE LFIAVYHIVR LIKRLFLKLV YKPGK