Gene Cthe_3179 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3179 
Symbol 
ID4809630 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3757051 
End bp3758307 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content46% 
IMG OID640108613 
ProductSerine-type D-Ala-D-Ala carboxypeptidase 
Protein accessionYP_001039567 
Protein GI125975657 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1686] D-alanyl-D-alanine carboxypeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTGTTTGA AAGGAAATAT AAAGATAATA TTTATTTTTT TGTTGTGTTT AATATTGGCG 
GTAAACTTGA ATGAAAGGGT ATACGGAGAT GAATTTGTTG AAGATGAGTT TGCATCAGAG
GTTGTTATTG AGTCTGTAAA TGTGGATGAG AGTGAGGAGT TAAAGCCTCC CAGGATAGAA
GCGAAAGCGG CGATAGTAAT TGATGCCGAT ACCGGGAGGG TGTTGTATGA AAAGGATGCG
TACTCAAGAA GGGCCATAGC CAGTACCACC AAGATAATGA CGGCAATTGT GGCGATAGAG
AACGGTAATC TCGATGACAA GGTCAAAGTC AGCAGCAGGG CTGCCAGTAT CTGGGGTTCC
ACAATCAAGC TCAAACCGGG TGAGGAACTC ACACTAAAGG AGCTTCTTTA CGGTATGATG
CTAAGGTCGG GCAATGATGC GGCATTGGCA GTGGCCGAGC ATGTCGGGGG AACGGTGGAG
AACTTCGTCA AGATGATGAA CGACAAGGCA AGGGAGCTGG GACTTAAAAA TACGGCGTTT
AAAACACCTC ATGGCCTGGA TGTGGAAGGC CACTACTCCA CGGCTTACGA GCTGGCCATG
CTCACCCGGT ATGCTTTACA GAATCCTGTT TTTGCCCAAA TTGTGGCAAC CAAGAGTACG
ACAATCACCA ACCGCAGCTT ATATACAACA AACGAAATGC TTTCTCTGTA TCCCGGTGCG
GACGGGGTAA AGACAGGATA TACGGGAAAG GCAGGAAGAT GCCTTGTGAC ATCGGCAACA
AGGGATGGAT TTAAGATAAT TTCGGTGGTT TTGAACTGCT CAAGCAGAAG TAAAAGGGCG
GAAAGCAGCA AGGCTATTCT TGATTATGCC TTCAACAATT ACAAGCCTTA TGAGCTTTTG
AGGGCAAACC AGGAACTGGG ACGAGTGAAG GTGTACAAGG GAAAGAAGGA TTCGGTGCCC
GTTGTTGCCG TGGAAAGCAT CAAAATGCCT TTGAGCAGGG AAGAAAAGGA GAAACTTAGG
ACGGAGCTTA CATTGGATGA GACAATAAAG GCGCCGGTGT ACAAGGGGGT TGAAGTGGGG
AAGATAGAGT TTTTTGTCGA CGGAAAGCTT ATTGGACGGT CGGCTGTAAA GACGGCCGAA
GCAGTGCCGG AGAAAAGTTA TGGGGATTAT TTCAGGGAGA TTCTGGATAT GTGGTTTAAA
CTGGCAAGAT TAAATCTTTC AGGTGTCTTT GCGAAATCCT TAAATATAAT GCAATAG
 
Protein sequence
MCLKGNIKII FIFLLCLILA VNLNERVYGD EFVEDEFASE VVIESVNVDE SEELKPPRIE 
AKAAIVIDAD TGRVLYEKDA YSRRAIASTT KIMTAIVAIE NGNLDDKVKV SSRAASIWGS
TIKLKPGEEL TLKELLYGMM LRSGNDAALA VAEHVGGTVE NFVKMMNDKA RELGLKNTAF
KTPHGLDVEG HYSTAYELAM LTRYALQNPV FAQIVATKST TITNRSLYTT NEMLSLYPGA
DGVKTGYTGK AGRCLVTSAT RDGFKIISVV LNCSSRSKRA ESSKAILDYA FNNYKPYELL
RANQELGRVK VYKGKKDSVP VVAVESIKMP LSREEKEKLR TELTLDETIK APVYKGVEVG
KIEFFVDGKL IGRSAVKTAE AVPEKSYGDY FREILDMWFK LARLNLSGVF AKSLNIMQ