Gene Cthe_1938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1938 
Symbolddl 
ID4810796 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2313294 
End bp2314424 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content41% 
IMG OID640107354 
ProductD-alanyl-alanine synthetase A 
Protein accessionYP_001038349 
Protein GI125974439 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1181] D-alanine-D-alanine ligase and related ATP-grasp enzymes 
TIGRFAM ID[TIGR01205] D-alanine--D-alanine ligase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00320777 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCGATA AAAAAAGAGT ACTGGTTATT TTTGGCGGAC AATCGTCGGA ACACGAAGTT 
TCAAGAATAT CGGCAACATC CATACTGAAG AACATTAATT TGGATAAATT CGATGTTTCA
ATGATAGGAA TCACAAAAGA CGGAAAGTGG CTTTATTATG ACGGGCCTAT TGATAAAATT
CCTTCCGGAG AGTGGGAGGA AATCGCACTG AAAGATGGGA CAAGAAGTAT TGCCGACAGA
GTGAGCCTGT TCGACAACAT AATAAGTTGC AAAAATAACG CATGCGGCCT TGAAAAGGCC
TCAGAGAACG AAAAAAGCAA AAAGATAGAT GTGGTTTTTC CGGTTCTGCA CGGCTGCAAC
GGTGAAGACG GGACCATCCA GGGACTTTTT GAACTGGCGG GCATTCCTTA TGTGGGCTGC
GGTGTGCTGG CTTCAGCAGT CGGAATGGAT AAGATTTATG CAAAGATAAT TTTTGAAAAA
GCCGGAATAC CCCAGGCGGA TTATCTGTAT TTCACAAGAA AAGAAATTTA CGGGGATGTT
GAGGGTGTGG TTGACAAAAT AGAGGAGAAA TTTTCATATC CTGTATTTGT AAAACCGTCC
AATGCCGGTT CTTCCGTAGG TGTGTCAAAG GCGCATGATA AAAATGAGCT TAAAGAGGCA
TTGATTTATG CCGCCAGGTA TGATAGAAAA GTACTGATTG AGGAATTTAT CAACGGAAGA
GAAGTTGAGT GTGCCGTGCT GGGGAATGAT GATCCTGTGG CATCAACGGT GGGAGAAATC
ATTCCGGGAA ATGAATTTTA CGACTACAAG GCAAAATACA TTGAAAATAC TTCCAAAATA
AAAATTCCCG CGGATCTTCC AGAAGAGACC GTGGAACAAA TAAGAAATTA TGCAGTAAAG
GCATTCAAGG CTTTGGATTG TTCGGGACTT GCAAGAGTTG ACTTTTTTGT GCACAAGGAA
ACCGGAAAAG TTTATATAAA TGAAATTAAT ACAATGCCGG GATTTACAAG TATAAGCATG
TATCCCATGC TTTGGGAGGA ATCCGGCATT TCCTATCCGG AACTTATTGA AAAGCTGATT
GACTTGGCTG TTCAAAGATA CAATGACAAT CTCAAAGAAT ATGATGAGTA G
 
Protein sequence
MGDKKRVLVI FGGQSSEHEV SRISATSILK NINLDKFDVS MIGITKDGKW LYYDGPIDKI 
PSGEWEEIAL KDGTRSIADR VSLFDNIISC KNNACGLEKA SENEKSKKID VVFPVLHGCN
GEDGTIQGLF ELAGIPYVGC GVLASAVGMD KIYAKIIFEK AGIPQADYLY FTRKEIYGDV
EGVVDKIEEK FSYPVFVKPS NAGSSVGVSK AHDKNELKEA LIYAARYDRK VLIEEFINGR
EVECAVLGND DPVASTVGEI IPGNEFYDYK AKYIENTSKI KIPADLPEET VEQIRNYAVK
AFKALDCSGL ARVDFFVHKE TGKVYINEIN TMPGFTSISM YPMLWEESGI SYPELIEKLI
DLAVQRYNDN LKEYDE