Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1938 |
Symbol | ddl |
ID | 4810796 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2313294 |
End bp | 2314424 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640107354 |
Product | D-alanyl-alanine synthetase A |
Protein accession | YP_001038349 |
Protein GI | 125974439 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1181] D-alanine-D-alanine ligase and related ATP-grasp enzymes |
TIGRFAM ID | [TIGR01205] D-alanine--D-alanine ligase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00320777 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCGATA AAAAAAGAGT ACTGGTTATT TTTGGCGGAC AATCGTCGGA ACACGAAGTT TCAAGAATAT CGGCAACATC CATACTGAAG AACATTAATT TGGATAAATT CGATGTTTCA ATGATAGGAA TCACAAAAGA CGGAAAGTGG CTTTATTATG ACGGGCCTAT TGATAAAATT CCTTCCGGAG AGTGGGAGGA AATCGCACTG AAAGATGGGA CAAGAAGTAT TGCCGACAGA GTGAGCCTGT TCGACAACAT AATAAGTTGC AAAAATAACG CATGCGGCCT TGAAAAGGCC TCAGAGAACG AAAAAAGCAA AAAGATAGAT GTGGTTTTTC CGGTTCTGCA CGGCTGCAAC GGTGAAGACG GGACCATCCA GGGACTTTTT GAACTGGCGG GCATTCCTTA TGTGGGCTGC GGTGTGCTGG CTTCAGCAGT CGGAATGGAT AAGATTTATG CAAAGATAAT TTTTGAAAAA GCCGGAATAC CCCAGGCGGA TTATCTGTAT TTCACAAGAA AAGAAATTTA CGGGGATGTT GAGGGTGTGG TTGACAAAAT AGAGGAGAAA TTTTCATATC CTGTATTTGT AAAACCGTCC AATGCCGGTT CTTCCGTAGG TGTGTCAAAG GCGCATGATA AAAATGAGCT TAAAGAGGCA TTGATTTATG CCGCCAGGTA TGATAGAAAA GTACTGATTG AGGAATTTAT CAACGGAAGA GAAGTTGAGT GTGCCGTGCT GGGGAATGAT GATCCTGTGG CATCAACGGT GGGAGAAATC ATTCCGGGAA ATGAATTTTA CGACTACAAG GCAAAATACA TTGAAAATAC TTCCAAAATA AAAATTCCCG CGGATCTTCC AGAAGAGACC GTGGAACAAA TAAGAAATTA TGCAGTAAAG GCATTCAAGG CTTTGGATTG TTCGGGACTT GCAAGAGTTG ACTTTTTTGT GCACAAGGAA ACCGGAAAAG TTTATATAAA TGAAATTAAT ACAATGCCGG GATTTACAAG TATAAGCATG TATCCCATGC TTTGGGAGGA ATCCGGCATT TCCTATCCGG AACTTATTGA AAAGCTGATT GACTTGGCTG TTCAAAGATA CAATGACAAT CTCAAAGAAT ATGATGAGTA G
|
Protein sequence | MGDKKRVLVI FGGQSSEHEV SRISATSILK NINLDKFDVS MIGITKDGKW LYYDGPIDKI PSGEWEEIAL KDGTRSIADR VSLFDNIISC KNNACGLEKA SENEKSKKID VVFPVLHGCN GEDGTIQGLF ELAGIPYVGC GVLASAVGMD KIYAKIIFEK AGIPQADYLY FTRKEIYGDV EGVVDKIEEK FSYPVFVKPS NAGSSVGVSK AHDKNELKEA LIYAARYDRK VLIEEFINGR EVECAVLGND DPVASTVGEI IPGNEFYDYK AKYIENTSKI KIPADLPEET VEQIRNYAVK AFKALDCSGL ARVDFFVHKE TGKVYINEIN TMPGFTSISM YPMLWEESGI SYPELIEKLI DLAVQRYNDN LKEYDE
|
| |