Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1041 |
Symbol | murD |
ID | 4811338 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1244775 |
End bp | 1246175 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640106462 |
Product | UDP-N-acetylmuramoyl-L-alanyl-D-glutamate synthetase |
Protein accession | YP_001037466 |
Protein GI | 125973556 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0771] UDP-N-acetylmuramoylalanine-D-glutamate ligase |
TIGRFAM ID | [TIGR01087] UDP-N-acetylmuramoylalanine--D-glutamate ligase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000538444 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAATACCA AATTAAATGA CTTTAAAAAG AAAATTAAAA ATAAAAAAGT TGCCGTACTG GGTATTGGAA TCAGCCATAC TCCGTTAATT TCATATTTGT ACCGGCTTGG GGCGGACATA ACGGCTTTCG ACAAAGCGGA TGAGGTTAAG CTTAAGTCCA CATTGGAACA GTTTAAAGGA ATGGATATTA AATATAGCCT GGGCGAAGGT TATCTTGACA ATCTGAAAGG CTTTGACATA TTGTTCAGGA CTCCTGGTAT GAGGTATGAT ATACCTGAGA TTCTTGCCGC AAAGGAGGAA GGAACTGAGG TAACTTCAGA AATGGAAGTA TTTTTTGAAC TTTGCCCGGC CGAAATTTTT GCCGTGACAG GCAGCGACGG TAAGACCACA ACCACCACGC TTATATACAA CATGCTGAAA GAGCAGGGAT ATAAGTGCTG GCTTGGTGGA AATATAGGCA TACCTTTGCT CAGCAAAATT GAAGAGATAA AGGATACGGA CAAAGTTGTC CTGGAGCTTA GCAGTTTTCA GCTGCACACC ATGACTAAAA GCCCGAATGT TGCGGTTGTG ACAAATGTTT CGCCCAACCA CCTGGATGTA CATAAATCCA TGGAGGAGTA TGTATCTGCC AAAAAGAATA TTTTTAGATA CCAGTCGGCG GAGGATAGGC TGGTACTTAA TTTTGACAAT GACATTACCC GGGAATTTGC CGGTGAGGCA AAAGGAGATG TAGTTTATTT CAGCAGAAAA ACCGTACTTG AAAAGGGTGC TATGCTAAAA GACGATATGC TGGTTTTCAG AGACGGAGAA ACTGAAACTG AAATTGCTAA AGCAAGTGAC ATCGTAATTC CGGGAGTCCA TAACGTTGAA AACTTTCTTG CGGCCACCGC CGCGGTAATA GACTGTGTCG ACAGGGATGT CATAAGAAAA GTTGCAACCA CCTTTACCGG GGTTGAACAC AGGATTGAAC TTGTAAGGGA GATCAACGGT GTAAAATTTT ACAATGATTC CATAGCAAGT AGCCCCACCA GAACGATTGC AGGATTGAAT TCATTTAAAG ATAAAGTAAT TCTGATTGCC GGAGGCTATG ATAAAAAAAT ACCTTATGAT GCTTTAGGAC CGGTAATTGC GGAAAAGGTA AAGTGCCTTG TGTTAATTGG ACAGACGGCT CCGAAAATTG AAAAGGTATT AAGGGATGAG ACGGAAAGGT CGGGAAAAGG CTCCGATATT CCGATAAAAA AATGTACCTC TCTTGAGGAA GCCGTTAAGG TGGCTTACCG TTTTGCTTCC GTCGGGGACG TAGTAATTTT GTCTCCTGCA AGCGCCAGCT TTGATATGTT TAAGAATTTT GAAGAGAGGG GCAATAGATT TAAAGAAATT GTCAACTCAA TTGAAGCCTG A
|
Protein sequence | MNTKLNDFKK KIKNKKVAVL GIGISHTPLI SYLYRLGADI TAFDKADEVK LKSTLEQFKG MDIKYSLGEG YLDNLKGFDI LFRTPGMRYD IPEILAAKEE GTEVTSEMEV FFELCPAEIF AVTGSDGKTT TTTLIYNMLK EQGYKCWLGG NIGIPLLSKI EEIKDTDKVV LELSSFQLHT MTKSPNVAVV TNVSPNHLDV HKSMEEYVSA KKNIFRYQSA EDRLVLNFDN DITREFAGEA KGDVVYFSRK TVLEKGAMLK DDMLVFRDGE TETEIAKASD IVIPGVHNVE NFLAATAAVI DCVDRDVIRK VATTFTGVEH RIELVREING VKFYNDSIAS SPTRTIAGLN SFKDKVILIA GGYDKKIPYD ALGPVIAEKV KCLVLIGQTA PKIEKVLRDE TERSGKGSDI PIKKCTSLEE AVKVAYRFAS VGDVVILSPA SASFDMFKNF EERGNRFKEI VNSIEA
|
| |