Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_0951 |
Symbol | |
ID | 8446543 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 1044546 |
End bp | 1045556 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | 645040087 |
Product | Mandelate racemase/muconate lactonizing protein |
Protein accession | YP_003200350 |
Protein GI | 258651194 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | [TIGR01927] o-succinylbenzoic acid (OSB) synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 44 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCGAGG ACCGCGGGCC GGCCGATTCG CCCGCCGAGC TGCTGGCCGG GGCCCGCGTG TTCTCGGTGC CGATGGTGCA CCGGTTCCGT GGCGTGACGG TGCGCGAGGG CGTGCTGCTG CCCGGGCCGG CCGGCTGGGG CGAGTTCGCG CCGTTCCGCG ACTACTCCGA CGAGCAGTGC GTGCCGTGGC TGCGGGCGGC GATCGAGTCG GCCACCGTCG GCTGGCCCGA GCCGCGGCGC GACCGGGTGC CGGTCAACAT CATCGTCCCG GTCATCGACC CGGCCCGGGC GGCGGATCTG GTCGCCGCGT CGGGGTGCCG GACGGCCAAG GTCAAGGTGG CCGACCCGGG GATGGAACTG GCGGCCGACC TGGACCGGGT GGCGGCGGTG CGCCGGGCGC TCGGCCCGAC CGGGGCGATC CGGATCGACG CCAACGGGGC CTGGACCGTG GCCGAGGCGA TCCCGGCCAT CGCCCGGCTG GACGAGGCCG CCGAGGGGCT GGAGTACGTC GAGCAGCCCT GCCGCACCCT GGCCGAGCTG GCCGAGCTGC GGCTGCGGGT CCGGCCCAAG ATCGCCGCCG ACGAGTCGAT CCGCGGGGCC GCCGACCCGG CCGCGGTGCG GCTGGCCGAC GCGGCCGACG TGGCCATCGT CAAGGTCGCG CCGCTGGGCG GGGTGCGCGC GGCGCTGCGG ATCGCGCAGG GCGCCGGAGT GCCCGCGGTG GTGTCCTCGG CGGTGGACAG CGCGGTCGGG CTGACCGCCG GCATCGCCCT GGCCGCGGCG TTGCCCGAGT TGCCCTACGC CTGCGGGCTC GGCACGGGCG TGCTGCTGGC CGACGACGTG TGCGGCCGGC CGCCCCGGCC CGTGGACGGG GCGCTCGAGG TCGTCACCCG GGCACCGGAT CCGGATCGGA TCGACGAGGT GGCCGCCGGC GCCGAGGCGA CGGCGTACTG GCGGGACCGG CTGCGCCGGG TGGCGCTGAT CACCCAGGCG CTCACGCAGG GCAGGACTTA G
|
Protein sequence | MTEDRGPADS PAELLAGARV FSVPMVHRFR GVTVREGVLL PGPAGWGEFA PFRDYSDEQC VPWLRAAIES ATVGWPEPRR DRVPVNIIVP VIDPARAADL VAASGCRTAK VKVADPGMEL AADLDRVAAV RRALGPTGAI RIDANGAWTV AEAIPAIARL DEAAEGLEYV EQPCRTLAEL AELRLRVRPK IAADESIRGA ADPAAVRLAD AADVAIVKVA PLGGVRAALR IAQGAGVPAV VSSAVDSAVG LTAGIALAAA LPELPYACGL GTGVLLADDV CGRPPRPVDG ALEVVTRAPD PDRIDEVAAG AEATAYWRDR LRRVALITQA LTQGRT
|
| |