Gene Namu_0951 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_0951 
Symbol 
ID8446543 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp1044546 
End bp1045556 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content78% 
IMG OID645040087 
ProductMandelate racemase/muconate lactonizing protein 
Protein accessionYP_003200350 
Protein GI258651194 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID[TIGR01927] o-succinylbenzoic acid (OSB) synthetase 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGAGG ACCGCGGGCC GGCCGATTCG CCCGCCGAGC TGCTGGCCGG GGCCCGCGTG 
TTCTCGGTGC CGATGGTGCA CCGGTTCCGT GGCGTGACGG TGCGCGAGGG CGTGCTGCTG
CCCGGGCCGG CCGGCTGGGG CGAGTTCGCG CCGTTCCGCG ACTACTCCGA CGAGCAGTGC
GTGCCGTGGC TGCGGGCGGC GATCGAGTCG GCCACCGTCG GCTGGCCCGA GCCGCGGCGC
GACCGGGTGC CGGTCAACAT CATCGTCCCG GTCATCGACC CGGCCCGGGC GGCGGATCTG
GTCGCCGCGT CGGGGTGCCG GACGGCCAAG GTCAAGGTGG CCGACCCGGG GATGGAACTG
GCGGCCGACC TGGACCGGGT GGCGGCGGTG CGCCGGGCGC TCGGCCCGAC CGGGGCGATC
CGGATCGACG CCAACGGGGC CTGGACCGTG GCCGAGGCGA TCCCGGCCAT CGCCCGGCTG
GACGAGGCCG CCGAGGGGCT GGAGTACGTC GAGCAGCCCT GCCGCACCCT GGCCGAGCTG
GCCGAGCTGC GGCTGCGGGT CCGGCCCAAG ATCGCCGCCG ACGAGTCGAT CCGCGGGGCC
GCCGACCCGG CCGCGGTGCG GCTGGCCGAC GCGGCCGACG TGGCCATCGT CAAGGTCGCG
CCGCTGGGCG GGGTGCGCGC GGCGCTGCGG ATCGCGCAGG GCGCCGGAGT GCCCGCGGTG
GTGTCCTCGG CGGTGGACAG CGCGGTCGGG CTGACCGCCG GCATCGCCCT GGCCGCGGCG
TTGCCCGAGT TGCCCTACGC CTGCGGGCTC GGCACGGGCG TGCTGCTGGC CGACGACGTG
TGCGGCCGGC CGCCCCGGCC CGTGGACGGG GCGCTCGAGG TCGTCACCCG GGCACCGGAT
CCGGATCGGA TCGACGAGGT GGCCGCCGGC GCCGAGGCGA CGGCGTACTG GCGGGACCGG
CTGCGCCGGG TGGCGCTGAT CACCCAGGCG CTCACGCAGG GCAGGACTTA G
 
Protein sequence
MTEDRGPADS PAELLAGARV FSVPMVHRFR GVTVREGVLL PGPAGWGEFA PFRDYSDEQC 
VPWLRAAIES ATVGWPEPRR DRVPVNIIVP VIDPARAADL VAASGCRTAK VKVADPGMEL
AADLDRVAAV RRALGPTGAI RIDANGAWTV AEAIPAIARL DEAAEGLEYV EQPCRTLAEL
AELRLRVRPK IAADESIRGA ADPAAVRLAD AADVAIVKVA PLGGVRAALR IAQGAGVPAV
VSSAVDSAVG LTAGIALAAA LPELPYACGL GTGVLLADDV CGRPPRPVDG ALEVVTRAPD
PDRIDEVAAG AEATAYWRDR LRRVALITQA LTQGRT