Gene Smed_5102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5102 
Symbol 
ID5319404 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp49375 
End bp50499 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content62% 
IMG OID640776880 
Productmandelate racemase/muconate lactonizing protein 
Protein accessionYP_001313812 
Protein GI150377217 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.401813 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCAAAA TATCAGGCAT CCGCGTCCGT CCGCTGGTTC TGCCGCTGAA ACAGCCTTAT 
CACTGGTCCT ACGGCATTCG CGAATCCTTT GCGGTCAACC TTATCGAGAT CGAGGCGGAT
GACGGCTCGG TCGGGATCGG CGAATGCACG GTGGCGCCCG ATCAGGCCGG TACGGCGGCC
ATTCTTCATC GCCTTGCCGG ACATCTCATC GGCCATTCGC CCCATGATGT GGCGCCGCTC
ATCGCGCGCA TCTTCCACCA GGAATATCTC GGGCACGGCG CCAATATCAT GCGTGCGGCC
AATCAGGTGT TCTCCGGTAT CGACATGGCC ATGTGGGATC TGCAGGGCAA GCTCGCCGGC
TTGCCCGTGC ACCAGCTGCT GGGCGGCGCA CACCGGAAGG CGGTCGGCTA TTTCTACTTC
CTCCAGGGAG AAACCGCCGA AGAGCTTGCG CGGGATGCCG CCGCCGGTCG GGCCCGGGGC
GAGCGGGTCT TCTATCTCAA GGTCGGCCGA GGCGAGAAGA CCGACCTGGA GATCACCGCT
GCGGTTCGCC GCGAGATCGG CGACGCGCGC CTTCGCCTGG ACGCGAACGA AGGCTGGAGC
GTGCATGACG CGATCAACAT GTGCCGCAAG CTGGAAAAAT ACGACATCGA GTTCATCGAG
CAGCCGACGG TCAGCTGGAG TATTCCGGCC ATGGCGCATG TCCGCGAGAA GGTCGGTATT
CCGATCGTCG CGGATCAGGC CGCCTTCACG CTCTACGACG TCTATGAGAT ATGCCGGCAG
CGTGCTGCGG ACATGATCTG CATCGGCCCG CGCGAAATCG GCGGGATACA GCCGATGATG
AAGGCGGCAG CCGTGGCGGA GGCCGCCGGG CTGAAGATCT GCATCCATTC CTCCTTCACG
ACCGGCATCA CCACCTGCGC GGAGCACCAT ATCGGGCTTG CCATTCCCAA TCTCGATGAC
GGTAACCAGA TCATGTGGCA GCTCGTTCAG AAGGATATCG TTTCCTCGCC GGATCTGGCG
CCCAGGAACG GCTGGCTCGA TGCCTTCAAG AAGCCGGGAC TGGGCTTCCA ACTCGCCGAA
GACCTGATCG CCGACGGCGA AAGACGCTTT GCGGCGAGCC GATGA
 
Protein sequence
MVKISGIRVR PLVLPLKQPY HWSYGIRESF AVNLIEIEAD DGSVGIGECT VAPDQAGTAA 
ILHRLAGHLI GHSPHDVAPL IARIFHQEYL GHGANIMRAA NQVFSGIDMA MWDLQGKLAG
LPVHQLLGGA HRKAVGYFYF LQGETAEELA RDAAAGRARG ERVFYLKVGR GEKTDLEITA
AVRREIGDAR LRLDANEGWS VHDAINMCRK LEKYDIEFIE QPTVSWSIPA MAHVREKVGI
PIVADQAAFT LYDVYEICRQ RAADMICIGP REIGGIQPMM KAAAVAEAAG LKICIHSSFT
TGITTCAEHH IGLAIPNLDD GNQIMWQLVQ KDIVSSPDLA PRNGWLDAFK KPGLGFQLAE
DLIADGERRF AASR