Gene Smed_5844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5844 
Symbol 
ID5320146 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp807741 
End bp808937 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content60% 
IMG OID640777539 
ProductN-acylglucosamine 2-epimerase 
Protein accessionYP_001314471 
Protein GI150377876 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2942] N-acyl-D-glucosamine 2-epimerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.742486 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACCAT TCCCCGATTT CCGCTCGAAA GACTTCCTGC TCGCCCATAT GCGCGAGATC 
ATGGATTTCT ATCATCCGAT CTGCCTCAAC AGGGAACACG GCGGCTACCA TAACGAATAT
CGTGACGACG GCTTCATCAC CGATCGGAAG ACCCAGCACC TCGTCTCAAC CACCCGCTTC
ATCTTCAACT ATGCGACAGC CTCCGTCCTC TTCCAGAGGC CGGATTACGC AGAGGCGGCT
GCCCATGGCG TCAGATATCT CGACGAGGTC CACCGCGATT CCGAGCACGG CGGCTATTTC
TGGGTAATGC ACGGGCGCGA GGCAGCGGAT ACGACGAAGC ACTGCTACGG CCACGCCTTC
GTTTTGCTCG CTTACGCGGC CGCCATGAAG GCCGGTATTC CAGGCATGGG CGCGCGGATT
TCGGACACAT GGGGCCTTCT CGAAAACCGC TTCTGGGAGC CGGAGCGCGA GCTCTACAAG
GACGAGATCA GCCGCGACTG GCAGAAGATC TCGCCCTATC GGGGCCAGAA CGCCAACATG
CATATGACAG AGGCGATGCT GGCGGCCTAT GAGGCGACCG GTGAGATTCG CTATCTCGAC
CGTGCCGAAA CGCTCGCCCG GCGTATCTGT GTGGAACTTG CCGCCACCGC TCAAGGTGTG
GTCTGGGAGC ATTACCGCGC GGACTGGTCG ATCGACTGGG ATTACAACAA GGACGATCCG
AAGCACCTGT TCCGACCCTA CGGCTATCTG CCAGGCCATA TGACGGAATG GACCAAGCTG
CTGCTGATCC TCGAGCGCTA CCGACCGCAG GACTGGATCC TGCCGAAAGC CATTCTCCTC
TACGAGACGG CCCTGGCAAA CAGCGCCGAT CTCGAATTCG GGGGCATGCA TTACACTTAC
GGTCCGGACG GAAGGCTCTA CGATCCCGAT AAGTATCATT GGGTCCATTG CGAAACGCTG
GCCGCCGCGG CAGCACTTGC CGGGCGCACC GGCCAGGAGC GTTACTGGCA GGATTACGAC
AGGCTCTGGC GCTACAGCTG GCGGCACCTG ATCGACCATG AATATGGCTG CTGGTTCCGC
ATACTCTCGC CGGAGGGCGT GAAGCAGAGC GATATCAAAA GCCCTTCGGG CAAGACCGAC
TACCATCCAT TCGGGGCCTG CTACGAAATT CTGCGCGTGC TTGGGGAAGC GAAGTAG
 
Protein sequence
MRPFPDFRSK DFLLAHMREI MDFYHPICLN REHGGYHNEY RDDGFITDRK TQHLVSTTRF 
IFNYATASVL FQRPDYAEAA AHGVRYLDEV HRDSEHGGYF WVMHGREAAD TTKHCYGHAF
VLLAYAAAMK AGIPGMGARI SDTWGLLENR FWEPERELYK DEISRDWQKI SPYRGQNANM
HMTEAMLAAY EATGEIRYLD RAETLARRIC VELAATAQGV VWEHYRADWS IDWDYNKDDP
KHLFRPYGYL PGHMTEWTKL LLILERYRPQ DWILPKAILL YETALANSAD LEFGGMHYTY
GPDGRLYDPD KYHWVHCETL AAAAALAGRT GQERYWQDYD RLWRYSWRHL IDHEYGCWFR
ILSPEGVKQS DIKSPSGKTD YHPFGACYEI LRVLGEAK