Gene Smed_0206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0206 
Symbol 
ID5321037 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp233241 
End bp234533 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content65% 
IMG OID640789140 
ProductUDP-N-acetylglucosamine 1-carboxyvinyltransferase 
Protein accessionYP_001325900 
Protein GI150395433 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0766] UDP-N-acetylglucosamine enolpyruvyl transferase 
TIGRFAM ID[TIGR01072] UDP-N-acetylglucosamine 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCGCA TCAGGATTGT AGGCGGAAAT GAACTCCACG GGGTGATCCC CATCTCCGGC 
GCGAAGAACG CCGCCTTGCC GCTGATGATC GCGTCGCTCC TGACCGATGA CACGCTGACG
CTCGAAAATG TGCCGCATCT CGCCGATGTC GAGCAATTGA TCCGCATCCT CGGCAATCAT
GGTGCCGACA TTTCCGTCAA TGGCCGGCGC GAGCGTCAGG GCGAGAGCTA CGCCCGCACG
GTCCATTTCA CCAGCCGCAA CATCGTTTCG ACGACTGCAC CCTATGAGCT CGTCTCGAAG
ATGCGCGCGA GCTTCTGGGT CATCGGGCCG CTGCTCGCGC GTGAGGGCAG GGCGCGCGTG
TCGCTGCCCG GCGGTTGCGC CATCGGAACG CGCCCGGTTG ATCTCTTCAT CGAGGGGCTG
ACCGCGCTTG GCGCCAGCAT TGAGATCGAC GGCGGCTACG TCAATGCAAC GGCACCGGCG
GGCGGGCTCA TCGGCGGGCG TTACACCTTC CCGAAAGTTT CCGTCGGCGC GACCCATGTG
CTGATGATGG CGGCAACGCT TGCCAATGGC ACGACGGTGC TCGGCAACGC CGCGCGTGAG
CCCGAAGTGG TGGACCTTGC CAAATGCCTG AACGCCATGG GCGCGAAGAT CAGCGGCCAG
GGGACGAGCA CGATCACCAT CGAGGGCGTG CGCTCGCTCT CGGGCGCCCG CCACCGGGTG
CTGCCCGATC GCATCGAGAC AGGGACCTAT GCGATGGCCG TCGCCATGGC GGGCGGCGAC
GTCATTCTCG AAGACACCGA GGCGAGCCTC CTCGATACAG CGCTTGAAGC GATCCGCCGC
GCCGGCGCCG AGATCAGCGA CACGAACAAC GGCATCCGGA TCGTCCGCAA CGGCGCCGGC
ATCAGGCCGG TCGACATCGT CACCGATCCC TTCCCCGGCT TCCCGACCGA CCTTCAGGCG
CAGTTCATGG GGTTGATGAC CCGGTCAAGC GGCGTTTCCC ACATCACCGA GACGATCTTT
GAAAACCGCT TCATGCATGT TCAGGAGCTG GCGCGGCTCG GCGCCAAGAT ATCGCTCTCC
GGCCAGACGG CGAAGGTCGA GGGTGTATCG CGGCTGAAGG GCGCACCGGT CATGGCAACG
GACCTCAGGG CTTCCGTCTC GCTCGTCATT GCGGGCCTCG CGGCCGAGGG CGAAACCATG
GTTTCGCGGG TTTACCACCT CGACCGCGGC TTCGAGCGCC TGGAAGAGAA GCTCACGCGT
TGCGGCGCCC ATGTCGAGCG CGTCAGCGAC TGA
 
Protein sequence
MDRIRIVGGN ELHGVIPISG AKNAALPLMI ASLLTDDTLT LENVPHLADV EQLIRILGNH 
GADISVNGRR ERQGESYART VHFTSRNIVS TTAPYELVSK MRASFWVIGP LLAREGRARV
SLPGGCAIGT RPVDLFIEGL TALGASIEID GGYVNATAPA GGLIGGRYTF PKVSVGATHV
LMMAATLANG TTVLGNAARE PEVVDLAKCL NAMGAKISGQ GTSTITIEGV RSLSGARHRV
LPDRIETGTY AMAVAMAGGD VILEDTEASL LDTALEAIRR AGAEISDTNN GIRIVRNGAG
IRPVDIVTDP FPGFPTDLQA QFMGLMTRSS GVSHITETIF ENRFMHVQEL ARLGAKISLS
GQTAKVEGVS RLKGAPVMAT DLRASVSLVI AGLAAEGETM VSRVYHLDRG FERLEEKLTR
CGAHVERVSD