Gene Smed_0851 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0851 
Symbol 
ID5321689 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp909123 
End bp910766 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content59% 
IMG OID640789788 
Productundecaprenyl-phosphate galactose phosphotransferase 
Protein accessionYP_001326541 
Protein GI150396074 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03023] Undecaprenyl-phosphate glucose phosphotransferase
[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.985925 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.189703 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGGCT GCGTACAACG TAAGCGTTTC TTAGTGGCTT TCTGCTATCC GGACGGAGCA 
ATTCAGCGCC GGAAAGCGAC CATGAACCAG TTCGACAGGC CAGAGACCTT CGACCCCGAA
GCGCTTCGAA AGAAGGTCTC GGAGATCCGG GAGACCACGA CCGAACACAG GAACAGAGCA
AGAGGGGATG CCGGGCTCAA CCCGCTTGCC AGGCAGATCG CCCTTCAGTT CCGCGCGGAC
ACCTATACGC CGGCGATGAT CACCGGCCTG ATCCGGCTTC TGGATTTCTG TGCGCTTTTC
GCCATAGGCT ACGGCATCAA TGCCCAATAT GTCGCGCCGA CGTTGGAGCA ACTGCCGGTC
AACCTTCTTA TCCTCGCCGG CGCGCCCGCG CTCTCCGTTG CCGTCATGCA GTTCGCCGAC
GCCTACCAGG TGCCGGCTCT GCGTGCCTGG CTCAGGATGG CGCCACGCAT TCTGGGCGCC
TGGACGGTGG CCTTCGGCAT GATTGCGCTC GGCCTGTTCT TTCTCAAATC GGGCCACCTC
TACTCGCGTT TCTGGATCGG CGCGTGGTAC CTCGCCGGCG CGGTCTTTCT CATCGCAGAA
CGCGCGTTCA TCGCCTATTC CATCCGCCAC TGGTCGCGAG ACGGCACCAT GGAGCGGCGA
GCCGTCGTCG TCGGCGGCGG CCAGTCTGCA AAGGATCTCA TTCGAAAAAT AGAACATCAG
CCGGACAACG ATATCCGGAT TTGCGGAATT TTCGACGATC GCGACGAGCG CAGGTCACCG
AACGTGATCG CCGGTTATCC GAAGCTCGGA ACGGTCGACG AACTGGTCGA GTTCGCCCGT
CTAGCCCGGA TCGACATGCT GATCATCTCG CTGCCGCTGA CGGCGGAAAA TCGCATCCTC
GAGCTCCTCA GGAAGCTGTG GGTCCTTCCC GTCGATATTC GGCTAGCGGC GCATGCCAAC
AGCCTTCGCT TTCGGCCGCG CAGCTATTCG CATGTCGGAC AGGTTCCGAT GCTGGACATC
TTCGACAAGC CGATCGCCGA TTGGGATAAT GTCGCAAAGC GCTGCTTCGA CGTCTTTTTC
AGTCTCGTCG CACTCGCCCT GCTCTGGCCG GTCATGCTCG CCGCCGCCAT TGCCGTGAAA
GTTACCTCGC CGGGTCCGAT CATCTTCAAA CAGCACCGGC ACGGCTTCAA CAATGAAACC
GTCGAGGTCT ATAAATTTCG CTCCATGTAC ACGCATATGA GCGACCCGAG CGCGCGTAAC
GCCGTAACCA AGAACGACCC GCGAGTGACA CCCGTCGGGC GCTTCCTGCG CAAATCTTCG
ATCGATGAGT TGCCACAATT CTTCAATGTG CTGAAAGGCG AGCTCTCACT CGTTGGGCCG
CGTCCTCACG CGGTGCTCGC ACAGACCCAG GATCGAAGCT ATTCCGATGT CGTCGAAGGC
TATTTCGCCC GCCATCGCGT GAAGCCGGGC GTGACCGGCT GGGCACAGAT CAACGGCTGG
CGCGGTGAAA TCGACAATGA CGAAAAGATC CGGTTCCGGA CGGCATTCGA CCTCTATTAT
ATCGAGAACT GGTCACTGCT CCTCGATCTG AAGATTCTCA TCCTGACACC GTTCCGGCTG
ATCAATACGG AAAACGCCTA TTGA
 
Protein sequence
MGGCVQRKRF LVAFCYPDGA IQRRKATMNQ FDRPETFDPE ALRKKVSEIR ETTTEHRNRA 
RGDAGLNPLA RQIALQFRAD TYTPAMITGL IRLLDFCALF AIGYGINAQY VAPTLEQLPV
NLLILAGAPA LSVAVMQFAD AYQVPALRAW LRMAPRILGA WTVAFGMIAL GLFFLKSGHL
YSRFWIGAWY LAGAVFLIAE RAFIAYSIRH WSRDGTMERR AVVVGGGQSA KDLIRKIEHQ
PDNDIRICGI FDDRDERRSP NVIAGYPKLG TVDELVEFAR LARIDMLIIS LPLTAENRIL
ELLRKLWVLP VDIRLAAHAN SLRFRPRSYS HVGQVPMLDI FDKPIADWDN VAKRCFDVFF
SLVALALLWP VMLAAAIAVK VTSPGPIIFK QHRHGFNNET VEVYKFRSMY THMSDPSARN
AVTKNDPRVT PVGRFLRKSS IDELPQFFNV LKGELSLVGP RPHAVLAQTQ DRSYSDVVEG
YFARHRVKPG VTGWAQINGW RGEIDNDEKI RFRTAFDLYY IENWSLLLDL KILILTPFRL
INTENAY