Gene Rleg_1310 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1310 
Symbol 
ID8012409 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1290058 
End bp1291620 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content61% 
IMG OID644823892 
Productexopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 
Protein accessionYP_002975141 
Protein GI241204045 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03023] Undecaprenyl-phosphate glucose phosphotransferase
[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAGC TGGAAAAAGG CGATCAATTC GACGTGGACG CACTGCGCAA GCAGGTTTCC 
GACATCGAGG TGCGCGGCGA AGCGGGTCAG GAGAAGCCTT CCGATCCGAC TGAGATCAAC
CCCTATGCCC GGCAGATCGC CGAACAGTTC CGTGACGGAA CCCGTTCGCC AACGATCATC
ATCGGGCAAT TGCGGCTGCT CGAATTCCTG GCGCTCTTTG CGATAGCTCT GATCACCTAT
TATTTCAGCC CAGGCGACGG CGGCGATTCC CCGTTGATGC GGGCGGGCAT GGCGGCGATT
GCCTCGGCCT TGACCGTCAT CAGCCTGCAG CTTGCCGATA CTTACACAAT TCCGGCACTT
CGCTCCAAGC TGCGTCTGAT ACCGCGCATC CTCGTTGCGT GGACGATCGC GTTTGTCCTC
ACCACCGGCC TGTTCGCACT TCTTCGCGGC ACGACATCGG CGATGGTCGA TGCCTATATC
GTCTGGTTTG CTGCGGGCGC TCTCTTCCTC GCGGCCGAGC GCTTCCTCGT CGCCTACGGC
ATCCGCAACT GGGCGCGCAA CGGCATCATG GAACGCCGTG CCGTCATCGT CGGCGGCGGC
GAGCCGGCCA AGGATCTGAT CCGCGTCCTC GAACAGCAGG CCGACAACGA CATCCGCATC
TGCGGCATCT TCGACGACCG CGGCGAGAAA CGCTCGCCGA TCATGGTCGC CGGTTATCCG
AAGCTCGGCA CCGTCGCCGA ACTCGTCGAA TTCGTGCGGC TAACGCGCAT CGACATGCTG
ATCATCGCCC TGCCGCTGTC GGCCGAGGCC CGTATCTACG ACCTTTTGAA GAAGCTTTGG
GTTCTGCCGG TCGATATCCG CCTTGCCGCG CATGCCAACC GGCTGCGCTT CCGGCCGCGC
GCCTATTCGC ATGTCGGCTC GGTGCCGATG CTCGATATTT TCAAGAAGCC GATCCGCGAC
TGGGATTCCG TCGCCAAGCG CGGTTTCGAC ATCTTCTTCA CGATCGTCGC ACTTGCGCTG
CTCTGGCCGA TCATGGTCGC GACCGCGATC GCCATCAAGG CAACGTCGGA AGGCCCGATC
TTCTTCATGC AGAAGCGCCA CGGCTTCAAC AATGAAATCA TCAACGTCTT CAAGTTCCGC
TCGATGTACA CCAACATGGC CGACCCCACC GGCAAGGCAG CGGTGACCAA GGGCGATCCG
CGTGTCACCC GCGTCGGCCG CTTCATCCGC AAGACCTCGA TCGATGAACT GCCGCAGCTT
TTCAACGTGC TGAGGGGAGA TCTCTCGCTC GTCGGCCCGC GTCCGCACGC CGTTCTCGCT
CAGGCGCGCG ACCGCGCCTT CGGCGATGTC GTCGAGGGTT ATTTCGCCCG CCATCGCGTC
AAGCCGGGCG TCACCGGCTG GGCGCAGATC AACGGCTGGC GCGGCGAAGT CGACAATGAC
GAGAAGATCA AGTTCCGCAC GGCCTACGAC CTCTATTACA TCGAGAACTG GTCGCTCTGG
TTCGATCTCA AGATCCTGTT CCTGACACCG ATCCGGCTGC TCAACACGGA AAACGCCTAT
TGA
 
Protein sequence
MNKLEKGDQF DVDALRKQVS DIEVRGEAGQ EKPSDPTEIN PYARQIAEQF RDGTRSPTII 
IGQLRLLEFL ALFAIALITY YFSPGDGGDS PLMRAGMAAI ASALTVISLQ LADTYTIPAL
RSKLRLIPRI LVAWTIAFVL TTGLFALLRG TTSAMVDAYI VWFAAGALFL AAERFLVAYG
IRNWARNGIM ERRAVIVGGG EPAKDLIRVL EQQADNDIRI CGIFDDRGEK RSPIMVAGYP
KLGTVAELVE FVRLTRIDML IIALPLSAEA RIYDLLKKLW VLPVDIRLAA HANRLRFRPR
AYSHVGSVPM LDIFKKPIRD WDSVAKRGFD IFFTIVALAL LWPIMVATAI AIKATSEGPI
FFMQKRHGFN NEIINVFKFR SMYTNMADPT GKAAVTKGDP RVTRVGRFIR KTSIDELPQL
FNVLRGDLSL VGPRPHAVLA QARDRAFGDV VEGYFARHRV KPGVTGWAQI NGWRGEVDND
EKIKFRTAYD LYYIENWSLW FDLKILFLTP IRLLNTENAY