Gene Rleg2_1222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1222 
Symbol 
ID6979943 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp1233838 
End bp1235403 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content62% 
IMG OID643395936 
Productexopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 
Protein accessionYP_002280742 
Protein GI209548825 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03023] Undecaprenyl-phosphate glucose phosphotransferase
[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.957103 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAGC TGGAAAAAGG CGATCAATTC GACGTGGAAG CACTGCGCAA GCAGGTCTCC 
GACATCGAAG TGCGCGGCAA TGCGGGTCAG GAAAAATCGT CCGACCCGAC TGAAATCAAT
CCCTATGCCC GGCAGATCGC CGAACAGTTC CGTGACGGAA CCAAGTCGCC GGTGATCATC
ATCGGGCAGT TGCGGCTGCT GGAATTCCTG GCGCTCTTCG CGATCGCCCT GATCGCCCAC
TATTTTTGGC CCGGCGACGG CTACGATTCA TCGCTGGTGC GAACCGGCAT GGCGGCGATC
GCCTCTGCCC TGACGGTCAT CGGCCTGCAG CTTGGCGACA CCTACACGAT TCCGGCACTT
CGGGCCAAGC TGCGGCTGAT CCCCCGCATC CTGGGCTCGT GGACGGTCGC GCTCCTCCTG
ACCGTCGGCC TGTTCGCGCT GGTTCGCGGC AGCACATGGG CGATGGTCGC CGACGCCTAT
CTGCCCTGGT TTGCGTCTGG CGCGCTCTTC CTGGCGGCCG AGCGTTTCCT CGTCGCCTAC
GGCATCCGCA ACTGGGCGCG CAACGGCATC ATGGAGCGCC GCGCCGTGAT CGTCGGCGGC
GGCGAGCCGG CCAAGGAACT GATCCGCATC CTCGAGCAGC AGGATGACAA TGACATCCGC
ATCTGCGGCA TCTTCGACGA TCGCGGCGAA AAGCGCTCGC CGATCATGGT CGCCGGTTAT
CCGAAGCTCG GCACCGTGGC CGAACTGGTC GAATTCGTGC GATTGACGCG CATCGACATG
CTGATCATCG CTCTGCCGCT TTCGGCCGAG GCGCGTATCT ACGACCTTTT GAAGAAGCTC
TGGGTTCTGC CGGTCGATAT CCGCCTTGCC GCGCATGCCA ACCGGCTGCG CTTCCGGCCG
CGCGCCTATT CGCATGTCGG CACGGTGCCG ATGCTCGACA TTTTCAAGAT GCCGATCCGC
GACTGGGATT CCGTTGCCAA GCGCGGCTTC GACATCTTCT TCACCGTAAT CGCGCTGGCG
CTGCTCTGGC CGCTGATGGT CGCGACCGCC ATCGCCATCA AGGTGACCTC CGAAGGCCCG
GTTTTCTTCA TGCAGAAGCG CCACGGCTTC AACAATGAAA TCATCAACGT CTTCAAGTTC
CGTTCGATGT ACACCAACAT GTCCGACCCC ACGGGCAAGG CTGCGGTGAC CAAGGGCGAT
CCGCGCGTCA CCCGCGTCGG CCGCTTCATC CGCAAGACCT CGATCGACGA ATTGCCGCAG
CTTTTCAACG TGCTGAGGGG CGAGCTTTCG CTGGTCGGGC CGCGCCCGCA TGCCGTTCTC
GCCCAGGCCC GTGACCGCGC CTTCGGCGAT GTCGTCGAGG GCTATTTCGC CCGCCACCGC
GTCAAGCCAG GGGTCACCGG CTGGGCGCAG ATCAACGGCT GGCGCGGCGA AGTGGACAAT
GACGAGAAGA TCAAGTTCCG CACCGCCTAC GACCTCTATT ACATCGAGAA CTGGTCGCTC
TGGTTCGATC TCAAGATCCT GTTCCTGACG CCGATCCGGC TGCTCAACAC GGAAAATGCC
TATTGA
 
Protein sequence
MNKLEKGDQF DVEALRKQVS DIEVRGNAGQ EKSSDPTEIN PYARQIAEQF RDGTKSPVII 
IGQLRLLEFL ALFAIALIAH YFWPGDGYDS SLVRTGMAAI ASALTVIGLQ LGDTYTIPAL
RAKLRLIPRI LGSWTVALLL TVGLFALVRG STWAMVADAY LPWFASGALF LAAERFLVAY
GIRNWARNGI MERRAVIVGG GEPAKELIRI LEQQDDNDIR ICGIFDDRGE KRSPIMVAGY
PKLGTVAELV EFVRLTRIDM LIIALPLSAE ARIYDLLKKL WVLPVDIRLA AHANRLRFRP
RAYSHVGTVP MLDIFKMPIR DWDSVAKRGF DIFFTVIALA LLWPLMVATA IAIKVTSEGP
VFFMQKRHGF NNEIINVFKF RSMYTNMSDP TGKAAVTKGD PRVTRVGRFI RKTSIDELPQ
LFNVLRGELS LVGPRPHAVL AQARDRAFGD VVEGYFARHR VKPGVTGWAQ INGWRGEVDN
DEKIKFRTAY DLYYIENWSL WFDLKILFLT PIRLLNTENA Y