Gene Rleg_3217 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3217 
Symbol 
ID8015776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3219891 
End bp3220895 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content58% 
IMG OID644825778 
Productpolysaccharide pyruvyl transferase 
Protein accessionYP_002977005 
Protein GI241205909 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5039] Exopolysaccharide biosynthesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAGCC AATCCAGAAC GGAGCTGATC GCAAAACTTA ATGGTATGAT CCACGATTGC 
CTGAAAGACT ATGTCTCACG CGATGAACCG CTTGCGATCC TCGATTTTCC CGACATCCGC
AATTGCGGCG ACTCCGCAAT CTGGATGGGC GAAATGGCCT ATCTGAAGGA TCGCTTCGGC
AAACGCCCGG ACTATGTTTC GAGAACGACG GACTTCTCCG CAGATGAGCT GAAAAAGCGC
GTGCCGACCG GTCCGATTTT CATTCATGGT GGCGGCAATT TCGGCGACAT CTGGGTTTCC
CACCAGGATT TCCGCGAAGC CATCATGGAG CGTTTCCCGG ATCGGCAGAT CGTCCAGTTT
CCGCAGTCTA TTCACTACAG TTCACCTGAG CGCATCGAGC AATCCGCACG CGCAATCGCG
CGTCATAAGA ATTTCGTGCT GCTCGTGCGC GACGAAGAAT CGAAGGAATT TTCCGAGAAG
CACTTCGACT GCACCGTACG GCTATGCCCC GACATGGCCT TCGCCATCGG GCCGCTGCCG
GACAGGGCAA CGCAGATTTC CGTCCTCGCC ATGCTGCGAG AGGATGCGGA ACGGGTTGGC
GGCACCGATC GCAAGATCCC TTCTGATATT CCTGTGGAGG ACTGGATCAC CGAGTCGAAA
CGCAAGGTTG ATATCGCCAA GAAATTGGGG GCAGCTTCGG CTTTCCTGGC ACTGAAGCCG
AGCGAAGTGG CATTGCGCAA GCTGGACGCG GCGGCGCACA ACCGCTTCGA GCGCGGCATC
AGCCAGATTT CACGCGCCCG CGCCATCGTC ACCGATCGCC TGCACGTTCA TATCTGCTCG
CTGCTGCTTG GTCGGCCGCA TGCCGTATTG GACAACAGCT ACGGAAAGAT CCGCCGCTTC
ATGAATGCCT TCTCCGGCGG GACGGACCTT TCTTACAAGG CAACGTCGCT TGAAGATGGA
ATCGAATGGG CGCGTCACCA GGCCGGGCAA GGAGCTCGCG GATGA
 
Protein sequence
MTSQSRTELI AKLNGMIHDC LKDYVSRDEP LAILDFPDIR NCGDSAIWMG EMAYLKDRFG 
KRPDYVSRTT DFSADELKKR VPTGPIFIHG GGNFGDIWVS HQDFREAIME RFPDRQIVQF
PQSIHYSSPE RIEQSARAIA RHKNFVLLVR DEESKEFSEK HFDCTVRLCP DMAFAIGPLP
DRATQISVLA MLREDAERVG GTDRKIPSDI PVEDWITESK RKVDIAKKLG AASAFLALKP
SEVALRKLDA AAHNRFERGI SQISRARAIV TDRLHVHICS LLLGRPHAVL DNSYGKIRRF
MNAFSGGTDL SYKATSLEDG IEWARHQAGQ GARG