Gene Rleg2_2968 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_2968 
Symbol 
ID6981713 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp3028613 
End bp3029644 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content55% 
IMG OID643397678 
Productpolysaccharide pyruvyl transferase 
Protein accessionYP_002282461 
Protein GI209550544 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5039] Exopolysaccharide biosynthesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.41225 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAGCC AATCCAGATC GGAACTGATT TCGAAACTCA ATGGCATGAT CCACGATTGC 
CTGAGGGATT ACGTGTCACC TGAGGAACCG CTCGCAATTC TCGACTTCCC CGACATCCGC
AATTGCGGGG ACTCGGCAAT CTGGTTGGGC GAAATGGCGT ATCTCAAGGA TCGTTTCGGC
AAACGTCCGG ACTATGTCTC GAGAATACAC GATTTCTCTG CTGATGAACT TAACCAACGC
GTTCCAACAG GTCCGATTTT CATTCATGGT GGTGGCAATT TCGGCGATAT CTGGGTTGCT
CACCAGGATT TCCGTGAAGC GATCATGGAG CGTTTTCCGG ATCGACAGAT CATCCAGTTC
CCGCAGTCGA TCCACTACAG TTCGCCTGAG CGTATCGAGC AGTCCAAACG GGCAATCGGG
CATCACAAGA ATTTCGTTCT GCTCGTGCGG GACGAAGAGT CGAAGGAATT TTCCGAAAAA
CACTTCGACT GCACCGTACG GTTATGCCCT GATATGGCCT TCGCCATCGG CGCCCTTCCG
GACAGAGCAA CTCAATTTTC GGTGCTCGCC ATGTTGCGAG AAGATGCGGA ACGGGTCGGG
AGTTCCGATC GAAAAATCCC CTCAGATATA CCGGTAGAGG ATTGGATATC GGAATCGAAA
CGGAAAGTGG AGATCGCCAA GAAGTTGGGG GCAGCGTCGG CGTTTCTCGC GTTGAAGCCA
AGGGAATTGG CTCTGCGAAC GTTGGACGCA GCGGCGCACA ACCGCTTCGA GCGCGGCATC
AGTCAGATTT CGCGCGCCCG CGCAATTGTC ACCGATCGCC TGCACGTCCA TATCTGCTCG
CTGCTTCTCG GCCGCCCTCA CGCGGTGCTG GACAACAGCT ACGGAAAGAT CCGCCGCTTC
ATGAACGCTT TCTCCGGCGG AACGGATCTT TCATACAAGG CAACCTCACT CGAAGACGGA
ATTGACTGGG CACGTCATCA GGCGGCCAAG ATGCGTGTCG ATGAAAAAGA AGCTGTGAGC
TTGCGGGCCT AG
 
Protein sequence
MTSQSRSELI SKLNGMIHDC LRDYVSPEEP LAILDFPDIR NCGDSAIWLG EMAYLKDRFG 
KRPDYVSRIH DFSADELNQR VPTGPIFIHG GGNFGDIWVA HQDFREAIME RFPDRQIIQF
PQSIHYSSPE RIEQSKRAIG HHKNFVLLVR DEESKEFSEK HFDCTVRLCP DMAFAIGALP
DRATQFSVLA MLREDAERVG SSDRKIPSDI PVEDWISESK RKVEIAKKLG AASAFLALKP
RELALRTLDA AAHNRFERGI SQISRARAIV TDRLHVHICS LLLGRPHAVL DNSYGKIRRF
MNAFSGGTDL SYKATSLEDG IDWARHQAAK MRVDEKEAVS LRA