Gene Rleg_5609 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5609 
Symbol 
ID8016835 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012853 
Strand
Start bp191899 
End bp193065 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content57% 
IMG OID644827774 
Productglycosyl transferase group 1 
Protein accessionYP_002978974 
Protein GI241518346 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.000732254 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGTTGCGA TGCAAGCACC AAACATACCA CGCCGAATAG TTGTCATCAG CGACTTCCAG 
AACGGCGATT GGCGCGACGC CCAATTCAAG ACGATAGCCT GGTTTCGGCA GCAATGGTTG
TCGGGAAATC GCGGGCCTCT CCCAGAGATG ACCGTCACCC CGAGACAGAA CGTCGAGGTC
GTGGGGGGCT ACATCGAGCG GCTCCCTCTT GCAGCTCTCA AAGGAGGCCT TGCCGACCGC
GCCGAAATTT GGACCCACAG CCGAGGTGCA TCGCCAAAGA TGCATCGGCC CGATGGCAGT
CCGTTCTTGA CGCGCCGGTC GTTCCAGATG AATGGGCCTG AAGCGCCATT CGCATCAAAT
GATATGCTGG GCCATATCGA GGCTTTCGGA CCACCGTCAA TCTTGTGCGT ATGGGGCCTT
GGCGTTAGCG AAGACATATT GCTGGCCTGC CCCGACAGTT TCAAAATTTA CAATTCGATC
GACGCACCGG CGTTGCGCGT GCCATCTGAG GTGAGCCGCC ATTTCGACCT TATTCTCACC
GGCGCCGCCT GGCAGTCGGA AGCAGTGCGC GTTTTATATC CCGACAAGCG CGTCGCCGTT
ATGCCGATTG GACCGGAGTT TGCTTCTGAG GTAACGTTCA GGCCTCTGGG TCTGGAAAAG
ATCTATGACG TGATCTACGT CGCTGCTGCT CAGGCTTACA AGCGGCATGA CATTCTGTTC
AACGCGCTGA GCCAACTGCC CCGTTCGCTG CGTGCATTGT GCGTCTGCGG CTATGGCGAG
ATGATGGAAG CGCTACGCCG GCACGCAGGA GAACTCAACA TCGACGTCGA TTTCATCGAT
CCGCCCGGTG TACCATTTGC CGAAGTGAAC AGGCTTATGA ACCAGGCCCG GATCGGCGTC
GTTTGCGGCG TCGATGATGG CGCACCGGCC ATTTTGACGG AGTATATGCT GGCTGGAATA
CCCGTTCTTG CAAACAGTGA GCTGAGGTGC GGACTGCAAT ACATCACGCC GAAGACGGGG
CGCGCTGCCT CGGCCGATGA ATTTCACGCG GGTATCCGCG ACATGCTTGG AGGGCTGCAA
AGCTTCGATC CACGTCAGGT CGTCTTGGAT AACTGGACAT GGCCGCATAG CCTCAGGACG
CTCAAGAGCC TTATCGAGAT AACTTAG
 
Protein sequence
MVAMQAPNIP RRIVVISDFQ NGDWRDAQFK TIAWFRQQWL SGNRGPLPEM TVTPRQNVEV 
VGGYIERLPL AALKGGLADR AEIWTHSRGA SPKMHRPDGS PFLTRRSFQM NGPEAPFASN
DMLGHIEAFG PPSILCVWGL GVSEDILLAC PDSFKIYNSI DAPALRVPSE VSRHFDLILT
GAAWQSEAVR VLYPDKRVAV MPIGPEFASE VTFRPLGLEK IYDVIYVAAA QAYKRHDILF
NALSQLPRSL RALCVCGYGE MMEALRRHAG ELNIDVDFID PPGVPFAEVN RLMNQARIGV
VCGVDDGAPA ILTEYMLAGI PVLANSELRC GLQYITPKTG RAASADEFHA GIRDMLGGLQ
SFDPRQVVLD NWTWPHSLRT LKSLIEIT