Gene Rleg_5581 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5581 
Symbol 
ID8016472 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012853 
Strand
Start bp163418 
End bp164524 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content65% 
IMG OID644827747 
Productglycosyl transferase group 1 
Protein accessionYP_002978947 
Protein GI241518319 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.641889 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.0686664 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCGTC GACGAAACCG AATATTGATG ACCCTCGACG CGGTTGGCGG TGTCTGGCGC 
TATGCGATGG ATCTCGGTGC CGGGCTTCGG CGCGAGGGGA TGGAAATCGT TTTCGCGGGC
CTCGGACCCG CACCGTCGGC AACACAGACC AGCGAAGCGA CAGCGCTTGG GCAACTGGTG
TGGCTCGATG CCCCTCTCGA CTGGATGGCG GCGAGCAGGG CCGAAATATC TGCCGCGCCC
GCCGAAATCT CTCGGATCGC CAGAGACCAT GGCGCCGATC TGCTGCATCT CAACCTGCCG
TCCCAGGCCG CCGGGATCGG TACGCCGCTG CCCGTTGTCG TGGTTTCCCA TTCCTGCGTC
GTCACGTGGT TTGCCGCGGT GCGCGGGACG CCGGCGCCCC CCGACTGGGC CTGGCAGAGC
GACGCGAACC GTGAAGGTTT CGACCGCGCC GATGCCGTGC TTGCGCCAAG CAGGAGCCAT
GCGGATGCCC TGGAGGCCGC CTATGGTCCT CTCTCCCGGC TGAAGGTGGT CCACAACGCC
AGTCGCGTCG GGTCCGACCC GCGTCCAAAG AAGATTTTCG TCTTCGCGGC CGGGCGCTGG
TGGGACGAGG GCAAGAATGG CGCGGTGCTC GACAGGTCGG CAGCGGTGAT GCCCCTTCCC
GTCGTCATGG TAGGTTCCTG CTCGGGTCCT AACGGACAGC GACTGCAGCT CGACGACGCC
GACGATCGCG GGCCGCTGCC TTATTCGAAG ACCATCGCCT TGATGCGGCG CGCGCAAATC
GTCGTATCGC CATCCATCTA CGAGCCTTTC GGCCTGACCG TCCTGGAGGC GGCGCGATGC
GGCGCGGCCC TGGCGCTGTC CGATATCCCG ACCTATCGCG AGCTATGGGA TGGATGCGCG
CTGTTCTTCG ATCCGCATGA TCCGAAGGCC TTGGCAGCCG CGTGCATGCG CCTCAGCGAA
GACGAGCAAT TGCGCGCCGA ACTCGTCGTG CGATCGCTGG AGCGCTCGCG AGCCTTCAGT
TTGGAACGGC ATGCGGCAGC GGTGCTCGAA ACCTATGCAC GACTGATGAA CGACAAATTT
AACCTGGTAG CGGCGGAGCA ATCATGA
 
Protein sequence
MIRRRNRILM TLDAVGGVWR YAMDLGAGLR REGMEIVFAG LGPAPSATQT SEATALGQLV 
WLDAPLDWMA ASRAEISAAP AEISRIARDH GADLLHLNLP SQAAGIGTPL PVVVVSHSCV
VTWFAAVRGT PAPPDWAWQS DANREGFDRA DAVLAPSRSH ADALEAAYGP LSRLKVVHNA
SRVGSDPRPK KIFVFAAGRW WDEGKNGAVL DRSAAVMPLP VVMVGSCSGP NGQRLQLDDA
DDRGPLPYSK TIALMRRAQI VVSPSIYEPF GLTVLEAARC GAALALSDIP TYRELWDGCA
LFFDPHDPKA LAAACMRLSE DEQLRAELVV RSLERSRAFS LERHAAAVLE TYARLMNDKF
NLVAAEQS