Gene Rleg_5942 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5942 
Symbol 
ID8016362 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012853 
Strand
Start bp486158 
End bp487399 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content61% 
IMG OID644828055 
Productglycosyl transferase family 28 
Protein accessionYP_002979255 
Protein GI241518627 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0547483 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGGTAG CCATTCATGC GCTCGGCACG CGCGGAGACG TTCAACCCTA TGTCGCTCTG 
GCATTGGGAT TGATCGAGCG AGGACATCGA GTACAGCTCG CTGCTCCGGT TCAGTTCGAG
AGCATGGTGC AAGACCACGG CATCGCATTT GCCCCCCTGC CTGGAGAGTT TCTCGCTCTT
CTCGATACTC CGGAAGGAAA GGCGGCGATC GCCGGCAGCA AGGGCTTCAG TGCGGGTTTG
AAGCTGCTAA AGTACGTCCG TCCGATGATG CGAACCCTGC TAGACGCGGA ATGGAGAGCA
GCGCAGGCCT TCAACCCCGA CATCTTCGTG CATCATCCGA AGGCAATCGC GGTGCCACAC
ATGGCGGAGG CGCTTCAGTG CCCATTTATT CTGGCCTCGC CCCTGCCTGG CTTTACGCCG
ACCGCCACTT TTCCCAGCCC GATGTTGCCT TTCAGAGATC TGGGCTGGTT CAACCGGATC
AGCCATATCG CGGCGATCAG GGGCGCGGAA CTTCTGTTCG GCACGTTGCT CTCGACCTGG
AGGGTGGAAC AGCTTGGTCT GGCGCGACGC AGGACGCCAG CTATCGCTTC GAATGGCACG
CTCTACGCCT ATAGTCGCCA TGTCGTGCCG GTCCCTCCGG ACTGGGGCAG TGACGTGCTG
GTAAGTGGCT ACTGGTTTCT CGACAGCAAG AACTGGCGAC CTCCAGACGA TTTGGCAGCA
TTCCTCGCGG ATGGGAAGCC GCCAATCTAC GTTGGCTTCG GAAGCATGCC GGGCGTCGAT
CCGGGCCGAA TGACAGCCAC TGTTGTCGAG GCCCTCGCAA GGCAGGGCAA GCGGGGTATC
TTGGCTTTGG GAGGCGGTGC TCTGGCCGCG GACCATAAAT CCGGTCATGT CCACGTCGTC
CGCGACGCCC CCCACGACTG GTTGTTTCCC GAGGTGAGCG CGGTCATCCA CCACGGCGGC
GCCGGAACGA CCGCGGCCGC TCTTCGGGCC GGCAAGCCTA TGATCATTTG CCCATTTTTC
GGCGATCAAC CGTTCTGGGC AAGGCGTGTA ACAGACCTCG GCGTCGGACT GTCACTCGAT
CGCAGAGCAT TGACCGTCGA GAGCCTGACA GATGCACTCG CAGCCATGGA CGATCCACAT
ATGCGACGCC AGGCAGATGC CCTTGGCTCT AGGATTCGGG ACGAAGATGG GGTTGCGAAC
GCAGTCGGTT TCATCGAGGC TGCTGCGGAC AAACTGCATT GA
 
Protein sequence
MRVAIHALGT RGDVQPYVAL ALGLIERGHR VQLAAPVQFE SMVQDHGIAF APLPGEFLAL 
LDTPEGKAAI AGSKGFSAGL KLLKYVRPMM RTLLDAEWRA AQAFNPDIFV HHPKAIAVPH
MAEALQCPFI LASPLPGFTP TATFPSPMLP FRDLGWFNRI SHIAAIRGAE LLFGTLLSTW
RVEQLGLARR RTPAIASNGT LYAYSRHVVP VPPDWGSDVL VSGYWFLDSK NWRPPDDLAA
FLADGKPPIY VGFGSMPGVD PGRMTATVVE ALARQGKRGI LALGGGALAA DHKSGHVHVV
RDAPHDWLFP EVSAVIHHGG AGTTAAALRA GKPMIICPFF GDQPFWARRV TDLGVGLSLD
RRALTVESLT DALAAMDDPH MRRQADALGS RIRDEDGVAN AVGFIEAAAD KLH