Gene Rleg_3402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3402 
Symbol 
ID8014278 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3419862 
End bp3421208 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content62% 
IMG OID644825960 
Productglycosyl transferase group 1 
Protein accessionYP_002977187 
Protein GI241206091 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.204154 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCCGT CGCTTCCAGA GGTCCTGCGA TCCGCCGCAA GCGTCGGCCG GTTTCCCGGC 
CCCGGCATGA CAAAGATCGA TCGCGTGGTC ATCATCGACG ACTATTCAGT CGCCAGAGGC
GGTGCGACGG CGCTGGCCGT GCTGTCCGCC AAGCTTTTTC GGGAACTCGA CATCCCCGTG
ACTTATATTT GCGGGGATGA CGCCTCCAAT GCGGAGCTTG TCGCCCTCGG AGTCTCAATG
GTCGGGCTGA ACAGCCGCGA CCTGCTCAGC GCCGAACGTG CGAAGGCTTT CGTGACCGGC
ATTCACAATG GCGCCGCCGT CCGTATGGTC GCAAACTGGA TTGCCGCAAA CGACACTGCC
AATACCGTCT ACCATGTGCA TGGCTGGCAC CAGATTCTGT CTCCTGCAAT TTTCAGGGCG
TTGCTGCCGG TCGCCAGACG CTGCGTGGTG CACGCGCATG ATTTCTTCAC GGCCTGCCCC
AACGGCGCCT TCTTCGACTA TCAGGCGCAG GAAATCTGCC TTCGACGCCC GCTCGGCGGA
AGCTGCATTG CGACAGCCTG CGACAAGAGA AGTTATTCGC ACAAATTGTG GCGGGCTGCC
CGCGGCTCCA ATATCCTCCG GCTGCTGAAG GATCGGGCCG ATTTCGGCCG GATCATCCTG
CTGCACGAGA AGATGGCAAG CTTCCTCGTC GGCGCCGGAT ATCGGCCCGA ACGATTGACG
ACGATCCGCA ATCCCGTCGC TCCCCTATCC ATCAAACGCA TCGAGGCGGA GACCAACGAC
GAGTTCGTCT TCATCGGGCG GTTGGACGAG GAGAAGGGCA TAGAGGACGC CGTGGCCGCC
ACGCGCAAAG CCGGCGTTCG GCTCTGCGTG ATCGGGGACG GGCCGCTGAT GCCGCTGGTT
GCGGCTTCGG GGGATCACGT CAGGGCCGTC GGCTGGCAGT CGCATGCGGA GATCGGCCCG
ACCATCCGCA AGGCGCGTGC ATTGTTGATG CCGTCTCGCT ATCCCGAGCC ATTCGGCCTC
GTCGCTATCG AAGCGGCCAG GAGCGGTCTG CCGGTCATCA TGTCGCGCAG CGCCTTTCTT
GCCGAAGAAA TGCAAAGAGC CGGCATGGCG ATCGCCTGCG ATACGGCTGA CGAAAGCGCC
TTTGCCGATA CGTTGACGAG ATTTAGCCAA ATGCCGAGCC ATGAGGTCCG CGCCATGAGC
GAGCAGGCTT TCCTGAAGTC GCCGGATCTC GCCTCGACAC ACGAGGAATG GCGCGACGCG
CTTCTTTCCG AATACCACAG CCTGATTTCG ACGAATGTGG TTTCCCAGCT GACAGACGGT
GTGGCGATAC AAGGAGTATT GAGTTGA
 
Protein sequence
MRPSLPEVLR SAASVGRFPG PGMTKIDRVV IIDDYSVARG GATALAVLSA KLFRELDIPV 
TYICGDDASN AELVALGVSM VGLNSRDLLS AERAKAFVTG IHNGAAVRMV ANWIAANDTA
NTVYHVHGWH QILSPAIFRA LLPVARRCVV HAHDFFTACP NGAFFDYQAQ EICLRRPLGG
SCIATACDKR SYSHKLWRAA RGSNILRLLK DRADFGRIIL LHEKMASFLV GAGYRPERLT
TIRNPVAPLS IKRIEAETND EFVFIGRLDE EKGIEDAVAA TRKAGVRLCV IGDGPLMPLV
AASGDHVRAV GWQSHAEIGP TIRKARALLM PSRYPEPFGL VAIEAARSGL PVIMSRSAFL
AEEMQRAGMA IACDTADESA FADTLTRFSQ MPSHEVRAMS EQAFLKSPDL ASTHEEWRDA
LLSEYHSLIS TNVVSQLTDG VAIQGVLS