Gene Rleg_4118 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4118 
Symbol 
ID8014916 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4196912 
End bp4198090 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content63% 
IMG OID644826688 
Productglycosyl transferase group 1 
Protein accessionYP_002977898 
Protein GI241206802 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGATCG TCGAGGCAGA TTGGGCGCGC GCGAAGCCGC AGTTGGAAAC AGCCCCGCTG 
ATCGTCCATG TCGTGCGCCA GTTCCTGCCC AACCGCGGCG GCCTCGAAGA CGTGGTCGCC
AATCTCGCCC GCCAGACGGT ACGCCGCGGT TATCGCGTGC GCGTCGTCAC GCTGGATTCG
CTCTTCACCG CCCCAGAGGA TAAGCTGCCG CCTCGCGAAG GTATCGACGG TATCGAGGTG
GTGCGCATTC CCTGGTTCGG CACCAGCCGT TATCCGCTGG CGCCTGAGGT TTTCCGCCAT
CTCGCCGATG CCGATCTCGT CCATGTCCAT GCCATCGACT TCTTCTTCGA TGCGCTCGCC
TGGGGCCGGC TGCTGCACGG CAAGCCGATG ATCGTCACCA CTCATGGCGG CTTCTTCCAC
ACGCGGAAAT ACGCGACGAT CAAGAAGATC TGGTTCCGGA CGCTGACCCG CGCTTCGGCG
ATGGCTTACC GCCGCGTCGT CTGCTGCAGC GCCTCCGACC TCAAGCAGTT TTCCGAGATC
GTGCCCGACA GCCTTCTGAT CGAAAATGGC GCTGATATCG CCAAATTCGC CGACACCGCT
TCGCGTCGGG CAAAGCGCCG CATCGTCACG ATCGGCCGGT TTTCGGTAAA CAAACGGCTG
GATCACCTGC TCGATGCGAT GGCCAAGCTG AAGACCCGCG ACCCGGAATG GCATCTCGAC
ATTGTCGGCG CCGAATCCGA CCTGAACCGG GCGGATGTCG AAGGCGCAAT CGAAAGCCGT
CATCTTTCCG GCCGCGTCAC CTTGCATGTG TCGCCCGAGA ACGACACCAT CCGGCGCATC
ATCGCAGAAG CCTCGCTCTT CGCCTCCGCC TCGGAATATG AAGGTTTCGG CCTGGTGGCG
CTGGAGGCGA TGAGCGCCGG CCTCCTGCCG GTGCTGAACG CCAACGATGC CTTTGCGACG
CTTGCCGCCC GGCATCCCGC AATCATGCTT GCCGATTTCA CCAATCCTGA GAGCGCCGCC
ACGGCGATCG AAGCGGCCTA TGAAGGCCTT TCGCGCCAGC CGGAGACCGT TCGCACCGAG
CTTCTCGACG CCGCCCGCGG CTATTCCTGG GATATCGTCG CCGGACGTTA TATCGATCTC
TACAGATCGC TTGATGTCGT TGCCGCGGAA AGCAGCTGA
 
Protein sequence
MSIVEADWAR AKPQLETAPL IVHVVRQFLP NRGGLEDVVA NLARQTVRRG YRVRVVTLDS 
LFTAPEDKLP PREGIDGIEV VRIPWFGTSR YPLAPEVFRH LADADLVHVH AIDFFFDALA
WGRLLHGKPM IVTTHGGFFH TRKYATIKKI WFRTLTRASA MAYRRVVCCS ASDLKQFSEI
VPDSLLIENG ADIAKFADTA SRRAKRRIVT IGRFSVNKRL DHLLDAMAKL KTRDPEWHLD
IVGAESDLNR ADVEGAIESR HLSGRVTLHV SPENDTIRRI IAEASLFASA SEYEGFGLVA
LEAMSAGLLP VLNANDAFAT LAARHPAIML ADFTNPESAA TAIEAAYEGL SRQPETVRTE
LLDAARGYSW DIVAGRYIDL YRSLDVVAAE SS