Gene Rleg_1106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1106 
Symbol 
ID8012228 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1089305 
End bp1090321 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content62% 
IMG OID644823689 
Productglycosyl transferase family 2 
Protein accessionYP_002974940 
Protein GI241203844 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.263485 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.233647 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCAGACAA CCGTAGAGCC CATTCGCGGT ACGAATGATC CGGTACAATC GCTCGAACTG 
TCGCTGGTCG TGCCCATCTT CAACGAAGAG CAAAGCGTCG GCCCGCTTGT CGAGCGCGTC
GCGGCTGCGA TGGTTAGCTA CCCCCATCGC TGGGAGCTGA TCCTCGTCGA CGACGGCAGC
ACGGATGCGA CGCTCGTCAA CGCCCGCAAG TATGTGGGAC GCGAGGGGCT GGCGCTCAGG
ATCGTCGAGC TGCAGCGCAA CTTCGGCCAG ACAGCCGCCA TGCAGGCCGG CATCGATACC
GCCCGTGGCC GCCTGATCGC GACGATGGAC GGCGACCTGC AGAACGACCC GAAGGACATT
CCTTCGATGG TCTCCGAGCT GGAACGGCGT GAACTCGACC TCTTGGTCGG CTGGCGCAAG
AACCGCAAAG ACGGCCTGTT CCTGCGCAAG ATCCCCTCCT GGTGCGCCAA CTACCTGATC
GGCCGCATCA CCGGCGTCAA GCTGCATGAT TACGGCTGCA GCCTGAAGAT CTACCGTGCC
TCGATCATCA AGCAGGTGAA GCTGATGGGC GAGATGCACC GCTTCATCCC CGCCTGGGTC
GCCGGTGTCG TCCCGAGCTC GCGCATCGGC GAGATGGCCG TCACCCACCA TGCCCGGGAG
CACGGCGTTT CGAAATACGG CATTTCGCGC ACCTTCCGCG TCATCCTCGA TCTGCTGTCG
GTGATGTTCT TCATGCGCTA CAAGGCGCGG CCGGGGCATT TCTTCGGTTC GCTGGGTCTC
GGCCTCGGCG CGCTCGCCAT GCTGATCCTG CTCTATCTCG GCTTCGACAA ATTCATCCTG
GGCAACGACA TCGGCACGCG ACCGATGCTG ATGGTCGGCG TCGTGCTGCT GCTGTCGTCG
GTACAGATGA TCACCACCGG CATCCTGGCG GAAATGATCG CGCGCACCTA TTACCGCGAC
GATGCCTCTC CGAATTATAT CGTGCGGCAG ATCTTCGACG ATCAAAGCCA AGCCTAA
 
Protein sequence
MQTTVEPIRG TNDPVQSLEL SLVVPIFNEE QSVGPLVERV AAAMVSYPHR WELILVDDGS 
TDATLVNARK YVGREGLALR IVELQRNFGQ TAAMQAGIDT ARGRLIATMD GDLQNDPKDI
PSMVSELERR ELDLLVGWRK NRKDGLFLRK IPSWCANYLI GRITGVKLHD YGCSLKIYRA
SIIKQVKLMG EMHRFIPAWV AGVVPSSRIG EMAVTHHARE HGVSKYGISR TFRVILDLLS
VMFFMRYKAR PGHFFGSLGL GLGALAMLIL LYLGFDKFIL GNDIGTRPML MVGVVLLLSS
VQMITTGILA EMIARTYYRD DASPNYIVRQ IFDDQSQA