Gene Rleg_1104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1104 
Symbol 
ID8012226 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1086304 
End bp1087791 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content63% 
IMG OID644823687 
Productputative glycosyltransferase protein 
Protein accessionYP_002974938 
Protein GI241203842 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.414123 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.408395 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAGA GCAACCGTCG CGGCCTGAGT TGGATCTTTG TCCTGCTGGC GGCCTATTTC 
GTGCTTCAGG TCGGCGTGCG GCTCGCAACC TCGCATTCTC TCGATCTCGA CGAAGCCGAA
CAGGCCTTCC GCTCGCAATG GCTTGCCGCC GGCTACGGCC CGCAGCCGCC CTTCTACAAC
TGGCTGCAAT ACACCGTCTT CCAGTTCGCC GGCGTCTCGC TAACCGCCCT TTCGGTGGTG
AAGAACCTAC TGCTGTTCAG CTCCTACGTG CTCTACAGCC TGACCGCGCG GCTTATCCTG
CGCGACAAGG CGCTGGTGGC GATCGCCACG CTCGGACTGC TGACCATCCC GCAGATGGCT
TTCGAGATGC AGCGCGACCT GACGCACACG GTCGCCGTGT TCTTCTCGGC CAGCATCTTC
TTCTACGGCT TCATCCGCAG CCTGAAGCAG CCGAGCCTTG CCTCCTATCT CATCGCCGGC
ATCGGCATCG GTTTCGGCCT GCTTGCTAAA TATAATTTCG CGATCCTGCC GGCGGCCGCC
CTGATTGCCG CGCTTTCGGA TGCGCGCCTG CGGCCGCGGA TCTTCGACTG GCGGCTGGTG
CTGACGGCGG CGGTAGCGCT CGTCATCATC CTGCCGCATC TCTTCTGGCT GAAGGACAAT
CTCGATTTCG CCACCGCACG CACCCTGGAG AAGATGACCG CGAGCGGCCA TGCGAGCTAT
CTCACGCAGG TGGCCATGGG CGTCAGTTCT CTGGCTCTCG CCATCATCAG CTTTGCCGGA
TTGACTGTGG CGGTGTTCGC GATCGTCTTC GGCAAGAGCC TTCGTCCGGC GCTGACCGCC
GGTTCGGAAT GGACGCGGCT GTTCGAGCGG ATGATGCTCG TCTTCCTCGC CGGCATTCTG
CTTCTGATCG TCTTCGGCGG CGCGGCCGGC ATCAAGGATC GCTGGCTGGT GCCGATGCTC
TTCATCCTGC CGCTCTATTT CTGCCTGAAG ATCGAGGCAG TGGGCGTCGC GACAGACAGG
GCGTTCAGGC GTTTCATGCC CATCGTCGCC GTCATCATGA TCGGCGTGCC GGCGGCCCTT
TACGGCAGCG TCGCGGCGGC ACGTATCACC GGTCATTACG AGCGGCTGAA CAGGCCTTAT
GCCGGAATGC TGGAAACCTT GCGCAAACAG GCCGAACCGG CGGCGATCCT TGCCGGGGAC
AGCCTGCTCG CCGGCAATCT CAGGCAGGAT ATTCCCGGCG TGCCGATCCT CTCGGTGGAT
TATCCTGGCT TCCACCCGGA TCTTACCGGC CGGCGACCAC TTCTCCTGGT GTGGTTCCTC
CCGCAGAGGG GGGGAAGCGA AGCTCTTCCG CCTGATATGG CTGAATGGCT GCAGACCCAT
CTCGGCGTGT CCGCACCGCA GGCGTCGGTG ATCGACGTGC CCTATCTCTA TGGGCGCGGC
GACGACCGCT ACCGTTTCGG CTATGCTTGG GTCAACCAGC CGGGCTGA
 
Protein sequence
MTESNRRGLS WIFVLLAAYF VLQVGVRLAT SHSLDLDEAE QAFRSQWLAA GYGPQPPFYN 
WLQYTVFQFA GVSLTALSVV KNLLLFSSYV LYSLTARLIL RDKALVAIAT LGLLTIPQMA
FEMQRDLTHT VAVFFSASIF FYGFIRSLKQ PSLASYLIAG IGIGFGLLAK YNFAILPAAA
LIAALSDARL RPRIFDWRLV LTAAVALVII LPHLFWLKDN LDFATARTLE KMTASGHASY
LTQVAMGVSS LALAIISFAG LTVAVFAIVF GKSLRPALTA GSEWTRLFER MMLVFLAGIL
LLIVFGGAAG IKDRWLVPML FILPLYFCLK IEAVGVATDR AFRRFMPIVA VIMIGVPAAL
YGSVAAARIT GHYERLNRPY AGMLETLRKQ AEPAAILAGD SLLAGNLRQD IPGVPILSVD
YPGFHPDLTG RRPLLLVWFL PQRGGSEALP PDMAEWLQTH LGVSAPQASV IDVPYLYGRG
DDRYRFGYAW VNQPG