Gene Rleg_1107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1107 
Symbol 
ID8012229 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1090459 
End bp1091949 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content60% 
IMG OID644823690 
Productglycosyl transferase family 39 
Protein accessionYP_002974941 
Protein GI241203845 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0699757 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0751827 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGAGCGGA TTACCAAAAG CATTACGAGC GCAAGTATTT TTCTGGCAGG CTATTTCCTG 
CTGAACATCG CGCTTCGCAT CGCCTTGCCG CATACGCTCG ATCTCGACGA GGCGGAGCAA
TCCTTCTACT CGCAATACCT GCTTGCCGGC TACGGCCCGC AGCCGCCCTT CTACAACTGG
ATCCAATATG CCATCGTTTC GGTGACCGGC ATATCGATGT GGGTGCTCTC GGTGCCCAAG
AACATCATCC TCTTCGGCTG TTATCTCTTC TACGGGCTTG CTGCCCGCGA GGTACTGAAG
AGCCGTTCGC TCGCAGCCGT CGCCATGTTG AGCCTGATTA CTCTGCCGCA GGTCGGCCTG
ATGGCGCAGC GCGAACTGAC CCATACGGTT GCCCTGCTGT TTGCGACCTC GCTCTTCCTC
TTCGGTTTTT TCCGCACGCT GCGCCAGCCG ACGATCGGGA GCTATCTCCT CATCGGCATC
GCCACGGGCA TCGGGCTCAT CTCGAAGTAT AATTTCGCCA TCCTGCCATT TGCTGCTCTC
GTCGCCGTGC TGCCAGAGCG GGAATGGCGC AGCCGGCTCA TCGACTGGCG TTTGCTGCCG
GCCGCCGTCC TTGCCATTCT GATCGTGCTG CCGCATGCGC TCTGGCTGCC TGACAATCTT
GCCAGCGCCT CTGCGCCGAC GCTGGAGCGG ATGACTGCCG AACACCTGGC CCCGGCCGGC
CTCCCCCGCA TCGGGCAAGG ACTGCTGTCT CTCGTCATCG CCGTCCTCGG CTTCGTCGCA
TTGCCGATCG TTCTGATCGC GGCCGCCTTC CGGCGCAACT TCTTTCGCGC GCTCTCCTCT
TCCAGCCCGA TGATCCGGGT GATCGAGCGG ATGATGGTCA TCAGCCTGCT CGCCTTCGTC
GGCGTGATCC TCTTCGCCGG CGCGAGCGAT ATCCACGAGC GCTGGCTCGA CCCATGCCTG
CTCGTCCTGC TGATCTATCT GTTCCTGAAA CTGGAAACCG CAGACCTCGA TCTTTCCGCC
GGTCTTGCGC GCTTCCGGCC GGTGGTGCCG GTCTTCATGG TCGTCATCCT GTCGATCCTT
CTTTTCCGGA TCGCCGGCAT TCAATATATC GGCACTTATA CGAGAACGAA CGTACCCTTT
TCCGGCTACG TTGCTGAATT GACCGCGACC CGCAAGCCGG TTCTGATCGT GGCGGGAACC
AAGTTCGTTG CCGGCAACAT GCGGCTAAAG TTTCCCGACG TTCCCGTCGT GATCCCGTTC
TTCCCCGGTC CCGGAGTTCC CGAATATGCT GACGCGAAGG GGCCGGTGCT GGTTATCTGG
CGCGGCGAGA CCGCAGATGA TCCAACAATT TCCCCCGGCT TCGCCAATGA CCTCGTCAAA
TCGGGCATTC ATCTGCCAGA GTTGAAGACG CTGACGCTGC CCTATCTCTT CGGTGACGGC
AAACGCAGCT TCTCCATTGG TTACTCCTGG GTGGACGGCG GCGCGAAATA G
 
Protein sequence
MERITKSITS ASIFLAGYFL LNIALRIALP HTLDLDEAEQ SFYSQYLLAG YGPQPPFYNW 
IQYAIVSVTG ISMWVLSVPK NIILFGCYLF YGLAAREVLK SRSLAAVAML SLITLPQVGL
MAQRELTHTV ALLFATSLFL FGFFRTLRQP TIGSYLLIGI ATGIGLISKY NFAILPFAAL
VAVLPEREWR SRLIDWRLLP AAVLAILIVL PHALWLPDNL ASASAPTLER MTAEHLAPAG
LPRIGQGLLS LVIAVLGFVA LPIVLIAAAF RRNFFRALSS SSPMIRVIER MMVISLLAFV
GVILFAGASD IHERWLDPCL LVLLIYLFLK LETADLDLSA GLARFRPVVP VFMVVILSIL
LFRIAGIQYI GTYTRTNVPF SGYVAELTAT RKPVLIVAGT KFVAGNMRLK FPDVPVVIPF
FPGPGVPEYA DAKGPVLVIW RGETADDPTI SPGFANDLVK SGIHLPELKT LTLPYLFGDG
KRSFSIGYSW VDGGAK