Gene Rleg_1105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1105 
Symbol 
ID8012227 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1087793 
End bp1089292 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content64% 
IMG OID644823688 
Productputative glycosyltransferase protein 
Protein accessionYP_002974939 
Protein GI241203843 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00589779 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.39875 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGAGC GCGCGACGAG GACGATCAGA ACCGCGGGCC TGCTGCTTGC AGCCTATTTC 
GTGCTCAACA TCGCGCTGCG CATCGCGCTT CCGCATTCGC TGGAGCTCGA CGAGGCCGAG
CAATCCTTCT TCTCGCAATA TCTGCTGGCC GGCTACGGCC CGCAGCCGCC CTTCTACAAC
TGGATGCAAT ATGCCGTCGT TTCGGTGACG GGCATGTCGA TCGGCGCGCT GATCGTTCCG
AAAAACATCC TGCTGTTCCT ATCCTATCTC TTCTATGGCC TTGCCGGGCG GCGCGCGCTG
AAGGACGAGG CGCTTGCCGC CGTCGGCATG CTGGCGCTGA TCACCCTGCC CCAGGTCTCC
TACATGGCCC AGCAGGATCT GACCCACACG ACGGCGCTGC TCTTTGCCAG TTCGCTGTTT
CTCTACGGCT TCTTCCGCAC GCTCGACCGG CCCGATATGG CGAGCTACCT GCTGCTCGGG
CTTGCGACCG GCATCGGGCT GATCTCGAAA TATAATTTCG CCCTGATGCC TGTCGTCGCC
TTGATCGCCA TCCTGCCCGA TGCCGAATGG CGGCGCCGGG CGCTCGACTG GCGCATGCTG
GCAGCGATCG CTGTTGCGCT CGTCATCATC CTGCCGCACG CGATCTGGCT GCAGGGCAAT
CTCGCATTCG CCTCCTCCGA CACTCTGGTC AAGATGGCCG CCGGCAGCGA ACCCGCCGGC
GCAGTGCGGA TCGGCAAAGG CCTTCTCGCC TTTCTCGTCG CCATCATCGC CTTTGCGGCG
CTGCCGGTCA CTATCTTCGC CGCCACCTTC CGCCGGGATT TTGTCCGGGC GCTTTCGGCG
GGCAACCGCT GGACTGCGAT GATGGAGCGG ATGATGCTCG CAAGTCTTGC CGGCATCGTT
CTGATCGTGC TGTTCACCGG CTCTACCACG GTGCGCGAGC GCTGGCTCGA CCCCTTCCTG
CTAGTGCTGC CGATCTATTT CCTGGCGAAG ATGCAGGCGG CCGGGCTCGA CCTTTCCGCC
GGGCTGCGCC GCTTCCGGCC GGTGTTGCCG GTGCTGATGG CCTGCGTGCT GATCGCACTC
GGCTTCCGCG TCGTCGGCGC CGGGCTGATC GGCACTTACA GCCGGCCGAA TGTGCCGATG
GCGGGTTTTT CGCGCGAGAT GACGCAACAG GCACAACCGG CGCTGGTGAT CGCTTCCGAC
ACCTATATCG GCGGAAACAT GCGGCTGCAA TTTCCCGATG TGCCGGTGGT GATCCCGGAT
TTTCCGGCAC CGGGCATTCC GGCCTATGCC GAGGCCAAGG GGCCGGTGCT GATCGTCTGG
CGCGGCAAGA AGACGGCGAC GGCTGCCGAT GCGGTGATGC CGGAGCGTTT TTCTTCGGCG
CTGACGGCGG CTGATATCGC GCTGCAGGAG ATCGGCTCGC TGTCGCTTCC CTATTACTTC
GGCCGCCAGG GCGACAATTT CGCGCTCGGC TACGCCTGGG TTCGGCCGGA GAGCAAATAG
 
Protein sequence
MLERATRTIR TAGLLLAAYF VLNIALRIAL PHSLELDEAE QSFFSQYLLA GYGPQPPFYN 
WMQYAVVSVT GMSIGALIVP KNILLFLSYL FYGLAGRRAL KDEALAAVGM LALITLPQVS
YMAQQDLTHT TALLFASSLF LYGFFRTLDR PDMASYLLLG LATGIGLISK YNFALMPVVA
LIAILPDAEW RRRALDWRML AAIAVALVII LPHAIWLQGN LAFASSDTLV KMAAGSEPAG
AVRIGKGLLA FLVAIIAFAA LPVTIFAATF RRDFVRALSA GNRWTAMMER MMLASLAGIV
LIVLFTGSTT VRERWLDPFL LVLPIYFLAK MQAAGLDLSA GLRRFRPVLP VLMACVLIAL
GFRVVGAGLI GTYSRPNVPM AGFSREMTQQ AQPALVIASD TYIGGNMRLQ FPDVPVVIPD
FPAPGIPAYA EAKGPVLIVW RGKKTATAAD AVMPERFSSA LTAADIALQE IGSLSLPYYF
GRQGDNFALG YAWVRPESK