Gene Rleg2_2737 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_2737 
Symbol 
ID6981481 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp2783027 
End bp2784700 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content60% 
IMG OID643397450 
Productglycosyl transferase family 39 
Protein accessionYP_002282234 
Protein GI209550317 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0489967 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAAGC CGCCACACGA GCCTTCCAAG CCGCTGCGTA TCGCGCCGGA CGACCCGGAA 
CTGAACCAGG CCGAAGCTCA CCAATATCAG CGGCAGACGG CCGACCGGGA GATGGCTGTT
CGCAACTACA TGAGCCTCGA TACGGCGATC TTGCTGGGGA TCCTGTTGAT TGCGATCGTC
TTCAGGTTCC ACAAGATCAC CTTACCGCTG GTCGACGGCT TCAGCTGGCG CGAGATAAGC
ACCGCAATGA TGGCCGACAA TTTCCGAGAG CGCAGCTGGA ACATCTTCTT TCCGGAGGTC
AGCTGGACCG GGCCCGGGCC AAGCTATCAG GGCCGCGAGT TCCAGATCGT CAGCTATCTC
ACAGCCCTGC TCTACCAACT CTTCGGCTGG CACGACTGGT TTGGCCGAGT GGTTGCGGCC
TGCTTCGGTC TGGTGACGGT GTTTTCGCTG CACAGGCTGA CGGCGCTATG CTGGGACAAG
ATGCATGCCC ACGCGGCGGC ACTCGCCTAC GCGCTGATGC CGGCGGCGGT CATGATCGAC
AGCTCGTTTC TTCCCGATCC CGCGATGCTG GCCTTGGTGA CCCTTGGCGT CTGGCTGTTT
GCCAAATATT GGACCGGCGG CAGCGGCTGG CTTTTGCCGC TCGCCACGGT CAGCTTCTCG
CTCGGCGTGC TGTCAAAACC ACCAGGCATC GCCGCCGGCG CCATCATCTT CTATCTGATG
GTCTGCTGGA TTCTGGAGAA CAGGCGAAAG CAGGCGGCCT TGGTCTTCCT GTCGGGGCTT
TTGAGCCTCG CTATCATCGG CGCTTATTTC AGTTGGGCGA TTTATCTCGC CCGCAGCTAT
CCGCCGTTTC ATATGGCCGG CAGCGGCGGC TATATCTGGG ATTCCGGCTT CTGGACCTAC
GTCAGGGAGA GATTCTATTT CAAATCCGCA TGGAACACCT CGGTTTTGTG GTTCTACGGC
TACCCATTCC TGGTACTGTT CGCGGTCGGC TTATGGATGC CGCCCGAACC TGCCGAAGAC
GAGAAGCAGC GTACCCTTTC GGCCATTCCC TATGTCTGGC TGACTGCGGC CACGATCCTC
TATCTGGCGG CGGCGGGCGA GATCACCAGC AATGTGTGGA ACTTCCACAT CTTCCATGTA
CCGATCGCGA TATTCTCAGG CCATGGCGCG CTTCTTCTGG CAAGGCTTTC ATCGAGAACC
GTTTCCACGC TGGCGGTCGT GCTTCGCGCA ATATGCATCG TGGCCGTCAC GCTGGCCTGG
TCGACCTTTC CCCTCGTCAG GACGATGAAG AAGCCAATCG CCATAAAGGG CAAGCTGCTT
GGCGAGGAAC TGGCGCGGCT GGCGCAACCG GGCGACCTCG TCGTTGCCAT CGCGCCCGAG
GTTGGCGATC CGGTCGCAGT CTACTATAGC AGGACGCGCG GCTGGGTGTT CCCGCCCGGC
GGAGGCGATA CCGAATGGTC GAAATTCGTC GCGGATGACG CCACCGCGAT CACGCAGCTC
GAAGAACTGC GCGCGCAGGG CGCGGATCTG TTCGGCGTCG CCAAGAATGC CACCGACAAG
CAGGACCTGC TGTTCATCGA GCATCACGAC GGGGTTGCCG ACTATCTGGA CAAGACAGCA
ACCAAGCTCG TGGATTCGGA CGATCTGCTG GTCTATCGGA TCACCCGTCC ATGA
 
Protein sequence
MTKPPHEPSK PLRIAPDDPE LNQAEAHQYQ RQTADREMAV RNYMSLDTAI LLGILLIAIV 
FRFHKITLPL VDGFSWREIS TAMMADNFRE RSWNIFFPEV SWTGPGPSYQ GREFQIVSYL
TALLYQLFGW HDWFGRVVAA CFGLVTVFSL HRLTALCWDK MHAHAAALAY ALMPAAVMID
SSFLPDPAML ALVTLGVWLF AKYWTGGSGW LLPLATVSFS LGVLSKPPGI AAGAIIFYLM
VCWILENRRK QAALVFLSGL LSLAIIGAYF SWAIYLARSY PPFHMAGSGG YIWDSGFWTY
VRERFYFKSA WNTSVLWFYG YPFLVLFAVG LWMPPEPAED EKQRTLSAIP YVWLTAATIL
YLAAAGEITS NVWNFHIFHV PIAIFSGHGA LLLARLSSRT VSTLAVVLRA ICIVAVTLAW
STFPLVRTMK KPIAIKGKLL GEELARLAQP GDLVVAIAPE VGDPVAVYYS RTRGWVFPPG
GGDTEWSKFV ADDATAITQL EELRAQGADL FGVAKNATDK QDLLFIEHHD GVADYLDKTA
TKLVDSDDLL VYRITRP