Gene Rleg_3198 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3198 
Symbol 
ID8014094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3200120 
End bp3203167 
Gene Length3048 bp 
Protein Length1015 aa 
Translation table11 
GC content58% 
IMG OID644825761 
Productglycosyl transferase family 2 
Protein accessionYP_002976988 
Protein GI241205892 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGATGG TTTCCATCAT CACACCGGCG CATGGAGCGG AAACAACGAT TTCTTCGACT 
TTGGAGAGCC TGATAGCGCA AAGTCATCCC GACTGGGAAT GCTTCATCAT TGATGATGGT
TCAACCGATC AGACCGCTGA GATCGCGGAC CGTTTCGCGG CTCGGGATAC GCGCTTCCGT
GTGCTACGGC AGGAGCAGAG CGGCGTTTCG GCTGCTCGCA ATGCGGGACT GGCACAGGCG
AAAGGCGACT GGGTCGTCTT TCTCGACTCT GACGACGCGC TGGCACCCTT CCATCTCGAA
ACCATGCTGA ATCACACGCG CACTTTGCCG CAGGCCGACA TTCTGCATTG CGGTTGGCGC
CGTCTGAAAA ATGGTGCTCC CTGGTGGCAG TCTCACCCCG CGGTGAAAAT GGACAATCCG
TTTGCCGTAG CGGCGCGATA TTGCCCCTTC GCGATCCACG CCGCCTTGCT TCGCCGGTCC
CGGCTTGCAG AAGTCGGCGG CTTCAATCCA GAGATGAAAA TGTGCGAGGA CTGGGACCTA
TGGCAGCGCC TCGCCCGAGC TGGCGCCGAA TTTGCGCCGG TCGAAGGCCT GATGGCTGAC
GTTAGCGTAG AGGCCGGTTC GCTGTCGTCA AACAGGGTTA AGCATCTCGA TTTTGGTCTT
AAGGTGATCC GGCGCGGCCA CCATTCCGAT CCGCGCATCG CCTATCCGGT GCCGGCGCTC
GCCCAGGGAA TAACAAAAGC TGCTTTGGGA ATGGCGTGCT GGTATCTCGC GATCTGGCTT
GTCGGCGCCT CAATAGGTCA GGGCGACAAT CCTGTCGAAC TGCTTGACCG GGTGGCAGAG
CCTATCCCAC CTGACGCCGA TGTCTACGCG TTCTGCGCAA TAATGATAGA CGGGATTGTG
GTCGGCGCCT TCCCCGACGA GCCGATCTGG CCCGGACTCT GGTCGAAGAT CCAGGACAAG
CTTCCGGACC TAGAGGCTTG GCTTGATCGG CATGCTCCCC AAGAAGCTTT TGGTTCCCTG
TTGGTTCGTT CACTGGAGCG AAAGGTCGCT GACCAGCTGC CGACCAACGT GCCCACCACG
ATCGGGAGAA CAAGGATACA GCCAATTGAT TTGGATCAAC CTATCGTCGA CCTTATCTTG
CCCGGCATTG AGCGGTTGCG CTGTTGCATC TGGCGAGGTT CGACGCTGCT GGGGCAACTG
GAGTTTGTTG TCTTCGGAGC AATCGAAGCG GATGCTATAC GAAATCGACT TAGGAGGGAG
TTCGGCCCGG AGTTCGACGA TCTGAACCAC CCAGCAGCAG CTTTGTTGGC AGAAGATGGC
TTCGTCAATC TTTCTGATCA GGATGACGGG CGCGCCGCCG CCAGCATGGC AGCGCCAAGC
TACAGCGGAG TAGCTCAACA GGTGCTGAAG CTCATGGCGA AAGTATCATA TTGGGCCGAA
GCAAAGACAG TAACGGGTTG GTGGAGCAAG AAAGAGCCGA CACCAACACG TGTCGACTCG
ATATCAGGCA TGGCGCACGC CGGGCGCGCC GAATTCGATC GAATCGTTGA AGAGGAAAGC
CAGCGGGCGG TTGCAGCCTC AACGCAGATG TCCATGGAGA TGCCGTCGAA AAAAACCGGC
CCAGTGGGCG ATGAGGTGCC GAGATACGAT ACGGAGGCGT ACTGGGAGGA GCTGTTTTCC
CAGGTAGACC CTTGGGACTA CCGAAACAAC TATGAGACTG TGAAATATCT TCAAACACTC
TCGTTGTTGG ACGATCGGCG CTTTTCAAAC GGCCTTGAGC TGGCCTGCGC GGAAGGAACC
TTTACACGGA TGTTGGCCCC CCGCGTCGAC AACCTGCTGG CGACCGACAT TTCGGCGTCG
GCAGTCGCTC GAGCGGCCTC GCTTCAGGAC CATGGTTCCG CGGTAGCCTA TCGGCAGTTG
GATCTTTTGA GCGACGCGCT AGAAGGGCCT TACGATCTCA TCGTCTGCAG TGAAGTTCTG
TACTATTTCG AGAACAGAGA AAAACTCCAG CAGATAGTCG ATAAGATTGC GGGCAGTCTT
CGCACGGGTG GTTGGTTCGT TACTGCCCAC GCCAACCTCC TGATTGATAT GCCGCATGAG
ACAGGATTTG GGTGGCCGCA TGAATTCGGT GCGGTCGGCA TAGGTGAGAT GTTTGACCAG
CATCCGGATT TAACTCTCTC GGTCGAAGCC AAGAGTCCGC TCTATAGAAT CCAAAGGTTC
GAGAAGGTGG AGCTCGGGGG CTTCCTTGAG CCCACACGCG TGGAAGTAAA CACGGCACAG
CCCTTGCCTC TTCAGGTCGC CAGCCAAGTA CGCTGGAGAG GGGGGCAAGT TGTGGAAGCG
GCCGCCGACT GGAACGATGT CCCGATCTTG ATGTATCATC AGGTTTCGGA CGACGGTGCT
GAACAGCTTG CTCGATACCG CCAGTCTCCG GAGGCTTTCG AAACTCAACT GGCCTTCCTG
CGCGACGCCG GATGGCGCGG GATGACTTTG GATCGGCTGC TTGCTTGTTT CGATGAAGGT
GCCAAACCGC CCGAAAAGAC ATTAGTTCTG ACCTTTGACG ACGCTACACG CGATTTCATG
ACGCATGCCC TCCCACTGCT CCATCGATAC GGCTTCCCAT CCTCACTCTT TGTCCCAACC
GATCGGGTCG GCGGCTCGGC AATATGGGAT TCGGCCTATG GATCGCCAGC TCCACTGCTA
ACTTGGGAGG AACTTGCGGC AGTCGCGAAC AGCGACGTGA CGCTGGGCGC CCACGGTGTC
CGGCACGTGC GTTTATCTGC CCTGGCACCG GAGAGCCTAT TGCGAGAGCT TGCTGGCTCC
AAAGCAATGC TTGAAAAACG TCTAGGTCGA GAAGTGCTGG CGGTCGCTTA TCCGTACGGC
GACTTCGACC CCGCTATCCG GGACATAGCC GAACAATGTG GTTACCGGAT CGGGTTAAGC
TGCGTCGGTG GCACGGTGCG GGCGGATGCC GATAAGCTCG CATTGAAAAG ACAGGAGGTC
TTTCGCGGGA TCAGCCAGTC AGAGTTTGCA AATCTGCTAT TTGGCTGA
 
Protein sequence
MPMVSIITPA HGAETTISST LESLIAQSHP DWECFIIDDG STDQTAEIAD RFAARDTRFR 
VLRQEQSGVS AARNAGLAQA KGDWVVFLDS DDALAPFHLE TMLNHTRTLP QADILHCGWR
RLKNGAPWWQ SHPAVKMDNP FAVAARYCPF AIHAALLRRS RLAEVGGFNP EMKMCEDWDL
WQRLARAGAE FAPVEGLMAD VSVEAGSLSS NRVKHLDFGL KVIRRGHHSD PRIAYPVPAL
AQGITKAALG MACWYLAIWL VGASIGQGDN PVELLDRVAE PIPPDADVYA FCAIMIDGIV
VGAFPDEPIW PGLWSKIQDK LPDLEAWLDR HAPQEAFGSL LVRSLERKVA DQLPTNVPTT
IGRTRIQPID LDQPIVDLIL PGIERLRCCI WRGSTLLGQL EFVVFGAIEA DAIRNRLRRE
FGPEFDDLNH PAAALLAEDG FVNLSDQDDG RAAASMAAPS YSGVAQQVLK LMAKVSYWAE
AKTVTGWWSK KEPTPTRVDS ISGMAHAGRA EFDRIVEEES QRAVAASTQM SMEMPSKKTG
PVGDEVPRYD TEAYWEELFS QVDPWDYRNN YETVKYLQTL SLLDDRRFSN GLELACAEGT
FTRMLAPRVD NLLATDISAS AVARAASLQD HGSAVAYRQL DLLSDALEGP YDLIVCSEVL
YYFENREKLQ QIVDKIAGSL RTGGWFVTAH ANLLIDMPHE TGFGWPHEFG AVGIGEMFDQ
HPDLTLSVEA KSPLYRIQRF EKVELGGFLE PTRVEVNTAQ PLPLQVASQV RWRGGQVVEA
AADWNDVPIL MYHQVSDDGA EQLARYRQSP EAFETQLAFL RDAGWRGMTL DRLLACFDEG
AKPPEKTLVL TFDDATRDFM THALPLLHRY GFPSSLFVPT DRVGGSAIWD SAYGSPAPLL
TWEELAAVAN SDVTLGAHGV RHVRLSALAP ESLLRELAGS KAMLEKRLGR EVLAVAYPYG
DFDPAIRDIA EQCGYRIGLS CVGGTVRADA DKLALKRQEV FRGISQSEFA NLLFG