Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3198 |
Symbol | |
ID | 8014094 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 3200120 |
End bp | 3203167 |
Gene Length | 3048 bp |
Protein Length | 1015 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 644825761 |
Product | glycosyl transferase family 2 |
Protein accession | YP_002976988 |
Protein GI | 241205892 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGATGG TTTCCATCAT CACACCGGCG CATGGAGCGG AAACAACGAT TTCTTCGACT TTGGAGAGCC TGATAGCGCA AAGTCATCCC GACTGGGAAT GCTTCATCAT TGATGATGGT TCAACCGATC AGACCGCTGA GATCGCGGAC CGTTTCGCGG CTCGGGATAC GCGCTTCCGT GTGCTACGGC AGGAGCAGAG CGGCGTTTCG GCTGCTCGCA ATGCGGGACT GGCACAGGCG AAAGGCGACT GGGTCGTCTT TCTCGACTCT GACGACGCGC TGGCACCCTT CCATCTCGAA ACCATGCTGA ATCACACGCG CACTTTGCCG CAGGCCGACA TTCTGCATTG CGGTTGGCGC CGTCTGAAAA ATGGTGCTCC CTGGTGGCAG TCTCACCCCG CGGTGAAAAT GGACAATCCG TTTGCCGTAG CGGCGCGATA TTGCCCCTTC GCGATCCACG CCGCCTTGCT TCGCCGGTCC CGGCTTGCAG AAGTCGGCGG CTTCAATCCA GAGATGAAAA TGTGCGAGGA CTGGGACCTA TGGCAGCGCC TCGCCCGAGC TGGCGCCGAA TTTGCGCCGG TCGAAGGCCT GATGGCTGAC GTTAGCGTAG AGGCCGGTTC GCTGTCGTCA AACAGGGTTA AGCATCTCGA TTTTGGTCTT AAGGTGATCC GGCGCGGCCA CCATTCCGAT CCGCGCATCG CCTATCCGGT GCCGGCGCTC GCCCAGGGAA TAACAAAAGC TGCTTTGGGA ATGGCGTGCT GGTATCTCGC GATCTGGCTT GTCGGCGCCT CAATAGGTCA GGGCGACAAT CCTGTCGAAC TGCTTGACCG GGTGGCAGAG CCTATCCCAC CTGACGCCGA TGTCTACGCG TTCTGCGCAA TAATGATAGA CGGGATTGTG GTCGGCGCCT TCCCCGACGA GCCGATCTGG CCCGGACTCT GGTCGAAGAT CCAGGACAAG CTTCCGGACC TAGAGGCTTG GCTTGATCGG CATGCTCCCC AAGAAGCTTT TGGTTCCCTG TTGGTTCGTT CACTGGAGCG AAAGGTCGCT GACCAGCTGC CGACCAACGT GCCCACCACG ATCGGGAGAA CAAGGATACA GCCAATTGAT TTGGATCAAC CTATCGTCGA CCTTATCTTG CCCGGCATTG AGCGGTTGCG CTGTTGCATC TGGCGAGGTT CGACGCTGCT GGGGCAACTG GAGTTTGTTG TCTTCGGAGC AATCGAAGCG GATGCTATAC GAAATCGACT TAGGAGGGAG TTCGGCCCGG AGTTCGACGA TCTGAACCAC CCAGCAGCAG CTTTGTTGGC AGAAGATGGC TTCGTCAATC TTTCTGATCA GGATGACGGG CGCGCCGCCG CCAGCATGGC AGCGCCAAGC TACAGCGGAG TAGCTCAACA GGTGCTGAAG CTCATGGCGA AAGTATCATA TTGGGCCGAA GCAAAGACAG TAACGGGTTG GTGGAGCAAG AAAGAGCCGA CACCAACACG TGTCGACTCG ATATCAGGCA TGGCGCACGC CGGGCGCGCC GAATTCGATC GAATCGTTGA AGAGGAAAGC CAGCGGGCGG TTGCAGCCTC AACGCAGATG TCCATGGAGA TGCCGTCGAA AAAAACCGGC CCAGTGGGCG ATGAGGTGCC GAGATACGAT ACGGAGGCGT ACTGGGAGGA GCTGTTTTCC CAGGTAGACC CTTGGGACTA CCGAAACAAC TATGAGACTG TGAAATATCT TCAAACACTC TCGTTGTTGG ACGATCGGCG CTTTTCAAAC GGCCTTGAGC TGGCCTGCGC GGAAGGAACC TTTACACGGA TGTTGGCCCC CCGCGTCGAC AACCTGCTGG CGACCGACAT TTCGGCGTCG GCAGTCGCTC GAGCGGCCTC GCTTCAGGAC CATGGTTCCG CGGTAGCCTA TCGGCAGTTG GATCTTTTGA GCGACGCGCT AGAAGGGCCT TACGATCTCA TCGTCTGCAG TGAAGTTCTG TACTATTTCG AGAACAGAGA AAAACTCCAG CAGATAGTCG ATAAGATTGC GGGCAGTCTT CGCACGGGTG GTTGGTTCGT TACTGCCCAC GCCAACCTCC TGATTGATAT GCCGCATGAG ACAGGATTTG GGTGGCCGCA TGAATTCGGT GCGGTCGGCA TAGGTGAGAT GTTTGACCAG CATCCGGATT TAACTCTCTC GGTCGAAGCC AAGAGTCCGC TCTATAGAAT CCAAAGGTTC GAGAAGGTGG AGCTCGGGGG CTTCCTTGAG CCCACACGCG TGGAAGTAAA CACGGCACAG CCCTTGCCTC TTCAGGTCGC CAGCCAAGTA CGCTGGAGAG GGGGGCAAGT TGTGGAAGCG GCCGCCGACT GGAACGATGT CCCGATCTTG ATGTATCATC AGGTTTCGGA CGACGGTGCT GAACAGCTTG CTCGATACCG CCAGTCTCCG GAGGCTTTCG AAACTCAACT GGCCTTCCTG CGCGACGCCG GATGGCGCGG GATGACTTTG GATCGGCTGC TTGCTTGTTT CGATGAAGGT GCCAAACCGC CCGAAAAGAC ATTAGTTCTG ACCTTTGACG ACGCTACACG CGATTTCATG ACGCATGCCC TCCCACTGCT CCATCGATAC GGCTTCCCAT CCTCACTCTT TGTCCCAACC GATCGGGTCG GCGGCTCGGC AATATGGGAT TCGGCCTATG GATCGCCAGC TCCACTGCTA ACTTGGGAGG AACTTGCGGC AGTCGCGAAC AGCGACGTGA CGCTGGGCGC CCACGGTGTC CGGCACGTGC GTTTATCTGC CCTGGCACCG GAGAGCCTAT TGCGAGAGCT TGCTGGCTCC AAAGCAATGC TTGAAAAACG TCTAGGTCGA GAAGTGCTGG CGGTCGCTTA TCCGTACGGC GACTTCGACC CCGCTATCCG GGACATAGCC GAACAATGTG GTTACCGGAT CGGGTTAAGC TGCGTCGGTG GCACGGTGCG GGCGGATGCC GATAAGCTCG CATTGAAAAG ACAGGAGGTC TTTCGCGGGA TCAGCCAGTC AGAGTTTGCA AATCTGCTAT TTGGCTGA
|
Protein sequence | MPMVSIITPA HGAETTISST LESLIAQSHP DWECFIIDDG STDQTAEIAD RFAARDTRFR VLRQEQSGVS AARNAGLAQA KGDWVVFLDS DDALAPFHLE TMLNHTRTLP QADILHCGWR RLKNGAPWWQ SHPAVKMDNP FAVAARYCPF AIHAALLRRS RLAEVGGFNP EMKMCEDWDL WQRLARAGAE FAPVEGLMAD VSVEAGSLSS NRVKHLDFGL KVIRRGHHSD PRIAYPVPAL AQGITKAALG MACWYLAIWL VGASIGQGDN PVELLDRVAE PIPPDADVYA FCAIMIDGIV VGAFPDEPIW PGLWSKIQDK LPDLEAWLDR HAPQEAFGSL LVRSLERKVA DQLPTNVPTT IGRTRIQPID LDQPIVDLIL PGIERLRCCI WRGSTLLGQL EFVVFGAIEA DAIRNRLRRE FGPEFDDLNH PAAALLAEDG FVNLSDQDDG RAAASMAAPS YSGVAQQVLK LMAKVSYWAE AKTVTGWWSK KEPTPTRVDS ISGMAHAGRA EFDRIVEEES QRAVAASTQM SMEMPSKKTG PVGDEVPRYD TEAYWEELFS QVDPWDYRNN YETVKYLQTL SLLDDRRFSN GLELACAEGT FTRMLAPRVD NLLATDISAS AVARAASLQD HGSAVAYRQL DLLSDALEGP YDLIVCSEVL YYFENREKLQ QIVDKIAGSL RTGGWFVTAH ANLLIDMPHE TGFGWPHEFG AVGIGEMFDQ HPDLTLSVEA KSPLYRIQRF EKVELGGFLE PTRVEVNTAQ PLPLQVASQV RWRGGQVVEA AADWNDVPIL MYHQVSDDGA EQLARYRQSP EAFETQLAFL RDAGWRGMTL DRLLACFDEG AKPPEKTLVL TFDDATRDFM THALPLLHRY GFPSSLFVPT DRVGGSAIWD SAYGSPAPLL TWEELAAVAN SDVTLGAHGV RHVRLSALAP ESLLRELAGS KAMLEKRLGR EVLAVAYPYG DFDPAIRDIA EQCGYRIGLS CVGGTVRADA DKLALKRQEV FRGISQSEFA NLLFG
|
| |