Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_6232 |
Symbol | |
ID | 8016244 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012852 |
Strand | - |
Start bp | 290549 |
End bp | 291874 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 644827537 |
Product | O-antigen polymerase |
Protein accession | YP_002978737 |
Protein GI | 241258853 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3307] Lipid A core - O-antigen ligase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.394036 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.503415 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAGCGT ACACGCTTAG AATAGCGAAT GGGCCGACTG CCAGTTGGCG AACAATAGAG TGCTGGGGCG CCGGCCTCTG CCTTTTTCTT CAGACAGGCG CGCTTTTCCC GCTAATGTTG GCGGATGCCG ACGGAGGCCT TAGCGATCAT GCCAGGTCGA TATTGCGTCT GCTTTGTCTA CCCGTATACG GGTTTACATT GCTGATGCTG GCACGAAATT TTCCGCACTT CATCACAGCG CTGAAGCGAA ATTGGTTTGT CCCGCTGATG GTCGCTATGC CCTTCTTGTC CGTCTTCTGG TCGGTTGGCC CGTCGACGAC TTTCAGGCGT GCCATCGGCC TGCTCTTCAC GGTTCTTTTG GCTTATGTAT TGGCAATACG CTTTACGCCG AGGCAGCTGC TTCTCATTGC ATTTGCGACC TTCGGAACAT GTATCGTCCT CAGCCTTTTG CTTCTTGTGG TGTCACCGGG GCTCGCTCGC ATGCCCACGG ACAGCGCGGT GCGCGGCATA TTCATCCACA AGAACAGCCT TGGTTGGTAC GCCGGCATGA TGATACTGGT GTCGACCGCC GTCGTAATGG ACGGCAACCG AGGCTTGCGG CGAACCGCTT TGATCTTGCT GATGGCGGGC GGGGCTTGTC TCTTCCTTTC CGGATCGATG ACCGCGACAA TTGCGACGGT ATCGGCATAT TGCCTGATCG GATTCTACTC CATGTTGCAG CGAATCCGTG GCATCGGTCG GATCGTCTTC ATCCTGTTTT TTGTTCAAAT GTTCGTCGGG ATCCTCCTTT TGCTTCATGA ATTCCTGGTC CCTTTCCTTG AGGCGCTTGG CAAGGATGCC ACGTTGACGG GCCGAGTGCC CTTGTGGGAA CTTGTCGACG GTCAGATCGC CGATCATCTC CTGCTCGGCT TCGGCTACCA GGCGTTTTGG ACAGAAGCGA ATCCCGAAGC GTGGATTATC TGGTCAAAGA TCCAATGGAT GGCTCCCCAC GCCCATAATG GATTCCGCGA TACGCTGCTG AGCTTCGGAA TAAGCGGAAT GACTTTGTTC GCTCTGATGC TTTTGCAGGC ACTCCGCCAA GGGGCGGCCC TACAATGCGG GGACCCGCGT TATGGCTGGC TCTGGCTGAA CGTCTTCACG GTTGTGGTTT TGGTGATGAA CCTGACCGAG ACAATCTTCC TTATCCAGAA TGACGCGATT TTCATTCTGT TCACAACAGC CATCATCATG TTCTCTCTAT ACAAGCCTGT TGTTGTTTCG ACGGCTCCCG GCCGGCAACT GCGAGCATCG GCCCGCAGCC CAACGGCGGA GCTGCAAATC TCATGA
|
Protein sequence | MRAYTLRIAN GPTASWRTIE CWGAGLCLFL QTGALFPLML ADADGGLSDH ARSILRLLCL PVYGFTLLML ARNFPHFITA LKRNWFVPLM VAMPFLSVFW SVGPSTTFRR AIGLLFTVLL AYVLAIRFTP RQLLLIAFAT FGTCIVLSLL LLVVSPGLAR MPTDSAVRGI FIHKNSLGWY AGMMILVSTA VVMDGNRGLR RTALILLMAG GACLFLSGSM TATIATVSAY CLIGFYSMLQ RIRGIGRIVF ILFFVQMFVG ILLLLHEFLV PFLEALGKDA TLTGRVPLWE LVDGQIADHL LLGFGYQAFW TEANPEAWII WSKIQWMAPH AHNGFRDTLL SFGISGMTLF ALMLLQALRQ GAALQCGDPR YGWLWLNVFT VVVLVMNLTE TIFLIQNDAI FILFTTAIIM FSLYKPVVVS TAPGRQLRAS ARSPTAELQI S
|
| |