Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_1107 |
Symbol | |
ID | 8012229 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 1090459 |
End bp | 1091949 |
Gene Length | 1491 bp |
Protein Length | 496 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644823690 |
Product | glycosyl transferase family 39 |
Protein accession | YP_002974941 |
Protein GI | 241203845 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0699757 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.0751827 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGAGCGGA TTACCAAAAG CATTACGAGC GCAAGTATTT TTCTGGCAGG CTATTTCCTG CTGAACATCG CGCTTCGCAT CGCCTTGCCG CATACGCTCG ATCTCGACGA GGCGGAGCAA TCCTTCTACT CGCAATACCT GCTTGCCGGC TACGGCCCGC AGCCGCCCTT CTACAACTGG ATCCAATATG CCATCGTTTC GGTGACCGGC ATATCGATGT GGGTGCTCTC GGTGCCCAAG AACATCATCC TCTTCGGCTG TTATCTCTTC TACGGGCTTG CTGCCCGCGA GGTACTGAAG AGCCGTTCGC TCGCAGCCGT CGCCATGTTG AGCCTGATTA CTCTGCCGCA GGTCGGCCTG ATGGCGCAGC GCGAACTGAC CCATACGGTT GCCCTGCTGT TTGCGACCTC GCTCTTCCTC TTCGGTTTTT TCCGCACGCT GCGCCAGCCG ACGATCGGGA GCTATCTCCT CATCGGCATC GCCACGGGCA TCGGGCTCAT CTCGAAGTAT AATTTCGCCA TCCTGCCATT TGCTGCTCTC GTCGCCGTGC TGCCAGAGCG GGAATGGCGC AGCCGGCTCA TCGACTGGCG TTTGCTGCCG GCCGCCGTCC TTGCCATTCT GATCGTGCTG CCGCATGCGC TCTGGCTGCC TGACAATCTT GCCAGCGCCT CTGCGCCGAC GCTGGAGCGG ATGACTGCCG AACACCTGGC CCCGGCCGGC CTCCCCCGCA TCGGGCAAGG ACTGCTGTCT CTCGTCATCG CCGTCCTCGG CTTCGTCGCA TTGCCGATCG TTCTGATCGC GGCCGCCTTC CGGCGCAACT TCTTTCGCGC GCTCTCCTCT TCCAGCCCGA TGATCCGGGT GATCGAGCGG ATGATGGTCA TCAGCCTGCT CGCCTTCGTC GGCGTGATCC TCTTCGCCGG CGCGAGCGAT ATCCACGAGC GCTGGCTCGA CCCATGCCTG CTCGTCCTGC TGATCTATCT GTTCCTGAAA CTGGAAACCG CAGACCTCGA TCTTTCCGCC GGTCTTGCGC GCTTCCGGCC GGTGGTGCCG GTCTTCATGG TCGTCATCCT GTCGATCCTT CTTTTCCGGA TCGCCGGCAT TCAATATATC GGCACTTATA CGAGAACGAA CGTACCCTTT TCCGGCTACG TTGCTGAATT GACCGCGACC CGCAAGCCGG TTCTGATCGT GGCGGGAACC AAGTTCGTTG CCGGCAACAT GCGGCTAAAG TTTCCCGACG TTCCCGTCGT GATCCCGTTC TTCCCCGGTC CCGGAGTTCC CGAATATGCT GACGCGAAGG GGCCGGTGCT GGTTATCTGG CGCGGCGAGA CCGCAGATGA TCCAACAATT TCCCCCGGCT TCGCCAATGA CCTCGTCAAA TCGGGCATTC ATCTGCCAGA GTTGAAGACG CTGACGCTGC CCTATCTCTT CGGTGACGGC AAACGCAGCT TCTCCATTGG TTACTCCTGG GTGGACGGCG GCGCGAAATA G
|
Protein sequence | MERITKSITS ASIFLAGYFL LNIALRIALP HTLDLDEAEQ SFYSQYLLAG YGPQPPFYNW IQYAIVSVTG ISMWVLSVPK NIILFGCYLF YGLAAREVLK SRSLAAVAML SLITLPQVGL MAQRELTHTV ALLFATSLFL FGFFRTLRQP TIGSYLLIGI ATGIGLISKY NFAILPFAAL VAVLPEREWR SRLIDWRLLP AAVLAILIVL PHALWLPDNL ASASAPTLER MTAEHLAPAG LPRIGQGLLS LVIAVLGFVA LPIVLIAAAF RRNFFRALSS SSPMIRVIER MMVISLLAFV GVILFAGASD IHERWLDPCL LVLLIYLFLK LETADLDLSA GLARFRPVVP VFMVVILSIL LFRIAGIQYI GTYTRTNVPF SGYVAELTAT RKPVLIVAGT KFVAGNMRLK FPDVPVVIPF FPGPGVPEYA DAKGPVLVIW RGETADDPTI SPGFANDLVK SGIHLPELKT LTLPYLFGDG KRSFSIGYSW VDGGAK
|
| |