Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_0953 |
Symbol | |
ID | 6979671 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 977141 |
End bp | 978628 |
Gene Length | 1488 bp |
Protein Length | 495 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643395664 |
Product | putative glycosyltransferase protein |
Protein accession | YP_002280473 |
Protein GI | 209548556 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.452893 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGAAA CCGACAGCCG CGACATCCGC TGGATTTTCC TCCTGCTGGC CGCCTATTTC GTGCTCCAGA TCGGCGTGCG GCTGGCGACC TCGCATTCAC TCGACCTCGA CGAAGCCGAA CAGGCCTTCC GCTCGCAATG GCTTGCCGCC GGCTACGGCC CGCAGCCGCC CTTCTACAAC TGGCTTCAAT ACACCGTCTT CCAATTTACA GGCGTCTCGC TTGCAGCACT TTCTCTCGTG AAGAACCTTC TGCTGTTCAT GTCCTACCTG CTCTATGGCC TGACCGCGCG TCTCCTGCTG CGCGACAAGG CGCTGGTGGC GATGGCGACG CTCGGGCTGC TGACGATCCC GCAGATGGCT TTCGAGATGC AGCGCGACCT CACGCATACG GTCGCGGTGT TCTTCTCGGC CAGCATCTTC TTTTGCGGCT TCATCCGCAG CCTGAAGCAG CCAAGCCTTG GCTCCTATCT GATCGCCGGC ATCGGCATCG GCTTCGGGCT GCTCGCCAAA TATAATTTCG CGATCCTGCC GGCGGCGGCG CTGATTGCCG CACTGGCGGA TGCGCGTTGG CGAGCGCGGA TCTTCGATTG GCGGCTGGTA CTGACCGCTG TCGTGGCGCT GATCATCGTC CTGCCGCATC TCTTCTGGCT GAAGGACAAT CTCGATTTCG CCACCGCCCG CACGCTGGAG AAGATGACCG CGAGCGGCGA TGCGAGCTAT CTCATGCAGG TGGCGATGGG GGTCAGCTCG CTTGCTCTCG CCATCATCAG CTTTGCCGGA TTGACGGTGG CGGTCTTTGC CATCGTCTTC GGCAAGAGCC TTCGTCCGGC GCTCACCGCC GGCTCGGAAT GGACGCGGCT TATCGAGCGG ATGATGCTCG TCTTCCTCAT CGGCATTCTA TTTCTGATCG TCTTCGGTGG CGCGGCCGGC ATCAAGGATC GCTGGCTGGT GCCCATGCTC TTCATCCTGC CGCTCTATTT CTGCCTGAAG ATCGAGGCCG CCGGCGTCAC GACAGGCCGG GCCTTCCGGC GTTTCATGGT CGCCGTCGCC ATCATCATGA TCGGCGTGCC GGCCGCCCTT TACGGCAGCG TCGCGGCCGC ACGCTTCACC GGTCATTACG AACGATTGAA CAGGCCCTAT GCCACGATGC TGGAAAACCT GCGCAAACAG GCCGAGCCGA TGGCGATCCT AGCCGGCGAC AGCCTGCTTG CCGGCAATCT CCGGCAGGAT ATTCCCGGCG TGCCGGTCCT CTCCGTCGAT TATCCCGGCT TCCATCCGGA TCTTGCCGGC CGGCGACCGC TTCTCCTGGT CTGGCTGATA CCGCCGAAGG GCGGCAGCGA AGCACTCCCG CCGGCCATGG CCCAATGGCT GCAAGCCAAT CTCGGCGCCT CCGCGCCTGA GGGATTGGTG ATCGACGTGC CCTATTTCTA TCAGCGCGGC GAGGACCGCT ACCGATTCGG CTATGCTTGG ATCAACCAGC CAGGCTGA
|
Protein sequence | MTETDSRDIR WIFLLLAAYF VLQIGVRLAT SHSLDLDEAE QAFRSQWLAA GYGPQPPFYN WLQYTVFQFT GVSLAALSLV KNLLLFMSYL LYGLTARLLL RDKALVAMAT LGLLTIPQMA FEMQRDLTHT VAVFFSASIF FCGFIRSLKQ PSLGSYLIAG IGIGFGLLAK YNFAILPAAA LIAALADARW RARIFDWRLV LTAVVALIIV LPHLFWLKDN LDFATARTLE KMTASGDASY LMQVAMGVSS LALAIISFAG LTVAVFAIVF GKSLRPALTA GSEWTRLIER MMLVFLIGIL FLIVFGGAAG IKDRWLVPML FILPLYFCLK IEAAGVTTGR AFRRFMVAVA IIMIGVPAAL YGSVAAARFT GHYERLNRPY ATMLENLRKQ AEPMAILAGD SLLAGNLRQD IPGVPVLSVD YPGFHPDLAG RRPLLLVWLI PPKGGSEALP PAMAQWLQAN LGASAPEGLV IDVPYFYQRG EDRYRFGYAW INQPG
|
| |