Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4716 |
Symbol | |
ID | 8007191 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | + |
Start bp | 85828 |
End bp | 87126 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 644821649 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002972909 |
Protein GI | 241113074 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAGCG CCACCGTCAG CGCATTTGCG CTGAGCACCA TGTTATTTTC AGCATCTACG TCATTTGCCC AGGAACTTGC CACGAAGGAC AGGATCGGCC TTGCCGATGC GCCCAAATCC CTCGTCGTCC GCTTGACGAA CGACAGTCCC AACAATTCGG ATCCGGCGAT CGCCGAGGGC TATCAGAAGC TCTTCGTCGA TTTCATCAAG AAGCATCCGG ACTGGAAATT GCAGATGCAA TTCATGTCCT CGGACATCGG CACCGAACAG GCCAAGATGC TGGAGCAGGC CAAGGCCGGC AACGCGCCTG ACTGCGCCGC TGTCGACTCC TTCGTCCTCT CGCAGTTCAT GGTCAATAAA GTTCTTGCGG ACTTCACGCC GTATTTTTCG AAGGAGGAAG TGGATGACCT GTTCCCCTTC ATCCGCAGCG GCATCACCGA TAAGGACCAG ACGGTCCGCG CTTGGTGGTG GGATACCGAT CTTCGTGTGC TCTACCGCAA CAAGTCGATC GTTGCGGACG CGCCGCAGAC ATGGGACGAC CTGAAAAAGG CCGCGCTCGC CTCCACCAAG GAGGGCATGG AAGGTGTCCT CTTCAATGGC GGGCGCTGGG AAGGCACGAC GTTCGACTGG CTCGCGAATT ATTGGGCGCT GGGCGGCAAG CTTGTCGACG ATTCAGGCAA GCCGGTATTC GGCGAAGGCG AGAACAAGGA GAAATTCCTG AAGGCCCTGA ATTATTTCAA GGATCTCGTC GATTCGGGTG CCGCGCCCAA GCGCGTCAGC ACGATCGCGA ATTATGACGA TTTGAATGCT GCGGCCGCCG CGGCGACGAC GGCTCTCTTC ATCGGCGGCA ACTGGCAATA TGCGCAGTTG AAGGCCACGC TCGACGAGGA CGAGTTCAAC AACTGGACAT TCTCGCCGAT CCCCGGCCCC AGCGCCGATC AGCGTTCGAC CGGCACCGGC GGTTGGACGA TCGCCTCCTT CAGTAAGGAC AAGGATAAGG TCGAGATGTG CGCCAACCTC GCCCGCGAGG TTTATATGGG GCCGGCAAAC GCGCTGCAGC AGCAGTTGCC GACCAGGAAG TCGCTGTTCG ACAAATATGA GGTCTTCTCT ACCGAAGCCA ACAAGACCTT CGCCAAGGCT CTGGTCGACG GACAGGCACG CCCGGGTGTG CCGATCTATC CCGAAATCTC CAACCAGATC CAGATCATGA TGGGCGACGT GCTCTCGGGG ACCAAAAAGC CGGAAGACGC GTTGAATGCC GCCTTCAACG CGGCGCTGGA GGCCTACAAG CGTCTCTAG
|
Protein sequence | MKSATVSAFA LSTMLFSAST SFAQELATKD RIGLADAPKS LVVRLTNDSP NNSDPAIAEG YQKLFVDFIK KHPDWKLQMQ FMSSDIGTEQ AKMLEQAKAG NAPDCAAVDS FVLSQFMVNK VLADFTPYFS KEEVDDLFPF IRSGITDKDQ TVRAWWWDTD LRVLYRNKSI VADAPQTWDD LKKAALASTK EGMEGVLFNG GRWEGTTFDW LANYWALGGK LVDDSGKPVF GEGENKEKFL KALNYFKDLV DSGAAPKRVS TIANYDDLNA AAAAATTALF IGGNWQYAQL KATLDEDEFN NWTFSPIPGP SADQRSTGTG GWTIASFSKD KDKVEMCANL AREVYMGPAN ALQQQLPTRK SLFDKYEVFS TEANKTFAKA LVDGQARPGV PIYPEISNQI QIMMGDVLSG TKKPEDALNA AFNAALEAYK RL
|
| |