Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4707 |
Symbol | |
ID | 8007182 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | + |
Start bp | 75310 |
End bp | 76569 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 644821640 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002972900 |
Protein GI | 241113065 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAATT TGCTTGGTGC CAGCGCACTT GCGCTTGTTT TCATAACCGC CACTGCCAAT GCCGAAACCA TCAACATTCT GGTGGAGGGC GGCGGCGAAA TGTTGCAGAA GGCGGTCGCC GAGAAGTTTA CTGCCGAAAC AGGCATCAAA GTGAACTTCA CGACCGTTCC CTATCAGGGT GTCTTTGACA AGTTCTCAGC CGAAATCGCC TCAGGCTCTT CCGCGTTCGA TGTCGTGACG ATCGACGTCG TGTGGAATGC GAAGTTTGCA AGCCACGTCG AGGATCTCTC GGCTCTTTTC ACCGATGCGG TTCGCGCGGA TCTGCCGCCT GTCTTGCTTG CCGACGCGAA GGTTGGTGAC AAACTGATCG GCATGCCTGC CTGGGCCAAT GCCGAGATCG TGTTCTATCG CAAGGACCTG TTTGATAAGG CTGAGGAAAA GGAGGCTTTT CAGGCCAAGT ATGGCTATCC TCTCGCGCCG CCCAAAACCT GGCAGCAGTG GCGCGACATT GCGAAATTCT TCACGCGTGA CACTGACGGC GATGGAAAGA CCGACTTCTG GGGCACCGAC ACCATCGGCA CGTTTTCAGA GGAATGGATG GCGCATGTGC TGCAAGCGGG TTCGCCAGGG GTGATCCTCG ATAAGGACGG GCAGGTCATC ATCGACAACG AGGCGCACAA AAAGGCACTG GAATTCTACA TCGCGCCACA CTGCATTGAT CATTCCGTTC CTGAAAACGT GAACGAAATC GGCTGGGGCG AGGCGCAGAA CCTGTTCTAT CAGGGCAAAA CAGCGATGAT GAAGTTCTGG GCGCACGCCT ACAAGATGAC GCCTCCGGAT TCAAAGGTCA GCGGCAAGGT CGGCGTGGTG CCGATGCTGG CCGGCGACGC CGGGATCGCA GCTGTTCCTG GCCCTTGGTA CAACGTCGTT CCCTCGACAT CCGAGCACAA GGATGCAGCG AAAAAATTCA TCTCGTTTGC CATCGCCAAT AATGCCCTGG GTATCGAAGC TCCGCTCGGC CTTGCCGCGA CGAATTCCGC CTATCGCAGC TATTCAGGCA AGGCCGGCTA TGAGCACTTC CCCCCACTTC TTGAGACGCT GAGCGCGCCT GCCACCCAGG GCCGGCCGAT CAATGAAAAA TATCAGGAAA TCGTCGATGA AGCTGTGCTG CCGGCTATCC AGCAGGCACT CACCTGCAAG GCGGATATCG GGGAGGTCCT GACGGAAGCC AAGGAAACGA TCGAGGACAT TCTCAACTAG
|
Protein sequence | MKNLLGASAL ALVFITATAN AETINILVEG GGEMLQKAVA EKFTAETGIK VNFTTVPYQG VFDKFSAEIA SGSSAFDVVT IDVVWNAKFA SHVEDLSALF TDAVRADLPP VLLADAKVGD KLIGMPAWAN AEIVFYRKDL FDKAEEKEAF QAKYGYPLAP PKTWQQWRDI AKFFTRDTDG DGKTDFWGTD TIGTFSEEWM AHVLQAGSPG VILDKDGQVI IDNEAHKKAL EFYIAPHCID HSVPENVNEI GWGEAQNLFY QGKTAMMKFW AHAYKMTPPD SKVSGKVGVV PMLAGDAGIA AVPGPWYNVV PSTSEHKDAA KKFISFAIAN NALGIEAPLG LAATNSAYRS YSGKAGYEHF PPLLETLSAP ATQGRPINEK YQEIVDEAVL PAIQQALTCK ADIGEVLTEA KETIEDILN
|
| |