Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_3491 |
Symbol | |
ID | 6982245 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 3608403 |
End bp | 3609662 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643398209 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002282984 |
Protein GI | 209551067 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCAAGT TTTTGAGTTC GGCGGCGGTT GCCGTCGTGA TGATGGCTGG CCTCGGTGCG GCGCAGGCAG CCGACGTGAA GGAAGTTCAG ATGCTGCACT GGTGGACATC CGGCGGCGAG GCGGCGGCTT TGAACGTTCT GAAGCAGGAT CTGTCGAAGG AAGGTTTTGC CTGGAAGGAC GTTCCGGTGG CCGGCGGCGG CGGTGATGCG GCGATGACGG CGCTGAAGGC GATGGTTGCG GCCGGCACCT ATCCCACAGC TTCGCAGATG CTGGGCTATA CCGTGCTCGA TTATGCCCAG GCCGGCGTCA TGGGCGATCT GACCGAGACG GCGAAGAAGG AAGGCTGGGA CAAGTCGGTG CCGGCGGCGC TGCAGAAGTT CTCGGTCTAT GACGGCAAAT GGGTCGCAGC TCCGGTCAAC GTGCACTCGG TCAACTGGCT GTGGATCAAC AAGGCGGTGA TGGACAAGAT CGGCGGCACC CAGCCGAAGA CCTTCGACGA TCTGATCGCG CTGCTCGACA AGGCCAAGGC CGCAGGTGTC ATCCCCTTGG CGCTCGGCGG CCAGAACTGG CAGGAAGCGA CGATGTTCGA TTCCATCGTG CTGTCGACCG GCGGGCCGGA ATTCTACAAG AAGGCCTTCA ACGATCTCGA TGAGGAGTCG CTGAAGTCGG ACACGATGAA GAAGTCCTTC GATAATCTGG CGACGATCAT CAAATATGTC GATCCGAACT TCTCCGGCCG CGACTGGAAC CTGGCGACCG CCATGGTCAT CAAGGGTGAT GCGCTGGTGC AGGTAATGGG CGACTGGGCC AAGGGCGAAT TCGTCGCCGC CAAGAAGACC CCGGACAAGG ACTTCCTCTG CTACCGCTTC CCCGGCACCG AAGGCAGCGT CGTCTATAAC TCCGACATGT TCGGCATGTT CAACGTCCCC GACGACCGCA AGGCGGCCCA GGTGGCGCTG GCAACGGCGA CGCTGTCGAA GAGCTTCCAG TCGGCCTTCA ACGTCGTCAA GGGTTCGGTG CCGGCCCGCA CCGACGTTCC CGACACCGAC TTCGACGCCT GCGGCAAGAA GGGCATCGCC GATCTGAAGG CGGCCAATGA GGGCGGCACG CTGTTCGGTT CGCTGGCCCA AGGTTACGGC GCGCCTCCGG CGATTGCCAA TGCCTATAAG GACGTGGTCT CGAAGTTCGT CCACGGCCAG ATCAAGACCT CCGACGAAGC CGTCAAGCAG CTCGTCCAGG CGATCGACGA CGCTCGCTGA
|
Protein sequence | MRKFLSSAAV AVVMMAGLGA AQAADVKEVQ MLHWWTSGGE AAALNVLKQD LSKEGFAWKD VPVAGGGGDA AMTALKAMVA AGTYPTASQM LGYTVLDYAQ AGVMGDLTET AKKEGWDKSV PAALQKFSVY DGKWVAAPVN VHSVNWLWIN KAVMDKIGGT QPKTFDDLIA LLDKAKAAGV IPLALGGQNW QEATMFDSIV LSTGGPEFYK KAFNDLDEES LKSDTMKKSF DNLATIIKYV DPNFSGRDWN LATAMVIKGD ALVQVMGDWA KGEFVAAKKT PDKDFLCYRF PGTEGSVVYN SDMFGMFNVP DDRKAAQVAL ATATLSKSFQ SAFNVVKGSV PARTDVPDTD FDACGKKGIA DLKAANEGGT LFGSLAQGYG APPAIANAYK DVVSKFVHGQ IKTSDEAVKQ LVQAIDDAR
|
| |