Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3786 |
Symbol | |
ID | 8014612 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 3843144 |
End bp | 3844403 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644826349 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002977568 |
Protein GI | 241206472 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAAGT TTTTGAGCTC GGCAGCTGTT GCTGTCGTGA TGATGGCTGG CCTCAGTGCT GCCCACGCAG CCGACGTCAA GGAAGTGCAG ATGCTGCACT GGTGGACATC GGGCGGCGAG GCGGCGGCAC TGAACGTTCT GAAGCAGGAT CTTTCGAAGG AAGGTTTTGC CTGGAAGGAT GTGCCGGTTG CCGGCGGTGG CGGCGATGCG GCGATGACGG CGCTGAAGGC GATGGTTGCG GCCGGCACCT ATCCGACAGC CTCGCAGATG CTGGGCTATA CCGTGCTCGA TTATGCTCAG GCCGGGGTCA TGGGCGATCT GACGGAGACG GCGAAGAAGG AAGGCTGGGA CAAGTCGGTT CCGGCAGCGC TGCAGAAGTT CTCGGTCTAT GACGGCAAGT GGGTCGCAGC CCCCGTCAAC GTCCACTCGG TCAACTGGCT GTGGATCAAC AAGGCTGTGA TGGACAAGAT CGGCGGCACC CAGCCGAAGA CCTTCGACGA GCTGATCGCC CTGCTCGACA AGGCGAAGGC CGCAGGCGTC ATCCCGCTGG CTCTCGGCGG CCAGAACTGG CAGGAGGCGA CGATGTTCGA TTCCATCGTG CTGTCGACCG GCGGGCCGGA GTTCTACAAG AAGGCCTTCA ACGACCTCGA CGAGGAATCG CTGAAGTCCG ACACGATGAA GAAGTCGTTC GACAATCTGG CGACGATCAT CAAATATGTC GACCCGAACT TCTCGGGCCG CGACTGGAAC CTGGCAACCG CCATGGTCAT CAAGGGTGAC GCGCTAGTGC AGGTGATGGG CGACTGGGCC AAGGGCGAAT TCGTCGCCGC CAAGAAGACG CCGGATACCG ACTTCCTGTG CTACCGCTTC CCCGGCACCG ACGGCAGCGT CGTCTACAAC TCCGACATGT TCGGCATGTT CAACGTTCCC GACGACCGCA AGGCGGCCCA GGTGGCGCTG GCGACCGCAA CGCTGTCGAA GAGCTTCCAG TCGGCCTTCA ACGTCGTCAA GGGTTCGGTG CCGGCTCGTA CCGACGTTCC CGATACCGAC TTCGATGCTT GCGGCAAGAA GGGCATCGCC GACCTGAAGG CAGCCAACGA AGGCGGCACG CTGTTCGGCT CGCTGGCACA GGGCTACGGC GCTCCTCCGG CGATCGCCAA TGCCTACAAG GATGTCGTCT CGAAGTTCGT CCACGGCCAG ATCAAGACCT CCGACGAAGC CGTCAAGCAG CTCGTCCAGG CGATCGACGA CGCCCGCTGA
|
Protein sequence | MNKFLSSAAV AVVMMAGLSA AHAADVKEVQ MLHWWTSGGE AAALNVLKQD LSKEGFAWKD VPVAGGGGDA AMTALKAMVA AGTYPTASQM LGYTVLDYAQ AGVMGDLTET AKKEGWDKSV PAALQKFSVY DGKWVAAPVN VHSVNWLWIN KAVMDKIGGT QPKTFDELIA LLDKAKAAGV IPLALGGQNW QEATMFDSIV LSTGGPEFYK KAFNDLDEES LKSDTMKKSF DNLATIIKYV DPNFSGRDWN LATAMVIKGD ALVQVMGDWA KGEFVAAKKT PDTDFLCYRF PGTDGSVVYN SDMFGMFNVP DDRKAAQVAL ATATLSKSFQ SAFNVVKGSV PARTDVPDTD FDACGKKGIA DLKAANEGGT LFGSLAQGYG APPAIANAYK DVVSKFVHGQ IKTSDEAVKQ LVQAIDDAR
|
| |