Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_5673 |
Symbol | |
ID | 8016899 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012853 |
Strand | + |
Start bp | 251020 |
End bp | 252327 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 644827826 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002979026 |
Protein GI | 241518398 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0285739 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.0925972 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAACA GCAGTGGATT CACGCATGAG ATACGTCGCC GCACCCTGCT TGCGGGTGCG GGAGGTGCCG CATTGATGGC GTTGGTGGGC AGCGCCAAAG CGCAGGATAT CGTCAGTGGC GAACTTGTGG TGCTGAATTG GCTCGGCGGT TCCGAGCTCG ACATGATGCA TAAGATCCAG ACTGCATTTA CCGCGAAATA TCCGAAAGTG ACGATCAGGG AAGTTGCCAT TACCGGCCAA GGAGACATGC GCGGTGGCAT CCGCACGGCA CTCATGGGCG GCGAAGTGGT CGATGTCCTC TACAACACCT GGCCGGCCTT CCGGAAGGAA CTGCTCGATG CCGGCATGCT GCGGCCGATC GACGACCAAT GGAAGTCCTT CGGCTGGGAC AGGCTCATCA GCCAATCCTG GAAGGATCTT GGCGCTATAG CCGGCAAGAC CTATGGCCTG ACCTACACCT TCGGCGACCG CTCCGGCATC TGGTACAAGA AGGAGCACCT TGCCAGGGCG GGCATCACCG AGTCGCCGAG GAGCTGGGAC GAGTTCGTCG CCAGCTTTGC GAAGCTTACA AAAGCTGGGT TCGCGGCTCC GGTCGCTATT CCCGGCAAAT ACTGGGCGCA TGCCGAATGG TTTGAAACGC TGCTGCTGAG AACGGCAGGC GTCGAGACGG CTTCGAAGCT AGGCGCGCAC GAAATTTCGT GGACCGATCC GGCCGTGAAG AATGCGCTTA CAAGATATGC CGAAATGTTG ACCGCGGGCT GCTGCGGAGC GCCGAACAGC ATGCTCGCCA ACGACTGGGA CGGGGAAGCC GACCAAATCT TCCAGGCGAA TGCCAAAAAT TACCTGCTGA TCGGCATGTG GATGAATAAC CGCGCCAAGA ACGACTACAA ACTCACCGAA GGTAAGGATT ACGGTCTCTT CCAGTTCCCC GCCCTCGGGA TGGGTCATGA CGACACGTCG AGCGTCGATA CCAAGGAACT GCTCGTCACG GCAAACGGCC CCAATCCGAA GGCAGCAGAC GCCTTCCTCG ATTTCTGGAC AAGTGCCGAG GCCGCCAACA TTCTTGCCAA GAACGGCTAT GCGTCACCAA GCAGCAATAC CGACACGTCG CTCTATGGCG AGACGCAGAA GGTGGCGACA TCAGCGGTCG CAAGCTCGAA GCTGCAATTC GTGCTCGGAG ATCTCTTGCC CGGCGATCTC GTCGATGAAT ATCGGGTGCA ACTGCAGAAA TTTCTCCAGG ATCCCTCGGC TGCCAATATC GATACCGTCC TTGCGGCAAT CGAAACCAAG GCCCAGGGAT CCTACTGA
|
Protein sequence | MTNSSGFTHE IRRRTLLAGA GGAALMALVG SAKAQDIVSG ELVVLNWLGG SELDMMHKIQ TAFTAKYPKV TIREVAITGQ GDMRGGIRTA LMGGEVVDVL YNTWPAFRKE LLDAGMLRPI DDQWKSFGWD RLISQSWKDL GAIAGKTYGL TYTFGDRSGI WYKKEHLARA GITESPRSWD EFVASFAKLT KAGFAAPVAI PGKYWAHAEW FETLLLRTAG VETASKLGAH EISWTDPAVK NALTRYAEML TAGCCGAPNS MLANDWDGEA DQIFQANAKN YLLIGMWMNN RAKNDYKLTE GKDYGLFQFP ALGMGHDDTS SVDTKELLVT ANGPNPKAAD AFLDFWTSAE AANILAKNGY ASPSSNTDTS LYGETQKVAT SAVASSKLQF VLGDLLPGDL VDEYRVQLQK FLQDPSAANI DTVLAAIETK AQGSY
|
| |