Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_5204 |
Symbol | |
ID | 8007099 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | + |
Start bp | 615295 |
End bp | 616617 |
Gene Length | 1323 bp |
Protein Length | 440 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 644822113 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002973373 |
Protein GI | 241113538 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.306263 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAAGA TCGCTTTTTT CCTGTCCGCA GGATTGACCA GCCTATCCTT ATCCGTTCCG GCAATGGCGG CCACGAAGAT CCAGTGGTGG CACGCAATGG GCGGCGAGAA TGGCGCGAAG CTTGAACAGA TCGCCAAGGG GTTCAACGCA TCTCAATCTG ACTATGAGAT CGTCCCGGTA TTCAAAGGCA CCTATGACGA GACGCTGACG GGCGCCATTG CTGCGTTCCG CGCCAACCAG CAGCCGGCAA TCGTGCAGGT CTATGAAGTC GGTACCGGCA CGATGATGGC AGCACAGGGC GCGATCTATC CCGTCTACCA GTTGATGAAG GACCAAGGGG AGGCCTGGGA CCAGAGCAAA TTCATTGCTC CGGTCGTCGG TTACTACTCA GACACCAGCG GCAACGTTCT GTCGTTGCCG TTCAATTCCT CGACGCCGAT CATGTATTAC AACAAGGATG TCTTCAAAAA GGCGGGGCTT GATCCGGAAA CACCGCCGAA AACATGGGCG GACGTTGAAG CCTTTTCGCG GACGATCATG AAGTCCGGCG CTGCGAAGTG TGGCTTTACC AGCGCCTGGA TCTCCTGGAT CCAGACTGAA AACCTCAATG CTTTGCACGA CAAGCCCTAC TCCACCAAGG CCAACGGCTT TGGCGGCTTG GATGCGGAGT TCACCTTCAA CAACGATCTC ACGATCCGCC ATTGGGGCAA CTTGAAGAAG TGGCAGGACG AGGGGCTCTT CAAATTCGGC GGGCCTGGCG GCGGCGATAA TGCTCCTCCG ATGTTCTATT CGCAGGAATG CGCGATGTAC ATGAACTCGT CGGCCGGCCG GGCAGGCGTT ATCAATAACG CAAAGGCTTT CAAGGTCGGG TTTGCGCCGC TTCCCTACTA TGACGACGTC ATTACGCAGC CGCTCAACTC GATTATTGGC GGCGCCACGC TCTGGACACT GAAAGGTCGC CCAGAAGAGG AATACAAGGG TGTCGCGAAG TTCTTCACCT ACCTGCAGAA GCCGGAAGTG CAAGCCGATT GGCATCAGTT CTCCGGCTAC CTGCCGATAA CCGAGGCTGC CTATAAGCTG GGCCAGGATC AGGGCTATTA CGAGAAGAAT CCTGGAGCAG ATATCGGCAT CAAGCAGCTG ACGCGGGTGA CACCCACCGA TAATTCCAAG GGTATCCGGT TCGGCAACTA CGTCCAGGTG CGTGGCATCA TCGACGATGA GTTTGCAGCA TTGCTGGGCG GGAAGAAGAC GGCGAAGGAA GCGGTTGATT CCGTGGTCGC ACGAGGCAAC GAACAGCTTC GCGATTTCCA GTCCGCCAAC TAA
|
Protein sequence | MNKIAFFLSA GLTSLSLSVP AMAATKIQWW HAMGGENGAK LEQIAKGFNA SQSDYEIVPV FKGTYDETLT GAIAAFRANQ QPAIVQVYEV GTGTMMAAQG AIYPVYQLMK DQGEAWDQSK FIAPVVGYYS DTSGNVLSLP FNSSTPIMYY NKDVFKKAGL DPETPPKTWA DVEAFSRTIM KSGAAKCGFT SAWISWIQTE NLNALHDKPY STKANGFGGL DAEFTFNNDL TIRHWGNLKK WQDEGLFKFG GPGGGDNAPP MFYSQECAMY MNSSAGRAGV INNAKAFKVG FAPLPYYDDV ITQPLNSIIG GATLWTLKGR PEEEYKGVAK FFTYLQKPEV QADWHQFSGY LPITEAAYKL GQDQGYYEKN PGADIGIKQL TRVTPTDNSK GIRFGNYVQV RGIIDDEFAA LLGGKKTAKE AVDSVVARGN EQLRDFQSAN
|
| |