Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_0091 |
Symbol | |
ID | 4896170 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | + |
Start bp | 104675 |
End bp | 105931 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640110674 |
Product | extracellular solute-binding protein |
Protein accession | YP_001041983 |
Protein GI | 126460869 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCATGA GGACAACCGC CGCCAGCCTC CTTGCGCTGG CCCTCGCCAC TGGCGGGACT GCGGCCGGCG CGCAGACCCT CGAATACTGG GTCTATTCCG ATTTCGCGCA GGGTGAGGCG CTGGCGCTGC AGCAGGAGTT CATCAAGGAA TTCCAGGAGT CTCATCCGGG CGTGACGGTG AACATCGTCG GCAAGGGCGA TGACGATCTG ACCGCCGGTC AGATCGCGGG GGCCGCCAGC GGCAACCTGC CCGACGTGTT CATGAACGCG GTCGGCGTGG GCGCGCAGCT GGTCGATGTC GGCGCTCTGG CGAACATCCA CGACAAGTGG ATGGCCATGC CCGAGGAGTT CCGCGCCCAG TTCAACAAGG GTGCGGTCGA GAATTGCGCG CCCCGCCCCG AAGAGATGTA CTGCATTCCC TACACCGGCT ATGGCACGCT GCTGTTCCGC AACCTGACGG TGCTGGAAGA GGCCGGGATC GACACTTCCG CCCCGCCCGC CGACTGGGCC GACTGGCTGG CGCAGATGGA GAAGGTGAAG GCCGCGGGCA AGTTCGCCAT TCCCGATCAG GCGCTGGTCT TCAACTCGAT CGCCGAGATG TATGGCGTGA CGGGCGATGT CTCGACCTGG GGCATCGACT GGGAGAGCAA GACCACGCGG ATCGACCCGG CGGTGATGAC CTCGGTGCTG GAGAAGTTCG TGGCGATGCA GCCGCTGACC TCGGGCACCA GCCGCAACGA TCAGGCCACG AAGGACCTGT TCGTCACCGA TCAGCTGGCT TTCCACACCA TCGGTCCGTG GGTGAACCCG ACCTATGTCG AGGCGGTCGA GAACTCGGGG CTGAAGTACG ATTTCGTGCT GATGCCCGGC GAGACCGCCG ACAAGCACGG CGGTATCAAG AACTTCGAGA TCGTGGGCGT CGCCCCGGGC GAAAACCTCG ATCTGGCCTT CGAATTCGCG ACCTACATCA CCGCCAAGGA GCAGATGGCC CGCTGGGCCA AGCTGCTGTC GCGCTACAAT TCGAACGATG CCGCGATGGC CGAGGCCGAT GTGGCAGCCC TGCCGCTGGT CGCCCGGTCG GTTGCGGCGG TCGAGGTGAC GATGGATGTG AGCCCGCCCT ATCTGATCCA GCCGGTTCCT GCCTGCTACC AGTCGACGGT GGTCGATTAT GTGTCCGCCA CCGCCGACGG CGAGTTCACG CCCGAAGAGG GCGCGGCGGA GATGATCGCC GAGCTGAACG ACTGCCTCGC CGGCTGA
|
Protein sequence | MTMRTTAASL LALALATGGT AAGAQTLEYW VYSDFAQGEA LALQQEFIKE FQESHPGVTV NIVGKGDDDL TAGQIAGAAS GNLPDVFMNA VGVGAQLVDV GALANIHDKW MAMPEEFRAQ FNKGAVENCA PRPEEMYCIP YTGYGTLLFR NLTVLEEAGI DTSAPPADWA DWLAQMEKVK AAGKFAIPDQ ALVFNSIAEM YGVTGDVSTW GIDWESKTTR IDPAVMTSVL EKFVAMQPLT SGTSRNDQAT KDLFVTDQLA FHTIGPWVNP TYVEAVENSG LKYDFVLMPG ETADKHGGIK NFEIVGVAPG ENLDLAFEFA TYITAKEQMA RWAKLLSRYN SNDAAMAEAD VAALPLVARS VAAVEVTMDV SPPYLIQPVP ACYQSTVVDY VSATADGEFT PEEGAAEMIA ELNDCLAG
|
| |