Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_1201 |
Symbol | |
ID | 5084485 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009428 |
Strand | - |
Start bp | 1243317 |
End bp | 1245035 |
Gene Length | 1719 bp |
Protein Length | 572 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640482759 |
Product | extracellular solute-binding protein |
Protein accession | YP_001167407 |
Protein GI | 146277248 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.487559 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.842714 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGATACG GCACGACAGC CATGGCGCTG GCGCTGATGG CGCTGGGGGC ACCGGCCTTC GCCGACATCG AGGCTGCGCG GCAGTTCCTC GATGCCGAGA TCGGCGACAT GTCCTCGCTC ACGCGCGAGG AGCAGGAAGC CGAGATGCAA TGGTTCATCG ACGCGGCGCA GCCCTTCGCC GGCATGGACA TCAAGGTGGT CTCGGAGACC ATCACCACGC ATGAATATGA ATCCAAGGTG CTGGCGCCCG CCTTCACCGC GATCACGGGC ATCCGGGTCA GCCACGACCT GATCGGCGAA GGCGACGTGG TCGAGAAGCT GCAGACGCAG ATGCAGTCGG GCGAGAACAT CTATGACGCC TACATCAACG ACAGCGACCT GATCGGCACC CACTGGCGCT ACCAGCAGGC CCGCAGCCTG ACCGACTGGA TGGCGAACGA GGGCCAGGAC GTCACCAACC CCGGCCTCGA TCTCGACGAT TACATCGGCC TGAAGTTCAC CACCGCACCG GATGGCGAGC TTTACCAGCT GCCCGACCAG CAGTTCGCCA ACCTCTACTG GTTCCGCGCC GACTGGTTCG ACGATCCAGA GACCAAGGCC GACTTCCAGG AGAAGTACGG CTACGAGCTG GGCGTGCCGC TGAACTGGTC GGCCTACGAG GACATCGCCG AGTTCTTCAC GGGGCGCGAC ATGAGCGCGC TCGGCGGGCC GACGAGCGCC TATGGCAGCA TGGATTACGG CAAGAAGGAC CCGAGCCTCG GCTGGCGCTA CACCGACGCC TGGATGTCGA TGGCGGGCAT GGGCGACAAG GGCGATCCGA ACGGTCTGCC GGTCGATGAA TGGGGCATCC GGGTGGACGA GAACTCGCGC CCCGTGGGCT CCTGCGTGGC GCGCGGCGGC GCGACCAACG ACGCGGCGGC GGTCTATGCG ATCACCAAGG CGATCGAATG GCTGCAGAAA TACGCCCCGC CGCAGGCCGC CGGCATGACC TTCTCGGAAT CCGGGCCGGT GCCCGCGCAG GGCGAGGTCG CCCAGCAGAT CTTCTGGTAC ACCGCTTTCA CCGCCGACAT GGTCAAGGAG GGCCTGCCGG TGATGAACGA GGATGGCACG CCCAAGTGGC GCATGGCCCC CTCGCCGCAT GGCGCCTACT GGACCGAAGG CACCAAGGTC GGCTACCAGG ACGTGGGCTC GTGGACGCTG CTGAAATCCA CCCCCGACGA CCGCGCCAAG GCCGCCTGGC TCTACGCCCA GTTCGTCTCG TCCAAGACCG TGGACGTGAA GAAGAGCCAC GTCGGCCTGA CCTTCGTGCG CGAATCCACC ATCCAGCACC AGAGCTTCAC CGACCGCGCG CCCAATCTTG GCGGTCTGGT CGAGTTCTAC CGCTCGCCCG CCCGCGTCCA GTGGTCGCCC ACGGGGACGA ACGTGCCGGA TTACCCGAAG CTCGCGCAGC TCTGGTGGCA GAACATCGGC GATGCGATGT CGGGCGCCAA GTCGCCGCAG GAGGCTCTGG ACGCGCTCTG CGCCGAGCAG GAGCGGGTGC TGGCGCGGCT GGAACGGGCC GGCGTGCAGG GTGATCTCGG GCCGAAGCTG AACGAGGAGA AGGACCCGCA GGAATGGCTC GACGCGCCCG GCGCGCCGGT GGGCAAGCTC GAGAACGAGA AACCCGCGGG TGAGACGATC CCCTACGACG AACTCATCAA GTCCTGGCAG CAGGGCTGA
|
Protein sequence | MRYGTTAMAL ALMALGAPAF ADIEAARQFL DAEIGDMSSL TREEQEAEMQ WFIDAAQPFA GMDIKVVSET ITTHEYESKV LAPAFTAITG IRVSHDLIGE GDVVEKLQTQ MQSGENIYDA YINDSDLIGT HWRYQQARSL TDWMANEGQD VTNPGLDLDD YIGLKFTTAP DGELYQLPDQ QFANLYWFRA DWFDDPETKA DFQEKYGYEL GVPLNWSAYE DIAEFFTGRD MSALGGPTSA YGSMDYGKKD PSLGWRYTDA WMSMAGMGDK GDPNGLPVDE WGIRVDENSR PVGSCVARGG ATNDAAAVYA ITKAIEWLQK YAPPQAAGMT FSESGPVPAQ GEVAQQIFWY TAFTADMVKE GLPVMNEDGT PKWRMAPSPH GAYWTEGTKV GYQDVGSWTL LKSTPDDRAK AAWLYAQFVS SKTVDVKKSH VGLTFVREST IQHQSFTDRA PNLGGLVEFY RSPARVQWSP TGTNVPDYPK LAQLWWQNIG DAMSGAKSPQ EALDALCAEQ ERVLARLERA GVQGDLGPKL NEEKDPQEWL DAPGAPVGKL ENEKPAGETI PYDELIKSWQ QG
|
| |