Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_2009 |
Symbol | |
ID | 7104779 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | + |
Start bp | 2083797 |
End bp | 2085065 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 643475070 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002372202 |
Protein GI | 218246831 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTAGAT TATTCCAACT AATTTGGCAG CGATCGCGTC TATATTCTCT CTTCCTAGCC AGTTTTCTGA TTCTATTAAC CGGGTGTCAG TTGAATCTGT TTGAGCAGCC ATCTCTTATG CGAGGAAGGC TGTTAATTTA CCATCCCTTT CAAGGAGAAA ATGGTATAAT TTTTGAGAAT TTCCTCGATA ATTTTGAACA ACTTTACCCC GAGGTTCAAC TATTAAGTGA ATATATTAGA GAGGACAGAC TTTCTCAACA GTTTATCTCA AAATCAAGAG CCGGGTTAGG AGCAACAGTC TTGATTGATT TTGCACGACA TATTCCTCAA TTAGTTAAAA GTAATAGTAT TCAACCTCTT GAAGATAAAA ATATAGATAC ATCTAGGTTT TTATCTTCAA ATATCATTCA ATCTCGCTAT CAGGGTAAAA TTTATGGTAT TCCTCTGGTT TCTCAGGTGC GTGTACTTTG CTACAATCTA GCTAAACTTC AACCTAATTC TAATACTCAA GATCCTATCC TTACTCAACC TCCTTTTGGG TTAGAAGGAC TATTAACACG AGCCAAAAAA GGCTACTCTG TGGGGATGGT TTCCAGTTTT GAAGATACGT TTTGGGGGTT AGGCATTTTT GGGGCTAAAT TCTTCGATAA TCAAGGATTC ATTAACCCCC AGTTAGAAGG GTGGGGAAAG TGGTTAGAAT GGCTTAAAAA AGCGGAAACT CAACCTAATT TTATACTCAG TCGCAATCGA GAGATTCTTC ATGAAGCTTT TGCTAAAGGG AAGTTGACTT ACTACGTTTG TAATTCTGAT GAAATTGGAG ATTTAAAAAA TATCTTGAAA GAGAACTTAC AGATAGTTTT TCTCCCTGGA GAACCTGACC ATCCGGCAAC CCCTTTGCTT TATACCATAG TGATGATGGT CAATAATAGT GCTAGTCTCC ATGAAACTGA ATTAGCTTTA CAATGGGCAC AGTTCATGAC TAACCCTGAA CAACAATTAA AAGCATTAAT AGGTTCTTTA AACTTTATTC CTACTAACCA AAAGATCAGT GTTAATCAAC AGTTATTACC CATAGAAGCC ACTTTACATA AACAGTCTAA AATGGCACTC ACTATTCCCA TCGACTCTAT AGAAAAAATT CTTAAAATTT TTCAAGAAGG GGAGATTGTA TATCAAAAAG CTATGGCCGG AGATCTGACT TCATCTCAAG CTGTTCAGGA ACTAACTGAT ATTATTAAAA CACAATTGAA TTTTCAAACA AGGAACTAA
|
Protein sequence | MSRLFQLIWQ RSRLYSLFLA SFLILLTGCQ LNLFEQPSLM RGRLLIYHPF QGENGIIFEN FLDNFEQLYP EVQLLSEYIR EDRLSQQFIS KSRAGLGATV LIDFARHIPQ LVKSNSIQPL EDKNIDTSRF LSSNIIQSRY QGKIYGIPLV SQVRVLCYNL AKLQPNSNTQ DPILTQPPFG LEGLLTRAKK GYSVGMVSSF EDTFWGLGIF GAKFFDNQGF INPQLEGWGK WLEWLKKAET QPNFILSRNR EILHEAFAKG KLTYYVCNSD EIGDLKNILK ENLQIVFLPG EPDHPATPLL YTIVMMVNNS ASLHETELAL QWAQFMTNPE QQLKALIGSL NFIPTNQKIS VNQQLLPIEA TLHKQSKMAL TIPIDSIEKI LKIFQEGEIV YQKAMAGDLT SSQAVQELTD IIKTQLNFQT RN
|
| |