Gene PCC8801_2009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_2009 
Symbol 
ID7104779 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp2083797 
End bp2085065 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content36% 
IMG OID643475070 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002372202 
Protein GI218246831 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTAGAT TATTCCAACT AATTTGGCAG CGATCGCGTC TATATTCTCT CTTCCTAGCC 
AGTTTTCTGA TTCTATTAAC CGGGTGTCAG TTGAATCTGT TTGAGCAGCC ATCTCTTATG
CGAGGAAGGC TGTTAATTTA CCATCCCTTT CAAGGAGAAA ATGGTATAAT TTTTGAGAAT
TTCCTCGATA ATTTTGAACA ACTTTACCCC GAGGTTCAAC TATTAAGTGA ATATATTAGA
GAGGACAGAC TTTCTCAACA GTTTATCTCA AAATCAAGAG CCGGGTTAGG AGCAACAGTC
TTGATTGATT TTGCACGACA TATTCCTCAA TTAGTTAAAA GTAATAGTAT TCAACCTCTT
GAAGATAAAA ATATAGATAC ATCTAGGTTT TTATCTTCAA ATATCATTCA ATCTCGCTAT
CAGGGTAAAA TTTATGGTAT TCCTCTGGTT TCTCAGGTGC GTGTACTTTG CTACAATCTA
GCTAAACTTC AACCTAATTC TAATACTCAA GATCCTATCC TTACTCAACC TCCTTTTGGG
TTAGAAGGAC TATTAACACG AGCCAAAAAA GGCTACTCTG TGGGGATGGT TTCCAGTTTT
GAAGATACGT TTTGGGGGTT AGGCATTTTT GGGGCTAAAT TCTTCGATAA TCAAGGATTC
ATTAACCCCC AGTTAGAAGG GTGGGGAAAG TGGTTAGAAT GGCTTAAAAA AGCGGAAACT
CAACCTAATT TTATACTCAG TCGCAATCGA GAGATTCTTC ATGAAGCTTT TGCTAAAGGG
AAGTTGACTT ACTACGTTTG TAATTCTGAT GAAATTGGAG ATTTAAAAAA TATCTTGAAA
GAGAACTTAC AGATAGTTTT TCTCCCTGGA GAACCTGACC ATCCGGCAAC CCCTTTGCTT
TATACCATAG TGATGATGGT CAATAATAGT GCTAGTCTCC ATGAAACTGA ATTAGCTTTA
CAATGGGCAC AGTTCATGAC TAACCCTGAA CAACAATTAA AAGCATTAAT AGGTTCTTTA
AACTTTATTC CTACTAACCA AAAGATCAGT GTTAATCAAC AGTTATTACC CATAGAAGCC
ACTTTACATA AACAGTCTAA AATGGCACTC ACTATTCCCA TCGACTCTAT AGAAAAAATT
CTTAAAATTT TTCAAGAAGG GGAGATTGTA TATCAAAAAG CTATGGCCGG AGATCTGACT
TCATCTCAAG CTGTTCAGGA ACTAACTGAT ATTATTAAAA CACAATTGAA TTTTCAAACA
AGGAACTAA
 
Protein sequence
MSRLFQLIWQ RSRLYSLFLA SFLILLTGCQ LNLFEQPSLM RGRLLIYHPF QGENGIIFEN 
FLDNFEQLYP EVQLLSEYIR EDRLSQQFIS KSRAGLGATV LIDFARHIPQ LVKSNSIQPL
EDKNIDTSRF LSSNIIQSRY QGKIYGIPLV SQVRVLCYNL AKLQPNSNTQ DPILTQPPFG
LEGLLTRAKK GYSVGMVSSF EDTFWGLGIF GAKFFDNQGF INPQLEGWGK WLEWLKKAET
QPNFILSRNR EILHEAFAKG KLTYYVCNSD EIGDLKNILK ENLQIVFLPG EPDHPATPLL
YTIVMMVNNS ASLHETELAL QWAQFMTNPE QQLKALIGSL NFIPTNQKIS VNQQLLPIEA
TLHKQSKMAL TIPIDSIEKI LKIFQEGEIV YQKAMAGDLT SSQAVQELTD IIKTQLNFQT
RN