Gene Cyan8802_2034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_2034 
Symbol 
ID8391350 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp2052780 
End bp2054048 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content36% 
IMG OID644980015 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003137760 
Protein GI257059872 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.718211 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.822994 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTAGAT TATTCCAACT AATTTGGCAG CGATCGCGTC TATATTCTCT CTTCCTAGCC 
AGTTTTCTGA TTCTATTAAC CGGGTGTCAG TTGAATCTGT TTGAGCAGCC ATCTCTTATG
CGAGGAAGGC TGTTAATTTA CCATCCCTTT CAAGGAGAAA ATGGTATAAT TTTTGAGAAT
TTCCTCGATA ATTTTGAACA ACTTTACCCC GAGGTTCAAC TATTAAGTGA ATATATTAGA
GAGGACAGAC TTTCTCAACA GTTTATCTCA AAATCAAGAG CCGGGTTAGG AGCAACAGTC
TTGATTGATT TTGCACGACA TATTCCTCAA TTAGTTAAAA GTAATAGTAT TCAACCTCTT
GAAGATAAAA ATATAGATAC ATCTAGGTTT TTATCTTCAA ATATCATTCA ATCTCGCTAT
CAGGGTAAAA TTTATGGTAT TCCTCTGGTT TCTCAGGTGC GTGTACTTTG CTACAATCTA
GCTAAACTTC AACCTAATTC TAATACTCAA GATCCTATCC TTACTCAACC TCCTTTTGGG
TTAGAAGGAC TATTAACACG AGCCAAAAAA GGCTACTCTG TGGGGATGGT TTCCAGTTTT
GAAGATACGT TTTGGGGGTT AGGCATTTTT GGGGCGAAAT TCTTCGATAA TCAAGGATTC
ATTAACCCCC AGTTAGAAGG GTGGGGAAAG TGGTTAGAAT GGCTTAAAAA AGCGGAAACT
CAACCTAATT TTATACTCAG TCGCAATCGA GAGATTCTTC ATGAAGCTTT TGCTAAAGGG
AAGTTGACTT ACTACGTTTG TAATTCTGAT GAAATTGGAG ATTTAAAAAA TATCTTGAAA
GAGAACTTAC AGATAGTTTT TCTCCCTGGA GAACCTGACC ATCCGGCAAC CCCTTTGCTT
TATACCATAG TGATGATGGT CAATAATAGT GCTAGTTCCC ATGAAACTGA ATTAGCTTTA
CAATGGGCAC AGTTCATGAC TAACCCTGAA CAACAATTAA AAGCATTAAT AGGTTCTTTA
AACTTTATTC CTACTAACCA AAAGATCAGT GTTAATCAAC AGTTATTACC CATAGAAGCC
ACTTTACATA AACAGTCTAA AATGGCACTC ACTATTCCCA TCGACTCTAT AGAAAAAATT
CTTAAAATTT TTAAAGAAGG GGAGATTGTA TATCAAAAAG CTATGGCTGG AGATCTGACT
TCATCTCAAG CTGTTCAGGA ACTAACTGAT ATTATTAAAA CACAATTGAA TTTTCAAACA
AGGAACTAA
 
Protein sequence
MSRLFQLIWQ RSRLYSLFLA SFLILLTGCQ LNLFEQPSLM RGRLLIYHPF QGENGIIFEN 
FLDNFEQLYP EVQLLSEYIR EDRLSQQFIS KSRAGLGATV LIDFARHIPQ LVKSNSIQPL
EDKNIDTSRF LSSNIIQSRY QGKIYGIPLV SQVRVLCYNL AKLQPNSNTQ DPILTQPPFG
LEGLLTRAKK GYSVGMVSSF EDTFWGLGIF GAKFFDNQGF INPQLEGWGK WLEWLKKAET
QPNFILSRNR EILHEAFAKG KLTYYVCNSD EIGDLKNILK ENLQIVFLPG EPDHPATPLL
YTIVMMVNNS ASSHETELAL QWAQFMTNPE QQLKALIGSL NFIPTNQKIS VNQQLLPIEA
TLHKQSKMAL TIPIDSIEKI LKIFKEGEIV YQKAMAGDLT SSQAVQELTD IIKTQLNFQT
RN