Gene Cyan8802_4213 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_4213 
Symbol 
ID8393564 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp4349763 
End bp4351043 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content44% 
IMG OID644982125 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003139837 
Protein GI257061949 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.215316 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTAGAT TTGTAAAGCT CAAACGACTG GCTATTTGGT CTTTGGTTGG ATTACTCTTA 
AGTTGGTTGA TTAGCTGTAA TGCTGCCCCT CCAACTTCTT CTAGTCCTGA ATTAGAGTTT
TGGACGATGC AGCTTCAGCC GAAATTTACG CCCTATTTCA CAGAGGTCAT TAGGCAATAC
GAATCAGAAA ATCAAGGCAT TAAGCTGCGT TGGGTAGATG TCCCCTGGGA AGCGATGGAA
AGCAAGATTT TAACGGCGGT TTCAGCGAAA ACTGCCCCCG ATGTAGTCAA TCTTAACCCG
AATTTTGCTT CCCAACTGGC CAGTCGCAAC GCTTGGTTAG ACTTAAATAC GCAAATTCCA
CCGGAGGTTA AACAACAATA TCTCCCGAAG ATTTGGGCAG CAACAACGCT AAAAGACGCG
AGTTTTGGCA TTCCTTGGTA CTTAACAACC CGTATTACCC TTTCTAACCA AGATTTACTT
AGCAAAGCGG GAATTAAGGA ACCACCGAAA ACCTTTGAGG AATTAGCCGA TGTGGCTGCT
AAACTTAAGG AGAAAACGGG GAAATATGCC CTATTTGTGA CCTTCGTACC GGGGGACTCT
GGGGAAGTCT TGGAGTCTTT GGTGCAAATG GGAGTCCAGT TAGTGGATGA TCAGGGTAAA
GCAGCGTTTA ATACCCCTGA TGGCATAGCA GGGTTCCGTT ATTGGGTAGA TTTATATCAA
CAAGGACTGT TACCCCCTGA AGTTCTCACC CAAGGACATC GCCATGCGAT AGATTTATAT
CAGTCGGGAG AGATAGCTTT ACTCTCTTCT GGGGCGGAAT TTCTGACCAG TATTGAAACG
AATGCCCCAA CCATTGCGAA AGTAACAGCC ACTTCTCCCC AAATTACCGG AAAAACAGGT
AAAAAGAACG TGGCAGTGAT GAATTTAGTC ATTCCCCGTG ATACGGATAA AGCTGAAGAG
TCGGTAAAAT TTGCGCTTTT TGTCACGAAT ACGGAAAATC AACTCGGGTT TGCTAAGGCG
GCTAATGTCC TTCCTTCGAC GGTAGAGGGA GTTAAACGCT ATATTGAGGA GTTAAAACAG
TCTTCTGATT CTAGCGCGAT CGCTCAAGCG CGTCAAGTTA GTGCGATGCA ACTCAATGAT
GCAGAAGTCC TAGTTCCAGC AATGAAAGAC CTTAATAAGT TGCAACAGAT TATTTACGAA
AATTTACAAG CTGCCATGCT CAAAGAGAAA ACTGTCGAAC AAGCCGTTAA GGATGCTGCT
GATGCTTGGG ATAGTATTTA G
 
Protein sequence
MRRFVKLKRL AIWSLVGLLL SWLISCNAAP PTSSSPELEF WTMQLQPKFT PYFTEVIRQY 
ESENQGIKLR WVDVPWEAME SKILTAVSAK TAPDVVNLNP NFASQLASRN AWLDLNTQIP
PEVKQQYLPK IWAATTLKDA SFGIPWYLTT RITLSNQDLL SKAGIKEPPK TFEELADVAA
KLKEKTGKYA LFVTFVPGDS GEVLESLVQM GVQLVDDQGK AAFNTPDGIA GFRYWVDLYQ
QGLLPPEVLT QGHRHAIDLY QSGEIALLSS GAEFLTSIET NAPTIAKVTA TSPQITGKTG
KKNVAVMNLV IPRDTDKAEE SVKFALFVTN TENQLGFAKA ANVLPSTVEG VKRYIEELKQ
SSDSSAIAQA RQVSAMQLND AEVLVPAMKD LNKLQQIIYE NLQAAMLKEK TVEQAVKDAA
DAWDSI