Gene PCC8801_4174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_4174 
Symbol 
ID7104575 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp4378009 
End bp4379289 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content44% 
IMG OID643477161 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002374260 
Protein GI218248889 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTAGAT TTGTAAAGCT CAAACGACTG GCCATTTGGT CTTTGGTTGG ATTACTCTTA 
AGTTGGTTGA TTAGCTGTAA TGCTGCCCCT CCAACTTCTT CTAGTCCTGA ATTAGAGTTT
TGGACGATGC AGCTTCAGCC AAAATTTACG CCCTATTTCA CAGAGGTCAT TAGGCAATAC
GAATCAGAAA ATCAAGGCAT TAAGCTGCGT TGGGTAGATG TCCCCTGGGA AGCGATGGAA
AGCAAGATTT TAACGGCGGT TTCAGCGAAA ACTGCCCCCG ATGTAGTCAA CCTTAACCCG
AATTTTGCTT CTCAACTGGC CAGTCGCAAC GCTTGGTTAG ACTTAAATAC GCAAATTCCA
CCGGAGGTTA AACAACAATA TCTCCCGAAG ATTTGGGCAG CAACAACGCT AAAAGACGCG
AGTTTTGGCA TTCCTTGGTA CTTAACAACC CGTATTACTC TTTCTAACCA AGATTTACTT
AGCAAAGCGG GAATTAAGGA ACCACCGAAA ACCTTTGAGG AATTAGCCGA TGTGGCTGCT
AAACTTAAGG AGAAAACGGG GAAATATGCC CTATTTGTGA CCTTCGTACC GGGGGACTCT
GGGGAAGTCT TGGAGTCTTT GGTGCAAATG GGAGTCCAGT TAGTGGATGA TCAGGGTAAA
GCAGCGTTTA ATACCCCTGA TGGCATAGCA GGGTTCCGTT ATTGGGTAGA TTTATATCAA
CAAGGACTGT TACCCCCTGA AGTTCTCACC CAAGGACATC GCCATGCGAT AGATTTATAT
CAGTCGGGAG AGATAGCTTT ACTCTCTTCT GGGGCGGAAT TTCTGACCAG TATTGAAACG
AATGCCCCAA CCATTGCGAA AGTGACAGCC ACTTCTCCCC AAATTACCGG AAAAACAGGT
AAAAAGAACG TGGCAGTGAT GAATTTAGTC ATTCCCCGTG ATACGGATAA AGCTGAAGAG
TCGGTAAAAT TTGCGCTTTT TGTCACGAAT ACGGAAAATC AACTCGGGTT TGCTAAGGCG
GCTAATGTCC TTCCTTCGAC GGTAGAGGGA GTTAAACGCT ATATTGAGGA GTTAAAACAG
TCTTCTGATT CTAGCGCGAT CGCTCAAGCG CGTCAAGTTA GTGCGATGCA ACTCAATGAT
GCAGAAGTCC TAGTTCCAGC AATGAAAGAC CTTAATAAGT TGCAACAGAT TATTTACGAA
AATTTACAAG CTGCCATGCT CAAAGAGAAA ACTGTCGAAC AAGCCGTTAA GGATGCTGCT
GATGCTTGGG ATAGTATTTA G
 
Protein sequence
MRRFVKLKRL AIWSLVGLLL SWLISCNAAP PTSSSPELEF WTMQLQPKFT PYFTEVIRQY 
ESENQGIKLR WVDVPWEAME SKILTAVSAK TAPDVVNLNP NFASQLASRN AWLDLNTQIP
PEVKQQYLPK IWAATTLKDA SFGIPWYLTT RITLSNQDLL SKAGIKEPPK TFEELADVAA
KLKEKTGKYA LFVTFVPGDS GEVLESLVQM GVQLVDDQGK AAFNTPDGIA GFRYWVDLYQ
QGLLPPEVLT QGHRHAIDLY QSGEIALLSS GAEFLTSIET NAPTIAKVTA TSPQITGKTG
KKNVAVMNLV IPRDTDKAEE SVKFALFVTN TENQLGFAKA ANVLPSTVEG VKRYIEELKQ
SSDSSAIAQA RQVSAMQLND AEVLVPAMKD LNKLQQIIYE NLQAAMLKEK TVEQAVKDAA
DAWDSI