Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan8802_4213 |
Symbol | |
ID | 8393564 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8802 |
Kingdom | Bacteria |
Replicon accession | NC_013161 |
Strand | + |
Start bp | 4349763 |
End bp | 4351043 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 644982125 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003139837 |
Protein GI | 257061949 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.215316 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTAGAT TTGTAAAGCT CAAACGACTG GCTATTTGGT CTTTGGTTGG ATTACTCTTA AGTTGGTTGA TTAGCTGTAA TGCTGCCCCT CCAACTTCTT CTAGTCCTGA ATTAGAGTTT TGGACGATGC AGCTTCAGCC GAAATTTACG CCCTATTTCA CAGAGGTCAT TAGGCAATAC GAATCAGAAA ATCAAGGCAT TAAGCTGCGT TGGGTAGATG TCCCCTGGGA AGCGATGGAA AGCAAGATTT TAACGGCGGT TTCAGCGAAA ACTGCCCCCG ATGTAGTCAA TCTTAACCCG AATTTTGCTT CCCAACTGGC CAGTCGCAAC GCTTGGTTAG ACTTAAATAC GCAAATTCCA CCGGAGGTTA AACAACAATA TCTCCCGAAG ATTTGGGCAG CAACAACGCT AAAAGACGCG AGTTTTGGCA TTCCTTGGTA CTTAACAACC CGTATTACCC TTTCTAACCA AGATTTACTT AGCAAAGCGG GAATTAAGGA ACCACCGAAA ACCTTTGAGG AATTAGCCGA TGTGGCTGCT AAACTTAAGG AGAAAACGGG GAAATATGCC CTATTTGTGA CCTTCGTACC GGGGGACTCT GGGGAAGTCT TGGAGTCTTT GGTGCAAATG GGAGTCCAGT TAGTGGATGA TCAGGGTAAA GCAGCGTTTA ATACCCCTGA TGGCATAGCA GGGTTCCGTT ATTGGGTAGA TTTATATCAA CAAGGACTGT TACCCCCTGA AGTTCTCACC CAAGGACATC GCCATGCGAT AGATTTATAT CAGTCGGGAG AGATAGCTTT ACTCTCTTCT GGGGCGGAAT TTCTGACCAG TATTGAAACG AATGCCCCAA CCATTGCGAA AGTAACAGCC ACTTCTCCCC AAATTACCGG AAAAACAGGT AAAAAGAACG TGGCAGTGAT GAATTTAGTC ATTCCCCGTG ATACGGATAA AGCTGAAGAG TCGGTAAAAT TTGCGCTTTT TGTCACGAAT ACGGAAAATC AACTCGGGTT TGCTAAGGCG GCTAATGTCC TTCCTTCGAC GGTAGAGGGA GTTAAACGCT ATATTGAGGA GTTAAAACAG TCTTCTGATT CTAGCGCGAT CGCTCAAGCG CGTCAAGTTA GTGCGATGCA ACTCAATGAT GCAGAAGTCC TAGTTCCAGC AATGAAAGAC CTTAATAAGT TGCAACAGAT TATTTACGAA AATTTACAAG CTGCCATGCT CAAAGAGAAA ACTGTCGAAC AAGCCGTTAA GGATGCTGCT GATGCTTGGG ATAGTATTTA G
|
Protein sequence | MRRFVKLKRL AIWSLVGLLL SWLISCNAAP PTSSSPELEF WTMQLQPKFT PYFTEVIRQY ESENQGIKLR WVDVPWEAME SKILTAVSAK TAPDVVNLNP NFASQLASRN AWLDLNTQIP PEVKQQYLPK IWAATTLKDA SFGIPWYLTT RITLSNQDLL SKAGIKEPPK TFEELADVAA KLKEKTGKYA LFVTFVPGDS GEVLESLVQM GVQLVDDQGK AAFNTPDGIA GFRYWVDLYQ QGLLPPEVLT QGHRHAIDLY QSGEIALLSS GAEFLTSIET NAPTIAKVTA TSPQITGKTG KKNVAVMNLV IPRDTDKAEE SVKFALFVTN TENQLGFAKA ANVLPSTVEG VKRYIEELKQ SSDSSAIAQA RQVSAMQLND AEVLVPAMKD LNKLQQIIYE NLQAAMLKEK TVEQAVKDAA DAWDSI
|
| |