Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_3521 |
Symbol | |
ID | 7105688 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | - |
Start bp | 3672404 |
End bp | 3673777 |
Gene Length | 1374 bp |
Protein Length | 457 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 643476533 |
Product | bicarbonate transport system substrate-binding protein |
Protein accession | YP_002373642 |
Protein GI | 218248271 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCAACT TTTCCCCATT GTCTCGTCGT AAATTTTTAG TCACTGCTTC TTTATCAGCC GCCAGTGCAG TCTTATTAAA AGGCTGTTTA GGCAACCCTC CCGAACCCAC TGCTACCAGT TCCCCTGGAG CATCTCCCTC TCCGGCTGCT TCTCCCTCAG CCTTAACCCC CGAAACCACT CCTGAAACCC CTAGCGCGAA ATTAGGCTTT ATTCCTATTT TTGAGGCTGC GCCCCTAATT ATTGCGCAAG AAAAGGGATT TTTTGCCAAA TACGGCATGA ATCAAGTGGA TGTGTCGAAA CAAGCCAACT GGGGCTCAGC TAGGGATAAC GTGGAAATTG GGTCGGCTGC CGGAGGGATC GATGGCGGAC AGTGGCAAAT GCCCATGCCT TACCTGATTT CTGAGGGGTT GATTACTAAA AATAACGTCA AAATTCCCAT GTATGTGTTA GCGATGCTGA ATACTCAAGG GAATGGCATT GCGATCGCCA AATCTCAAGA AGGCAAAAAC ATTGGGTTAG ATGTCAGCAA GGCTAAAGAC TATATTACGG GTCTTAAAGC GTCAGGAAAA CCCTTTAGGG CTGCCTATAC GTTCCCTCAA GCCAACCAAG ATTTTTGGAT ACGTTACTGG TTAGCTGCCG GAGGTATTGA CCCGGACAAA GAAGTGAACC TAATGACGGT TCCCGCAGCG CAAACCGTGG CCAACATGAA AACGGGAAGC ATGGAAGGGT TTAGTACAGG TGATCCGTGG CCAGCCCGAA TTGTTGGGGA TGATATTGGC TTTATGGCTG CATTAACCGC ACAAATTTGG CCCTTTCATC CTGAAGAATA TTTTGCCATG CGGGGGGATT GGGTAGATCA AAATCCCAAA GCAACCAAAG CGTTATTAAA AGGAATTATG GAGGCACAAC AATGGTGTGA TGTGGAAGCT AATCGTACAG AAATGGCACA GATTTTATCA GGGGCAAAAT ACTTTAATGT TCCCGTTGAA ATTTTGGAAC CTATGTTAAA AGGAACGTAC ATCATGGGAG ATGGACAACC CGAAATTAAA GACTTCCAAA AAGCAGCAAT GTATTGGAAA TCTCCCCTTG GTAGTGTTTC TTTTCCCTAC AAGAGTCTTG ATCTTTGGTT CTTAACTGAA AGTGTTCGTT GGGGTTTCTT ACCTCCCAAT ACTTTAGATA ATAATGCCAA GGCTTTAATT GATAAAGTGA ACCGTTCAGA TATTTGGAAA GAAGCAGCCA AAGAAGCAGG GATTCCTGAT GCAGATATTC CTACCAGTGA CTCTCGCGGT GTTGAGAAGT TCTTTGATGG CAAAGAGTTT AATCCAGATA ATCCGAAAGC CTATCTACAA AGTCTGACAA TTAAACGAGT TTAA
|
Protein sequence | MSNFSPLSRR KFLVTASLSA ASAVLLKGCL GNPPEPTATS SPGASPSPAA SPSALTPETT PETPSAKLGF IPIFEAAPLI IAQEKGFFAK YGMNQVDVSK QANWGSARDN VEIGSAAGGI DGGQWQMPMP YLISEGLITK NNVKIPMYVL AMLNTQGNGI AIAKSQEGKN IGLDVSKAKD YITGLKASGK PFRAAYTFPQ ANQDFWIRYW LAAGGIDPDK EVNLMTVPAA QTVANMKTGS MEGFSTGDPW PARIVGDDIG FMAALTAQIW PFHPEEYFAM RGDWVDQNPK ATKALLKGIM EAQQWCDVEA NRTEMAQILS GAKYFNVPVE ILEPMLKGTY IMGDGQPEIK DFQKAAMYWK SPLGSVSFPY KSLDLWFLTE SVRWGFLPPN TLDNNAKALI DKVNRSDIWK EAAKEAGIPD ADIPTSDSRG VEKFFDGKEF NPDNPKAYLQ SLTIKRV
|
| |