Gene PCC8801_3521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_3521 
Symbol 
ID7105688 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp3672404 
End bp3673777 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content45% 
IMG OID643476533 
Productbicarbonate transport system substrate-binding protein 
Protein accessionYP_002373642 
Protein GI218248271 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCAACT TTTCCCCATT GTCTCGTCGT AAATTTTTAG TCACTGCTTC TTTATCAGCC 
GCCAGTGCAG TCTTATTAAA AGGCTGTTTA GGCAACCCTC CCGAACCCAC TGCTACCAGT
TCCCCTGGAG CATCTCCCTC TCCGGCTGCT TCTCCCTCAG CCTTAACCCC CGAAACCACT
CCTGAAACCC CTAGCGCGAA ATTAGGCTTT ATTCCTATTT TTGAGGCTGC GCCCCTAATT
ATTGCGCAAG AAAAGGGATT TTTTGCCAAA TACGGCATGA ATCAAGTGGA TGTGTCGAAA
CAAGCCAACT GGGGCTCAGC TAGGGATAAC GTGGAAATTG GGTCGGCTGC CGGAGGGATC
GATGGCGGAC AGTGGCAAAT GCCCATGCCT TACCTGATTT CTGAGGGGTT GATTACTAAA
AATAACGTCA AAATTCCCAT GTATGTGTTA GCGATGCTGA ATACTCAAGG GAATGGCATT
GCGATCGCCA AATCTCAAGA AGGCAAAAAC ATTGGGTTAG ATGTCAGCAA GGCTAAAGAC
TATATTACGG GTCTTAAAGC GTCAGGAAAA CCCTTTAGGG CTGCCTATAC GTTCCCTCAA
GCCAACCAAG ATTTTTGGAT ACGTTACTGG TTAGCTGCCG GAGGTATTGA CCCGGACAAA
GAAGTGAACC TAATGACGGT TCCCGCAGCG CAAACCGTGG CCAACATGAA AACGGGAAGC
ATGGAAGGGT TTAGTACAGG TGATCCGTGG CCAGCCCGAA TTGTTGGGGA TGATATTGGC
TTTATGGCTG CATTAACCGC ACAAATTTGG CCCTTTCATC CTGAAGAATA TTTTGCCATG
CGGGGGGATT GGGTAGATCA AAATCCCAAA GCAACCAAAG CGTTATTAAA AGGAATTATG
GAGGCACAAC AATGGTGTGA TGTGGAAGCT AATCGTACAG AAATGGCACA GATTTTATCA
GGGGCAAAAT ACTTTAATGT TCCCGTTGAA ATTTTGGAAC CTATGTTAAA AGGAACGTAC
ATCATGGGAG ATGGACAACC CGAAATTAAA GACTTCCAAA AAGCAGCAAT GTATTGGAAA
TCTCCCCTTG GTAGTGTTTC TTTTCCCTAC AAGAGTCTTG ATCTTTGGTT CTTAACTGAA
AGTGTTCGTT GGGGTTTCTT ACCTCCCAAT ACTTTAGATA ATAATGCCAA GGCTTTAATT
GATAAAGTGA ACCGTTCAGA TATTTGGAAA GAAGCAGCCA AAGAAGCAGG GATTCCTGAT
GCAGATATTC CTACCAGTGA CTCTCGCGGT GTTGAGAAGT TCTTTGATGG CAAAGAGTTT
AATCCAGATA ATCCGAAAGC CTATCTACAA AGTCTGACAA TTAAACGAGT TTAA
 
Protein sequence
MSNFSPLSRR KFLVTASLSA ASAVLLKGCL GNPPEPTATS SPGASPSPAA SPSALTPETT 
PETPSAKLGF IPIFEAAPLI IAQEKGFFAK YGMNQVDVSK QANWGSARDN VEIGSAAGGI
DGGQWQMPMP YLISEGLITK NNVKIPMYVL AMLNTQGNGI AIAKSQEGKN IGLDVSKAKD
YITGLKASGK PFRAAYTFPQ ANQDFWIRYW LAAGGIDPDK EVNLMTVPAA QTVANMKTGS
MEGFSTGDPW PARIVGDDIG FMAALTAQIW PFHPEEYFAM RGDWVDQNPK ATKALLKGIM
EAQQWCDVEA NRTEMAQILS GAKYFNVPVE ILEPMLKGTY IMGDGQPEIK DFQKAAMYWK
SPLGSVSFPY KSLDLWFLTE SVRWGFLPPN TLDNNAKALI DKVNRSDIWK EAAKEAGIPD
ADIPTSDSRG VEKFFDGKEF NPDNPKAYLQ SLTIKRV