Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_2115 |
Symbol | |
ID | 7104347 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | + |
Start bp | 2184414 |
End bp | 2187443 |
Gene Length | 3030 bp |
Protein Length | 1009 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 643475172 |
Product | hypothetical protein |
Protein accession | YP_002372303 |
Protein GI | 218246932 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCATC GACAAGAATG GCAAAATAGC TGCGTTGATG AAGCGTTGAT TAATCTCAAT GTAACCGCAT TAGAAGGTAA TTCTCCCTCA GATTATTTGC TGTATTCTGA CGCACTACCT AGACGTAACG ACGGTAGGGT CAGTGATTCT ATTTTAAGAC GCTACGAACA CACAGAACAA GGAGGTTGGT GGTGCTCAGG GGTTGATGTC TTGACGGGAA ATGAGGATCT ATGGGGGTGT TTTAAACCTA ATTCTCCCCG TATTAGCCAT GATCGCCATA AACCGATTAA ATACGAACAT CCTCCCAACG CGCCAACGGG TATTTTTGCT TTGCGGGTTC CGTTGCATTT ATGGCAAGAT ATCGAGCAAT CTTACCATTG TGATCTTACC ACAGAAGATA TTAATGAGCA ACTACCCGAT TTAGGCTTTT GGCAATGGGT GATTAATCAT CCAAATATTC CCCTTTTTAT CACAGAAGGA GCGAAAAAAG CGGGAGCATT ATTAACAGCA GGATACGTTG CGATCGCCCT TCCTGGGATT AATAATGGTT ATCGAACTCC CCATGATGAA TTTGGCAACC GTATCGGTAA GTCCCGTTTA ATTCCTCAAC TGGAAAAACT GGCTATTTCT GGTAGAAAAA TCTATATTGT TTTTGATCAA GAGAGTAAAC CTAATACCAT TAAAGCAGTC AATACGGCTA TTAGAAATTT AGGCTATTTG TTCACTCAAG CAGGATGTCA AGTTAACGTC ATTACTTGGT TGCTAGAATG GGGTAAAGGG GTTGATGATT TTATTGCTAA TAAGGGACTA GATAAGTTCA AAGAAGTTTA TCAAAAAGCC TTACCGTTAG AAACCTGGAA AGCACAAGGA TTAAGTCAAT TAACCTATCC CTATGATGTA GAAGTTAATC GTCGCTATTT AGGAGAGTTA GCTATTCCTA AAACGGCTCA ATTAATTGGG ATTAAATCAG CTATTGGAAC GGGAAAAACC CAAGGGTTAG AAAAAATTGT TCAAGAGGCG ATCGCCAATA ATCAAAAAGT CTTAGTCATT GGACATCGAA TTAAGTTAGT TGAACAACTT TGTCAACGGT TTCAACTTCC TTATATTACG GAAATTCAGA ATTATGATGT TACCTTGGGA TATGGATTAT GTATTGACTC ACTGCATCCT AATTCTCAAG CGAAGTTTAA TCCTGATGAG TGGGAAAATA GTTTAATTAT TATTGATGAA GTTGAACAAG TTTTATGGCA TGGCTTAAAT TCAGATACTT GTCAAAAAAA TCGGGTTGCT ATCCTTAAAT CTCTCAAAAT ATTACTGCAA ACAGTTTTAG AAACACAAGG AAAAGTGTTT ATTGCCGATG CAGACTTAAG TGATATTTCC TTAGATTATT TAATCTCTTT AACAGGAATT AATCTAAAAC CGTTTATTAT TAATAATACT TGGAAACCCA CTAATAAAGA GTCATGGACA GTTTATAACT ATCCAGAAAC TACTCCTAAA CGCTTAGTTA AAGATTTAGT CCAGCATATT GAGCAAGGAG GAAAACCGTT TATTTGTCTT TCCGCGCAAA AATTAACCAG TAATTGGGGA ACACAAACCC TAGAATCTTA CTTAAAAAAA CAATTTCCTG ATGCTAAAAT ACTACGGATT GATTCTGAGT CATTAACCGA TCCTAATCAT GCTGCTTACC AATGTATTAG CCAGCTTAAT GAGATTTTAT TTAACTATGA TATTGTCCTA GCCAGTCCTT CTATTGAAAC AGGAGTTAGT ATTGATATTA AAGGACATTT TACCTCAGTT TGGGGGTTAG CTCAAGGAGT ACAAATAGCT ACTTCTGTTT GTCAATCGTT AGGACGTATT CGGGAGAATA TTCCCCGTTA TCTTTGGGTT GCTTCCTACG GGTTTAATAA AGTAGGAAAT GGTTCCACTT CCATACCTAA TTTGTTAACC TCTAACCATC GGGTGACTCA ATTAAATGTT CGTTTGTTGC AACAATCTGA TCTAGAAGCA TTAGAGGATA TTGATACAGA ATTTCAAGCA GAATCATTGC TATGTTGGGC AAAAATGGCA GTTCGTGTGA ATGCTTCCAT GATTTATTAT CGAGAGTCTA TTTTACGGAT ACTTGAACAA CAAAATCATC AAGTTTATCC CAATACTAAG GTAATTCAAT CTTCACGAAA TAAAAACAAT CAAAATAACA ATAAAACTGA TCAAACCTCT AACCAATTAA CTGAAGCAAT TGAAATAGTT AGGGAAGAAA ACTATCACGC AGAATGTCAA GCAATTGCCC AAGCAGAAGA ACTAACGAAT CAAGAATATC GTAGTTTAAA CAAACGATTG GTTAAAACAT CCTCAGAACG TCATAGACTG AGAAAATATA ATTTACAACG ACGTTATTGT ATCCCTGTTA CCCCTGAATT AGTCGCTTTA GATAATGAAG GATGGTATCA AAAACTTAGG TTACATTATT TCCTAACAAT AGGACGCTGT TATTTAGCTG ATAGAGATAC TATTGTTGCT CAAAAATTAA TTAACAAAGG ACACGGTAGT TTATTTATTC CCGACTTTAA TGGTTGCCAA TTAGGGGCAA TTATTGGAAC GATGGAAGTT TTAGGATTGC CTGTTTTATT GTCAAATAAT CAACGAAAAT TAAAACCCGT AGATGAAGAT TTACAAACCA TGGCTAAGAT GGCTATTAAA AATCGTTCAG AGATCAAAAC TATTCTGGGA ATTGGTATTG CTAAAAACTC CAGTCCTATT ACAATTATTC GACGATTATT AGATAAAATT GGCTATGGAT TGACTTGTAT TGGTTTAGAA ACAGTCGCTA AAAAGCGGGT TCGAGTTTAT CAAGTTGTTT TGCCTAATGA TCAACGGGAA GAAGTGTTTA AACAATGGTG GTATAGGGAT GAAAATTGTC CAGGGAGTTC TGAACCCTGG TTTGAAGAAT ATACTATCGC TAAATCAAAT CTGAGTCAAA ATCAACAAGA AGGATCTAAA AATTATATTC AATTGAGTTT AGAGTTATAA
|
Protein sequence | MNHRQEWQNS CVDEALINLN VTALEGNSPS DYLLYSDALP RRNDGRVSDS ILRRYEHTEQ GGWWCSGVDV LTGNEDLWGC FKPNSPRISH DRHKPIKYEH PPNAPTGIFA LRVPLHLWQD IEQSYHCDLT TEDINEQLPD LGFWQWVINH PNIPLFITEG AKKAGALLTA GYVAIALPGI NNGYRTPHDE FGNRIGKSRL IPQLEKLAIS GRKIYIVFDQ ESKPNTIKAV NTAIRNLGYL FTQAGCQVNV ITWLLEWGKG VDDFIANKGL DKFKEVYQKA LPLETWKAQG LSQLTYPYDV EVNRRYLGEL AIPKTAQLIG IKSAIGTGKT QGLEKIVQEA IANNQKVLVI GHRIKLVEQL CQRFQLPYIT EIQNYDVTLG YGLCIDSLHP NSQAKFNPDE WENSLIIIDE VEQVLWHGLN SDTCQKNRVA ILKSLKILLQ TVLETQGKVF IADADLSDIS LDYLISLTGI NLKPFIINNT WKPTNKESWT VYNYPETTPK RLVKDLVQHI EQGGKPFICL SAQKLTSNWG TQTLESYLKK QFPDAKILRI DSESLTDPNH AAYQCISQLN EILFNYDIVL ASPSIETGVS IDIKGHFTSV WGLAQGVQIA TSVCQSLGRI RENIPRYLWV ASYGFNKVGN GSTSIPNLLT SNHRVTQLNV RLLQQSDLEA LEDIDTEFQA ESLLCWAKMA VRVNASMIYY RESILRILEQ QNHQVYPNTK VIQSSRNKNN QNNNKTDQTS NQLTEAIEIV REENYHAECQ AIAQAEELTN QEYRSLNKRL VKTSSERHRL RKYNLQRRYC IPVTPELVAL DNEGWYQKLR LHYFLTIGRC YLADRDTIVA QKLINKGHGS LFIPDFNGCQ LGAIIGTMEV LGLPVLLSNN QRKLKPVDED LQTMAKMAIK NRSEIKTILG IGIAKNSSPI TIIRRLLDKI GYGLTCIGLE TVAKKRVRVY QVVLPNDQRE EVFKQWWYRD ENCPGSSEPW FEEYTIAKSN LSQNQQEGSK NYIQLSLEL
|
| |