Gene PCC8801_2115 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_2115 
Symbol 
ID7104347 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp2184414 
End bp2187443 
Gene Length3030 bp 
Protein Length1009 aa 
Translation table11 
GC content36% 
IMG OID643475172 
Producthypothetical protein 
Protein accessionYP_002372303 
Protein GI218246932 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCATC GACAAGAATG GCAAAATAGC TGCGTTGATG AAGCGTTGAT TAATCTCAAT 
GTAACCGCAT TAGAAGGTAA TTCTCCCTCA GATTATTTGC TGTATTCTGA CGCACTACCT
AGACGTAACG ACGGTAGGGT CAGTGATTCT ATTTTAAGAC GCTACGAACA CACAGAACAA
GGAGGTTGGT GGTGCTCAGG GGTTGATGTC TTGACGGGAA ATGAGGATCT ATGGGGGTGT
TTTAAACCTA ATTCTCCCCG TATTAGCCAT GATCGCCATA AACCGATTAA ATACGAACAT
CCTCCCAACG CGCCAACGGG TATTTTTGCT TTGCGGGTTC CGTTGCATTT ATGGCAAGAT
ATCGAGCAAT CTTACCATTG TGATCTTACC ACAGAAGATA TTAATGAGCA ACTACCCGAT
TTAGGCTTTT GGCAATGGGT GATTAATCAT CCAAATATTC CCCTTTTTAT CACAGAAGGA
GCGAAAAAAG CGGGAGCATT ATTAACAGCA GGATACGTTG CGATCGCCCT TCCTGGGATT
AATAATGGTT ATCGAACTCC CCATGATGAA TTTGGCAACC GTATCGGTAA GTCCCGTTTA
ATTCCTCAAC TGGAAAAACT GGCTATTTCT GGTAGAAAAA TCTATATTGT TTTTGATCAA
GAGAGTAAAC CTAATACCAT TAAAGCAGTC AATACGGCTA TTAGAAATTT AGGCTATTTG
TTCACTCAAG CAGGATGTCA AGTTAACGTC ATTACTTGGT TGCTAGAATG GGGTAAAGGG
GTTGATGATT TTATTGCTAA TAAGGGACTA GATAAGTTCA AAGAAGTTTA TCAAAAAGCC
TTACCGTTAG AAACCTGGAA AGCACAAGGA TTAAGTCAAT TAACCTATCC CTATGATGTA
GAAGTTAATC GTCGCTATTT AGGAGAGTTA GCTATTCCTA AAACGGCTCA ATTAATTGGG
ATTAAATCAG CTATTGGAAC GGGAAAAACC CAAGGGTTAG AAAAAATTGT TCAAGAGGCG
ATCGCCAATA ATCAAAAAGT CTTAGTCATT GGACATCGAA TTAAGTTAGT TGAACAACTT
TGTCAACGGT TTCAACTTCC TTATATTACG GAAATTCAGA ATTATGATGT TACCTTGGGA
TATGGATTAT GTATTGACTC ACTGCATCCT AATTCTCAAG CGAAGTTTAA TCCTGATGAG
TGGGAAAATA GTTTAATTAT TATTGATGAA GTTGAACAAG TTTTATGGCA TGGCTTAAAT
TCAGATACTT GTCAAAAAAA TCGGGTTGCT ATCCTTAAAT CTCTCAAAAT ATTACTGCAA
ACAGTTTTAG AAACACAAGG AAAAGTGTTT ATTGCCGATG CAGACTTAAG TGATATTTCC
TTAGATTATT TAATCTCTTT AACAGGAATT AATCTAAAAC CGTTTATTAT TAATAATACT
TGGAAACCCA CTAATAAAGA GTCATGGACA GTTTATAACT ATCCAGAAAC TACTCCTAAA
CGCTTAGTTA AAGATTTAGT CCAGCATATT GAGCAAGGAG GAAAACCGTT TATTTGTCTT
TCCGCGCAAA AATTAACCAG TAATTGGGGA ACACAAACCC TAGAATCTTA CTTAAAAAAA
CAATTTCCTG ATGCTAAAAT ACTACGGATT GATTCTGAGT CATTAACCGA TCCTAATCAT
GCTGCTTACC AATGTATTAG CCAGCTTAAT GAGATTTTAT TTAACTATGA TATTGTCCTA
GCCAGTCCTT CTATTGAAAC AGGAGTTAGT ATTGATATTA AAGGACATTT TACCTCAGTT
TGGGGGTTAG CTCAAGGAGT ACAAATAGCT ACTTCTGTTT GTCAATCGTT AGGACGTATT
CGGGAGAATA TTCCCCGTTA TCTTTGGGTT GCTTCCTACG GGTTTAATAA AGTAGGAAAT
GGTTCCACTT CCATACCTAA TTTGTTAACC TCTAACCATC GGGTGACTCA ATTAAATGTT
CGTTTGTTGC AACAATCTGA TCTAGAAGCA TTAGAGGATA TTGATACAGA ATTTCAAGCA
GAATCATTGC TATGTTGGGC AAAAATGGCA GTTCGTGTGA ATGCTTCCAT GATTTATTAT
CGAGAGTCTA TTTTACGGAT ACTTGAACAA CAAAATCATC AAGTTTATCC CAATACTAAG
GTAATTCAAT CTTCACGAAA TAAAAACAAT CAAAATAACA ATAAAACTGA TCAAACCTCT
AACCAATTAA CTGAAGCAAT TGAAATAGTT AGGGAAGAAA ACTATCACGC AGAATGTCAA
GCAATTGCCC AAGCAGAAGA ACTAACGAAT CAAGAATATC GTAGTTTAAA CAAACGATTG
GTTAAAACAT CCTCAGAACG TCATAGACTG AGAAAATATA ATTTACAACG ACGTTATTGT
ATCCCTGTTA CCCCTGAATT AGTCGCTTTA GATAATGAAG GATGGTATCA AAAACTTAGG
TTACATTATT TCCTAACAAT AGGACGCTGT TATTTAGCTG ATAGAGATAC TATTGTTGCT
CAAAAATTAA TTAACAAAGG ACACGGTAGT TTATTTATTC CCGACTTTAA TGGTTGCCAA
TTAGGGGCAA TTATTGGAAC GATGGAAGTT TTAGGATTGC CTGTTTTATT GTCAAATAAT
CAACGAAAAT TAAAACCCGT AGATGAAGAT TTACAAACCA TGGCTAAGAT GGCTATTAAA
AATCGTTCAG AGATCAAAAC TATTCTGGGA ATTGGTATTG CTAAAAACTC CAGTCCTATT
ACAATTATTC GACGATTATT AGATAAAATT GGCTATGGAT TGACTTGTAT TGGTTTAGAA
ACAGTCGCTA AAAAGCGGGT TCGAGTTTAT CAAGTTGTTT TGCCTAATGA TCAACGGGAA
GAAGTGTTTA AACAATGGTG GTATAGGGAT GAAAATTGTC CAGGGAGTTC TGAACCCTGG
TTTGAAGAAT ATACTATCGC TAAATCAAAT CTGAGTCAAA ATCAACAAGA AGGATCTAAA
AATTATATTC AATTGAGTTT AGAGTTATAA
 
Protein sequence
MNHRQEWQNS CVDEALINLN VTALEGNSPS DYLLYSDALP RRNDGRVSDS ILRRYEHTEQ 
GGWWCSGVDV LTGNEDLWGC FKPNSPRISH DRHKPIKYEH PPNAPTGIFA LRVPLHLWQD
IEQSYHCDLT TEDINEQLPD LGFWQWVINH PNIPLFITEG AKKAGALLTA GYVAIALPGI
NNGYRTPHDE FGNRIGKSRL IPQLEKLAIS GRKIYIVFDQ ESKPNTIKAV NTAIRNLGYL
FTQAGCQVNV ITWLLEWGKG VDDFIANKGL DKFKEVYQKA LPLETWKAQG LSQLTYPYDV
EVNRRYLGEL AIPKTAQLIG IKSAIGTGKT QGLEKIVQEA IANNQKVLVI GHRIKLVEQL
CQRFQLPYIT EIQNYDVTLG YGLCIDSLHP NSQAKFNPDE WENSLIIIDE VEQVLWHGLN
SDTCQKNRVA ILKSLKILLQ TVLETQGKVF IADADLSDIS LDYLISLTGI NLKPFIINNT
WKPTNKESWT VYNYPETTPK RLVKDLVQHI EQGGKPFICL SAQKLTSNWG TQTLESYLKK
QFPDAKILRI DSESLTDPNH AAYQCISQLN EILFNYDIVL ASPSIETGVS IDIKGHFTSV
WGLAQGVQIA TSVCQSLGRI RENIPRYLWV ASYGFNKVGN GSTSIPNLLT SNHRVTQLNV
RLLQQSDLEA LEDIDTEFQA ESLLCWAKMA VRVNASMIYY RESILRILEQ QNHQVYPNTK
VIQSSRNKNN QNNNKTDQTS NQLTEAIEIV REENYHAECQ AIAQAEELTN QEYRSLNKRL
VKTSSERHRL RKYNLQRRYC IPVTPELVAL DNEGWYQKLR LHYFLTIGRC YLADRDTIVA
QKLINKGHGS LFIPDFNGCQ LGAIIGTMEV LGLPVLLSNN QRKLKPVDED LQTMAKMAIK
NRSEIKTILG IGIAKNSSPI TIIRRLLDKI GYGLTCIGLE TVAKKRVRVY QVVLPNDQRE
EVFKQWWYRD ENCPGSSEPW FEEYTIAKSN LSQNQQEGSK NYIQLSLEL