Gene Cyan8802_2159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_2159 
Symbol 
ID8391476 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp2163421 
End bp2166450 
Gene Length3030 bp 
Protein Length1009 aa 
Translation table11 
GC content36% 
IMG OID644980137 
Producthypothetical protein 
Protein accessionYP_003137881 
Protein GI257059993 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.135027 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.649366 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCATC GACAAGAATG GCAAAATAGC TGCGTTGATG AAGCGTTGAT TAATCTCAAT 
GTAACCGCAT TAGAAGGTAA TTCTCCCTCA GATTATTTGC TGTATTCTGA CGCACTACCT
AGACGTAACG ACGGTAGGGT CAGTGATTCT ATTTTAAGAC GCTACGAACA CACAGAACAA
GGAGGTTGGT GGTGCTCAGG GGTTGATGTC TTGACGGGAA ATGAGGATCT ATGGGGGTGT
TTTAAACCTA ATTCTCCCCG TATTAGCCAT GATCGCCATA AACCGATTAA ATACGAACAT
CCTCCCAACG CGCCAACGGG TATTTTTGCT TTGCGGGTTC CGTTGCATTT ATGGCAAGAT
ATCGAGCAAT CTTACCATTG TGATCTTACC ACAGAAGATA TTAATGAGCA ACTACCCGAT
TTAGGCTTTT GGCAATGGGT GATTAATCAT CCAAATATTC CCCTTTTTAT CACAGAAGGA
GCGAAAAAAG CGGGAGCATT ATTAACAGCA GGATACGTTG CGATCGCCCT TCCTGGGATT
AATAATGGTT ATCGAACTCC CCATGATGAA TTTGGCAACC GTATCGGGAA GTCCCGTTTA
ATTCCTCAAC TGGAAAAACT GGCTATTTCT GGTAGAAAAA TCTATATTGT TTTTGATCAA
GAGAGTAAAC CGAATACCAT TAAAGCAGTC AATACGGCTA TTAGAAATTT AGGCTATTTG
TTCACTCAAG CAGGATGTCA AGTTAATGTC ATTACTTGGT TAGTAGAATG GGGTAAAGGG
GTTGATGATT TTATTGCTAA TAAGGGACTA GATAAGTTCA AAGAAGTTTA TCAAAAAGCC
TTACCGTTAG AAACCTGGAA AGCACAAGGA TTAAGTCAAT TAACCTATCC CTATGATGTA
GAAGTTAATC GTCGCTATTT AGGAGAATTA GCTATTCCTA AAACGGCTCA ATTAATTGGC
ATTAAATCGG CTATTGGAAC GGGAAAAACC CAAGGGTTAG AAAAAATTGT TCAAGAGGCG
ATCGCCAATA ATCAAAAAGT CTTAGTCATT GGACATCGAA TTAAGTTAGT TGAACAACTT
TGTCAACGGT TTCAACTTCC TTATATTACG GAAATTCAGA ATTATGATGT TACCTTGGGA
TATGGATTAT GTATTGACTC ACTGCATCCT AATTCTCAAG CGAAGTTTAA TCCTGATGAG
TGGGAAAATA GTTTAATTAT TATTGATGAA GTCGAACAAG TTTTATGGCA TGGCTTAAAT
TCAGATACTT GTCAAAAAAA TCGAGTTTCT ATCCTTAAAT CTCTCAAAAT ATTACTACAA
ACGGTTTTAG AAACGCAAGG AAAAGTGTTT ATTGCCGATG CAGACTTAAG TGATATTTCC
TTAGATTATT TAATCTCTTT AACAGGAATT AATCTAAAAC CGTTTATTAT TAATAATACT
TGGAAACCCA CTAATAAAGA GTCATGGACA GTTTATAACT ATCCAGAAAC TACCCCTAAA
CGCTTAGTTA AAGATTTAGT CCAACATATT CAACAAGGAG GAAAACCGTT TATTTGTCTT
TCCGCACAAA AATTAACCAG TAATTGGGGA ACACAAACCC TAGAATCTTA CTTAAAAAAA
CAATTTCCTG ATGCTAAAAT ACTACGGATT GATTCTGAGT CTTTAACCGA TCCTAATCAT
GCTGCTTACC AGTGTATTAA GCAACTTAAT GAGATTTTAT TAGATTATGA TATTGTCCTA
GCCAGTCCTT CTATTGAGAC GGGAGTTAGT ATTGATATTA AAGGACATTT TACCTCAGTT
TGGGGGTTAG CTCAAGGAGT ACAAATAGCT ACCTCGGTTT GTCAATCGTT AGGACGTATT
CGGGATAATA TACCGCGTTA TCTTTGGGTT GCTTCCTATG GGTTTAATAA AATAGGAAAT
GGTTCAACTT CCATACCTAA TTTGTTAACC TCTAACCATC GTGTCACACA ATTAAATGTT
CGTTTGTTGC AACAATCTGA TCTAGAAGCA TTAGAGGATA TTGATACAGA ATTTCAAGCA
GAATCATTGC TGTGTTGGGC AAAAATGGCA GTTCGTGTGA ATGCTTCCAT GATTCATTAT
CGAGAGTCTA TTTTACGGAT ACTTGAACAA CAAAATCATC AAATTTATCC TAATACTAAG
GTAATTCAAT CTTCACGAAA TAAAAACAAT CAAAATAACA ATAAAAGCGA TCAAACGTCT
AACCAATTAA CCGAAGTAAT TGAAATAATT AGAGAAGAAA ACTATCAAGC AGAATGTCAA
GCTATTGCCC AAGCAGAAGA ACTAACGGAT CAAAAATATC GTCATTTAAA CAAACGATTA
GTTAAAACAT CCCTAGAACG TCATCAACTG AGAAAATATA ATTTACAACG ACGTTATTGC
ATTCCTGTTA CCCCTGAATT AGTCGCTTTA GATAATGAAG GATGGTACCA GAAACTTAGG
TTACATTATT TCCTAACAAT AGGACGCTGT TATTTAGCTG ATAGAGATAC TATTGTTGCT
CAAAAATTGA TTAACAAAGG ACACGGTAGT TTATTTATTC CCGACTTTAA TGGTTGCCAA
TTAGGGGCAA TTATTGGAAC GATGGAAGTT TTAGGATTGC CTGTTTTATT GTCAAATAGT
CAACGGAAAT TAAAACCCGT AGATGAAGAT TTACAAACCA TGGCTAAGAT GGCTATTAAA
AATCGTTCAG AGATCAAAAC TATTCTGGGA ATTGGTATTG CTAAAAACTC CAGTCCTATT
ACAATTATTC GACGATTATT AGATAAAATT GGCTATGGAT TGACTTGTAT TGGTTTAGAA
ACAGTCGCTA AAAAGCGGGT TCGAGTTTAT CAAGTTGTTC CTCCTAATGA TCAACGCGAA
GAAGTGTTTA AACAATGGTG GTATAGGGAT GAAAATTGTC CAGGGAGTTC CGAACCCTGG
TTTGAAGAAT ATACTATCGC TAAATCAAAT CTGAGTCAAA ATCAACAAGA AGGATCTAAA
AATTATATTC AATTGAGTTT AGAATTGTAA
 
Protein sequence
MNHRQEWQNS CVDEALINLN VTALEGNSPS DYLLYSDALP RRNDGRVSDS ILRRYEHTEQ 
GGWWCSGVDV LTGNEDLWGC FKPNSPRISH DRHKPIKYEH PPNAPTGIFA LRVPLHLWQD
IEQSYHCDLT TEDINEQLPD LGFWQWVINH PNIPLFITEG AKKAGALLTA GYVAIALPGI
NNGYRTPHDE FGNRIGKSRL IPQLEKLAIS GRKIYIVFDQ ESKPNTIKAV NTAIRNLGYL
FTQAGCQVNV ITWLVEWGKG VDDFIANKGL DKFKEVYQKA LPLETWKAQG LSQLTYPYDV
EVNRRYLGEL AIPKTAQLIG IKSAIGTGKT QGLEKIVQEA IANNQKVLVI GHRIKLVEQL
CQRFQLPYIT EIQNYDVTLG YGLCIDSLHP NSQAKFNPDE WENSLIIIDE VEQVLWHGLN
SDTCQKNRVS ILKSLKILLQ TVLETQGKVF IADADLSDIS LDYLISLTGI NLKPFIINNT
WKPTNKESWT VYNYPETTPK RLVKDLVQHI QQGGKPFICL SAQKLTSNWG TQTLESYLKK
QFPDAKILRI DSESLTDPNH AAYQCIKQLN EILLDYDIVL ASPSIETGVS IDIKGHFTSV
WGLAQGVQIA TSVCQSLGRI RDNIPRYLWV ASYGFNKIGN GSTSIPNLLT SNHRVTQLNV
RLLQQSDLEA LEDIDTEFQA ESLLCWAKMA VRVNASMIHY RESILRILEQ QNHQIYPNTK
VIQSSRNKNN QNNNKSDQTS NQLTEVIEII REENYQAECQ AIAQAEELTD QKYRHLNKRL
VKTSLERHQL RKYNLQRRYC IPVTPELVAL DNEGWYQKLR LHYFLTIGRC YLADRDTIVA
QKLINKGHGS LFIPDFNGCQ LGAIIGTMEV LGLPVLLSNS QRKLKPVDED LQTMAKMAIK
NRSEIKTILG IGIAKNSSPI TIIRRLLDKI GYGLTCIGLE TVAKKRVRVY QVVPPNDQRE
EVFKQWWYRD ENCPGSSEPW FEEYTIAKSN LSQNQQEGSK NYIQLSLEL