Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan8802_2159 |
Symbol | |
ID | 8391476 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8802 |
Kingdom | Bacteria |
Replicon accession | NC_013161 |
Strand | + |
Start bp | 2163421 |
End bp | 2166450 |
Gene Length | 3030 bp |
Protein Length | 1009 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 644980137 |
Product | hypothetical protein |
Protein accession | YP_003137881 |
Protein GI | 257059993 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.135027 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.649366 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCATC GACAAGAATG GCAAAATAGC TGCGTTGATG AAGCGTTGAT TAATCTCAAT GTAACCGCAT TAGAAGGTAA TTCTCCCTCA GATTATTTGC TGTATTCTGA CGCACTACCT AGACGTAACG ACGGTAGGGT CAGTGATTCT ATTTTAAGAC GCTACGAACA CACAGAACAA GGAGGTTGGT GGTGCTCAGG GGTTGATGTC TTGACGGGAA ATGAGGATCT ATGGGGGTGT TTTAAACCTA ATTCTCCCCG TATTAGCCAT GATCGCCATA AACCGATTAA ATACGAACAT CCTCCCAACG CGCCAACGGG TATTTTTGCT TTGCGGGTTC CGTTGCATTT ATGGCAAGAT ATCGAGCAAT CTTACCATTG TGATCTTACC ACAGAAGATA TTAATGAGCA ACTACCCGAT TTAGGCTTTT GGCAATGGGT GATTAATCAT CCAAATATTC CCCTTTTTAT CACAGAAGGA GCGAAAAAAG CGGGAGCATT ATTAACAGCA GGATACGTTG CGATCGCCCT TCCTGGGATT AATAATGGTT ATCGAACTCC CCATGATGAA TTTGGCAACC GTATCGGGAA GTCCCGTTTA ATTCCTCAAC TGGAAAAACT GGCTATTTCT GGTAGAAAAA TCTATATTGT TTTTGATCAA GAGAGTAAAC CGAATACCAT TAAAGCAGTC AATACGGCTA TTAGAAATTT AGGCTATTTG TTCACTCAAG CAGGATGTCA AGTTAATGTC ATTACTTGGT TAGTAGAATG GGGTAAAGGG GTTGATGATT TTATTGCTAA TAAGGGACTA GATAAGTTCA AAGAAGTTTA TCAAAAAGCC TTACCGTTAG AAACCTGGAA AGCACAAGGA TTAAGTCAAT TAACCTATCC CTATGATGTA GAAGTTAATC GTCGCTATTT AGGAGAATTA GCTATTCCTA AAACGGCTCA ATTAATTGGC ATTAAATCGG CTATTGGAAC GGGAAAAACC CAAGGGTTAG AAAAAATTGT TCAAGAGGCG ATCGCCAATA ATCAAAAAGT CTTAGTCATT GGACATCGAA TTAAGTTAGT TGAACAACTT TGTCAACGGT TTCAACTTCC TTATATTACG GAAATTCAGA ATTATGATGT TACCTTGGGA TATGGATTAT GTATTGACTC ACTGCATCCT AATTCTCAAG CGAAGTTTAA TCCTGATGAG TGGGAAAATA GTTTAATTAT TATTGATGAA GTCGAACAAG TTTTATGGCA TGGCTTAAAT TCAGATACTT GTCAAAAAAA TCGAGTTTCT ATCCTTAAAT CTCTCAAAAT ATTACTACAA ACGGTTTTAG AAACGCAAGG AAAAGTGTTT ATTGCCGATG CAGACTTAAG TGATATTTCC TTAGATTATT TAATCTCTTT AACAGGAATT AATCTAAAAC CGTTTATTAT TAATAATACT TGGAAACCCA CTAATAAAGA GTCATGGACA GTTTATAACT ATCCAGAAAC TACCCCTAAA CGCTTAGTTA AAGATTTAGT CCAACATATT CAACAAGGAG GAAAACCGTT TATTTGTCTT TCCGCACAAA AATTAACCAG TAATTGGGGA ACACAAACCC TAGAATCTTA CTTAAAAAAA CAATTTCCTG ATGCTAAAAT ACTACGGATT GATTCTGAGT CTTTAACCGA TCCTAATCAT GCTGCTTACC AGTGTATTAA GCAACTTAAT GAGATTTTAT TAGATTATGA TATTGTCCTA GCCAGTCCTT CTATTGAGAC GGGAGTTAGT ATTGATATTA AAGGACATTT TACCTCAGTT TGGGGGTTAG CTCAAGGAGT ACAAATAGCT ACCTCGGTTT GTCAATCGTT AGGACGTATT CGGGATAATA TACCGCGTTA TCTTTGGGTT GCTTCCTATG GGTTTAATAA AATAGGAAAT GGTTCAACTT CCATACCTAA TTTGTTAACC TCTAACCATC GTGTCACACA ATTAAATGTT CGTTTGTTGC AACAATCTGA TCTAGAAGCA TTAGAGGATA TTGATACAGA ATTTCAAGCA GAATCATTGC TGTGTTGGGC AAAAATGGCA GTTCGTGTGA ATGCTTCCAT GATTCATTAT CGAGAGTCTA TTTTACGGAT ACTTGAACAA CAAAATCATC AAATTTATCC TAATACTAAG GTAATTCAAT CTTCACGAAA TAAAAACAAT CAAAATAACA ATAAAAGCGA TCAAACGTCT AACCAATTAA CCGAAGTAAT TGAAATAATT AGAGAAGAAA ACTATCAAGC AGAATGTCAA GCTATTGCCC AAGCAGAAGA ACTAACGGAT CAAAAATATC GTCATTTAAA CAAACGATTA GTTAAAACAT CCCTAGAACG TCATCAACTG AGAAAATATA ATTTACAACG ACGTTATTGC ATTCCTGTTA CCCCTGAATT AGTCGCTTTA GATAATGAAG GATGGTACCA GAAACTTAGG TTACATTATT TCCTAACAAT AGGACGCTGT TATTTAGCTG ATAGAGATAC TATTGTTGCT CAAAAATTGA TTAACAAAGG ACACGGTAGT TTATTTATTC CCGACTTTAA TGGTTGCCAA TTAGGGGCAA TTATTGGAAC GATGGAAGTT TTAGGATTGC CTGTTTTATT GTCAAATAGT CAACGGAAAT TAAAACCCGT AGATGAAGAT TTACAAACCA TGGCTAAGAT GGCTATTAAA AATCGTTCAG AGATCAAAAC TATTCTGGGA ATTGGTATTG CTAAAAACTC CAGTCCTATT ACAATTATTC GACGATTATT AGATAAAATT GGCTATGGAT TGACTTGTAT TGGTTTAGAA ACAGTCGCTA AAAAGCGGGT TCGAGTTTAT CAAGTTGTTC CTCCTAATGA TCAACGCGAA GAAGTGTTTA AACAATGGTG GTATAGGGAT GAAAATTGTC CAGGGAGTTC CGAACCCTGG TTTGAAGAAT ATACTATCGC TAAATCAAAT CTGAGTCAAA ATCAACAAGA AGGATCTAAA AATTATATTC AATTGAGTTT AGAATTGTAA
|
Protein sequence | MNHRQEWQNS CVDEALINLN VTALEGNSPS DYLLYSDALP RRNDGRVSDS ILRRYEHTEQ GGWWCSGVDV LTGNEDLWGC FKPNSPRISH DRHKPIKYEH PPNAPTGIFA LRVPLHLWQD IEQSYHCDLT TEDINEQLPD LGFWQWVINH PNIPLFITEG AKKAGALLTA GYVAIALPGI NNGYRTPHDE FGNRIGKSRL IPQLEKLAIS GRKIYIVFDQ ESKPNTIKAV NTAIRNLGYL FTQAGCQVNV ITWLVEWGKG VDDFIANKGL DKFKEVYQKA LPLETWKAQG LSQLTYPYDV EVNRRYLGEL AIPKTAQLIG IKSAIGTGKT QGLEKIVQEA IANNQKVLVI GHRIKLVEQL CQRFQLPYIT EIQNYDVTLG YGLCIDSLHP NSQAKFNPDE WENSLIIIDE VEQVLWHGLN SDTCQKNRVS ILKSLKILLQ TVLETQGKVF IADADLSDIS LDYLISLTGI NLKPFIINNT WKPTNKESWT VYNYPETTPK RLVKDLVQHI QQGGKPFICL SAQKLTSNWG TQTLESYLKK QFPDAKILRI DSESLTDPNH AAYQCIKQLN EILLDYDIVL ASPSIETGVS IDIKGHFTSV WGLAQGVQIA TSVCQSLGRI RDNIPRYLWV ASYGFNKIGN GSTSIPNLLT SNHRVTQLNV RLLQQSDLEA LEDIDTEFQA ESLLCWAKMA VRVNASMIHY RESILRILEQ QNHQIYPNTK VIQSSRNKNN QNNNKSDQTS NQLTEVIEII REENYQAECQ AIAQAEELTD QKYRHLNKRL VKTSLERHQL RKYNLQRRYC IPVTPELVAL DNEGWYQKLR LHYFLTIGRC YLADRDTIVA QKLINKGHGS LFIPDFNGCQ LGAIIGTMEV LGLPVLLSNS QRKLKPVDED LQTMAKMAIK NRSEIKTILG IGIAKNSSPI TIIRRLLDKI GYGLTCIGLE TVAKKRVRVY QVVPPNDQRE EVFKQWWYRD ENCPGSSEPW FEEYTIAKSN LSQNQQEGSK NYIQLSLEL
|
| |