Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan8802_2208 |
Symbol | |
ID | 8391525 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8802 |
Kingdom | Bacteria |
Replicon accession | NC_013161 |
Strand | - |
Start bp | 2216990 |
End bp | 2220646 |
Gene Length | 3657 bp |
Protein Length | 1218 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 644980180 |
Product | hypothetical protein |
Protein accession | YP_003137924 |
Protein GI | 257060036 |
COG category | [V] Defense mechanisms |
COG ID | [COG1002] Type II restriction enzyme, methylase subunits |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00220427 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.956264 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATACAT CAAGACTCAA AAAGTTCGCC CAGTATGCCC GTCGTTATCT CAGAGAACAG GTAGCGACTA AGTTAGAAGT GGTCTTAGCT CAAAATAGTG CAGCAAGACG GGAAAATCCT GAAGCTATCC GTAAGTTAAC CGAAAATATC GAAGTTTCAG GGAAAGAACA GGTTATTGAA CAGGTTGCCT ATACTTGGTT TAACCGTTTT TGTGCGTTGC GGTTTATGGA TAGTAACCGT TATAACTCCA TTGGGGTTGT GTCTCCTGCT GAGGGGCAAT TTCAGCCTGA AATCTTGGCT GAGGCGAAAA TGGGGCATAT TGATGAGGAA ATGATAACTA AACACAGAAA GCAGATTTTT GATTTATTAA GCGGTAATGC ACCGAGTTCT GATGCTCAAG GGGAGGCCTA TCGTTTATTA ATTGTCGGGG TATGTAATTA TTATCATAAG GTGATGGATT ATTTGTTTGA GCCAATTGAT AATTATACGG AGTTATTAAT GCCTGATGAT TTGCTGTCGG GTAATTCAAT TTTGGCTTAT ACTCGTGAGG CAATGACTCC TGAAAATTGT GAGAGTGTGG AGGTAATTGG GTGGTTATAT CAGTTTTATA TTTCTGAGAA GAAAGATGAG GTTTTTGAGG CTCTTAAAAA GAATAAAAAA ATCATACCCG AAAATATTCC GGCAGCGACT CAGTTATTTA CTCCTCATTG GATTGTGCGT TATTTAGTAG AAAATTCTTT GGGGCATTTA TGGATGTTAA ATCGTCCTCA GTCTCGTTTA ATTGAGATGA TGGATTATTA TGTGGATAGT GGGAAGTGGG AAGTGGGAAG TGGGGAGTCT AGTTTAGGAG GTGAAGATGA TGACCGTCAG AAATTATCAG GAGTTAATCG TTTGGCAGAA AGCGATGGAC TTAGTGGAGT TAGTTTATCA GGCGACAAAA ATGTTTCCGA AAGAGGAACT TTACGGTCTG ACCAATCAAG TGAGACGGGC AGTTGTTTCG ATTCCGTCAA ACATAGCGGA GGGTCAAACC CGTCAATCGA CGGCGGAATT CAAGCGTTTT CTCTCAATAG CGCAGGGGTC GAGAGCAGAG GTAGAAACTC AAATTATGAT AGCTCAGAGA CTCAAATATC TCAGTCAGGA GGAGATGGAA CAAATTCTCA ATCTGGCCGA GGAAATCAAG CGAATGATTT ATGGACTAAC AGCCAAACTA AAATAACTTC CCACTCTCCA CTTCCTACTT CCTACTTAAA AATAAATTCT CCTGAAGATT TAAAAATTTG TGATCCTGCT TGTGGTTCGG GACATATTTT AGTCTATGCT TTTGAGTTAC TTTATGCTAT TTATGAGGAG GAAGGTTATG CAGTAAATGA GATTCCTAGC AAGATTTTAA CCCATAATCT CTATGGTATG GAAATTGATG AACGGGCAGG AAGTTTGGCA GCTTTTGCAT TAACTATGAA GGCAAGGGAG AAGCAAATAC GGTTTTTTCG TAAGCCGATT CAGCCTCATA TTTGTGTGTT GAAAAAGGTT GAGTTTGAGG AGTGGGAAGT GGGGAGTGGG AAGTGGGAAG TGGATAGTGA GAAGTTGGGG GTAAGTCGTG ATGATTTATT ACATGATTTA CATTTGTTTA AAGAAGCGGA TAATTTTGGG GCTTTGTTAC GTCCAAAAAT GTCTGAAAAC CAAATAGCTA ATTTAAGAGA TTACTTTGCT AATTTATGGA AAAAAATTCC TAATCCTTCT TTGTTTGAAC ATAAAACCCA TGAGAAGGTT ATGGATGTGT TAAAACAGGC TGATTTTTTA AGCCCTAAGT ATCACATTGT TGTTGCAAAT CCGCCTTATA TGGGTAATAA GGGTATGAAT AATCGGTTAA AGGCTTTTTT ACAGGATAAT TATAGTAATG TAAAATCTGA TTTATTTTCT GCTTTTATGA TTCGTATTTT AGAAATGACT TTACAAAAAG GGGAAATGGG TTTTGTTACT CCCTATGTTT GGATGTTTAT TTCTTCTTAT GAAAAATTAA GAACTTTGAT TTTAGAGAAA ACGACAATAA CTAGCTTAAT TCAATTAGAA TATAATGCTT TTGCTCCTGC TTGTATTCCA GTTGCTACGT TTACCCTTTC TAATCAAAAT TTACCTAATT TTAAAGGGGG ATATATTAAA TTATCCGATT TTAGAGGTGC AGATAATCAA GCACCGAAGG CATTAGAAGC GATAAAAAAT CCTAATTGTG GTTGGTTTTA TCGGGCTTCT GCAAGTGATT TTAAGAAGAT TCCGGGGAGT GCGATCGCTT ATTGGGTTAG TGATAGTATT CGCATCTCTT TTTCTAAATA TAAATCTATT TCTGAGCAGT ATACAGTCAA GTCGGGAATT ATGACAGGCA ATGATGATAC TTTTCTGAAG TTTTGGTTTG AGGTTAAAAT AAGTCAGATC GGTTTTAATT TGATTTCTTA TGATGAAATG AAAAGTTATT GGTATCCAAT AAGTAAAGGA GGAAATTTTA GGAAATGGTA TGGAAATAAT GAACACATTA TTAACTTAAG AGATGATGCT TATGATATTC GTAATGGAGG AGGAAACTTT AGATTAAGAG AGAAAAATTT ATATTTTAGA CCTTATATAA CTTGGTCACG CATTACTTCA TCACAAGTTG CCTTTAGATT TTCACAAGGT GGTATTCTTT TTTCAGATGC CGGACCAGGA ATTTTTGCAG AAAATGATTG TCAGAAAATA ATAACTTTTC TTAATACAAA ACTGAGTAAT TACTTTTTAG CCTGTATTAA TCCCACCCTT AATTACCAAA CCCGTGATAT TGAATCATTA CCATGTATAG ATATCCAAAA AGATGTTCAG ACTCCTTATT TAATAAAAAC CACAAAATCA GACTGGGATT CTTACGAAAC ATCATGGGAT TTTACCACCC TTCCGCTTTT AAGAGTGGGA AGTGAACAGT GGGAAGTAGA GAGTGGAGAG TGGGAAGTGA ATAGAAAAGA GGGAACAGTG GTGAATAGTA ATGAAACAAT AGATAGTAAG CAAAAAACAG TTGATAGTAA TGAAACAATA GATAGTGAAC CGTTAATAGT TGATAATAGA AATGAAGAAA AGACTCCCCA CTTCCCACTC CCCACTTCTC ACTCAAAACT CCCCACTTCC CACTCCCCAC TTCTCACTCA AAACTCCCCA CTTCCCACTC CCCACTTCTC ACTCAAAACT CCCCACTTCC CACTCCCCAC TTCTCACTCA AAACTCCCCA CTCCCCACTC CCCACTTCCC ACTCCCCACT TCTCACTCAA AACTCCCCAC TCCCCACTCC CCACTTCCCA CTCCCCACTT CTCACTCAAA ACTCCCCACT CCCCACTCCC CACTTCCCAC TCCCCACTTC TCACTCAAAA CTCCCCACTC CCCACTCCCC ACTTCCCACT CCCCACTTCT CACTCAAAAC TCCCCACTCC CCACTCCCCA CTTCCCACTC CCCACTTCTC ACTCAAAACT CCCCACTCCC CACTCCCCAC TTCCCACTCC CCACTTCTCA CTCAAAACTC CCCACTCCCC ACTCCCCACT TCCCACTTCC CACTCCCCTA AAAGAAACCT ATCAAACCCT GCGCCAACAA TGGCAAGAAA TGACCTTAGA AATGCAGAAA TTAGAAGAAG AAAATAA
|
Protein sequence | MDTSRLKKFA QYARRYLREQ VATKLEVVLA QNSAARRENP EAIRKLTENI EVSGKEQVIE QVAYTWFNRF CALRFMDSNR YNSIGVVSPA EGQFQPEILA EAKMGHIDEE MITKHRKQIF DLLSGNAPSS DAQGEAYRLL IVGVCNYYHK VMDYLFEPID NYTELLMPDD LLSGNSILAY TREAMTPENC ESVEVIGWLY QFYISEKKDE VFEALKKNKK IIPENIPAAT QLFTPHWIVR YLVENSLGHL WMLNRPQSRL IEMMDYYVDS GKWEVGSGES SLGGEDDDRQ KLSGVNRLAE SDGLSGVSLS GDKNVSERGT LRSDQSSETG SCFDSVKHSG GSNPSIDGGI QAFSLNSAGV ESRGRNSNYD SSETQISQSG GDGTNSQSGR GNQANDLWTN SQTKITSHSP LPTSYLKINS PEDLKICDPA CGSGHILVYA FELLYAIYEE EGYAVNEIPS KILTHNLYGM EIDERAGSLA AFALTMKARE KQIRFFRKPI QPHICVLKKV EFEEWEVGSG KWEVDSEKLG VSRDDLLHDL HLFKEADNFG ALLRPKMSEN QIANLRDYFA NLWKKIPNPS LFEHKTHEKV MDVLKQADFL SPKYHIVVAN PPYMGNKGMN NRLKAFLQDN YSNVKSDLFS AFMIRILEMT LQKGEMGFVT PYVWMFISSY EKLRTLILEK TTITSLIQLE YNAFAPACIP VATFTLSNQN LPNFKGGYIK LSDFRGADNQ APKALEAIKN PNCGWFYRAS ASDFKKIPGS AIAYWVSDSI RISFSKYKSI SEQYTVKSGI MTGNDDTFLK FWFEVKISQI GFNLISYDEM KSYWYPISKG GNFRKWYGNN EHIINLRDDA YDIRNGGGNF RLREKNLYFR PYITWSRITS SQVAFRFSQG GILFSDAGPG IFAENDCQKI ITFLNTKLSN YFLACINPTL NYQTRDIESL PCIDIQKDVQ TPYLIKTTKS DWDSYETSWD FTTLPLLRVG SEQWEVESGE WEVNRKEGTV VNSNETIDSK QKTVDSNETI DSEPLIVDNR NEEKTPHFPL PTSHSKLPTS HSPLLTQNSP LPTPHFSLKT PHFPLPTSHS KLPTPHSPLP TPHFSLKTPH SPLPTSHSPL LTQNSPLPTP HFPLPTSHSK LPTPHSPLPT PHFSLKTPHS PLPTSHSPLL TQNSPLPTPH FPLPTSHSKL PTPHSPLPTS HSPKRNLSNP APTMARNDLR NAEIRRRK
|
| |