Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan8802_3142 |
Symbol | |
ID | 8392474 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8802 |
Kingdom | Bacteria |
Replicon accession | NC_013161 |
Strand | + |
Start bp | 3209615 |
End bp | 3212743 |
Gene Length | 3129 bp |
Protein Length | 1042 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 644981087 |
Product | type I site-specific deoxyribonuclease, HsdR family |
Protein accession | YP_003138817 |
Protein GI | 257060929 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.175068 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.39911 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAACCG ACAAAACCCC TAAACCTATC AAGATGGATG AACGGAACTC TGTTGAAAAG CCGATGCTTG ATCAACTGTA CGGTTTAGGG TGGGAAATAT TAGACTTAGA GAGAACCCAA ACCCCTCAAC AAACCTTTAG GGAAAGCTTT ACCGAAGTGG TGATGCTGCC AATATTAAGG GAACAACTAA AAGTTATTAA CCCTTGGATA GAAGATGATC AAATAGAAGA AACCGTTAAA AACCTCACCG CTAACTTCCC CAGTACCAAC TTACTCGAAA ATAACCGCCA CATTTTTAAA CTCCTACAAG AAAATAGCAG GGTCTCCGAA AACCGCAACA CAGCAGAAAA AAGCCCCACC GTCCGCTTTA TCGACTTTGA AAAACCAGAA AATAACCGCT TTATTGCCGT CTGTCAGTTT AAAGTCCGCA TTCTTGGCAC AGAAAACCAC ATCATCCCCG ATATTGTCCT CTTTTTAAAC GGTTTACCCG TCGTCGTCAT CGAATGTAAG TCACCCAAAG TAAAAGAACC CATCCCCGAA GCCATTGATC AACTATTGCG CTACAGTGAA CAACGAGGGG CAAAAGGAGA AGGAAGTCAA CCCCTATTCT TTTATAACTA TTTTATCGTC GTAACCTGTC GCAACGAGGC AAAATTCGGC ACTATTACCA CCCACACCGA AAAATACTTT TATCGTTGGG CTGATCCCTA TCCTCGTACC TTAGATGACT TAGAACATGG TAGCAGTTCC CCCAACGATC AGCAGCGACT CGTAGCGGGA ATGTTAGCCC CAGAGAACCT ATTAGAAATT ATCCGCAACT TTACCCTGTT TTCGGTTAAC AACAAGGGAA AAACCATCAA AATAGTGGGA CGTTACCAAC AGTTTAGGGC CGTTAAAAAA GCTGTTGAAC GACTTATAGA AGGGAAAACA CCAAGGGAAC GAAGCGGTAT TATTTGGCAT ACCCAAGGCT CAGGTAAATC CCTTACTATG ATGTTTATGG TACGAGCCAT GTACCGCCAT CCTAACCTCT GCCAGTGGAA AATCGTCTTT ATCACCGATC GCACCCAACT AGAACAGCAA CTCGCCAAAA CAGGGCAGAA TGTGGGCTTT ACCGTCAAAC CTGCCGATAG TATCGCCAAA TTAAAGGAAT TACTCCGCAG CGACTCCTCC GATCTCGTCA TGGCGATGAT CCACAAATTC CAAGAACGAG ACTTGCAAGA AACCTTCCCA GAACTAAACC CCAGTAGTAA CATCTTGATC ATGACCGATG AGGCACACCG ATCGCAATAT GCCTTACTAG GGGCCAACTT AGATAAAGCC CTCCCCTACG CCTCTCGTAT TGGCTACACA GGAACCCCCA TTGATAAAAC CGAAAAAGTA TTCGGGGACT ATATCGATAA GTACACCATG CGCCAGTCAA TTGAGGACGG AGTGACCCTA GAAATCGTCT ATACAGGGCG AACTCACAAC GCAGAAGTAC CTGATAAGGA AGGGATGGAT AAAGCCTTTG AGGACGTTTT TAGCGAGTAT AACCTATCAG AACGATTGCA GATTTTAGGC TATGGTTCAC GGGATGCCTA CCTAGAAGCC CATTCCACCA TCGCAGCTAA AGCAAAAGAT ATGGTACAGC ATTACCTGGA TCATATCTTT CCCAATGGAT ATAAAGCCCA AGTAGTGGCT ACTTCCCGCG AGGCTGCGGT ACGGTACAAA CAACACATAG ACGAAGCGTT AGTAGAAGCG ATCGCCCAAC TGCAACAGTC CAACCCCCAC GGGCTAAACC TGGATGTATT GAAGCAAGTT AAAACCGATG TGATCATTTC TGGGGGACAT AACGACTTAC CCCACCTCAA AGCCTACAGT AAGGATAGAC AACACGAAAC CACGATCAAA AGCTTTAAAC TGCCTTTTGA CGCAGAAGAT GAAGGTATTA CGGGAGAGAT GGGTATTGTT ATCGTTAATA ATATGCTGCT GACGGGGTTT GATGCCCCAG TAGAACAGGT GATGTATTTA GATAAGGTAA TTCAGTCCCA TAACCTTCTG CAAGCGATCG CCCGTGTCAA CCGAGTAGGA GGAATAGATA AGGATAAGGG CTTTGTGGTG GATTATGTGG GTATTGGACA CCACCTCAAG GATGCCATTG ATAATTACGA TGAACGGGAG CAGAAAGAAA TTACAGAGGC TTTAAGCTTT CCAGAGGAAG AAATTCGCCA ATTAGAAGAT GATTATCGGA CTATTTTAAC CTTCTTAGCA GAATACGGAC TGACCGACTT GACGGATCAT GATGCCTTTT TTGACCTGTT TTACGACGAA GATATTCGCT TTGAGTTCAT GTTAAAGTTT AAAAACCTAA CCCGTAGCCT TAACCTGGTT TTTCCTGCAA AAGAAGCTTT AACCTACATC AAAGACTACC AAGCACTCAC CGAAGTGAAT GTATTAGCGG GTAAGCACTT CCGCGATGAA CGTCTGAGCA TGAAAGGGAT TCCTGCTAAA TTGCGAACCT TAACCGATGA ATACCTTAAA TCAAGGGGAA TTGAGGAAAA AATTAAACCC ATTTCTATCT TAGACGAACA CTTTGCCAAT GAGGTTAAAA AACACAACCG CGTTAAAACC AAGGCGGCTG AAATAGAACA CGCCCTACGT CACCATATTG ACATTGAGCT AGACGATGAT CCAGAACTGC AAGCCTCCTT CTCAGAAGCC CTTAGAGCAA TTCTAGAGGC GTTTAGGGAC AACTGGCAGA AAATTTATGA GGAACTCGAA AAGCTCCGTC AGAAGGTAAG AGAAGCGCAA AATGAGCCTA CCTACGGACT CCATAAGAAA AAACAAATGC CCTTCTTTCG GATGTTCAAA CGGGAGTGTT TTGCAGACAT GGAGTTAGAT GATCAGGCGA TCGCTCAAAT AGTAGCCTTA ACCCAAGAAG TTTTTACAGT AGTAGAAAGA GAGTTACAAT TAACAGGGTT TTGGGAGAGC ATTCCAGCCA GGAAAAAACT AGAGGCCGAG ATTCAAAAAG TGTTACTTTC CCCCGAATTT ATCAAAATAT CCAATCTAGT TAACAATCGC AAGCCGATCA TCTCACGGGT GATAGAAATA GCCGAGAAAA ATAACGACCG AATTTTGTAT GCTGAATAG
|
Protein sequence | MKTDKTPKPI KMDERNSVEK PMLDQLYGLG WEILDLERTQ TPQQTFRESF TEVVMLPILR EQLKVINPWI EDDQIEETVK NLTANFPSTN LLENNRHIFK LLQENSRVSE NRNTAEKSPT VRFIDFEKPE NNRFIAVCQF KVRILGTENH IIPDIVLFLN GLPVVVIECK SPKVKEPIPE AIDQLLRYSE QRGAKGEGSQ PLFFYNYFIV VTCRNEAKFG TITTHTEKYF YRWADPYPRT LDDLEHGSSS PNDQQRLVAG MLAPENLLEI IRNFTLFSVN NKGKTIKIVG RYQQFRAVKK AVERLIEGKT PRERSGIIWH TQGSGKSLTM MFMVRAMYRH PNLCQWKIVF ITDRTQLEQQ LAKTGQNVGF TVKPADSIAK LKELLRSDSS DLVMAMIHKF QERDLQETFP ELNPSSNILI MTDEAHRSQY ALLGANLDKA LPYASRIGYT GTPIDKTEKV FGDYIDKYTM RQSIEDGVTL EIVYTGRTHN AEVPDKEGMD KAFEDVFSEY NLSERLQILG YGSRDAYLEA HSTIAAKAKD MVQHYLDHIF PNGYKAQVVA TSREAAVRYK QHIDEALVEA IAQLQQSNPH GLNLDVLKQV KTDVIISGGH NDLPHLKAYS KDRQHETTIK SFKLPFDAED EGITGEMGIV IVNNMLLTGF DAPVEQVMYL DKVIQSHNLL QAIARVNRVG GIDKDKGFVV DYVGIGHHLK DAIDNYDERE QKEITEALSF PEEEIRQLED DYRTILTFLA EYGLTDLTDH DAFFDLFYDE DIRFEFMLKF KNLTRSLNLV FPAKEALTYI KDYQALTEVN VLAGKHFRDE RLSMKGIPAK LRTLTDEYLK SRGIEEKIKP ISILDEHFAN EVKKHNRVKT KAAEIEHALR HHIDIELDDD PELQASFSEA LRAILEAFRD NWQKIYEELE KLRQKVREAQ NEPTYGLHKK KQMPFFRMFK RECFADMELD DQAIAQIVAL TQEVFTVVER ELQLTGFWES IPARKKLEAE IQKVLLSPEF IKISNLVNNR KPIISRVIEI AEKNNDRILY AE
|
| |