Gene Cyan8802_3142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_3142 
Symbol 
ID8392474 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp3209615 
End bp3212743 
Gene Length3129 bp 
Protein Length1042 aa 
Translation table11 
GC content44% 
IMG OID644981087 
Producttype I site-specific deoxyribonuclease, HsdR family 
Protein accessionYP_003138817 
Protein GI257060929 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.175068 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.39911 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACCG ACAAAACCCC TAAACCTATC AAGATGGATG AACGGAACTC TGTTGAAAAG 
CCGATGCTTG ATCAACTGTA CGGTTTAGGG TGGGAAATAT TAGACTTAGA GAGAACCCAA
ACCCCTCAAC AAACCTTTAG GGAAAGCTTT ACCGAAGTGG TGATGCTGCC AATATTAAGG
GAACAACTAA AAGTTATTAA CCCTTGGATA GAAGATGATC AAATAGAAGA AACCGTTAAA
AACCTCACCG CTAACTTCCC CAGTACCAAC TTACTCGAAA ATAACCGCCA CATTTTTAAA
CTCCTACAAG AAAATAGCAG GGTCTCCGAA AACCGCAACA CAGCAGAAAA AAGCCCCACC
GTCCGCTTTA TCGACTTTGA AAAACCAGAA AATAACCGCT TTATTGCCGT CTGTCAGTTT
AAAGTCCGCA TTCTTGGCAC AGAAAACCAC ATCATCCCCG ATATTGTCCT CTTTTTAAAC
GGTTTACCCG TCGTCGTCAT CGAATGTAAG TCACCCAAAG TAAAAGAACC CATCCCCGAA
GCCATTGATC AACTATTGCG CTACAGTGAA CAACGAGGGG CAAAAGGAGA AGGAAGTCAA
CCCCTATTCT TTTATAACTA TTTTATCGTC GTAACCTGTC GCAACGAGGC AAAATTCGGC
ACTATTACCA CCCACACCGA AAAATACTTT TATCGTTGGG CTGATCCCTA TCCTCGTACC
TTAGATGACT TAGAACATGG TAGCAGTTCC CCCAACGATC AGCAGCGACT CGTAGCGGGA
ATGTTAGCCC CAGAGAACCT ATTAGAAATT ATCCGCAACT TTACCCTGTT TTCGGTTAAC
AACAAGGGAA AAACCATCAA AATAGTGGGA CGTTACCAAC AGTTTAGGGC CGTTAAAAAA
GCTGTTGAAC GACTTATAGA AGGGAAAACA CCAAGGGAAC GAAGCGGTAT TATTTGGCAT
ACCCAAGGCT CAGGTAAATC CCTTACTATG ATGTTTATGG TACGAGCCAT GTACCGCCAT
CCTAACCTCT GCCAGTGGAA AATCGTCTTT ATCACCGATC GCACCCAACT AGAACAGCAA
CTCGCCAAAA CAGGGCAGAA TGTGGGCTTT ACCGTCAAAC CTGCCGATAG TATCGCCAAA
TTAAAGGAAT TACTCCGCAG CGACTCCTCC GATCTCGTCA TGGCGATGAT CCACAAATTC
CAAGAACGAG ACTTGCAAGA AACCTTCCCA GAACTAAACC CCAGTAGTAA CATCTTGATC
ATGACCGATG AGGCACACCG ATCGCAATAT GCCTTACTAG GGGCCAACTT AGATAAAGCC
CTCCCCTACG CCTCTCGTAT TGGCTACACA GGAACCCCCA TTGATAAAAC CGAAAAAGTA
TTCGGGGACT ATATCGATAA GTACACCATG CGCCAGTCAA TTGAGGACGG AGTGACCCTA
GAAATCGTCT ATACAGGGCG AACTCACAAC GCAGAAGTAC CTGATAAGGA AGGGATGGAT
AAAGCCTTTG AGGACGTTTT TAGCGAGTAT AACCTATCAG AACGATTGCA GATTTTAGGC
TATGGTTCAC GGGATGCCTA CCTAGAAGCC CATTCCACCA TCGCAGCTAA AGCAAAAGAT
ATGGTACAGC ATTACCTGGA TCATATCTTT CCCAATGGAT ATAAAGCCCA AGTAGTGGCT
ACTTCCCGCG AGGCTGCGGT ACGGTACAAA CAACACATAG ACGAAGCGTT AGTAGAAGCG
ATCGCCCAAC TGCAACAGTC CAACCCCCAC GGGCTAAACC TGGATGTATT GAAGCAAGTT
AAAACCGATG TGATCATTTC TGGGGGACAT AACGACTTAC CCCACCTCAA AGCCTACAGT
AAGGATAGAC AACACGAAAC CACGATCAAA AGCTTTAAAC TGCCTTTTGA CGCAGAAGAT
GAAGGTATTA CGGGAGAGAT GGGTATTGTT ATCGTTAATA ATATGCTGCT GACGGGGTTT
GATGCCCCAG TAGAACAGGT GATGTATTTA GATAAGGTAA TTCAGTCCCA TAACCTTCTG
CAAGCGATCG CCCGTGTCAA CCGAGTAGGA GGAATAGATA AGGATAAGGG CTTTGTGGTG
GATTATGTGG GTATTGGACA CCACCTCAAG GATGCCATTG ATAATTACGA TGAACGGGAG
CAGAAAGAAA TTACAGAGGC TTTAAGCTTT CCAGAGGAAG AAATTCGCCA ATTAGAAGAT
GATTATCGGA CTATTTTAAC CTTCTTAGCA GAATACGGAC TGACCGACTT GACGGATCAT
GATGCCTTTT TTGACCTGTT TTACGACGAA GATATTCGCT TTGAGTTCAT GTTAAAGTTT
AAAAACCTAA CCCGTAGCCT TAACCTGGTT TTTCCTGCAA AAGAAGCTTT AACCTACATC
AAAGACTACC AAGCACTCAC CGAAGTGAAT GTATTAGCGG GTAAGCACTT CCGCGATGAA
CGTCTGAGCA TGAAAGGGAT TCCTGCTAAA TTGCGAACCT TAACCGATGA ATACCTTAAA
TCAAGGGGAA TTGAGGAAAA AATTAAACCC ATTTCTATCT TAGACGAACA CTTTGCCAAT
GAGGTTAAAA AACACAACCG CGTTAAAACC AAGGCGGCTG AAATAGAACA CGCCCTACGT
CACCATATTG ACATTGAGCT AGACGATGAT CCAGAACTGC AAGCCTCCTT CTCAGAAGCC
CTTAGAGCAA TTCTAGAGGC GTTTAGGGAC AACTGGCAGA AAATTTATGA GGAACTCGAA
AAGCTCCGTC AGAAGGTAAG AGAAGCGCAA AATGAGCCTA CCTACGGACT CCATAAGAAA
AAACAAATGC CCTTCTTTCG GATGTTCAAA CGGGAGTGTT TTGCAGACAT GGAGTTAGAT
GATCAGGCGA TCGCTCAAAT AGTAGCCTTA ACCCAAGAAG TTTTTACAGT AGTAGAAAGA
GAGTTACAAT TAACAGGGTT TTGGGAGAGC ATTCCAGCCA GGAAAAAACT AGAGGCCGAG
ATTCAAAAAG TGTTACTTTC CCCCGAATTT ATCAAAATAT CCAATCTAGT TAACAATCGC
AAGCCGATCA TCTCACGGGT GATAGAAATA GCCGAGAAAA ATAACGACCG AATTTTGTAT
GCTGAATAG
 
Protein sequence
MKTDKTPKPI KMDERNSVEK PMLDQLYGLG WEILDLERTQ TPQQTFRESF TEVVMLPILR 
EQLKVINPWI EDDQIEETVK NLTANFPSTN LLENNRHIFK LLQENSRVSE NRNTAEKSPT
VRFIDFEKPE NNRFIAVCQF KVRILGTENH IIPDIVLFLN GLPVVVIECK SPKVKEPIPE
AIDQLLRYSE QRGAKGEGSQ PLFFYNYFIV VTCRNEAKFG TITTHTEKYF YRWADPYPRT
LDDLEHGSSS PNDQQRLVAG MLAPENLLEI IRNFTLFSVN NKGKTIKIVG RYQQFRAVKK
AVERLIEGKT PRERSGIIWH TQGSGKSLTM MFMVRAMYRH PNLCQWKIVF ITDRTQLEQQ
LAKTGQNVGF TVKPADSIAK LKELLRSDSS DLVMAMIHKF QERDLQETFP ELNPSSNILI
MTDEAHRSQY ALLGANLDKA LPYASRIGYT GTPIDKTEKV FGDYIDKYTM RQSIEDGVTL
EIVYTGRTHN AEVPDKEGMD KAFEDVFSEY NLSERLQILG YGSRDAYLEA HSTIAAKAKD
MVQHYLDHIF PNGYKAQVVA TSREAAVRYK QHIDEALVEA IAQLQQSNPH GLNLDVLKQV
KTDVIISGGH NDLPHLKAYS KDRQHETTIK SFKLPFDAED EGITGEMGIV IVNNMLLTGF
DAPVEQVMYL DKVIQSHNLL QAIARVNRVG GIDKDKGFVV DYVGIGHHLK DAIDNYDERE
QKEITEALSF PEEEIRQLED DYRTILTFLA EYGLTDLTDH DAFFDLFYDE DIRFEFMLKF
KNLTRSLNLV FPAKEALTYI KDYQALTEVN VLAGKHFRDE RLSMKGIPAK LRTLTDEYLK
SRGIEEKIKP ISILDEHFAN EVKKHNRVKT KAAEIEHALR HHIDIELDDD PELQASFSEA
LRAILEAFRD NWQKIYEELE KLRQKVREAQ NEPTYGLHKK KQMPFFRMFK RECFADMELD
DQAIAQIVAL TQEVFTVVER ELQLTGFWES IPARKKLEAE IQKVLLSPEF IKISNLVNNR
KPIISRVIEI AEKNNDRILY AE