Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_2218 |
Symbol | |
ID | 7102462 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | + |
Start bp | 2292282 |
End bp | 2295209 |
Gene Length | 2928 bp |
Protein Length | 975 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 643475273 |
Product | type I site-specific deoxyribonuclease, HsdR family |
Protein accession | YP_002372402 |
Protein GI | 218247031 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTAAAC CAACAGAATC AAAAACCGTC CAAGATCGCA TTTTAACCTA TGCCCAAGAA ATGACCCCAC AATGGCGTTA TGTATCCCGT AGCGAAGCAG AAACCAGAAG GGGATTTAAT AACATTGACA ATAATGATAT TCAAGTTCAA GCCCAACAAG CCTCTTTATA TTTCGATGAT CTACTTTATC AAAAAATTCA ACAATTTAAT CCTACCTACA ACGAAACTCA ACAAGAACTA ATCACAAAAT TAAATAGCCT TCCTACAGAT ATTTATGGAA ATCGTGACTT TCTTAACTAT CTTCGCAATC AAGTTAAATA TTTTTGTCCT GTTGAAAAGC GAGAACTTGA CTTAACCCTA ATAAACTACG AAGATCCCAC TCAAAATGAA TACGAAGTTA CCGAAGAATA TTATACCCAT AATAGAAAAG ATGGCATCAG AGAAGATATT GTCTTTTTAA TTAATGGGAT TCCCATCCTT GTCATTGAAT GTAAAAATGC TGATAAAATT GAAGGGATTG CATTAGGAAT AGACCAAATA AGACGCTATC ACCGAGAAGC CCCTGAGTTA TTTGTCCCTC AAATGTTATT TACAGCTACC GAAGCTATTG GCTTTTCTTA TGGGGTAACA TGGAATTTAG TCAGGAGAAA TATTTTTAAC TGGAAAGATG AACAAATTGG ACAACTCGAA AACAAAATCA AAACCTTTTG CCATCCCTAT ATTCTCCTAA AATTCTTACT AAATTATATT ATCTTTGCTG AAAAAGACGA AACTCTCCAA AAATTTATCC TCAAACAACA TCAAACTATT GCCATTGAAA AAGTTATCCA AAGATGTCAT GATACTGAAA AATCACGGGG ATTAGTTTGG CATACACAGG GAAGCGGTAA AACCTTCACC ATGATTAAAA TTGCCGAGAT GCTATTTAAA GCCCCAGATA GTGAAAAACC CACCATTATT TTAATAATAG ATCGCAATGA ACTCCAAGAT CAACTATTGC GAAATCTCAA TAACTTAGGG GTTAATAATA TTCGTCATGC TAACCGTATT AAAACCTTAA TTGAACTGCT AGAAAATGAC TATCGGGGCA TCATTATCAC CATGATTCAT AAATTCCGAG AAATGCCCAC TGATGTTAAT CTTAGAAATA ATATTTATGT CCTAATTGAT GAGGCACACC GCACCACAGG AGGAGACTTA GGAACATATT TAATGGCAGG TTTACCCCAT GCAACTATTA TCGGCTTTAC AGGCACACCT ATAGACAAAA CCAATCAAGG AAAAGGAACA TTTAAAACCT TTGGCACAGA TGACGAAAAA GGCTATTTAC ATAAATATTC TATCGCTGAA AGTATCGAAG ACGGCACAAC TTTACCCCTT TATTATAATC TTGCTCCCAA TGAAATGTTA GTTCCTGCTG AGATTATGGA TCAAGAATTT CTTGACTTAG TAGAAACAGA AGGCATTAAT GATATTGCAG AATTAAATAA AATTTTAGAT CGCGCTGTCA ACTTAAAAAA CTTCCTGAAA GGCGATCAAA GAGTCGATCA AGTTGCCAAA TATGTCGCCC AACATTACAC CAAAAATGTT GAACCATTAG GATATAAAGC CTTTTTAGTC GCCGTAGATC GTCCTGCCTG TGCTAAATAT AAGCAAGCAT TAGATCGTTA TCTACCCCTG GAATATTCTG CCGTTGTTTA TACAGGGAAT AATAACGATA CAGAAGACCT TAAAACCCAT CATATTGATG ATAAAACCGA AAAACAAATC AGAAAAAACT TTGCTAAATT TGGAGAATAT CCTAAAATCT TAATTGTTAC CGAAAAACTC TTAACAGGAT ACGATGCACC GATTTTATAT GCCATGTATC TTGATAAACC CATGCGAGAT CATACTTTAT TACAAGCGAT CGCTAGAGTC AATCGTCCCT ATGAAAACGA AACAGAAGAA ATGGTTAAAC CCCATGGTTT TGTCTTAGAT TTTGTCGGCA TTTTTGATAA ATTAGAAAAA GCTTTATCCT TTGACAGTCA AGAAGTCAAT GCTATTGTTA AAGATTTAAG TTTACTGAAA AACTTATTTA AAATTAAAAT AGAGCAGTTA ATCAAAAATT ATTTAACACT GATTCAACAT AATTTTAATG ATCAAGATGT CGATCATTTA CTCGAATATT TTCGAGATAA GGAACGACGA AAAGCCTTTA CAAAAGATTA TAAATCCTTA GAAATGCTCT ATGAAGTCAT TTCCCCCGAT GCCTTTTTAC GCCCCTATCT CAATGAGTAC GGAACACTCT CTGGTATTTA TCAAGTGATT CGTAATGCTT ACAGTAAAAG AGTTTATGTA GATCGAGAAG TCAAGCGAAA AACCGACCAA ATTGTTCAAA ATAATATTGC AACAACAGCA ATTCCGACTG TAACCGATTT TATTGAAATT AATGCTCAAA CCATTGAAAC TATTCAAAAT AAAGGAGGTG GTAAAACAAC TAAAGTTATT AACTTAATCA AAAGTATTGA AAAAACAGCA GAGGAAAATA ATGATGATCC TTTCTTAATT GCAATGGTAC AGAGAGCTAA AGCGATTCAA GAACAATTTG AAAATCGCCA AACAGATACC CAAGAAACCC TTGATTTACT GCTTCAAGCC GTTAGAGAAA ACGAACAACG TAAGCAAGAA CAATCTGCCA AAGGATTTGA TAGTTTAAGT TTTTTTGTTT ATCAAGCTCT AGAAAATGCA GGAATTGATA ACCCTGAAGA TATGGCTCAA GAAATTAGAC AACACTTTAT TGAAAATCCG AACTGGAAAA CCAGTGAAGG CGAATTAAGA GAATTAAGAA AAAATGTAAC TTTCTCCATC TATACAGAAA TAGATGAATT AGAAAAAGTC ACAGCCCTTG TTGAGCAACT ATTTACCCTA TTACAACAAA ATCACTAA
|
Protein sequence | MPKPTESKTV QDRILTYAQE MTPQWRYVSR SEAETRRGFN NIDNNDIQVQ AQQASLYFDD LLYQKIQQFN PTYNETQQEL ITKLNSLPTD IYGNRDFLNY LRNQVKYFCP VEKRELDLTL INYEDPTQNE YEVTEEYYTH NRKDGIREDI VFLINGIPIL VIECKNADKI EGIALGIDQI RRYHREAPEL FVPQMLFTAT EAIGFSYGVT WNLVRRNIFN WKDEQIGQLE NKIKTFCHPY ILLKFLLNYI IFAEKDETLQ KFILKQHQTI AIEKVIQRCH DTEKSRGLVW HTQGSGKTFT MIKIAEMLFK APDSEKPTII LIIDRNELQD QLLRNLNNLG VNNIRHANRI KTLIELLEND YRGIIITMIH KFREMPTDVN LRNNIYVLID EAHRTTGGDL GTYLMAGLPH ATIIGFTGTP IDKTNQGKGT FKTFGTDDEK GYLHKYSIAE SIEDGTTLPL YYNLAPNEML VPAEIMDQEF LDLVETEGIN DIAELNKILD RAVNLKNFLK GDQRVDQVAK YVAQHYTKNV EPLGYKAFLV AVDRPACAKY KQALDRYLPL EYSAVVYTGN NNDTEDLKTH HIDDKTEKQI RKNFAKFGEY PKILIVTEKL LTGYDAPILY AMYLDKPMRD HTLLQAIARV NRPYENETEE MVKPHGFVLD FVGIFDKLEK ALSFDSQEVN AIVKDLSLLK NLFKIKIEQL IKNYLTLIQH NFNDQDVDHL LEYFRDKERR KAFTKDYKSL EMLYEVISPD AFLRPYLNEY GTLSGIYQVI RNAYSKRVYV DREVKRKTDQ IVQNNIATTA IPTVTDFIEI NAQTIETIQN KGGGKTTKVI NLIKSIEKTA EENNDDPFLI AMVQRAKAIQ EQFENRQTDT QETLDLLLQA VRENEQRKQE QSAKGFDSLS FFVYQALENA GIDNPEDMAQ EIRQHFIENP NWKTSEGELR ELRKNVTFSI YTEIDELEKV TALVEQLFTL LQQNH
|
| |