Gene PCC8801_2218 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_2218 
Symbol 
ID7102462 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp2292282 
End bp2295209 
Gene Length2928 bp 
Protein Length975 aa 
Translation table11 
GC content34% 
IMG OID643475273 
Producttype I site-specific deoxyribonuclease, HsdR family 
Protein accessionYP_002372402 
Protein GI218247031 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTAAAC CAACAGAATC AAAAACCGTC CAAGATCGCA TTTTAACCTA TGCCCAAGAA 
ATGACCCCAC AATGGCGTTA TGTATCCCGT AGCGAAGCAG AAACCAGAAG GGGATTTAAT
AACATTGACA ATAATGATAT TCAAGTTCAA GCCCAACAAG CCTCTTTATA TTTCGATGAT
CTACTTTATC AAAAAATTCA ACAATTTAAT CCTACCTACA ACGAAACTCA ACAAGAACTA
ATCACAAAAT TAAATAGCCT TCCTACAGAT ATTTATGGAA ATCGTGACTT TCTTAACTAT
CTTCGCAATC AAGTTAAATA TTTTTGTCCT GTTGAAAAGC GAGAACTTGA CTTAACCCTA
ATAAACTACG AAGATCCCAC TCAAAATGAA TACGAAGTTA CCGAAGAATA TTATACCCAT
AATAGAAAAG ATGGCATCAG AGAAGATATT GTCTTTTTAA TTAATGGGAT TCCCATCCTT
GTCATTGAAT GTAAAAATGC TGATAAAATT GAAGGGATTG CATTAGGAAT AGACCAAATA
AGACGCTATC ACCGAGAAGC CCCTGAGTTA TTTGTCCCTC AAATGTTATT TACAGCTACC
GAAGCTATTG GCTTTTCTTA TGGGGTAACA TGGAATTTAG TCAGGAGAAA TATTTTTAAC
TGGAAAGATG AACAAATTGG ACAACTCGAA AACAAAATCA AAACCTTTTG CCATCCCTAT
ATTCTCCTAA AATTCTTACT AAATTATATT ATCTTTGCTG AAAAAGACGA AACTCTCCAA
AAATTTATCC TCAAACAACA TCAAACTATT GCCATTGAAA AAGTTATCCA AAGATGTCAT
GATACTGAAA AATCACGGGG ATTAGTTTGG CATACACAGG GAAGCGGTAA AACCTTCACC
ATGATTAAAA TTGCCGAGAT GCTATTTAAA GCCCCAGATA GTGAAAAACC CACCATTATT
TTAATAATAG ATCGCAATGA ACTCCAAGAT CAACTATTGC GAAATCTCAA TAACTTAGGG
GTTAATAATA TTCGTCATGC TAACCGTATT AAAACCTTAA TTGAACTGCT AGAAAATGAC
TATCGGGGCA TCATTATCAC CATGATTCAT AAATTCCGAG AAATGCCCAC TGATGTTAAT
CTTAGAAATA ATATTTATGT CCTAATTGAT GAGGCACACC GCACCACAGG AGGAGACTTA
GGAACATATT TAATGGCAGG TTTACCCCAT GCAACTATTA TCGGCTTTAC AGGCACACCT
ATAGACAAAA CCAATCAAGG AAAAGGAACA TTTAAAACCT TTGGCACAGA TGACGAAAAA
GGCTATTTAC ATAAATATTC TATCGCTGAA AGTATCGAAG ACGGCACAAC TTTACCCCTT
TATTATAATC TTGCTCCCAA TGAAATGTTA GTTCCTGCTG AGATTATGGA TCAAGAATTT
CTTGACTTAG TAGAAACAGA AGGCATTAAT GATATTGCAG AATTAAATAA AATTTTAGAT
CGCGCTGTCA ACTTAAAAAA CTTCCTGAAA GGCGATCAAA GAGTCGATCA AGTTGCCAAA
TATGTCGCCC AACATTACAC CAAAAATGTT GAACCATTAG GATATAAAGC CTTTTTAGTC
GCCGTAGATC GTCCTGCCTG TGCTAAATAT AAGCAAGCAT TAGATCGTTA TCTACCCCTG
GAATATTCTG CCGTTGTTTA TACAGGGAAT AATAACGATA CAGAAGACCT TAAAACCCAT
CATATTGATG ATAAAACCGA AAAACAAATC AGAAAAAACT TTGCTAAATT TGGAGAATAT
CCTAAAATCT TAATTGTTAC CGAAAAACTC TTAACAGGAT ACGATGCACC GATTTTATAT
GCCATGTATC TTGATAAACC CATGCGAGAT CATACTTTAT TACAAGCGAT CGCTAGAGTC
AATCGTCCCT ATGAAAACGA AACAGAAGAA ATGGTTAAAC CCCATGGTTT TGTCTTAGAT
TTTGTCGGCA TTTTTGATAA ATTAGAAAAA GCTTTATCCT TTGACAGTCA AGAAGTCAAT
GCTATTGTTA AAGATTTAAG TTTACTGAAA AACTTATTTA AAATTAAAAT AGAGCAGTTA
ATCAAAAATT ATTTAACACT GATTCAACAT AATTTTAATG ATCAAGATGT CGATCATTTA
CTCGAATATT TTCGAGATAA GGAACGACGA AAAGCCTTTA CAAAAGATTA TAAATCCTTA
GAAATGCTCT ATGAAGTCAT TTCCCCCGAT GCCTTTTTAC GCCCCTATCT CAATGAGTAC
GGAACACTCT CTGGTATTTA TCAAGTGATT CGTAATGCTT ACAGTAAAAG AGTTTATGTA
GATCGAGAAG TCAAGCGAAA AACCGACCAA ATTGTTCAAA ATAATATTGC AACAACAGCA
ATTCCGACTG TAACCGATTT TATTGAAATT AATGCTCAAA CCATTGAAAC TATTCAAAAT
AAAGGAGGTG GTAAAACAAC TAAAGTTATT AACTTAATCA AAAGTATTGA AAAAACAGCA
GAGGAAAATA ATGATGATCC TTTCTTAATT GCAATGGTAC AGAGAGCTAA AGCGATTCAA
GAACAATTTG AAAATCGCCA AACAGATACC CAAGAAACCC TTGATTTACT GCTTCAAGCC
GTTAGAGAAA ACGAACAACG TAAGCAAGAA CAATCTGCCA AAGGATTTGA TAGTTTAAGT
TTTTTTGTTT ATCAAGCTCT AGAAAATGCA GGAATTGATA ACCCTGAAGA TATGGCTCAA
GAAATTAGAC AACACTTTAT TGAAAATCCG AACTGGAAAA CCAGTGAAGG CGAATTAAGA
GAATTAAGAA AAAATGTAAC TTTCTCCATC TATACAGAAA TAGATGAATT AGAAAAAGTC
ACAGCCCTTG TTGAGCAACT ATTTACCCTA TTACAACAAA ATCACTAA
 
Protein sequence
MPKPTESKTV QDRILTYAQE MTPQWRYVSR SEAETRRGFN NIDNNDIQVQ AQQASLYFDD 
LLYQKIQQFN PTYNETQQEL ITKLNSLPTD IYGNRDFLNY LRNQVKYFCP VEKRELDLTL
INYEDPTQNE YEVTEEYYTH NRKDGIREDI VFLINGIPIL VIECKNADKI EGIALGIDQI
RRYHREAPEL FVPQMLFTAT EAIGFSYGVT WNLVRRNIFN WKDEQIGQLE NKIKTFCHPY
ILLKFLLNYI IFAEKDETLQ KFILKQHQTI AIEKVIQRCH DTEKSRGLVW HTQGSGKTFT
MIKIAEMLFK APDSEKPTII LIIDRNELQD QLLRNLNNLG VNNIRHANRI KTLIELLEND
YRGIIITMIH KFREMPTDVN LRNNIYVLID EAHRTTGGDL GTYLMAGLPH ATIIGFTGTP
IDKTNQGKGT FKTFGTDDEK GYLHKYSIAE SIEDGTTLPL YYNLAPNEML VPAEIMDQEF
LDLVETEGIN DIAELNKILD RAVNLKNFLK GDQRVDQVAK YVAQHYTKNV EPLGYKAFLV
AVDRPACAKY KQALDRYLPL EYSAVVYTGN NNDTEDLKTH HIDDKTEKQI RKNFAKFGEY
PKILIVTEKL LTGYDAPILY AMYLDKPMRD HTLLQAIARV NRPYENETEE MVKPHGFVLD
FVGIFDKLEK ALSFDSQEVN AIVKDLSLLK NLFKIKIEQL IKNYLTLIQH NFNDQDVDHL
LEYFRDKERR KAFTKDYKSL EMLYEVISPD AFLRPYLNEY GTLSGIYQVI RNAYSKRVYV
DREVKRKTDQ IVQNNIATTA IPTVTDFIEI NAQTIETIQN KGGGKTTKVI NLIKSIEKTA
EENNDDPFLI AMVQRAKAIQ EQFENRQTDT QETLDLLLQA VRENEQRKQE QSAKGFDSLS
FFVYQALENA GIDNPEDMAQ EIRQHFIENP NWKTSEGELR ELRKNVTFSI YTEIDELEKV
TALVEQLFTL LQQNH