Gene Cyan8802_2281 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_2281 
Symbol 
ID8391601 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp2297106 
End bp2300066 
Gene Length2961 bp 
Protein Length986 aa 
Translation table11 
GC content35% 
IMG OID644980253 
Producttype I site-specific deoxyribonuclease, HsdR family 
Protein accessionYP_003137995 
Protein GI257060107 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000865916 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCTAAAC CAACAGAATC AAAAACCGTC CAAGATCGCA TTTTAACCTA TGCCCAAGAA 
ATGACCCCAC AATGGCGTTA TGTATCCCGT AGCGAAGCAG AAACCAGAAG GGGATTTAAT
AACTTGCCCC CTAACCCCCA AACGGGGGGA ATTGAAGGGG GCGATTACTC CCCAAGATTG
GGGTCGGGGG TTGCCTCCTT ATATTTCGAT GATTTACTTT ACCAAAAAGT TAGACAATTT
AATCCTACCT ACAATGAAAC TCAACAAGAA CTAATCACAA AATTAAATAG CCTTCCTACA
GATATTTATG GAAATCGTGA CTTTCTTAAC TATCTTCGCA ATCAAGTTAA ATATTTTTGT
CCAGTTGAAA AGCGAGAACT TGACTTAACC CTAATAAACT ACGAAGATCC CACTCAAAAT
GAATACGAAG TTACCGAAGA ATATTATACC CATAATGGAA AAGATGGCAT CAGAGAAGAT
ATTGTCTTTT TAATTAATGG GATTCCCATC CTTGTCATTG AATGTAAAAA TGCTGATAAA
ATTGAAGGGA TTGCATTAGG AATAGACCAA ATAAGACGCT ATCACCGAGA AGCCCCTGAG
TTATTTGTCC CTCAAATGTT ATTTACAGCT ACCGAAGCTA TTGGCTTTTC TTATGGGGTA
ACATGGAATT TAGTCAGGAG AAATATTTTT AACTGGAAAG ATGAACAAAT TGGACAACTC
GAAAACAAAA TCAAAACCTT TTGCCATCCC TATATTCTCC TAAAATTCTT ACTAAATTAT
ATTATCTTTG CTGAAAAAGA CGAAACTCTC CAAAAATTTA TCCTCAAACA ACATCAAACT
ATTGCCATTG AAAAAGTTAT CCAAAGATGT CATGATACTG AAAAATCACG GGGATTAGTT
TGGCATACAC AGGGAAGCGG TAAAACCTTC ACCATGATTA AAATTGCCGA GATGCTATTT
AAAGCCCCAG ATAGTGAAAA ACCCACCATT ATTTTAATAA TAGATCGCAA TGAACTCCAA
GATCAACTAT TGCGAAATCT CAATAACTTA GGGGTTAATA ATATTCGTCA TGCTAACCGT
ATTAAAACCT TAATTGAACT GCTAGAAAAT GACTATCGGG GCATCATTAT CACCATGATT
CATAAATTCC GAGAAATGCC CACTGATGTT AATCTTAGAA ATAATATTTA TGTCCTAATT
GATGAGGCAC ACCGCACCAC AGGAGGAGAC TTAGGAACAT ATTTAATGGC AGGTTTACCC
CATGCAACTA TTATCGGCTT TACAGGCACA CCTATAGACA AAACCAATCA AGGAAAAGGA
ACATTTAAAA CCTTTGGCAC AGATGACGAA AAAGGCTATT TACATAAATA TTCTATCGCT
GAAAGTATCG AAGACGGCAC AACCTTACCC CTTTATTATA ATCTTGCTCC CAATGAAATG
TTAGTTCCTG CTGAGATTAT GGATCAAGAA TTTCTTGACT TAGTAGAAAC AGAAGGCATT
AATGATATTG CAGAATTAAA TAAAATTTTA GATCGCGCTG TCAACTTAAA AAACTTCCTC
AAAGGCGATC AAAGAGTCGA TCAAGTTGCC AAATATGTCG CCCAACATTA CACCAAAAAT
GTTGAACCAT TAGGATATAA AGCCTTTTTA GTCGCCGTAG ATCGTCCTGC CTGTGCTAAA
TATAAGCAAG CATTAGATCG TTATCTACCC CTGGAATATT CTGCCGTTGT TTATACAGGG
AATAATAACG ATACAGAAGA CCTTAAAACC CATCATATTG ATGATAAAAC CGAAAAACAA
ATTAGAAAAA ACTTTGCCAA ATTTGGAGAA TATCCTAAAA TCTTAATTGT TACCGAAAAA
CTCTTAACAG GATACGATGC ACCTATTTTA TATGCAATGT ATCTTGATAA ACCCATGCGA
GATCATACTT TATTACAAGC GATCGCTAGA GTCAATCGTC CCTATGAAAA CGAAACAGAA
GAAATGGTTA AACCCCATGG TTTTGTCTTA GATTTTGTCG GCATTTTTGA TAAATTAGAA
AAAGCTTTAT CCTTTGACAG TCAAGAAGTC AATGCTATTG TTAAAGATTT AAGTTTACTG
AAAAACTTAT TTAAAATTAA AATAGAGCAG TTAATCAAAA ATTATTTAAC ACTGATTCAA
CATAATTTTA ATGATCAAGA TGTCGATCAT TTACTCGAAT ATTTTCGAGA TAAGGAACGA
CGAAAAGCCT TTACAAAAGA TTATAAATCC TTAGAAATGC TCTATGAAGT CATTTCCCCC
GATGCCTTTT TACGCCCCTA TCTCAATGAG TACGGAACAC TCTCTGGTAT TTATCAAGTG
ATTCGTAATG CTTACAGTAA AAGAGTTTAT GTAGATCGAG AAGTCAAGCG AAAAACCGAC
AAAATTGTTC AAAATAATAT TGCAACAACA GCAATTCCGA CTGTAACCGA TTTTATTGAA
ATTAATGCTC AAACCATTGA AACTATTCAA AATAAAGGAG GTGGTAAAAC AACTAAAGTT
ATTAACTTAA TCAAAAGTAT TGAAAAAACA GCAGAGGAAA ATAATGATGA TCCTTTCTTA
ATTGCAATGG TACAGAGAGC TAAAGCGATT CAAGAACGAT TTGAAAATCG CCAAACAGAT
ACCCAAGAAA CCCTTGATTT ACTGCTTCAA GCCGTTAGAG AAAACGAACA ACGTAAGCAA
GAACAATCTG CCAAGGGATT TGATAGTTTA AGTTTTTTTG TTTATCAAGC TCTAGAAAAT
GCAGGAATTG ATAACCCTGA AGATATGGCT CAAGAAATTA GACAACACTT TATTGAAAAT
CCGAACTGGA AAACCAGTGA AGGCGAATTA AGAGAATTAA GAAAAAATGT AACTTTCTCC
ATCTATACAG AAATAGATGA ATTAGAAAAA GTCACAGCCC TTGTTGAGCA ACTATTTACC
CTATTACAAC AAAATCACTA A
 
Protein sequence
MPKPTESKTV QDRILTYAQE MTPQWRYVSR SEAETRRGFN NLPPNPQTGG IEGGDYSPRL 
GSGVASLYFD DLLYQKVRQF NPTYNETQQE LITKLNSLPT DIYGNRDFLN YLRNQVKYFC
PVEKRELDLT LINYEDPTQN EYEVTEEYYT HNGKDGIRED IVFLINGIPI LVIECKNADK
IEGIALGIDQ IRRYHREAPE LFVPQMLFTA TEAIGFSYGV TWNLVRRNIF NWKDEQIGQL
ENKIKTFCHP YILLKFLLNY IIFAEKDETL QKFILKQHQT IAIEKVIQRC HDTEKSRGLV
WHTQGSGKTF TMIKIAEMLF KAPDSEKPTI ILIIDRNELQ DQLLRNLNNL GVNNIRHANR
IKTLIELLEN DYRGIIITMI HKFREMPTDV NLRNNIYVLI DEAHRTTGGD LGTYLMAGLP
HATIIGFTGT PIDKTNQGKG TFKTFGTDDE KGYLHKYSIA ESIEDGTTLP LYYNLAPNEM
LVPAEIMDQE FLDLVETEGI NDIAELNKIL DRAVNLKNFL KGDQRVDQVA KYVAQHYTKN
VEPLGYKAFL VAVDRPACAK YKQALDRYLP LEYSAVVYTG NNNDTEDLKT HHIDDKTEKQ
IRKNFAKFGE YPKILIVTEK LLTGYDAPIL YAMYLDKPMR DHTLLQAIAR VNRPYENETE
EMVKPHGFVL DFVGIFDKLE KALSFDSQEV NAIVKDLSLL KNLFKIKIEQ LIKNYLTLIQ
HNFNDQDVDH LLEYFRDKER RKAFTKDYKS LEMLYEVISP DAFLRPYLNE YGTLSGIYQV
IRNAYSKRVY VDREVKRKTD KIVQNNIATT AIPTVTDFIE INAQTIETIQ NKGGGKTTKV
INLIKSIEKT AEENNDDPFL IAMVQRAKAI QERFENRQTD TQETLDLLLQ AVRENEQRKQ
EQSAKGFDSL SFFVYQALEN AGIDNPEDMA QEIRQHFIEN PNWKTSEGEL RELRKNVTFS
IYTEIDELEK VTALVEQLFT LLQQNH