Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan8802_2281 |
Symbol | |
ID | 8391601 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8802 |
Kingdom | Bacteria |
Replicon accession | NC_013161 |
Strand | + |
Start bp | 2297106 |
End bp | 2300066 |
Gene Length | 2961 bp |
Protein Length | 986 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 644980253 |
Product | type I site-specific deoxyribonuclease, HsdR family |
Protein accession | YP_003137995 |
Protein GI | 257060107 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.000865916 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCTAAAC CAACAGAATC AAAAACCGTC CAAGATCGCA TTTTAACCTA TGCCCAAGAA ATGACCCCAC AATGGCGTTA TGTATCCCGT AGCGAAGCAG AAACCAGAAG GGGATTTAAT AACTTGCCCC CTAACCCCCA AACGGGGGGA ATTGAAGGGG GCGATTACTC CCCAAGATTG GGGTCGGGGG TTGCCTCCTT ATATTTCGAT GATTTACTTT ACCAAAAAGT TAGACAATTT AATCCTACCT ACAATGAAAC TCAACAAGAA CTAATCACAA AATTAAATAG CCTTCCTACA GATATTTATG GAAATCGTGA CTTTCTTAAC TATCTTCGCA ATCAAGTTAA ATATTTTTGT CCAGTTGAAA AGCGAGAACT TGACTTAACC CTAATAAACT ACGAAGATCC CACTCAAAAT GAATACGAAG TTACCGAAGA ATATTATACC CATAATGGAA AAGATGGCAT CAGAGAAGAT ATTGTCTTTT TAATTAATGG GATTCCCATC CTTGTCATTG AATGTAAAAA TGCTGATAAA ATTGAAGGGA TTGCATTAGG AATAGACCAA ATAAGACGCT ATCACCGAGA AGCCCCTGAG TTATTTGTCC CTCAAATGTT ATTTACAGCT ACCGAAGCTA TTGGCTTTTC TTATGGGGTA ACATGGAATT TAGTCAGGAG AAATATTTTT AACTGGAAAG ATGAACAAAT TGGACAACTC GAAAACAAAA TCAAAACCTT TTGCCATCCC TATATTCTCC TAAAATTCTT ACTAAATTAT ATTATCTTTG CTGAAAAAGA CGAAACTCTC CAAAAATTTA TCCTCAAACA ACATCAAACT ATTGCCATTG AAAAAGTTAT CCAAAGATGT CATGATACTG AAAAATCACG GGGATTAGTT TGGCATACAC AGGGAAGCGG TAAAACCTTC ACCATGATTA AAATTGCCGA GATGCTATTT AAAGCCCCAG ATAGTGAAAA ACCCACCATT ATTTTAATAA TAGATCGCAA TGAACTCCAA GATCAACTAT TGCGAAATCT CAATAACTTA GGGGTTAATA ATATTCGTCA TGCTAACCGT ATTAAAACCT TAATTGAACT GCTAGAAAAT GACTATCGGG GCATCATTAT CACCATGATT CATAAATTCC GAGAAATGCC CACTGATGTT AATCTTAGAA ATAATATTTA TGTCCTAATT GATGAGGCAC ACCGCACCAC AGGAGGAGAC TTAGGAACAT ATTTAATGGC AGGTTTACCC CATGCAACTA TTATCGGCTT TACAGGCACA CCTATAGACA AAACCAATCA AGGAAAAGGA ACATTTAAAA CCTTTGGCAC AGATGACGAA AAAGGCTATT TACATAAATA TTCTATCGCT GAAAGTATCG AAGACGGCAC AACCTTACCC CTTTATTATA ATCTTGCTCC CAATGAAATG TTAGTTCCTG CTGAGATTAT GGATCAAGAA TTTCTTGACT TAGTAGAAAC AGAAGGCATT AATGATATTG CAGAATTAAA TAAAATTTTA GATCGCGCTG TCAACTTAAA AAACTTCCTC AAAGGCGATC AAAGAGTCGA TCAAGTTGCC AAATATGTCG CCCAACATTA CACCAAAAAT GTTGAACCAT TAGGATATAA AGCCTTTTTA GTCGCCGTAG ATCGTCCTGC CTGTGCTAAA TATAAGCAAG CATTAGATCG TTATCTACCC CTGGAATATT CTGCCGTTGT TTATACAGGG AATAATAACG ATACAGAAGA CCTTAAAACC CATCATATTG ATGATAAAAC CGAAAAACAA ATTAGAAAAA ACTTTGCCAA ATTTGGAGAA TATCCTAAAA TCTTAATTGT TACCGAAAAA CTCTTAACAG GATACGATGC ACCTATTTTA TATGCAATGT ATCTTGATAA ACCCATGCGA GATCATACTT TATTACAAGC GATCGCTAGA GTCAATCGTC CCTATGAAAA CGAAACAGAA GAAATGGTTA AACCCCATGG TTTTGTCTTA GATTTTGTCG GCATTTTTGA TAAATTAGAA AAAGCTTTAT CCTTTGACAG TCAAGAAGTC AATGCTATTG TTAAAGATTT AAGTTTACTG AAAAACTTAT TTAAAATTAA AATAGAGCAG TTAATCAAAA ATTATTTAAC ACTGATTCAA CATAATTTTA ATGATCAAGA TGTCGATCAT TTACTCGAAT ATTTTCGAGA TAAGGAACGA CGAAAAGCCT TTACAAAAGA TTATAAATCC TTAGAAATGC TCTATGAAGT CATTTCCCCC GATGCCTTTT TACGCCCCTA TCTCAATGAG TACGGAACAC TCTCTGGTAT TTATCAAGTG ATTCGTAATG CTTACAGTAA AAGAGTTTAT GTAGATCGAG AAGTCAAGCG AAAAACCGAC AAAATTGTTC AAAATAATAT TGCAACAACA GCAATTCCGA CTGTAACCGA TTTTATTGAA ATTAATGCTC AAACCATTGA AACTATTCAA AATAAAGGAG GTGGTAAAAC AACTAAAGTT ATTAACTTAA TCAAAAGTAT TGAAAAAACA GCAGAGGAAA ATAATGATGA TCCTTTCTTA ATTGCAATGG TACAGAGAGC TAAAGCGATT CAAGAACGAT TTGAAAATCG CCAAACAGAT ACCCAAGAAA CCCTTGATTT ACTGCTTCAA GCCGTTAGAG AAAACGAACA ACGTAAGCAA GAACAATCTG CCAAGGGATT TGATAGTTTA AGTTTTTTTG TTTATCAAGC TCTAGAAAAT GCAGGAATTG ATAACCCTGA AGATATGGCT CAAGAAATTA GACAACACTT TATTGAAAAT CCGAACTGGA AAACCAGTGA AGGCGAATTA AGAGAATTAA GAAAAAATGT AACTTTCTCC ATCTATACAG AAATAGATGA ATTAGAAAAA GTCACAGCCC TTGTTGAGCA ACTATTTACC CTATTACAAC AAAATCACTA A
|
Protein sequence | MPKPTESKTV QDRILTYAQE MTPQWRYVSR SEAETRRGFN NLPPNPQTGG IEGGDYSPRL GSGVASLYFD DLLYQKVRQF NPTYNETQQE LITKLNSLPT DIYGNRDFLN YLRNQVKYFC PVEKRELDLT LINYEDPTQN EYEVTEEYYT HNGKDGIRED IVFLINGIPI LVIECKNADK IEGIALGIDQ IRRYHREAPE LFVPQMLFTA TEAIGFSYGV TWNLVRRNIF NWKDEQIGQL ENKIKTFCHP YILLKFLLNY IIFAEKDETL QKFILKQHQT IAIEKVIQRC HDTEKSRGLV WHTQGSGKTF TMIKIAEMLF KAPDSEKPTI ILIIDRNELQ DQLLRNLNNL GVNNIRHANR IKTLIELLEN DYRGIIITMI HKFREMPTDV NLRNNIYVLI DEAHRTTGGD LGTYLMAGLP HATIIGFTGT PIDKTNQGKG TFKTFGTDDE KGYLHKYSIA ESIEDGTTLP LYYNLAPNEM LVPAEIMDQE FLDLVETEGI NDIAELNKIL DRAVNLKNFL KGDQRVDQVA KYVAQHYTKN VEPLGYKAFL VAVDRPACAK YKQALDRYLP LEYSAVVYTG NNNDTEDLKT HHIDDKTEKQ IRKNFAKFGE YPKILIVTEK LLTGYDAPIL YAMYLDKPMR DHTLLQAIAR VNRPYENETE EMVKPHGFVL DFVGIFDKLE KALSFDSQEV NAIVKDLSLL KNLFKIKIEQ LIKNYLTLIQ HNFNDQDVDH LLEYFRDKER RKAFTKDYKS LEMLYEVISP DAFLRPYLNE YGTLSGIYQV IRNAYSKRVY VDREVKRKTD KIVQNNIATT AIPTVTDFIE INAQTIETIQ NKGGGKTTKV INLIKSIEKT AEENNDDPFL IAMVQRAKAI QERFENRQTD TQETLDLLLQ AVRENEQRKQ EQSAKGFDSL SFFVYQALEN AGIDNPEDMA QEIRQHFIEN PNWKTSEGEL RELRKNVTFS IYTEIDELEK VTALVEQLFT LLQQNH
|
| |