Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan8802_3992 |
Symbol | |
ID | 8393342 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8802 |
Kingdom | Bacteria |
Replicon accession | NC_013161 |
Strand | + |
Start bp | 4105115 |
End bp | 4108069 |
Gene Length | 2955 bp |
Protein Length | 984 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 644981916 |
Product | type III restriction protein res subunit |
Protein accession | YP_003139630 |
Protein GI | 257061742 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.208178 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATACTT CAGAAAAAAA CTTTGAAAGC GACATCCAAG CCTATTTACT CGAAAATGGC TACCATTCCC GTACATCTAA CGATTATGAC AAAAAACTGT GTCTAATTCC CCAAGATGTC CTTAACTTCA TTAACGCGAC TCAACCCCAA GAATGGCAAA AATATCAAAC CCAATATGGT GACGACGCGC AAACAAAACT CCTACAACGC CTCGCCCAAC AAATCAAAAA ACGGGGTACA GTCGAGATAT TAAGAAAAGG ATTAAAAGCC AACGGCTGTA AATTTAAACT CGCTTATTTT CGCCCCAGTA CAGCCTTAAA CGACGAAACT CAACGCCTTT ATCAAGGTAA TTTCTTTAGT CTGATTCGCC AATTTTACTA CAGCGAGAAA AATCAAAATA GTATTGATTT AGCTATCTTT CTCAACGGAC TTCCCCTCTT CACCTGTGAA CTAAAAAACT CCTTTAAAGG GCAAACCGTC GAAAATGCGA TTAAACAGTA CAGAAATGAC CGTGAACCCC GTGAAACCGT TCTCAGCTTC GGTGTCTGTC TCTCTCACTT CGCCGTAGAC CCCAATTTAG TCTATATGAC TACCCACCTA CAGGGTAAAA AGACTAAGTT TCTTCCCTTC AACCAAGGAC GAGATAACGG ACATGGCAAC CCCCCATCAG CCTTAAGCTA TCCTACTGCT TACCTTTGGC AACAAATTTG GCAAAAAGAC AGTATCTTAA ACCTGATTCA AAACTTTATC ACCCAATACG AAGAAGAAGA CGATAAAGGC AACAAAACAG GAGAGAAAAA ACTCATCTTC CCTCGCTATC ATCAACTAGA TACCATTAAT CGCCTGATAA ACCATGCAAA AGACCATAAA ACCGGTCAAA AATACCTAAT TCAACATAGC GCAGGAAGCG GAAAAAGTAA TACTATTGCT TGGTTGGCAC ATCAACTCGT CAGCCTTCAT GAAAGGGAAG ATAACCGAGT TTTTGATAGT ATTCTGGTGA TAACCGATAG AAAAGCCCTA GATAAACAAC TACAACGTAA TTTAAAGCAA TTTGAAACCA CTTCCGGTGT AGTGGAAAAT ATTGATAAAA CCTCTCGACA ACTCAAAGAA GCCTTAGAAA ACGGCAAAAA TATTATTGTC ACCACCCTAC AGAAATTCCC CGGCGTTATT GACCAGATAA ACAGCTTAAA AGGTCAAAAA TTCGCTATTA TCATCGATGA AGCCCACTCA TCCCAAACAG GGGAAAATAG TCGCCAACTG AAAACCGTCT TAAGTACCCA AACCCTAGAA GAAGCAGAAA CACAAGAACA GGACATAGAA GACTATATAG AAGATAGAAT CGAAGAAGCC GCCCGAACCA GAGGAAATTT ACCCAATTTA AGTTATTTTG CCTTTACCGC TACCCCCAAA CCCAAAACCT TAGAATTATT CGGCATAAAA CAACCTGATG GCACATTTAA ACCCTGTAGC CTGTATTCTA TGCGCCAAGC GATAGAAGAA GGGTTTATAC TCGATGTTTT GCAAAATTAC ACCACCTATC AAACCTATTT TAGTCTGCTG AAAACTGTTG AAAACGACCC CCACTACGAC AGAAACAAAG CCGGAAGACT CCTCAGAAAC TTTGTTGACC TTCATCCCCA CAATATTAAC GCAAAAGTCG CTATTATCGC GGAGCATTTC CATAATAACG TTGCTCATCA AATTAATAAT CAAGCTAAGG CGATGATAGT CACCCGTTCT CGTCTTCACG CTGTTAGATA TAAACTCGCT TTAGATAACT ATTTACGAGA AAATGGCTAC CCGTATCAAT CCTTAGTCGC CTTTACAGGT ACAGTCAAGG ACGGGGGAAG AGACTTCACC GAAACAGGGA TGAATACCGC TTCATCTGGG GTTTCTATCC CAGAAAAAGC CACCGCAGAC ACTTTTAATC AGAATCTCTA TAAATTTCTG ATTGTGGCGA ATAAATTCCA AACCGGATTT AACCAACCCT TATTAACAGC GATGTATGTT GATAAAAAAT TGGGGGGTGT GAATGCCGTC CAAACCTTAT CCCGTCTTAA TCGTACCTAT TCCCAGAAAG AAAGTACCGT GATTTTAGAT TTCGCCAATG AGATTGACGT TATACAATCT GCCTTTGAGA ATTATTACGA TAGAACCGTA TTAAGCCAAG AAACGGATGT TAACCTTGTC TATGATATTC AGCAACAGCT AGACGATTAT GACTTCTATA CAGCATCCGA TATAACCGAT TTTGCTCAAA TTTACTTTAA TCCCAAAGCT ACACAAGACC GACTACATAG TATTTTAATG CCAGTCATTG ACCGCTATCA AGAAGCAACA GAAGCCGAAC AATTTAGCTT TAGAAATAAG CTAAAAGACT TTATCAGACT CTATCGCTTC ATAGGGCAAC TTATTGGCTG TCCTGACTCA GAATTAGAAC AATTTTATGA ATTTGCTCGT CATTTAGCCC CTAAATTACC CTTTGCACAG CAACAATTAC CCCTAGAAGT TCAACAAAAT ATCGAACTGT CTCAATATCG TATCCAACGG ACTTATACGG GACAAATTGA CCTAAAACGA GGAGAAAGAC AACTTGACCC CATTATCGCC GCCGGGACAG GAAATCCTCC AGTAGAAGAC AGAGAACCCT TATCCGTAAT TATTGAACAG CTTAATCAAC AATTTGGTAC AAATTTCACC GAAGATGAAC AAGTTTTCAT CGAACAGCTA GAACATAAAT TAGATAACAG CGACTCCTTA CAAGCCAGTT TAAAGATCAA TTCCCTAGAA AATGTACGAT TAACTTTTAA TAATCTGACT AATGAATTTA TGCAGGAAAT GATAGAATCT AATTTCAATT TTTATAAGCA TTTTAACGAT GATAGTGAGT TTGCCAATCT GTTATTAAAT TGGCTGTTTC AACGCTTCTT AGAGAGACAG CAAAGTAATA GTTAA
|
Protein sequence | MNTSEKNFES DIQAYLLENG YHSRTSNDYD KKLCLIPQDV LNFINATQPQ EWQKYQTQYG DDAQTKLLQR LAQQIKKRGT VEILRKGLKA NGCKFKLAYF RPSTALNDET QRLYQGNFFS LIRQFYYSEK NQNSIDLAIF LNGLPLFTCE LKNSFKGQTV ENAIKQYRND REPRETVLSF GVCLSHFAVD PNLVYMTTHL QGKKTKFLPF NQGRDNGHGN PPSALSYPTA YLWQQIWQKD SILNLIQNFI TQYEEEDDKG NKTGEKKLIF PRYHQLDTIN RLINHAKDHK TGQKYLIQHS AGSGKSNTIA WLAHQLVSLH EREDNRVFDS ILVITDRKAL DKQLQRNLKQ FETTSGVVEN IDKTSRQLKE ALENGKNIIV TTLQKFPGVI DQINSLKGQK FAIIIDEAHS SQTGENSRQL KTVLSTQTLE EAETQEQDIE DYIEDRIEEA ARTRGNLPNL SYFAFTATPK PKTLELFGIK QPDGTFKPCS LYSMRQAIEE GFILDVLQNY TTYQTYFSLL KTVENDPHYD RNKAGRLLRN FVDLHPHNIN AKVAIIAEHF HNNVAHQINN QAKAMIVTRS RLHAVRYKLA LDNYLRENGY PYQSLVAFTG TVKDGGRDFT ETGMNTASSG VSIPEKATAD TFNQNLYKFL IVANKFQTGF NQPLLTAMYV DKKLGGVNAV QTLSRLNRTY SQKESTVILD FANEIDVIQS AFENYYDRTV LSQETDVNLV YDIQQQLDDY DFYTASDITD FAQIYFNPKA TQDRLHSILM PVIDRYQEAT EAEQFSFRNK LKDFIRLYRF IGQLIGCPDS ELEQFYEFAR HLAPKLPFAQ QQLPLEVQQN IELSQYRIQR TYTGQIDLKR GERQLDPIIA AGTGNPPVED REPLSVIIEQ LNQQFGTNFT EDEQVFIEQL EHKLDNSDSL QASLKINSLE NVRLTFNNLT NEFMQEMIES NFNFYKHFND DSEFANLLLN WLFQRFLERQ QSNS
|
| |