Gene Cyan8802_3992 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_3992 
Symbol 
ID8393342 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp4105115 
End bp4108069 
Gene Length2955 bp 
Protein Length984 aa 
Translation table11 
GC content39% 
IMG OID644981916 
Producttype III restriction protein res subunit 
Protein accessionYP_003139630 
Protein GI257061742 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.208178 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATACTT CAGAAAAAAA CTTTGAAAGC GACATCCAAG CCTATTTACT CGAAAATGGC 
TACCATTCCC GTACATCTAA CGATTATGAC AAAAAACTGT GTCTAATTCC CCAAGATGTC
CTTAACTTCA TTAACGCGAC TCAACCCCAA GAATGGCAAA AATATCAAAC CCAATATGGT
GACGACGCGC AAACAAAACT CCTACAACGC CTCGCCCAAC AAATCAAAAA ACGGGGTACA
GTCGAGATAT TAAGAAAAGG ATTAAAAGCC AACGGCTGTA AATTTAAACT CGCTTATTTT
CGCCCCAGTA CAGCCTTAAA CGACGAAACT CAACGCCTTT ATCAAGGTAA TTTCTTTAGT
CTGATTCGCC AATTTTACTA CAGCGAGAAA AATCAAAATA GTATTGATTT AGCTATCTTT
CTCAACGGAC TTCCCCTCTT CACCTGTGAA CTAAAAAACT CCTTTAAAGG GCAAACCGTC
GAAAATGCGA TTAAACAGTA CAGAAATGAC CGTGAACCCC GTGAAACCGT TCTCAGCTTC
GGTGTCTGTC TCTCTCACTT CGCCGTAGAC CCCAATTTAG TCTATATGAC TACCCACCTA
CAGGGTAAAA AGACTAAGTT TCTTCCCTTC AACCAAGGAC GAGATAACGG ACATGGCAAC
CCCCCATCAG CCTTAAGCTA TCCTACTGCT TACCTTTGGC AACAAATTTG GCAAAAAGAC
AGTATCTTAA ACCTGATTCA AAACTTTATC ACCCAATACG AAGAAGAAGA CGATAAAGGC
AACAAAACAG GAGAGAAAAA ACTCATCTTC CCTCGCTATC ATCAACTAGA TACCATTAAT
CGCCTGATAA ACCATGCAAA AGACCATAAA ACCGGTCAAA AATACCTAAT TCAACATAGC
GCAGGAAGCG GAAAAAGTAA TACTATTGCT TGGTTGGCAC ATCAACTCGT CAGCCTTCAT
GAAAGGGAAG ATAACCGAGT TTTTGATAGT ATTCTGGTGA TAACCGATAG AAAAGCCCTA
GATAAACAAC TACAACGTAA TTTAAAGCAA TTTGAAACCA CTTCCGGTGT AGTGGAAAAT
ATTGATAAAA CCTCTCGACA ACTCAAAGAA GCCTTAGAAA ACGGCAAAAA TATTATTGTC
ACCACCCTAC AGAAATTCCC CGGCGTTATT GACCAGATAA ACAGCTTAAA AGGTCAAAAA
TTCGCTATTA TCATCGATGA AGCCCACTCA TCCCAAACAG GGGAAAATAG TCGCCAACTG
AAAACCGTCT TAAGTACCCA AACCCTAGAA GAAGCAGAAA CACAAGAACA GGACATAGAA
GACTATATAG AAGATAGAAT CGAAGAAGCC GCCCGAACCA GAGGAAATTT ACCCAATTTA
AGTTATTTTG CCTTTACCGC TACCCCCAAA CCCAAAACCT TAGAATTATT CGGCATAAAA
CAACCTGATG GCACATTTAA ACCCTGTAGC CTGTATTCTA TGCGCCAAGC GATAGAAGAA
GGGTTTATAC TCGATGTTTT GCAAAATTAC ACCACCTATC AAACCTATTT TAGTCTGCTG
AAAACTGTTG AAAACGACCC CCACTACGAC AGAAACAAAG CCGGAAGACT CCTCAGAAAC
TTTGTTGACC TTCATCCCCA CAATATTAAC GCAAAAGTCG CTATTATCGC GGAGCATTTC
CATAATAACG TTGCTCATCA AATTAATAAT CAAGCTAAGG CGATGATAGT CACCCGTTCT
CGTCTTCACG CTGTTAGATA TAAACTCGCT TTAGATAACT ATTTACGAGA AAATGGCTAC
CCGTATCAAT CCTTAGTCGC CTTTACAGGT ACAGTCAAGG ACGGGGGAAG AGACTTCACC
GAAACAGGGA TGAATACCGC TTCATCTGGG GTTTCTATCC CAGAAAAAGC CACCGCAGAC
ACTTTTAATC AGAATCTCTA TAAATTTCTG ATTGTGGCGA ATAAATTCCA AACCGGATTT
AACCAACCCT TATTAACAGC GATGTATGTT GATAAAAAAT TGGGGGGTGT GAATGCCGTC
CAAACCTTAT CCCGTCTTAA TCGTACCTAT TCCCAGAAAG AAAGTACCGT GATTTTAGAT
TTCGCCAATG AGATTGACGT TATACAATCT GCCTTTGAGA ATTATTACGA TAGAACCGTA
TTAAGCCAAG AAACGGATGT TAACCTTGTC TATGATATTC AGCAACAGCT AGACGATTAT
GACTTCTATA CAGCATCCGA TATAACCGAT TTTGCTCAAA TTTACTTTAA TCCCAAAGCT
ACACAAGACC GACTACATAG TATTTTAATG CCAGTCATTG ACCGCTATCA AGAAGCAACA
GAAGCCGAAC AATTTAGCTT TAGAAATAAG CTAAAAGACT TTATCAGACT CTATCGCTTC
ATAGGGCAAC TTATTGGCTG TCCTGACTCA GAATTAGAAC AATTTTATGA ATTTGCTCGT
CATTTAGCCC CTAAATTACC CTTTGCACAG CAACAATTAC CCCTAGAAGT TCAACAAAAT
ATCGAACTGT CTCAATATCG TATCCAACGG ACTTATACGG GACAAATTGA CCTAAAACGA
GGAGAAAGAC AACTTGACCC CATTATCGCC GCCGGGACAG GAAATCCTCC AGTAGAAGAC
AGAGAACCCT TATCCGTAAT TATTGAACAG CTTAATCAAC AATTTGGTAC AAATTTCACC
GAAGATGAAC AAGTTTTCAT CGAACAGCTA GAACATAAAT TAGATAACAG CGACTCCTTA
CAAGCCAGTT TAAAGATCAA TTCCCTAGAA AATGTACGAT TAACTTTTAA TAATCTGACT
AATGAATTTA TGCAGGAAAT GATAGAATCT AATTTCAATT TTTATAAGCA TTTTAACGAT
GATAGTGAGT TTGCCAATCT GTTATTAAAT TGGCTGTTTC AACGCTTCTT AGAGAGACAG
CAAAGTAATA GTTAA
 
Protein sequence
MNTSEKNFES DIQAYLLENG YHSRTSNDYD KKLCLIPQDV LNFINATQPQ EWQKYQTQYG 
DDAQTKLLQR LAQQIKKRGT VEILRKGLKA NGCKFKLAYF RPSTALNDET QRLYQGNFFS
LIRQFYYSEK NQNSIDLAIF LNGLPLFTCE LKNSFKGQTV ENAIKQYRND REPRETVLSF
GVCLSHFAVD PNLVYMTTHL QGKKTKFLPF NQGRDNGHGN PPSALSYPTA YLWQQIWQKD
SILNLIQNFI TQYEEEDDKG NKTGEKKLIF PRYHQLDTIN RLINHAKDHK TGQKYLIQHS
AGSGKSNTIA WLAHQLVSLH EREDNRVFDS ILVITDRKAL DKQLQRNLKQ FETTSGVVEN
IDKTSRQLKE ALENGKNIIV TTLQKFPGVI DQINSLKGQK FAIIIDEAHS SQTGENSRQL
KTVLSTQTLE EAETQEQDIE DYIEDRIEEA ARTRGNLPNL SYFAFTATPK PKTLELFGIK
QPDGTFKPCS LYSMRQAIEE GFILDVLQNY TTYQTYFSLL KTVENDPHYD RNKAGRLLRN
FVDLHPHNIN AKVAIIAEHF HNNVAHQINN QAKAMIVTRS RLHAVRYKLA LDNYLRENGY
PYQSLVAFTG TVKDGGRDFT ETGMNTASSG VSIPEKATAD TFNQNLYKFL IVANKFQTGF
NQPLLTAMYV DKKLGGVNAV QTLSRLNRTY SQKESTVILD FANEIDVIQS AFENYYDRTV
LSQETDVNLV YDIQQQLDDY DFYTASDITD FAQIYFNPKA TQDRLHSILM PVIDRYQEAT
EAEQFSFRNK LKDFIRLYRF IGQLIGCPDS ELEQFYEFAR HLAPKLPFAQ QQLPLEVQQN
IELSQYRIQR TYTGQIDLKR GERQLDPIIA AGTGNPPVED REPLSVIIEQ LNQQFGTNFT
EDEQVFIEQL EHKLDNSDSL QASLKINSLE NVRLTFNNLT NEFMQEMIES NFNFYKHFND
DSEFANLLLN WLFQRFLERQ QSNS