Gene Cyan8802_2208 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_2208 
Symbol 
ID8391525 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp2216990 
End bp2220646 
Gene Length3657 bp 
Protein Length1218 aa 
Translation table11 
GC content39% 
IMG OID644980180 
Producthypothetical protein 
Protein accessionYP_003137924 
Protein GI257060036 
COG category[V] Defense mechanisms 
COG ID[COG1002] Type II restriction enzyme, methylase subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00220427 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.956264 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATACAT CAAGACTCAA AAAGTTCGCC CAGTATGCCC GTCGTTATCT CAGAGAACAG 
GTAGCGACTA AGTTAGAAGT GGTCTTAGCT CAAAATAGTG CAGCAAGACG GGAAAATCCT
GAAGCTATCC GTAAGTTAAC CGAAAATATC GAAGTTTCAG GGAAAGAACA GGTTATTGAA
CAGGTTGCCT ATACTTGGTT TAACCGTTTT TGTGCGTTGC GGTTTATGGA TAGTAACCGT
TATAACTCCA TTGGGGTTGT GTCTCCTGCT GAGGGGCAAT TTCAGCCTGA AATCTTGGCT
GAGGCGAAAA TGGGGCATAT TGATGAGGAA ATGATAACTA AACACAGAAA GCAGATTTTT
GATTTATTAA GCGGTAATGC ACCGAGTTCT GATGCTCAAG GGGAGGCCTA TCGTTTATTA
ATTGTCGGGG TATGTAATTA TTATCATAAG GTGATGGATT ATTTGTTTGA GCCAATTGAT
AATTATACGG AGTTATTAAT GCCTGATGAT TTGCTGTCGG GTAATTCAAT TTTGGCTTAT
ACTCGTGAGG CAATGACTCC TGAAAATTGT GAGAGTGTGG AGGTAATTGG GTGGTTATAT
CAGTTTTATA TTTCTGAGAA GAAAGATGAG GTTTTTGAGG CTCTTAAAAA GAATAAAAAA
ATCATACCCG AAAATATTCC GGCAGCGACT CAGTTATTTA CTCCTCATTG GATTGTGCGT
TATTTAGTAG AAAATTCTTT GGGGCATTTA TGGATGTTAA ATCGTCCTCA GTCTCGTTTA
ATTGAGATGA TGGATTATTA TGTGGATAGT GGGAAGTGGG AAGTGGGAAG TGGGGAGTCT
AGTTTAGGAG GTGAAGATGA TGACCGTCAG AAATTATCAG GAGTTAATCG TTTGGCAGAA
AGCGATGGAC TTAGTGGAGT TAGTTTATCA GGCGACAAAA ATGTTTCCGA AAGAGGAACT
TTACGGTCTG ACCAATCAAG TGAGACGGGC AGTTGTTTCG ATTCCGTCAA ACATAGCGGA
GGGTCAAACC CGTCAATCGA CGGCGGAATT CAAGCGTTTT CTCTCAATAG CGCAGGGGTC
GAGAGCAGAG GTAGAAACTC AAATTATGAT AGCTCAGAGA CTCAAATATC TCAGTCAGGA
GGAGATGGAA CAAATTCTCA ATCTGGCCGA GGAAATCAAG CGAATGATTT ATGGACTAAC
AGCCAAACTA AAATAACTTC CCACTCTCCA CTTCCTACTT CCTACTTAAA AATAAATTCT
CCTGAAGATT TAAAAATTTG TGATCCTGCT TGTGGTTCGG GACATATTTT AGTCTATGCT
TTTGAGTTAC TTTATGCTAT TTATGAGGAG GAAGGTTATG CAGTAAATGA GATTCCTAGC
AAGATTTTAA CCCATAATCT CTATGGTATG GAAATTGATG AACGGGCAGG AAGTTTGGCA
GCTTTTGCAT TAACTATGAA GGCAAGGGAG AAGCAAATAC GGTTTTTTCG TAAGCCGATT
CAGCCTCATA TTTGTGTGTT GAAAAAGGTT GAGTTTGAGG AGTGGGAAGT GGGGAGTGGG
AAGTGGGAAG TGGATAGTGA GAAGTTGGGG GTAAGTCGTG ATGATTTATT ACATGATTTA
CATTTGTTTA AAGAAGCGGA TAATTTTGGG GCTTTGTTAC GTCCAAAAAT GTCTGAAAAC
CAAATAGCTA ATTTAAGAGA TTACTTTGCT AATTTATGGA AAAAAATTCC TAATCCTTCT
TTGTTTGAAC ATAAAACCCA TGAGAAGGTT ATGGATGTGT TAAAACAGGC TGATTTTTTA
AGCCCTAAGT ATCACATTGT TGTTGCAAAT CCGCCTTATA TGGGTAATAA GGGTATGAAT
AATCGGTTAA AGGCTTTTTT ACAGGATAAT TATAGTAATG TAAAATCTGA TTTATTTTCT
GCTTTTATGA TTCGTATTTT AGAAATGACT TTACAAAAAG GGGAAATGGG TTTTGTTACT
CCCTATGTTT GGATGTTTAT TTCTTCTTAT GAAAAATTAA GAACTTTGAT TTTAGAGAAA
ACGACAATAA CTAGCTTAAT TCAATTAGAA TATAATGCTT TTGCTCCTGC TTGTATTCCA
GTTGCTACGT TTACCCTTTC TAATCAAAAT TTACCTAATT TTAAAGGGGG ATATATTAAA
TTATCCGATT TTAGAGGTGC AGATAATCAA GCACCGAAGG CATTAGAAGC GATAAAAAAT
CCTAATTGTG GTTGGTTTTA TCGGGCTTCT GCAAGTGATT TTAAGAAGAT TCCGGGGAGT
GCGATCGCTT ATTGGGTTAG TGATAGTATT CGCATCTCTT TTTCTAAATA TAAATCTATT
TCTGAGCAGT ATACAGTCAA GTCGGGAATT ATGACAGGCA ATGATGATAC TTTTCTGAAG
TTTTGGTTTG AGGTTAAAAT AAGTCAGATC GGTTTTAATT TGATTTCTTA TGATGAAATG
AAAAGTTATT GGTATCCAAT AAGTAAAGGA GGAAATTTTA GGAAATGGTA TGGAAATAAT
GAACACATTA TTAACTTAAG AGATGATGCT TATGATATTC GTAATGGAGG AGGAAACTTT
AGATTAAGAG AGAAAAATTT ATATTTTAGA CCTTATATAA CTTGGTCACG CATTACTTCA
TCACAAGTTG CCTTTAGATT TTCACAAGGT GGTATTCTTT TTTCAGATGC CGGACCAGGA
ATTTTTGCAG AAAATGATTG TCAGAAAATA ATAACTTTTC TTAATACAAA ACTGAGTAAT
TACTTTTTAG CCTGTATTAA TCCCACCCTT AATTACCAAA CCCGTGATAT TGAATCATTA
CCATGTATAG ATATCCAAAA AGATGTTCAG ACTCCTTATT TAATAAAAAC CACAAAATCA
GACTGGGATT CTTACGAAAC ATCATGGGAT TTTACCACCC TTCCGCTTTT AAGAGTGGGA
AGTGAACAGT GGGAAGTAGA GAGTGGAGAG TGGGAAGTGA ATAGAAAAGA GGGAACAGTG
GTGAATAGTA ATGAAACAAT AGATAGTAAG CAAAAAACAG TTGATAGTAA TGAAACAATA
GATAGTGAAC CGTTAATAGT TGATAATAGA AATGAAGAAA AGACTCCCCA CTTCCCACTC
CCCACTTCTC ACTCAAAACT CCCCACTTCC CACTCCCCAC TTCTCACTCA AAACTCCCCA
CTTCCCACTC CCCACTTCTC ACTCAAAACT CCCCACTTCC CACTCCCCAC TTCTCACTCA
AAACTCCCCA CTCCCCACTC CCCACTTCCC ACTCCCCACT TCTCACTCAA AACTCCCCAC
TCCCCACTCC CCACTTCCCA CTCCCCACTT CTCACTCAAA ACTCCCCACT CCCCACTCCC
CACTTCCCAC TCCCCACTTC TCACTCAAAA CTCCCCACTC CCCACTCCCC ACTTCCCACT
CCCCACTTCT CACTCAAAAC TCCCCACTCC CCACTCCCCA CTTCCCACTC CCCACTTCTC
ACTCAAAACT CCCCACTCCC CACTCCCCAC TTCCCACTCC CCACTTCTCA CTCAAAACTC
CCCACTCCCC ACTCCCCACT TCCCACTTCC CACTCCCCTA AAAGAAACCT ATCAAACCCT
GCGCCAACAA TGGCAAGAAA TGACCTTAGA AATGCAGAAA TTAGAAGAAG AAAATAA
 
Protein sequence
MDTSRLKKFA QYARRYLREQ VATKLEVVLA QNSAARRENP EAIRKLTENI EVSGKEQVIE 
QVAYTWFNRF CALRFMDSNR YNSIGVVSPA EGQFQPEILA EAKMGHIDEE MITKHRKQIF
DLLSGNAPSS DAQGEAYRLL IVGVCNYYHK VMDYLFEPID NYTELLMPDD LLSGNSILAY
TREAMTPENC ESVEVIGWLY QFYISEKKDE VFEALKKNKK IIPENIPAAT QLFTPHWIVR
YLVENSLGHL WMLNRPQSRL IEMMDYYVDS GKWEVGSGES SLGGEDDDRQ KLSGVNRLAE
SDGLSGVSLS GDKNVSERGT LRSDQSSETG SCFDSVKHSG GSNPSIDGGI QAFSLNSAGV
ESRGRNSNYD SSETQISQSG GDGTNSQSGR GNQANDLWTN SQTKITSHSP LPTSYLKINS
PEDLKICDPA CGSGHILVYA FELLYAIYEE EGYAVNEIPS KILTHNLYGM EIDERAGSLA
AFALTMKARE KQIRFFRKPI QPHICVLKKV EFEEWEVGSG KWEVDSEKLG VSRDDLLHDL
HLFKEADNFG ALLRPKMSEN QIANLRDYFA NLWKKIPNPS LFEHKTHEKV MDVLKQADFL
SPKYHIVVAN PPYMGNKGMN NRLKAFLQDN YSNVKSDLFS AFMIRILEMT LQKGEMGFVT
PYVWMFISSY EKLRTLILEK TTITSLIQLE YNAFAPACIP VATFTLSNQN LPNFKGGYIK
LSDFRGADNQ APKALEAIKN PNCGWFYRAS ASDFKKIPGS AIAYWVSDSI RISFSKYKSI
SEQYTVKSGI MTGNDDTFLK FWFEVKISQI GFNLISYDEM KSYWYPISKG GNFRKWYGNN
EHIINLRDDA YDIRNGGGNF RLREKNLYFR PYITWSRITS SQVAFRFSQG GILFSDAGPG
IFAENDCQKI ITFLNTKLSN YFLACINPTL NYQTRDIESL PCIDIQKDVQ TPYLIKTTKS
DWDSYETSWD FTTLPLLRVG SEQWEVESGE WEVNRKEGTV VNSNETIDSK QKTVDSNETI
DSEPLIVDNR NEEKTPHFPL PTSHSKLPTS HSPLLTQNSP LPTPHFSLKT PHFPLPTSHS
KLPTPHSPLP TPHFSLKTPH SPLPTSHSPL LTQNSPLPTP HFPLPTSHSK LPTPHSPLPT
PHFSLKTPHS PLPTSHSPLL TQNSPLPTPH FPLPTSHSKL PTPHSPLPTS HSPKRNLSNP
APTMARNDLR NAEIRRRK