Gene Cyan8802_3011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_3011 
Symbol 
ID8392339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp3044909 
End bp3047827 
Gene Length2919 bp 
Protein Length972 aa 
Translation table11 
GC content35% 
IMG OID644980958 
ProductDNA polymerase I 
Protein accessionYP_003138692 
Protein GI257060804 
COG category[L] Replication, recombination and repair 
COG ID[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.664853 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAACGC AGCAATCCTT ACCGCTTTTA ATCTTAATTG ATGGGCATTC CTTAGCCTTT 
CGCTCTTATT ATGCCTTTGC TAAAGCGAGA AAAGGTCCTT TGCGTACTTC TAAAGGTATT
CCGACGAGTG TTTGTTTTGG GTTCCTTAAT TCATTGATGC AAGTGATTGA GTCTCAGAAA
CCTGAATATT TAGCCATTGC TTTTGATCGT TCTGAACCAA CTTTTCGTCA TGAAGCTGAT
GAAAATTATA AAGCTGATAG GGAGGAAACC CCTGAAGATT TTATTATCGA TATGGCTAAT
CTTCAAGAGT TATTAGAAGC GTTAAATTTA ACCATTGTTA CTTTCCCCGG ATATGAAGCA
GATGATGTAT TAGGAACCCT TGCTAACCAA GGAAGTGAGT CAGGATATCA AGTAAAAATT
GTCAGTGGCG ATCGCGATCT GTTTCAATTA GTTGATGAAG CTAAAAATAT TAGTGTTTTA
TATTTAGAAA GAAATGCGGT TAAAGGTTCA TCGGAAGGCT ATACAGAATT TAACTCCGAG
GCAGTGGAAG CAAAATTAGG TATTAAACCC GTTCAAGTTG TTGATTATAA AGCACTTTGT
GGCGATAAGT CTGATAGTAT TCCAGGGATT GATGGAATTG GTGAAAAAAC CGCAGTGAAA
TTATTAAAAG AATACGGTAC TATTCAAGAA GTTTATAACA ATTTAGACAA AATCAAAGGA
ACACTAAACA AGAAATTAAA AGACGGAGTA AAAGAAGCAG AACATTCTCA ATATTTGGCT
AAAATTGTCG TTGATGTTCC TGTGGATATT GACTTTAAAA ACTGTCAATT AAAAGGATTT
AATTTATCTA TACTCGAACC GCTTTTACAA AAGCTAGAAT TAAATAAGTT TTTAAAACAA
ATTAATCACT TACAACAAAA ATTTGGGGGA GAAGTTAAAC CCGTTGATCA AGCAACTTCT
CAACCGTTAG ATCACCCTCA GCAACTGTCT CTATTTCCAG AGCAAAATTT ACCCGCTCAT
TCTGTTGAAC CCTTAGTGAT TAAAACTATC CCAGAAGTCT CTCCTATAAC TCCTAAAATT
ATTACGACGT TAGATGACTT AAATCACTTA ATTAACAAGC TAGAACAATA TACTGATGTT
GAATATCCTG TTGCATGGGA TACGGAGACA ACATCCTTAG AACCGAGGAA TGCAGAGTTA
GTCGGAATTG GCTGTTGTTG GGGAGAGAAA TCCACTGATA TTGCCTATAT TCCCACGGGA
CATAACAAAG GGAATCAACT CAAAAAAGAA GCCGTATTAC AAGCATTAAA ACCTATTTTA
GAAAGCGATC GCTATCCCAA AGTATTCCAG AATACTAAGT TTGATCGCTT GATTTTTTAT
CATCAAGGAA TTAACCTAAA AGGAGTCGTC TTTGACACTT TATTAGCGAG TTATGTTTTG
CATCCTGAAA TGAGTCATAA TCTCAGTGAT TTATGTTATC GCTATTTATC AGGAATTACC
TCGCAAAGCT ACAAAGAATT AGCTATCCCA AAAGGTAAAA CTATTGCCAG TTTAGACATT
GAAACTGTTG CTAATTACTG CGGACTCGAT GCTTATGCGA CGTTTTTATT AGTCAGTAAG
TTAAAACAAG AATTAAAATC CTTTCCTAGT TTAGAAAAAT TGCTGCTAGA AGTAGAACAA
CCCCTAGAAC CTGTACTAGC AGCAATGGAA GATATCGGGA TTCGTATCGA TACCGACTAT
TTACAACAAT TATCTCAACA ATTAGAACAA GATTTACATA TTATTGAACA ACAAGCGTAT
CAAGAAGCAG GAGAAATCTT TAACCTAGGT TCTCCCAAAC AACTCGGAGA AATCTTATTT
GAAAAACTCG AATTGAACCG TAAAAAGTCT CGCAAAACAA AAACCGGGTA TTCTACTGAT
CACGCTACCC TAGAAAAGTT ACAAGGAGAT CATCCAATTA TTGATCATAT TTTAGACTAT
CGTACTTTAT CGAAATTAAA ATCAACTTAT GTCGATGCTT TACCTGCCTT AGTTCGTCCT
GATACTCAAC GAGTTCACAC CAATTTTAAT CAAACAGTAA CCGCAACCGG AAGATTATCT
TCTTCTAACC CTAACCTACA AAATATCCCG ATTCGGACAG AATTTTCCCG TAAAATTCGG
CAAGCATTTA TTCCCCAAGA AAATTGGTTA TTGGTGTCAG CAGACTATTC TCAAATTGAA
CTCAGAATTT TAGCGCATTT AAGTCAAGAA GTTGTCTTAT TAGAAGCCTA TCGAAATAAT
CAAGATGTCC ATAGTGTGAC AGCTAAATTA TTATTTGATA AAGAAACAAT TACTCCACAA
GAAAGAAATT TAGGTAAAAC GATTAACTTT GGTGTTATCT ATGGAATGGG TGCTCAAAGA
TTTGCCAGAG AAGCAGGAGT AAGCGCAGCA GAAGGAAAGA CATTTATTAA TAAATATCGT
CAGCGATATG CTAAAGTATT TGACTATTTA GAAAGGATGA AAAAAGAAGC GATCGCTGAT
GGATTCGTCA CAACGATTTT AGGAAGAAGA CGCTATTTTA ATTTTATCAC CGAGAGTTTA
CAAAAGCTGA AAGGATCTGA TCCTGAAAAG ATTAACTTAG AAGTCTTAAA TATTAATTAT
GCCGATGCTC AATTATTACG CGCTGCTGCT AATGCACCCA TTCAAGGATC AAGTGCTGAT
ATTATTAAAA TAGCTATGAT TAAAATGCAA GAAATTTTAA GTCATTATCA AGCTAGATTA
CTGCTACAAG TTCATGATGA ATTAGTGTTT GAAATTCCTC TCAATGAATG GGAAGAATTA
CAAACAAAAA TCAAAGAAAC TATGGAGAAT GCTGTTACCT TAACCGTTCC TTTAGTGGTT
GAAATTCATT CAGGTAACAA TTGGATGGAA GCTAAATAA
 
Protein sequence
MQTQQSLPLL ILIDGHSLAF RSYYAFAKAR KGPLRTSKGI PTSVCFGFLN SLMQVIESQK 
PEYLAIAFDR SEPTFRHEAD ENYKADREET PEDFIIDMAN LQELLEALNL TIVTFPGYEA
DDVLGTLANQ GSESGYQVKI VSGDRDLFQL VDEAKNISVL YLERNAVKGS SEGYTEFNSE
AVEAKLGIKP VQVVDYKALC GDKSDSIPGI DGIGEKTAVK LLKEYGTIQE VYNNLDKIKG
TLNKKLKDGV KEAEHSQYLA KIVVDVPVDI DFKNCQLKGF NLSILEPLLQ KLELNKFLKQ
INHLQQKFGG EVKPVDQATS QPLDHPQQLS LFPEQNLPAH SVEPLVIKTI PEVSPITPKI
ITTLDDLNHL INKLEQYTDV EYPVAWDTET TSLEPRNAEL VGIGCCWGEK STDIAYIPTG
HNKGNQLKKE AVLQALKPIL ESDRYPKVFQ NTKFDRLIFY HQGINLKGVV FDTLLASYVL
HPEMSHNLSD LCYRYLSGIT SQSYKELAIP KGKTIASLDI ETVANYCGLD AYATFLLVSK
LKQELKSFPS LEKLLLEVEQ PLEPVLAAME DIGIRIDTDY LQQLSQQLEQ DLHIIEQQAY
QEAGEIFNLG SPKQLGEILF EKLELNRKKS RKTKTGYSTD HATLEKLQGD HPIIDHILDY
RTLSKLKSTY VDALPALVRP DTQRVHTNFN QTVTATGRLS SSNPNLQNIP IRTEFSRKIR
QAFIPQENWL LVSADYSQIE LRILAHLSQE VVLLEAYRNN QDVHSVTAKL LFDKETITPQ
ERNLGKTINF GVIYGMGAQR FAREAGVSAA EGKTFINKYR QRYAKVFDYL ERMKKEAIAD
GFVTTILGRR RYFNFITESL QKLKGSDPEK INLEVLNINY ADAQLLRAAA NAPIQGSSAD
IIKIAMIKMQ EILSHYQARL LLQVHDELVF EIPLNEWEEL QTKIKETMEN AVTLTVPLVV
EIHSGNNWME AK