Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan8802_3011 |
Symbol | |
ID | 8392339 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8802 |
Kingdom | Bacteria |
Replicon accession | NC_013161 |
Strand | - |
Start bp | 3044909 |
End bp | 3047827 |
Gene Length | 2919 bp |
Protein Length | 972 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 644980958 |
Product | DNA polymerase I |
Protein accession | YP_003138692 |
Protein GI | 257060804 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.664853 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAACGC AGCAATCCTT ACCGCTTTTA ATCTTAATTG ATGGGCATTC CTTAGCCTTT CGCTCTTATT ATGCCTTTGC TAAAGCGAGA AAAGGTCCTT TGCGTACTTC TAAAGGTATT CCGACGAGTG TTTGTTTTGG GTTCCTTAAT TCATTGATGC AAGTGATTGA GTCTCAGAAA CCTGAATATT TAGCCATTGC TTTTGATCGT TCTGAACCAA CTTTTCGTCA TGAAGCTGAT GAAAATTATA AAGCTGATAG GGAGGAAACC CCTGAAGATT TTATTATCGA TATGGCTAAT CTTCAAGAGT TATTAGAAGC GTTAAATTTA ACCATTGTTA CTTTCCCCGG ATATGAAGCA GATGATGTAT TAGGAACCCT TGCTAACCAA GGAAGTGAGT CAGGATATCA AGTAAAAATT GTCAGTGGCG ATCGCGATCT GTTTCAATTA GTTGATGAAG CTAAAAATAT TAGTGTTTTA TATTTAGAAA GAAATGCGGT TAAAGGTTCA TCGGAAGGCT ATACAGAATT TAACTCCGAG GCAGTGGAAG CAAAATTAGG TATTAAACCC GTTCAAGTTG TTGATTATAA AGCACTTTGT GGCGATAAGT CTGATAGTAT TCCAGGGATT GATGGAATTG GTGAAAAAAC CGCAGTGAAA TTATTAAAAG AATACGGTAC TATTCAAGAA GTTTATAACA ATTTAGACAA AATCAAAGGA ACACTAAACA AGAAATTAAA AGACGGAGTA AAAGAAGCAG AACATTCTCA ATATTTGGCT AAAATTGTCG TTGATGTTCC TGTGGATATT GACTTTAAAA ACTGTCAATT AAAAGGATTT AATTTATCTA TACTCGAACC GCTTTTACAA AAGCTAGAAT TAAATAAGTT TTTAAAACAA ATTAATCACT TACAACAAAA ATTTGGGGGA GAAGTTAAAC CCGTTGATCA AGCAACTTCT CAACCGTTAG ATCACCCTCA GCAACTGTCT CTATTTCCAG AGCAAAATTT ACCCGCTCAT TCTGTTGAAC CCTTAGTGAT TAAAACTATC CCAGAAGTCT CTCCTATAAC TCCTAAAATT ATTACGACGT TAGATGACTT AAATCACTTA ATTAACAAGC TAGAACAATA TACTGATGTT GAATATCCTG TTGCATGGGA TACGGAGACA ACATCCTTAG AACCGAGGAA TGCAGAGTTA GTCGGAATTG GCTGTTGTTG GGGAGAGAAA TCCACTGATA TTGCCTATAT TCCCACGGGA CATAACAAAG GGAATCAACT CAAAAAAGAA GCCGTATTAC AAGCATTAAA ACCTATTTTA GAAAGCGATC GCTATCCCAA AGTATTCCAG AATACTAAGT TTGATCGCTT GATTTTTTAT CATCAAGGAA TTAACCTAAA AGGAGTCGTC TTTGACACTT TATTAGCGAG TTATGTTTTG CATCCTGAAA TGAGTCATAA TCTCAGTGAT TTATGTTATC GCTATTTATC AGGAATTACC TCGCAAAGCT ACAAAGAATT AGCTATCCCA AAAGGTAAAA CTATTGCCAG TTTAGACATT GAAACTGTTG CTAATTACTG CGGACTCGAT GCTTATGCGA CGTTTTTATT AGTCAGTAAG TTAAAACAAG AATTAAAATC CTTTCCTAGT TTAGAAAAAT TGCTGCTAGA AGTAGAACAA CCCCTAGAAC CTGTACTAGC AGCAATGGAA GATATCGGGA TTCGTATCGA TACCGACTAT TTACAACAAT TATCTCAACA ATTAGAACAA GATTTACATA TTATTGAACA ACAAGCGTAT CAAGAAGCAG GAGAAATCTT TAACCTAGGT TCTCCCAAAC AACTCGGAGA AATCTTATTT GAAAAACTCG AATTGAACCG TAAAAAGTCT CGCAAAACAA AAACCGGGTA TTCTACTGAT CACGCTACCC TAGAAAAGTT ACAAGGAGAT CATCCAATTA TTGATCATAT TTTAGACTAT CGTACTTTAT CGAAATTAAA ATCAACTTAT GTCGATGCTT TACCTGCCTT AGTTCGTCCT GATACTCAAC GAGTTCACAC CAATTTTAAT CAAACAGTAA CCGCAACCGG AAGATTATCT TCTTCTAACC CTAACCTACA AAATATCCCG ATTCGGACAG AATTTTCCCG TAAAATTCGG CAAGCATTTA TTCCCCAAGA AAATTGGTTA TTGGTGTCAG CAGACTATTC TCAAATTGAA CTCAGAATTT TAGCGCATTT AAGTCAAGAA GTTGTCTTAT TAGAAGCCTA TCGAAATAAT CAAGATGTCC ATAGTGTGAC AGCTAAATTA TTATTTGATA AAGAAACAAT TACTCCACAA GAAAGAAATT TAGGTAAAAC GATTAACTTT GGTGTTATCT ATGGAATGGG TGCTCAAAGA TTTGCCAGAG AAGCAGGAGT AAGCGCAGCA GAAGGAAAGA CATTTATTAA TAAATATCGT CAGCGATATG CTAAAGTATT TGACTATTTA GAAAGGATGA AAAAAGAAGC GATCGCTGAT GGATTCGTCA CAACGATTTT AGGAAGAAGA CGCTATTTTA ATTTTATCAC CGAGAGTTTA CAAAAGCTGA AAGGATCTGA TCCTGAAAAG ATTAACTTAG AAGTCTTAAA TATTAATTAT GCCGATGCTC AATTATTACG CGCTGCTGCT AATGCACCCA TTCAAGGATC AAGTGCTGAT ATTATTAAAA TAGCTATGAT TAAAATGCAA GAAATTTTAA GTCATTATCA AGCTAGATTA CTGCTACAAG TTCATGATGA ATTAGTGTTT GAAATTCCTC TCAATGAATG GGAAGAATTA CAAACAAAAA TCAAAGAAAC TATGGAGAAT GCTGTTACCT TAACCGTTCC TTTAGTGGTT GAAATTCATT CAGGTAACAA TTGGATGGAA GCTAAATAA
|
Protein sequence | MQTQQSLPLL ILIDGHSLAF RSYYAFAKAR KGPLRTSKGI PTSVCFGFLN SLMQVIESQK PEYLAIAFDR SEPTFRHEAD ENYKADREET PEDFIIDMAN LQELLEALNL TIVTFPGYEA DDVLGTLANQ GSESGYQVKI VSGDRDLFQL VDEAKNISVL YLERNAVKGS SEGYTEFNSE AVEAKLGIKP VQVVDYKALC GDKSDSIPGI DGIGEKTAVK LLKEYGTIQE VYNNLDKIKG TLNKKLKDGV KEAEHSQYLA KIVVDVPVDI DFKNCQLKGF NLSILEPLLQ KLELNKFLKQ INHLQQKFGG EVKPVDQATS QPLDHPQQLS LFPEQNLPAH SVEPLVIKTI PEVSPITPKI ITTLDDLNHL INKLEQYTDV EYPVAWDTET TSLEPRNAEL VGIGCCWGEK STDIAYIPTG HNKGNQLKKE AVLQALKPIL ESDRYPKVFQ NTKFDRLIFY HQGINLKGVV FDTLLASYVL HPEMSHNLSD LCYRYLSGIT SQSYKELAIP KGKTIASLDI ETVANYCGLD AYATFLLVSK LKQELKSFPS LEKLLLEVEQ PLEPVLAAME DIGIRIDTDY LQQLSQQLEQ DLHIIEQQAY QEAGEIFNLG SPKQLGEILF EKLELNRKKS RKTKTGYSTD HATLEKLQGD HPIIDHILDY RTLSKLKSTY VDALPALVRP DTQRVHTNFN QTVTATGRLS SSNPNLQNIP IRTEFSRKIR QAFIPQENWL LVSADYSQIE LRILAHLSQE VVLLEAYRNN QDVHSVTAKL LFDKETITPQ ERNLGKTINF GVIYGMGAQR FAREAGVSAA EGKTFINKYR QRYAKVFDYL ERMKKEAIAD GFVTTILGRR RYFNFITESL QKLKGSDPEK INLEVLNINY ADAQLLRAAA NAPIQGSSAD IIKIAMIKMQ EILSHYQARL LLQVHDELVF EIPLNEWEEL QTKIKETMEN AVTLTVPLVV EIHSGNNWME AK
|
| |