Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9515_13041 |
Symbol | polA |
ID | 4718824 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9515 |
Kingdom | Bacteria |
Replicon accession | NC_008817 |
Strand | + |
Start bp | 1144210 |
End bp | 1147140 |
Gene Length | 2931 bp |
Protein Length | 976 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 640080988 |
Product | DNA polymerase I |
Protein accession | YP_001011618 |
Protein GI | 123966537 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTTGA AATCACAAGA CTCAAAAAAA CCAGTTTTAC TTTTAGTTGA TGGACATTCT CTCGCTTTTA GAAGTTTCTA TGCGTTTAGC AAAGGAGTTG ATGGAGGATT AACCACTAAA GAAGGATTCC CAACAAGTGT CACTTATGGA TTTTTAAAAA GCCTTCTAGA TAATTGCAAA AAAGTCAGTG CTGAAGGAGT TTGTATTACT TTTGATACAG AAAAACCAAC TTTCAGACAT GAATTAGATC CAAATTACAA AGCCAATAGG GATGTAGCTC CAGATGTTTT TTTTCAAGAT ATTGAACAAC TAGAAATCAT CTTAAAAGAA AGCCTTAATT TACCTATTTT CAAATCCCCA GGATTCGAAG CTGATGATCT ATTAGGAACA ATCGCTAATG ATGCATCCTC AAAAGGTTGG TGTGTAAATA TTCTTTCTGG GGATCGAGAT TTATTTCAAT TAGTTGATGA TCAAAAAGAT ATTTATGTTC TCTATATGGG CGGAGGTCCT TATGCTAAAA GTGGGAATCC GACTTTAATG AATGAAAACG GAGTCAAAGA AAAACTAGGA GTTTATCCAG ATAGAGTTGT CGATCTTAAG GCCTTGACCG GCGACAGTTC TGATAATATT CCAGGAATAA AAGGTGTAGG ACCAAAAACT GCAATTAATT TATTAAAAGA GAATGATACC TTAGACGGTA TTTATAAAAC CTTGGATATT ATTCAAAAAA ATGAGGATAA AAAATATCAG GGATTTATAA AAGGAGCCGT TATGGAAAAG CTAAAAAATG ATAAATTCAA TGCTTATCTT TCAAGAAATT TAGCAAAGAT AGATGTTGAA GTTCCCTTGG TATTGAACAA TGGATATGAA CTAAATGAAA TAAATCAAGA CTCTCTATCG GAATCCCTTA AAAAACTTGA ATTATCAACT TTACTGAGGC AAGTTGATAT TTTCAATTCA GCCTTTAGTA AAGGTGGTTT TAATAAAAAT AATGAGGAAA AACATAAGGA CAAAGAATCA AAAGATTCAC CGTCAACAGA TTTAGAAGAA ATAGAAAATA AACTCCCAAA AATCAAAGTA AATATAGTTA ATGATTCAGA ATCACTAGAT CAACTCGTTA AAAGATTAGA GATTACTAAG GAGATTGTTG CCTTAGACAC TGAGACAAAT AGTTTAAATC CTCTAGATGC AGAACTTGTT GGTATAGGTT TTTGTCTTGG CGAAGAGAGT AATGATCTGT TTTATATACC CCTTGGTCAT CAATCGCAAA AAGATGAGAT GAATCAATTA GCAATTGAAG ATGTATTCCT TAATTTAAGA ACTTGGATTG AAAGTCCAGA GAAAGAAAAA ACTCTTCAAA ATTGTAAATT TGATAGACAA ATATTTTATA ATCATGGGTT AAATCTAAGC GGAGTTACTT TTGATACCCT ACTAGCAGAC TACATTCTCA ATAATCAGGA AAAGCATGGA TTAAGTGAAA TTTGTTTTAG AGAATTTGGA TTTAAGCCAC CATCTTTTAA AGAAACAGTT GGAAAAAATA AAGACTTTTC ATTTGTCGAC ATTAATCATG CAAGTATTTA CTGCGGTTAT GATGTATTTC TAACTTACAA AATTGCGAAA ATTTTTAAAG AAAGATTTAG AAAAGAAAAA AAAGAATTAA CAAAATTATT TAAAGAAATC GAATTACCTT TAGAGCAAGT TTTATCAGGA ATGGAGATGA GTGGCATCAG CATAGATATC ACTTATTTAA ATGAACTTTC TAGAGAATTA AAAAGTACCT TAGCAAATAT TGAGGAAAAT GTTTTTGAAA TAGCAAAACA AGAATTTAAT TTATCTTCTC CAAAACAACT TGGCGAAATA TTGTTTGATA AGTTAAATCT TGATAAAAAA AAATCAAGGA AAACAAAAAC AGGTTGGAGT ACTGATGCAG TTGTTCTCGA AAGATTAGTT GAAGAACATG AGATAATTCC CTATCTAATA AAACATAGAA CTCTTAGTAA ACTTTTAAGT ACATATATTG ATGCTCTTCC CAACCTAATA AATAAAAAAA CTGGAAGAGT TCATACTAAT TTTAATCAAG CAGCCACCGC CACTGGGCGA TTAAGCAGTA GCAATCCAAA TCTTCAGAAT ATCCCTGTCA GAACCGAATT CAGTAGAAGA ATAAGAAAAG CCTTCTTGCC TGAAAAAGGT TGGAAATTAT TATCAGCAGA TTATTCTCAA ATTGAATTAA GAATATTAGC TCACTTAGCC AATGAAGAAA TTCTTATAAC TGCGTTCCAC CAAAATGATG ATATTCACTC TTTAACAGCG CGATTAATTT TTGAAAAGAA AGATATAAAT TCTGACGAAA GAAGAGTTGG TAAAACTATT AATTTTGGAG TTATTTATGG AATGGGAATA AAAAAATTCG CACGTTCTAC AGGTGTAAGT ACGACAGAGG CAAAAGAATT TTTAATTAAA TACAAAAAAA GATATGCCAA AATTTTTAAA TTTCTAGAAC TTCAAGAAAG GCTTGCTCTC TCAAAAGGTT ATGTAGAAAC TATTTTCGGA AGGAAAAGGG AATTTAAATT TGATAAAAAT GGTCTTGGTA GATTAATCGG AAAAGAACCG TATGAAATAG ATTTACAAAC TGCTAGGAAA GCTGGAATGG AAGCTCAATC ATTAAGAGCC GCTGCAAATG CTCCTATTCA GGGTTCTAGT GCTGACATTA TAAAAATAGC TATGGTTCAA TTAAACAAAA AATTATTAGA GATGAATATT CCAGTTAAAA TGCTTTTACA AGTTCACGAT GAATTATTAT TCGAGGTGAA ACCAGATTTC TTAGAAATAA CAAAGCAATT AGTTAAAGAA ACTATGGAAG ATTGTGTAAA ATTAAAAGTA CCACTTCTAG TGGATATTGG AGTTGGAAAT AACTGGATGG AAACCAAATA A
|
Protein sequence | MSLKSQDSKK PVLLLVDGHS LAFRSFYAFS KGVDGGLTTK EGFPTSVTYG FLKSLLDNCK KVSAEGVCIT FDTEKPTFRH ELDPNYKANR DVAPDVFFQD IEQLEIILKE SLNLPIFKSP GFEADDLLGT IANDASSKGW CVNILSGDRD LFQLVDDQKD IYVLYMGGGP YAKSGNPTLM NENGVKEKLG VYPDRVVDLK ALTGDSSDNI PGIKGVGPKT AINLLKENDT LDGIYKTLDI IQKNEDKKYQ GFIKGAVMEK LKNDKFNAYL SRNLAKIDVE VPLVLNNGYE LNEINQDSLS ESLKKLELST LLRQVDIFNS AFSKGGFNKN NEEKHKDKES KDSPSTDLEE IENKLPKIKV NIVNDSESLD QLVKRLEITK EIVALDTETN SLNPLDAELV GIGFCLGEES NDLFYIPLGH QSQKDEMNQL AIEDVFLNLR TWIESPEKEK TLQNCKFDRQ IFYNHGLNLS GVTFDTLLAD YILNNQEKHG LSEICFREFG FKPPSFKETV GKNKDFSFVD INHASIYCGY DVFLTYKIAK IFKERFRKEK KELTKLFKEI ELPLEQVLSG MEMSGISIDI TYLNELSREL KSTLANIEEN VFEIAKQEFN LSSPKQLGEI LFDKLNLDKK KSRKTKTGWS TDAVVLERLV EEHEIIPYLI KHRTLSKLLS TYIDALPNLI NKKTGRVHTN FNQAATATGR LSSSNPNLQN IPVRTEFSRR IRKAFLPEKG WKLLSADYSQ IELRILAHLA NEEILITAFH QNDDIHSLTA RLIFEKKDIN SDERRVGKTI NFGVIYGMGI KKFARSTGVS TTEAKEFLIK YKKRYAKIFK FLELQERLAL SKGYVETIFG RKREFKFDKN GLGRLIGKEP YEIDLQTARK AGMEAQSLRA AANAPIQGSS ADIIKIAMVQ LNKKLLEMNI PVKMLLQVHD ELLFEVKPDF LEITKQLVKE TMEDCVKLKV PLLVDIGVGN NWMETK
|
| |