Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_15881 |
Symbol | polA |
ID | 4779359 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 1295257 |
End bp | 1298217 |
Gene Length | 2961 bp |
Protein Length | 986 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 640084870 |
Product | DNA polymerase I |
Protein accession | YP_001015410 |
Protein GI | 124026294 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.65374 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGAAA TAAAAAAGCC AACTTTATTA TTGGTTGATG GCCATTCACT AGCCTTTAGA AGTTTTTATG CTTTTAGCAA AGGAGGTGAA GGAGGACTTA CCACCAAAGA TGGATTCCCT ACGAGTGTTA CCTATGGTTT TCTAAAAAGC CTTTTAGACA ATTGCAAATC TATCGAACCC AAAGGGGTCA CAATTGCTTT TGACACTGCT GAGCCAACAT TTCGCCACAA GGAAGATCCA AACTACAAAG CCAATCGAGA TGTAGCACCA GATATATTTT TTCAAGATTT AGATCAACTT GAAGAGATTC TCAAAGAAAG TTTGAATCTT TCAATCTGCA AGGCCCCAGG ATATGAGGCA GATGATGTCT TGGGAACACT CGCAAATGAT GCGGCTGAAA AAGGATGGAG TGTCAGAATC CTTTCTGGAG ATAGAGACCT ATTCCAATTA GTGGATGATG AAAGAGACAT AGCTGTCTTG TACATGGGTG GAGGTCCTTA TGCAAAAAGT GGAAGTCCTA AACTGATTAA TGAAAAAGGC GTAAGGGAAA AACTCGGAGT AAATCCAAAC AAAGTTATTG ATCTGAAAGC CTTAACTGGT GATAGCTCAG ACAATATTCC AGGTGTTAAA GGAGTTGGTC CTAAAACAGC AATAAATCTT TTAAACGAGA ACCTTGATCT TGATGGAGTC TATAAATCAC TTCAAGAGTT AGAAAAAGAA GGCGAGAAAG CTAAACGAGG AGCGATAAAA GGAGCAGTTA GATTAAAATT AAAAGCAGAT AAAGACAATG CCTATCTCTC AAAAAAACTT GCAGAGATAT TAATTAAAAT TCCCATTGAT CCAAAAGTAA ACTATAATTT AGAAGGAATT AACGAATCAA AGCTAGCTGA AAATCTTGAG AGGCTTGAGT TACATAGTTT ATCAAAACAA GTTTCAACTT TTAAAGCTAT ATTTTCCAAA GATGGATCAT CAAAAAAAGA TTTAAACCCC TCATCAAAAG AAATTAATAT CGCTAATAAT AAGACTAAAA ATAGTGAATT TAGTACTCTT AATGAAACGA AAGAGATCCC TAGGATAGAA CCTAAGATAA TAAACAATCT GGAAGAACTT AATAACTTTG TTGCACAAAT AATTAAACAT ACTGATGCGA AAAAACCAAT AGCGATTGAT ACAGAAACAA CGAGTCTGAA TCCCTTTAAA GCTGAACTTG TTGGTTTAGG ATTTTGTTTT GGTGAATCGT TAAAAGATAT AGTTTATATA CCAATAGGTC ATAAAAACAA AGAGGACGAT TTAATAGAAA TTAATCAAAT AAATCAATTA AAGATTGAAG AAGTTATCTT TGCACTTCAG GATTGGTTCT CTAGCAATGA AAATCCTAAA ACCCTACAAA ATGCAAAATA CGACCGACTG ATTTTGCTGA GACACGGAAT TATATTAAAC GGAGTTGTGA TGGATACTTT ACTTGCAGAT TATATATGTG ATGCAACCCT TAAACATAGT TTAGATGAAA TTGCTTATAG AGAATTTGGA TTTAAGCCCA AAAGTTTTTC TGATATTGTC AAAAAAGGAG AAGACTTCTC TTATGTAGAC ATTAAGTCTG CGAGTATGTA CTGCGGGATG GACGTTTATT TAACAAGAAA ATTAGCAATT ATTTATATTA ATAGATTAAA AGAAACAAGT ATAAAATTAA TAAACTTACT CAAAGAAGTT GAACAACCAC TTGAGCAAGT ATTAGCGGAA ATTGAATCGA CCGGTATCAT TATTGATACT CCTTATCTAA AAGATTTATC TTTAGAGTTA ACAAAAAGAT TAAATACTAT TGAGAAAGAA GTTTATAATA TTGCAGGAAG TGAGTTCAAC CTTTCATCAC CTAAACAGTT AGGTGAATTA CTTTTTGAGA AACTTGATTT AGATAGGAAG AAATCAAGAA AAACAAAAAC AGGATGGAGT ACAGATGTAG CTGTATTGGA AAAGCTGGAA TCAGACCATC CAATAGTAAA AATAATCATT GAACATCGCA CTATAAGCAA GTTGCTTAAT ACTTATGTAG ATGCTTTGCC TAAGCTTATT GAAAAAGAAA CAGGAAGAGT ACACACAGAT TTCAATCAAG CCGTAACCGC TACTGGAAGA TTAAGTAGCA GCAATCCAAA TCTGCAAAAT ATCCCTGTCA GAACTGAATT TAGTAGACGA ATAAGAAAAG CCTTTCTTCC GCAAAAAGAT TGGAAACTTC TAAGCGCAGA CTATTCACAA ATTGAACTTC GTATACTCAC ACATCTTTCA GGTGAAGAAG TACTAAAAAA TGCTTATTTA AAAAATGAAG ATGTCCACTC TTTAACAGCA AAAATTTTGT TTGAAAAAGA TGCTATTGAT GCCGATGAAA GAAGAATAGG AAAAACAATA AATTTTGGGG TTATTTATGG TATGGGAGCT CAAAGGTTTG CAAGATCAAC GGGTGTTTCA TTAATAGAAG CAAAATATTT TTTAAGTAAA TTCAAAGAAC GTTATCCAGC CGTTTTTAAA TTTTTAGAAT ATCAAGAAAG ACTTGCCTTA AGCCAAGGGT TTGTTGAAAC ATTGCTAGGG AGAAGACGAT ATTTTCATTT CAATAAAAAT GGACTTGGAA GACTTCTGGG AACTCCACCA AATGAAATTG ATTTAACCAC TGCAAGAAGA GCAGGGATGG AAGCACAACA ATTAAGAGCC GCAGCAAACG CACCTATTCA AGGCTCAAGT GCTGACATAA TTAAGCTAGC AATGATTCAG TTGCATTCAG CTTTAAGAGA GACTGGATTG GCCGCGAAAA TTCTACTTCA AGTGCATGAT GAACTCGTGC TAGAAGTTAA TCCCAAAGAT TTGGAGGAAA CAAAACTTCT TGTCCAAAAT ACTATGGAAA ATGCTGTAAA ACTTAGTGTC CCTCTTATCG TTGAAACTGG CGTTGGAGTA AATTGGATGG AGGCAAAATA G
|
Protein sequence | MTEIKKPTLL LVDGHSLAFR SFYAFSKGGE GGLTTKDGFP TSVTYGFLKS LLDNCKSIEP KGVTIAFDTA EPTFRHKEDP NYKANRDVAP DIFFQDLDQL EEILKESLNL SICKAPGYEA DDVLGTLAND AAEKGWSVRI LSGDRDLFQL VDDERDIAVL YMGGGPYAKS GSPKLINEKG VREKLGVNPN KVIDLKALTG DSSDNIPGVK GVGPKTAINL LNENLDLDGV YKSLQELEKE GEKAKRGAIK GAVRLKLKAD KDNAYLSKKL AEILIKIPID PKVNYNLEGI NESKLAENLE RLELHSLSKQ VSTFKAIFSK DGSSKKDLNP SSKEINIANN KTKNSEFSTL NETKEIPRIE PKIINNLEEL NNFVAQIIKH TDAKKPIAID TETTSLNPFK AELVGLGFCF GESLKDIVYI PIGHKNKEDD LIEINQINQL KIEEVIFALQ DWFSSNENPK TLQNAKYDRL ILLRHGIILN GVVMDTLLAD YICDATLKHS LDEIAYREFG FKPKSFSDIV KKGEDFSYVD IKSASMYCGM DVYLTRKLAI IYINRLKETS IKLINLLKEV EQPLEQVLAE IESTGIIIDT PYLKDLSLEL TKRLNTIEKE VYNIAGSEFN LSSPKQLGEL LFEKLDLDRK KSRKTKTGWS TDVAVLEKLE SDHPIVKIII EHRTISKLLN TYVDALPKLI EKETGRVHTD FNQAVTATGR LSSSNPNLQN IPVRTEFSRR IRKAFLPQKD WKLLSADYSQ IELRILTHLS GEEVLKNAYL KNEDVHSLTA KILFEKDAID ADERRIGKTI NFGVIYGMGA QRFARSTGVS LIEAKYFLSK FKERYPAVFK FLEYQERLAL SQGFVETLLG RRRYFHFNKN GLGRLLGTPP NEIDLTTARR AGMEAQQLRA AANAPIQGSS ADIIKLAMIQ LHSALRETGL AAKILLQVHD ELVLEVNPKD LEETKLLVQN TMENAVKLSV PLIVETGVGV NWMEAK
|
| |