Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PMN2A_0748 |
Symbol | |
ID | 3606126 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL2A |
Kingdom | Bacteria |
Replicon accession | NC_007335 |
Strand | + |
Start bp | 1256907 |
End bp | 1259867 |
Gene Length | 2961 bp |
Protein Length | 986 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 637687611 |
Product | DNA polymerase I |
Protein accession | YP_291942 |
Protein GI | 72382587 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.100174 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAGAAA TAAAAAAGCC AACTTTATTA TTGGTTGATG GCCATTCATT AGCTTTTAGA AGTTTTTATG CTTTTAGTAA AGGTGGAGAA GGAGGTCTTA CCACCAAAGA TGGATTTCCC ACGAGTGTTA CCTATGGTTT TCTAAAAAGC CTTTTAGATA ATTGCAAATC TATCGAACCC AAAGGGGTCA CAATTGCTTT TGACACTGCT GAGCCAACAT TTCGCCACAA GGAAGATCCA AACTACAAAG CCAATCGAGA TGTAGCACCA GATATATTTT TTCAAGATTT AGATCAACTT GAAGAGATTC TCAAAGAAAG TTTGAATCTT TCAATCTGCA AAGCCCCGGG ATATGAGGCA GATGATGTCT TGGGAACACT CGCAAATGAT GCAGCTGAAA AAGGATGGAG TGTCAGAATC CTTTCTGGAG ATAGAGACCT ATTCCAATTA GTGGATGATG AAAGAGACAT AGCTGTCTTG TACATGGGTG GAGGTCCTTA TGCAAAAAGT GGAAGTCCTA AACTGATTAA TGAAAAAGGC GTAAGGGAGA AACTCGGAGT CAATCCAAAC AAAGTTATTG ATCTGAAAGC CTTAACTGGT GATAGCTCAG ACAATATTCC AGGTGTTAAA GGAGTTGGTC CTAAAACAGC AATAAATCTT TTAAACGAGA ACCTTGATCT TGATGGAGTC TATAAATCAC TTCAAGAGTT AGAAAAAGAA GGCGAGAAAG CAAAACGAGG AGCGATAAAA GGAGCAGTTA GATTAAAATT AAAAGCAGAT AAAGACAATG CCTATCTCTC AAAAAAACTT GCAGAGATAT TAATTAAAAT TCCCATTGAT CCAAAAGTAA ACTATAACTT AGAAGGAATT AACGAATCAA AGCTAGCTGA AAACCTTGAG AGGCTTGAGT TACATAGTTT ATCAAAACAA GTTTCAACTT TTAAAGCTAT ATTTTCTAAA GATGGATTAT CAAAAAAAGA TTTAAATCCC TCATCAAAAG AAATTAATAT CGCTAACGAT AAGACTAAAA ATAGTGAATT GAGTACTCTT AATGAAACGA AAGAGATCCC CAAGATAGAA CCCAAGATAA TAGACAATCT GGAAGAACTT AATAACTTTG TTGCACAAAT AATGAAACAT ACTGATTCGA CAAAACCAAT AGCCATTGAT ACAGAAACAA CGAGCTTGAA TCCATTTAAA GCTGAACTTG TTGGATTAGG ATTTTGTTTT GGCGAATCGT TAAAAGATAT AGTTTATATA CCAATAGGAC ATCAAAATAA AGAGGGCGAT TTAATAAAAA TTAATCAAAT AAATCAATTA AAGATTGAAG AAGTTATCTT TGCACTTCAG GATTGGTTCT CTAGCAATGA AAATCATAAA ACCCTACAGA ATGCAAAATA CGACCGACTG ATTTTATTAA GACACGGAAT TATACTAAAC GGAGTTGTAA TTGATACTTT ACTTGCAGAT TATATATTTG ATGCAACCCT TAAACATAGT TTAGATGAAA TTGCTTATAG AGAATTTGGA TTTAAACCCA AAAGTTTTTC TGATGTTGTC AAAAAAGGAG AAGACTTCTC TTATGTGGAC ATTAAGTCTG CGAGTATGTA CTGCGGGATG GACGTTTATT TAACAAGAAA ATTAGCAATT ATTTATATTA ATAGATTAAA AGAAACAAGT ACAAAATTAA TAAACTTACT CAAAGAAGTT GAACAACCAC TCGAGCAAGT ATTAGCAGAA ATTGAATCGA CCGGTATCAT TATTGATACT CCTTATCTAA AAGATTTATC TTTAGAGTTA ACAAAAAAAT TAAATACTAT CGAGAAAGAA GTTTATAATA TTGCAGGAAG TGAGTTCAAC CTTTCATCAC CTAAACAGTT AGGTGAATTA CTTTTTGAGA AACTTGATTT AGATAGGAAA AAATCAAGAA AAACAAAAAC AGGATGGAGT ACAGATGTAG CTGTATTGGA AAAGCTGGAA TCAGACCATC CCATAGTAAA AATGATCATT GAACATCGCA CTATAAGCAA GTTGCTTAAT ACTTATGTAG ATGCTTTGCC TAAGCTTATT GAAAAAGAAA CAGGAAGAGT ACATACAGAT TTCAATCAAG CCGTAACCGC TACTGGAAGA CTAAGTAGCA GCAATCCAAA TCTGCAAAAT ATCCCTGTCA GAACTGAATT TAGTAGACGA ATAAGAAAAG CCTTTCTTCC GCAAAAAGAT TGGAAACTTC TAAGTGCAGA CTATTCACAA ATTGAACTTC GTATACTCAC ACATCTTTCA GGTGAAGAAG TACTAAAAAA TGCTTATTTA AAAAATGAAG ATGTCCACTC TTTAACAGCA AAACTTTTGT TTGAAAAAGA TGCTATTGAC GCCGATGAAA GAAGAATAGG AAAAACAATA AATTTTGGGG TTATTTATGG TATGGGGGCT CAAAGGTTTG CAAGATCAAC GGGCGTTTCA TTAATAGAAG CAAAATATTT TTTAAGTAAA TTCAAAGAAC GTTATCCAGC CGTTTTTAAT TTTTTAGAAT ATCAAGAAAG ACTTGCCTTA AGCCAAGGGT TTGTTGAAAC ATTGCTAGGG AGAAGACGAT ATTTTCATTT CAATAAAAAT GGGCTTGGAA GACTTCTGGG AACTCCACCA AATGAAATTG ATTTAACTAC TGCCAGAAGA GCAGGGATGG AAGCACAACA ATTAAGAGCC GCAGCAAACG CACCTATTCA AGGCTCAAGT GCAGACATAA TTAAGCTAGC AATGATTCAG TTGCATTCAG CTTTAAGAGA GACTGGATTG GCAGCGAAAA TTCTACTTCA AGTGCATGAT GAACTCGTGC TAGAAGTTAA TCCCAAAGAT TTGGAGGAAA CAAAACTTCT TGTCAAAAAC ACTATGGAAA ATGCTGTAAA ACTTAGTGTC CCTCTTATCG TTGAAACTGG CGTTGGAGTA AATTGGATGG AAGCAAAATA G
|
Protein sequence | MTEIKKPTLL LVDGHSLAFR SFYAFSKGGE GGLTTKDGFP TSVTYGFLKS LLDNCKSIEP KGVTIAFDTA EPTFRHKEDP NYKANRDVAP DIFFQDLDQL EEILKESLNL SICKAPGYEA DDVLGTLAND AAEKGWSVRI LSGDRDLFQL VDDERDIAVL YMGGGPYAKS GSPKLINEKG VREKLGVNPN KVIDLKALTG DSSDNIPGVK GVGPKTAINL LNENLDLDGV YKSLQELEKE GEKAKRGAIK GAVRLKLKAD KDNAYLSKKL AEILIKIPID PKVNYNLEGI NESKLAENLE RLELHSLSKQ VSTFKAIFSK DGLSKKDLNP SSKEINIAND KTKNSELSTL NETKEIPKIE PKIIDNLEEL NNFVAQIMKH TDSTKPIAID TETTSLNPFK AELVGLGFCF GESLKDIVYI PIGHQNKEGD LIKINQINQL KIEEVIFALQ DWFSSNENHK TLQNAKYDRL ILLRHGIILN GVVIDTLLAD YIFDATLKHS LDEIAYREFG FKPKSFSDVV KKGEDFSYVD IKSASMYCGM DVYLTRKLAI IYINRLKETS TKLINLLKEV EQPLEQVLAE IESTGIIIDT PYLKDLSLEL TKKLNTIEKE VYNIAGSEFN LSSPKQLGEL LFEKLDLDRK KSRKTKTGWS TDVAVLEKLE SDHPIVKMII EHRTISKLLN TYVDALPKLI EKETGRVHTD FNQAVTATGR LSSSNPNLQN IPVRTEFSRR IRKAFLPQKD WKLLSADYSQ IELRILTHLS GEEVLKNAYL KNEDVHSLTA KLLFEKDAID ADERRIGKTI NFGVIYGMGA QRFARSTGVS LIEAKYFLSK FKERYPAVFN FLEYQERLAL SQGFVETLLG RRRYFHFNKN GLGRLLGTPP NEIDLTTARR AGMEAQQLRA AANAPIQGSS ADIIKLAMIQ LHSALRETGL AAKILLQVHD ELVLEVNPKD LEETKLLVKN TMENAVKLSV PLIVETGVGV NWMEAK
|
| |