Gene PMN2A_0748 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPMN2A_0748 
Symbol 
ID3606126 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL2A 
KingdomBacteria 
Replicon accessionNC_007335 
Strand
Start bp1256907 
End bp1259867 
Gene Length2961 bp 
Protein Length986 aa 
Translation table11 
GC content34% 
IMG OID637687611 
ProductDNA polymerase I 
Protein accessionYP_291942 
Protein GI72382587 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.100174 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGAAA TAAAAAAGCC AACTTTATTA TTGGTTGATG GCCATTCATT AGCTTTTAGA 
AGTTTTTATG CTTTTAGTAA AGGTGGAGAA GGAGGTCTTA CCACCAAAGA TGGATTTCCC
ACGAGTGTTA CCTATGGTTT TCTAAAAAGC CTTTTAGATA ATTGCAAATC TATCGAACCC
AAAGGGGTCA CAATTGCTTT TGACACTGCT GAGCCAACAT TTCGCCACAA GGAAGATCCA
AACTACAAAG CCAATCGAGA TGTAGCACCA GATATATTTT TTCAAGATTT AGATCAACTT
GAAGAGATTC TCAAAGAAAG TTTGAATCTT TCAATCTGCA AAGCCCCGGG ATATGAGGCA
GATGATGTCT TGGGAACACT CGCAAATGAT GCAGCTGAAA AAGGATGGAG TGTCAGAATC
CTTTCTGGAG ATAGAGACCT ATTCCAATTA GTGGATGATG AAAGAGACAT AGCTGTCTTG
TACATGGGTG GAGGTCCTTA TGCAAAAAGT GGAAGTCCTA AACTGATTAA TGAAAAAGGC
GTAAGGGAGA AACTCGGAGT CAATCCAAAC AAAGTTATTG ATCTGAAAGC CTTAACTGGT
GATAGCTCAG ACAATATTCC AGGTGTTAAA GGAGTTGGTC CTAAAACAGC AATAAATCTT
TTAAACGAGA ACCTTGATCT TGATGGAGTC TATAAATCAC TTCAAGAGTT AGAAAAAGAA
GGCGAGAAAG CAAAACGAGG AGCGATAAAA GGAGCAGTTA GATTAAAATT AAAAGCAGAT
AAAGACAATG CCTATCTCTC AAAAAAACTT GCAGAGATAT TAATTAAAAT TCCCATTGAT
CCAAAAGTAA ACTATAACTT AGAAGGAATT AACGAATCAA AGCTAGCTGA AAACCTTGAG
AGGCTTGAGT TACATAGTTT ATCAAAACAA GTTTCAACTT TTAAAGCTAT ATTTTCTAAA
GATGGATTAT CAAAAAAAGA TTTAAATCCC TCATCAAAAG AAATTAATAT CGCTAACGAT
AAGACTAAAA ATAGTGAATT GAGTACTCTT AATGAAACGA AAGAGATCCC CAAGATAGAA
CCCAAGATAA TAGACAATCT GGAAGAACTT AATAACTTTG TTGCACAAAT AATGAAACAT
ACTGATTCGA CAAAACCAAT AGCCATTGAT ACAGAAACAA CGAGCTTGAA TCCATTTAAA
GCTGAACTTG TTGGATTAGG ATTTTGTTTT GGCGAATCGT TAAAAGATAT AGTTTATATA
CCAATAGGAC ATCAAAATAA AGAGGGCGAT TTAATAAAAA TTAATCAAAT AAATCAATTA
AAGATTGAAG AAGTTATCTT TGCACTTCAG GATTGGTTCT CTAGCAATGA AAATCATAAA
ACCCTACAGA ATGCAAAATA CGACCGACTG ATTTTATTAA GACACGGAAT TATACTAAAC
GGAGTTGTAA TTGATACTTT ACTTGCAGAT TATATATTTG ATGCAACCCT TAAACATAGT
TTAGATGAAA TTGCTTATAG AGAATTTGGA TTTAAACCCA AAAGTTTTTC TGATGTTGTC
AAAAAAGGAG AAGACTTCTC TTATGTGGAC ATTAAGTCTG CGAGTATGTA CTGCGGGATG
GACGTTTATT TAACAAGAAA ATTAGCAATT ATTTATATTA ATAGATTAAA AGAAACAAGT
ACAAAATTAA TAAACTTACT CAAAGAAGTT GAACAACCAC TCGAGCAAGT ATTAGCAGAA
ATTGAATCGA CCGGTATCAT TATTGATACT CCTTATCTAA AAGATTTATC TTTAGAGTTA
ACAAAAAAAT TAAATACTAT CGAGAAAGAA GTTTATAATA TTGCAGGAAG TGAGTTCAAC
CTTTCATCAC CTAAACAGTT AGGTGAATTA CTTTTTGAGA AACTTGATTT AGATAGGAAA
AAATCAAGAA AAACAAAAAC AGGATGGAGT ACAGATGTAG CTGTATTGGA AAAGCTGGAA
TCAGACCATC CCATAGTAAA AATGATCATT GAACATCGCA CTATAAGCAA GTTGCTTAAT
ACTTATGTAG ATGCTTTGCC TAAGCTTATT GAAAAAGAAA CAGGAAGAGT ACATACAGAT
TTCAATCAAG CCGTAACCGC TACTGGAAGA CTAAGTAGCA GCAATCCAAA TCTGCAAAAT
ATCCCTGTCA GAACTGAATT TAGTAGACGA ATAAGAAAAG CCTTTCTTCC GCAAAAAGAT
TGGAAACTTC TAAGTGCAGA CTATTCACAA ATTGAACTTC GTATACTCAC ACATCTTTCA
GGTGAAGAAG TACTAAAAAA TGCTTATTTA AAAAATGAAG ATGTCCACTC TTTAACAGCA
AAACTTTTGT TTGAAAAAGA TGCTATTGAC GCCGATGAAA GAAGAATAGG AAAAACAATA
AATTTTGGGG TTATTTATGG TATGGGGGCT CAAAGGTTTG CAAGATCAAC GGGCGTTTCA
TTAATAGAAG CAAAATATTT TTTAAGTAAA TTCAAAGAAC GTTATCCAGC CGTTTTTAAT
TTTTTAGAAT ATCAAGAAAG ACTTGCCTTA AGCCAAGGGT TTGTTGAAAC ATTGCTAGGG
AGAAGACGAT ATTTTCATTT CAATAAAAAT GGGCTTGGAA GACTTCTGGG AACTCCACCA
AATGAAATTG ATTTAACTAC TGCCAGAAGA GCAGGGATGG AAGCACAACA ATTAAGAGCC
GCAGCAAACG CACCTATTCA AGGCTCAAGT GCAGACATAA TTAAGCTAGC AATGATTCAG
TTGCATTCAG CTTTAAGAGA GACTGGATTG GCAGCGAAAA TTCTACTTCA AGTGCATGAT
GAACTCGTGC TAGAAGTTAA TCCCAAAGAT TTGGAGGAAA CAAAACTTCT TGTCAAAAAC
ACTATGGAAA ATGCTGTAAA ACTTAGTGTC CCTCTTATCG TTGAAACTGG CGTTGGAGTA
AATTGGATGG AAGCAAAATA G
 
Protein sequence
MTEIKKPTLL LVDGHSLAFR SFYAFSKGGE GGLTTKDGFP TSVTYGFLKS LLDNCKSIEP 
KGVTIAFDTA EPTFRHKEDP NYKANRDVAP DIFFQDLDQL EEILKESLNL SICKAPGYEA
DDVLGTLAND AAEKGWSVRI LSGDRDLFQL VDDERDIAVL YMGGGPYAKS GSPKLINEKG
VREKLGVNPN KVIDLKALTG DSSDNIPGVK GVGPKTAINL LNENLDLDGV YKSLQELEKE
GEKAKRGAIK GAVRLKLKAD KDNAYLSKKL AEILIKIPID PKVNYNLEGI NESKLAENLE
RLELHSLSKQ VSTFKAIFSK DGLSKKDLNP SSKEINIAND KTKNSELSTL NETKEIPKIE
PKIIDNLEEL NNFVAQIMKH TDSTKPIAID TETTSLNPFK AELVGLGFCF GESLKDIVYI
PIGHQNKEGD LIKINQINQL KIEEVIFALQ DWFSSNENHK TLQNAKYDRL ILLRHGIILN
GVVIDTLLAD YIFDATLKHS LDEIAYREFG FKPKSFSDVV KKGEDFSYVD IKSASMYCGM
DVYLTRKLAI IYINRLKETS TKLINLLKEV EQPLEQVLAE IESTGIIIDT PYLKDLSLEL
TKKLNTIEKE VYNIAGSEFN LSSPKQLGEL LFEKLDLDRK KSRKTKTGWS TDVAVLEKLE
SDHPIVKMII EHRTISKLLN TYVDALPKLI EKETGRVHTD FNQAVTATGR LSSSNPNLQN
IPVRTEFSRR IRKAFLPQKD WKLLSADYSQ IELRILTHLS GEEVLKNAYL KNEDVHSLTA
KLLFEKDAID ADERRIGKTI NFGVIYGMGA QRFARSTGVS LIEAKYFLSK FKERYPAVFN
FLEYQERLAL SQGFVETLLG RRRYFHFNKN GLGRLLGTPP NEIDLTTARR AGMEAQQLRA
AANAPIQGSS ADIIKLAMIQ LHSALRETGL AAKILLQVHD ELVLEVNPKD LEETKLLVKN
TMENAVKLSV PLIVETGVGV NWMEAK