Gene NATL1_15881 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_15881 
SymbolpolA 
ID4779359 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1295257 
End bp1298217 
Gene Length2961 bp 
Protein Length986 aa 
Translation table11 
GC content34% 
IMG OID640084870 
ProductDNA polymerase I 
Protein accessionYP_001015410 
Protein GI124026294 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.65374 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGAAA TAAAAAAGCC AACTTTATTA TTGGTTGATG GCCATTCACT AGCCTTTAGA 
AGTTTTTATG CTTTTAGCAA AGGAGGTGAA GGAGGACTTA CCACCAAAGA TGGATTCCCT
ACGAGTGTTA CCTATGGTTT TCTAAAAAGC CTTTTAGACA ATTGCAAATC TATCGAACCC
AAAGGGGTCA CAATTGCTTT TGACACTGCT GAGCCAACAT TTCGCCACAA GGAAGATCCA
AACTACAAAG CCAATCGAGA TGTAGCACCA GATATATTTT TTCAAGATTT AGATCAACTT
GAAGAGATTC TCAAAGAAAG TTTGAATCTT TCAATCTGCA AGGCCCCAGG ATATGAGGCA
GATGATGTCT TGGGAACACT CGCAAATGAT GCGGCTGAAA AAGGATGGAG TGTCAGAATC
CTTTCTGGAG ATAGAGACCT ATTCCAATTA GTGGATGATG AAAGAGACAT AGCTGTCTTG
TACATGGGTG GAGGTCCTTA TGCAAAAAGT GGAAGTCCTA AACTGATTAA TGAAAAAGGC
GTAAGGGAAA AACTCGGAGT AAATCCAAAC AAAGTTATTG ATCTGAAAGC CTTAACTGGT
GATAGCTCAG ACAATATTCC AGGTGTTAAA GGAGTTGGTC CTAAAACAGC AATAAATCTT
TTAAACGAGA ACCTTGATCT TGATGGAGTC TATAAATCAC TTCAAGAGTT AGAAAAAGAA
GGCGAGAAAG CTAAACGAGG AGCGATAAAA GGAGCAGTTA GATTAAAATT AAAAGCAGAT
AAAGACAATG CCTATCTCTC AAAAAAACTT GCAGAGATAT TAATTAAAAT TCCCATTGAT
CCAAAAGTAA ACTATAATTT AGAAGGAATT AACGAATCAA AGCTAGCTGA AAATCTTGAG
AGGCTTGAGT TACATAGTTT ATCAAAACAA GTTTCAACTT TTAAAGCTAT ATTTTCCAAA
GATGGATCAT CAAAAAAAGA TTTAAACCCC TCATCAAAAG AAATTAATAT CGCTAATAAT
AAGACTAAAA ATAGTGAATT TAGTACTCTT AATGAAACGA AAGAGATCCC TAGGATAGAA
CCTAAGATAA TAAACAATCT GGAAGAACTT AATAACTTTG TTGCACAAAT AATTAAACAT
ACTGATGCGA AAAAACCAAT AGCGATTGAT ACAGAAACAA CGAGTCTGAA TCCCTTTAAA
GCTGAACTTG TTGGTTTAGG ATTTTGTTTT GGTGAATCGT TAAAAGATAT AGTTTATATA
CCAATAGGTC ATAAAAACAA AGAGGACGAT TTAATAGAAA TTAATCAAAT AAATCAATTA
AAGATTGAAG AAGTTATCTT TGCACTTCAG GATTGGTTCT CTAGCAATGA AAATCCTAAA
ACCCTACAAA ATGCAAAATA CGACCGACTG ATTTTGCTGA GACACGGAAT TATATTAAAC
GGAGTTGTGA TGGATACTTT ACTTGCAGAT TATATATGTG ATGCAACCCT TAAACATAGT
TTAGATGAAA TTGCTTATAG AGAATTTGGA TTTAAGCCCA AAAGTTTTTC TGATATTGTC
AAAAAAGGAG AAGACTTCTC TTATGTAGAC ATTAAGTCTG CGAGTATGTA CTGCGGGATG
GACGTTTATT TAACAAGAAA ATTAGCAATT ATTTATATTA ATAGATTAAA AGAAACAAGT
ATAAAATTAA TAAACTTACT CAAAGAAGTT GAACAACCAC TTGAGCAAGT ATTAGCGGAA
ATTGAATCGA CCGGTATCAT TATTGATACT CCTTATCTAA AAGATTTATC TTTAGAGTTA
ACAAAAAGAT TAAATACTAT TGAGAAAGAA GTTTATAATA TTGCAGGAAG TGAGTTCAAC
CTTTCATCAC CTAAACAGTT AGGTGAATTA CTTTTTGAGA AACTTGATTT AGATAGGAAG
AAATCAAGAA AAACAAAAAC AGGATGGAGT ACAGATGTAG CTGTATTGGA AAAGCTGGAA
TCAGACCATC CAATAGTAAA AATAATCATT GAACATCGCA CTATAAGCAA GTTGCTTAAT
ACTTATGTAG ATGCTTTGCC TAAGCTTATT GAAAAAGAAA CAGGAAGAGT ACACACAGAT
TTCAATCAAG CCGTAACCGC TACTGGAAGA TTAAGTAGCA GCAATCCAAA TCTGCAAAAT
ATCCCTGTCA GAACTGAATT TAGTAGACGA ATAAGAAAAG CCTTTCTTCC GCAAAAAGAT
TGGAAACTTC TAAGCGCAGA CTATTCACAA ATTGAACTTC GTATACTCAC ACATCTTTCA
GGTGAAGAAG TACTAAAAAA TGCTTATTTA AAAAATGAAG ATGTCCACTC TTTAACAGCA
AAAATTTTGT TTGAAAAAGA TGCTATTGAT GCCGATGAAA GAAGAATAGG AAAAACAATA
AATTTTGGGG TTATTTATGG TATGGGAGCT CAAAGGTTTG CAAGATCAAC GGGTGTTTCA
TTAATAGAAG CAAAATATTT TTTAAGTAAA TTCAAAGAAC GTTATCCAGC CGTTTTTAAA
TTTTTAGAAT ATCAAGAAAG ACTTGCCTTA AGCCAAGGGT TTGTTGAAAC ATTGCTAGGG
AGAAGACGAT ATTTTCATTT CAATAAAAAT GGACTTGGAA GACTTCTGGG AACTCCACCA
AATGAAATTG ATTTAACCAC TGCAAGAAGA GCAGGGATGG AAGCACAACA ATTAAGAGCC
GCAGCAAACG CACCTATTCA AGGCTCAAGT GCTGACATAA TTAAGCTAGC AATGATTCAG
TTGCATTCAG CTTTAAGAGA GACTGGATTG GCCGCGAAAA TTCTACTTCA AGTGCATGAT
GAACTCGTGC TAGAAGTTAA TCCCAAAGAT TTGGAGGAAA CAAAACTTCT TGTCCAAAAT
ACTATGGAAA ATGCTGTAAA ACTTAGTGTC CCTCTTATCG TTGAAACTGG CGTTGGAGTA
AATTGGATGG AGGCAAAATA G
 
Protein sequence
MTEIKKPTLL LVDGHSLAFR SFYAFSKGGE GGLTTKDGFP TSVTYGFLKS LLDNCKSIEP 
KGVTIAFDTA EPTFRHKEDP NYKANRDVAP DIFFQDLDQL EEILKESLNL SICKAPGYEA
DDVLGTLAND AAEKGWSVRI LSGDRDLFQL VDDERDIAVL YMGGGPYAKS GSPKLINEKG
VREKLGVNPN KVIDLKALTG DSSDNIPGVK GVGPKTAINL LNENLDLDGV YKSLQELEKE
GEKAKRGAIK GAVRLKLKAD KDNAYLSKKL AEILIKIPID PKVNYNLEGI NESKLAENLE
RLELHSLSKQ VSTFKAIFSK DGSSKKDLNP SSKEINIANN KTKNSEFSTL NETKEIPRIE
PKIINNLEEL NNFVAQIIKH TDAKKPIAID TETTSLNPFK AELVGLGFCF GESLKDIVYI
PIGHKNKEDD LIEINQINQL KIEEVIFALQ DWFSSNENPK TLQNAKYDRL ILLRHGIILN
GVVMDTLLAD YICDATLKHS LDEIAYREFG FKPKSFSDIV KKGEDFSYVD IKSASMYCGM
DVYLTRKLAI IYINRLKETS IKLINLLKEV EQPLEQVLAE IESTGIIIDT PYLKDLSLEL
TKRLNTIEKE VYNIAGSEFN LSSPKQLGEL LFEKLDLDRK KSRKTKTGWS TDVAVLEKLE
SDHPIVKIII EHRTISKLLN TYVDALPKLI EKETGRVHTD FNQAVTATGR LSSSNPNLQN
IPVRTEFSRR IRKAFLPQKD WKLLSADYSQ IELRILTHLS GEEVLKNAYL KNEDVHSLTA
KILFEKDAID ADERRIGKTI NFGVIYGMGA QRFARSTGVS LIEAKYFLSK FKERYPAVFK
FLEYQERLAL SQGFVETLLG RRRYFHFNKN GLGRLLGTPP NEIDLTTARR AGMEAQQLRA
AANAPIQGSS ADIIKLAMIQ LHSALRETGL AAKILLQVHD ELVLEVNPKD LEETKLLVQN
TMENAVKLSV PLIVETGVGV NWMEAK