Gene P9301_13291 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_13291 
SymbolpolA 
ID4910918 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp1108559 
End bp1111489 
Gene Length2931 bp 
Protein Length976 aa 
Translation table11 
GC content32% 
IMG OID640160918 
ProductDNA polymerase I 
Protein accessionYP_001091553 
Protein GI126696667 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTTAA CATCTGAAAA CTCTAAAAAA CCAATTTTAC TTTTAGTCGA TGGTCATTCA 
CTTGCTTTTA GAAGCTTCTA TGCATTTAGC AAAGGGATTG ATGGAGGTTT AACAACCAAA
GATGGATTCC CAACAAGTGT GACTTATGGA TTTCTAAAAA GCCTTCTTGA TAATTGCAAA
AATATTTGTC CTGAGGGCGT TTGTATTACT TTTGATACCG AAAAACCTAC TTTCAGGCAT
GAATTAGATC CAAATTATAA GGCCAATAGA GATGTAGCAC CAGATGTTTT TTTTCAGGAT
ATTGAACAAC TAGAAATCAT TTTAGAAGAA AGTCTTAATT TGCCAATTTT TAAATCTCCT
GGTTATGAAG CAGATGATCT CCTAGGCACA ATTGCAAATG ATGCTTCATC TAAAGGATGG
TGCGTGAATA TTCTTTCTGG GGATCGGGAC TTATTTCAAT TAGTAGATGA TCAAAAAGAT
ATTTATGTAC TTTATATGGG AGGTGGTCCA TATGCAAAAA GTGGGAATCC AACTCTTATG
AATGAAAATG GAGTAAAAGA AAAATTAGGT GTCGCTCCAG AAAGAGTAGT TGATCTTAAA
GCTCTAACTG GTGATAGTTC TGATAATATT CCTGGTATTA AAGGAGTAGG TCCAAAAACT
GCAATTAATC TTCTAAAAGA AAACGATACA CTTGATGGAA TTTATCAGGC TTTGGAGAAG
ATTCAGCAGA ATAATGATAA AAAATATAAA GGATTCATCA AAGGTTCGGT TATAGAAAAG
CTCAGAAACG ATAAGCATAA TGCTTTTCTC TCCAGAGATT TAGCAAAAAT AAATACTGAA
GTGCCTTTGA TATTAAGTGA TGGTTATGAA TTAAAAAATA TAAATCAAGA ACTACTTTCA
GAGTCACTGC AAAAACTTGA ATTATCAACA CTACTTCGGC AAATTGATAT TTTCAATTCA
ACTTTCAGCA AAGGTGGTTT TGGAAAAAAT AATTTGGTTA GCAAGGAGGA GAAGGTTCCA
AAGATCTTAA GTAACAATGA ATTAGAAAAT AGTGAAAATA AAATCCCCAA AATTAAGGTA
ATTATTGTAA ATGATTTCGA ATTACTTGAT AAATTAATTA AAAGATTAAA CAAGACAAAT
CAAATAGTTT CTTTAGATAC AGAGACTAAT AGTTTGAATC CAATCGATGC AGAACTTGTT
GGGATAGGGT TATGTCTTGG AGAAGAAAAT GATGATTTAT TTTATATACC TCTTGGTCAT
CAAATAAAAA AAGAGACCCC CAATCAATTA TCGATTGAAG ATGTTTTCTC AAAACTAAGA
ACTTGGATAG AAGATCCGAA AAAAGAAAAG GCACTCCAAA ATTCTAAATT TGATAGGCAA
ATATTTTTTA ATCATGGACT TGATCTTAAA GGCGTAACCT TTGACACCTT GCTAGCAGAC
TACCTTCTTA ATAACCAGGA GAAGCATGGA TTAAGTGAAA TTAGTTTTAG ATTATTTGGA
TTTAAGCCCC CTTCATTTAA AGAAACAGTT GGGAAAAATA AAGACTTTTC ATTTGTTGAT
ATTAATGAAG CAAGTATTTA CTGCGGTTAT GATGTTTTTC TAACTTTTAA GATTGTCAAA
ATTTTTAAAG AAAGATTTTC AAAGGAAAAA GATGAATTAA TCAAATTGTT CAAAGAAATC
GAGCTACCCT TAGAGCCGGT ATTGTCTCAA ATGGAAATGA ATGGCATAAC TATCGATATA
CCTTATTTGG ATAAACTCTC AAAGGAACTA AAAAGTACCT TAGAAGATAT TGAAAATAAA
GTTTTTGAAT TAGCAGATGA AAATTTTAAT TTATCTTCAC CAAAACAACT CGGTGAGATC
TTATTTGAAA AATTAAATTT GGATAAAAAG AAATCACGCA AAACAAAAAC AGGATGGAGC
ACAGATGCTT TAGTTCTTGA AAGATTAGTC GACGAACATG AAATAATCCA ACATTTAATA
AAGCACAGAA CTATTAGCAA ATTACTTAGC ACCTATATTG ATGCTCTTCC AAATCTTATA
AACGAAAAGA CAGGAAGAGT TCATACAAAC TTTAATCAAG CTGCTACAGC GACTGGGAGA
CTAAGTAGTA GTAATCCTAA TCTTCAAAAT ATTCCGGTTA GGACTGAATT TAGTAGGAGG
ATCAGAAAAG CATTCTTGCC TGAAAAAAAT TGGAAACTTT TATCAGCTGA TTATTCTCAG
ATCGAATTAA GAATACTTGC TCACTTAGCG GATGAAGAAA TATTAATTAA TGCATTCCAT
AAAAATGACG ACATTCATTC TCTGACTGCA AGATTAATTT TTGAGAAAGA AGAAATTTCT
TCTGATGAGA GGAGAGTTGG GAAAACAATA AATTTCGGAG TTATCTATGG TATGGGAATT
AAAAAGTTTG CTCGTTCTAC AGGAGTAAGT ACTCCTGAAG CAAAAGAATT CCTAATAAAA
TACAAAGAAA GATATTCAAA AATTTTCAAA TTTCTTGAAC TTCAAGAAAG GCTTGCCTTA
TCAAAAGGTT ATGTAAAAAC AATTTTTGGT AGAAAGAGAG AATTTAAGTT TGATAAAAAT
GGACTTGGAA GACTAATAGG AAAAGATCCT TACGAAATTG ACTTGCAATC CGCAAGAAGA
GCTGGCATGG AAGCACAGTC ACTAAGAGCC GCAGCCAATG CCCCAATTCA GGGTTCAAGT
GCAGATATTA TTAAAATTGC AATGGTTCAA CTAAATAAGA AATTCATAGA AATGAACTTT
CCAGCAAAAA TGCTTTTACA AGTACATGAT GAATTATTGT TTGAAGTTGA ACCAGATTCT
TTGGAAATTA CGACGAAATT AGTGAAGAAG ACTATGGAAA ATTGTGTAAA ATTAAATGTA
CCTCTTTTGG TTGATATTGG AATTGGAGAT AATTGGATGG AGACAAAATA A
 
Protein sequence
MSLTSENSKK PILLLVDGHS LAFRSFYAFS KGIDGGLTTK DGFPTSVTYG FLKSLLDNCK 
NICPEGVCIT FDTEKPTFRH ELDPNYKANR DVAPDVFFQD IEQLEIILEE SLNLPIFKSP
GYEADDLLGT IANDASSKGW CVNILSGDRD LFQLVDDQKD IYVLYMGGGP YAKSGNPTLM
NENGVKEKLG VAPERVVDLK ALTGDSSDNI PGIKGVGPKT AINLLKENDT LDGIYQALEK
IQQNNDKKYK GFIKGSVIEK LRNDKHNAFL SRDLAKINTE VPLILSDGYE LKNINQELLS
ESLQKLELST LLRQIDIFNS TFSKGGFGKN NLVSKEEKVP KILSNNELEN SENKIPKIKV
IIVNDFELLD KLIKRLNKTN QIVSLDTETN SLNPIDAELV GIGLCLGEEN DDLFYIPLGH
QIKKETPNQL SIEDVFSKLR TWIEDPKKEK ALQNSKFDRQ IFFNHGLDLK GVTFDTLLAD
YLLNNQEKHG LSEISFRLFG FKPPSFKETV GKNKDFSFVD INEASIYCGY DVFLTFKIVK
IFKERFSKEK DELIKLFKEI ELPLEPVLSQ MEMNGITIDI PYLDKLSKEL KSTLEDIENK
VFELADENFN LSSPKQLGEI LFEKLNLDKK KSRKTKTGWS TDALVLERLV DEHEIIQHLI
KHRTISKLLS TYIDALPNLI NEKTGRVHTN FNQAATATGR LSSSNPNLQN IPVRTEFSRR
IRKAFLPEKN WKLLSADYSQ IELRILAHLA DEEILINAFH KNDDIHSLTA RLIFEKEEIS
SDERRVGKTI NFGVIYGMGI KKFARSTGVS TPEAKEFLIK YKERYSKIFK FLELQERLAL
SKGYVKTIFG RKREFKFDKN GLGRLIGKDP YEIDLQSARR AGMEAQSLRA AANAPIQGSS
ADIIKIAMVQ LNKKFIEMNF PAKMLLQVHD ELLFEVEPDS LEITTKLVKK TMENCVKLNV
PLLVDIGIGD NWMETK