Gene A9601_13151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_13151 
SymbolpolA 
ID4718034 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1091354 
End bp1094284 
Gene Length2931 bp 
Protein Length976 aa 
Translation table11 
GC content33% 
IMG OID640079034 
ProductDNA polymerase I 
Protein accessionYP_001009706 
Protein GI123968848 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTTAA AATCTGAAAA CTCTAAAAAA CCAATTTTAC TTTTAGTCGA TGGCCACTCA 
CTTGCTTTTA GAAGCTTCTA TGCATTTAGC AAAGGGATTG ATGGAGGTTT AACTACCAAA
GAGGGATTTC CAACAAGTGT CACTTATGGA TTTCTAAAGA GCCTTCTGGA TAATTGTAAA
AATATTAGTC CTGAGGGTGT TTGTATTACG TTTGATACCG AAAAACCTAC TTTCAGACAT
GAATTAGATC CAAATTATAA GGCCAATAGA GATGTAGCAC CAGATGTTTT TTTTCAGGAT
ATTGAACAAC TAGAAATCAT TTTAGAAGAA AGCCTTAATT TACCAATTTT CAAATCTCCA
GGATACGAAG CAGATGATCT CCTAGGCACA ATTGCAAATG ATGCTTCTTC TAAAGGATGG
TGCGTGAATA TTCTTTCTGG AGATAGGGAC TTATTTCAAT TAGTAGATGA TCAAAAAGAT
ATTTATGTGC TTTATATGGG TGGTGGTCCA TATGCGAAAA GTGGAAATCC AACTCTTATG
AATGAAAATG GAGTAAAAGA AAAATTAGGT GTTGCGCCAG AAAGAGTAGT TGATCTCAAA
GCCCTAACTG GTGATAGTTC TGATAATATT CCAGGTATTA AAGGAGTAGG GCCAAAAACT
GCAATTAATC TACTAAAAGA GAACGATACG CTTGATGGAA TCTATCAGGC TTTGGACAAG
ATTCAGCAGA ACAAAGATAA GAAATATAAA GGATTCATCA AAGGTTCAGT TATAGAAAAG
CTCAGAAACG ATAAACATAA TGCTTTTCTT TCCAGGGATT TAGCAAAAAT AAATACTGAG
GTGCCTTTAA TTTTAAGTAA CGGTTATGAA TTAAAAAATA TAAATCAAGA ACTACTTTCA
GAGTCACTGA AAAAATTAGA ACTATCAACA CTACTTAGAC AAATTGATAT TTTCAATTCA
ACTTTCAGCA AAGGTGGTTT TGACAAAAAT AATGTAGCTA AAGAGGAGGA GAAGGCACCA
AAAGTCGCAG GCAATAATGA ATTAGAAAAT AGTGAAAATA AAATCCCTAA AATCAACGTA
ACTGTTGTAA ATGATTTCGA ATTACTTGAT AAATTAATTC AAAGATTAGA CAAGACTAAT
CAAATAGTTT CTTTAGATAC AGAGACCAAT AGTTTGAATC CAATCGATGC GGAACTTGTT
GGGATAGGGT TATGTCTTGG AGAAGAAAAT GATGATTTAT TTTATATACC CCTTGGTCAT
CAAACAAAAA AGGAGACCCC CGATCAATTA TCAATTGAAG ATGTTTTCTC AAAGCTAAGA
AATTGGATAG AAGATCCAAA AAAAGAAAAG GCACTCCAAA ATTCTAAATT TGATAGGCAA
ATATTTTTTA ATCATGGACT TGATCTTAAA GGCGTAACCT TTGACACCTT ATTAGCAGAC
TACCTTCTTA ATAATCAGGA GAAACATGGG TTAAGTGAAA TTAGTTTTAG ATTATTTGGA
TTTAAGCCTC CTTCATTTAA GGAGACAGTT GGAAAAAATA AAGACTTTTC ATTTGTTGAT
ATTGATGAAG CAAGTATTTA CTGCGGTTAT GATGTTTTTC TAACTTTTAA GATTGTCAAA
ATTTTTAAAG AAAGTTTTTC AAAGGAAAAA GATGAATTAA TCAAATTGTT CGAAGAAATC
GAGCTACCTT TAGAGCCGGT ATTGTCCCAA ATGGAGATGA ATGGCATTAC GATCGACATC
CCTTATTTGG ATAAACTCTC AAAAGAATTA AAAAGTACCT TAGAAGATAT TGAAAGTAAA
GTTTATGAGT TAGCAGATGA AAGTTTCAAT CTATCTTCAC CAAAACAACT TGGTGAGATC
TTGTTTGAAA AATTAAATTT GGATAAGAAA AAATCACGGA AAACAAAAAC AGGATGGAGC
ACAGATGCAG TAGTTCTGGA AAGATTAGTC GACGAACATG AAATAATCCA ACATTTAATA
AAACATAGAA CTCTTAGCAA ATTACTTAGC ACCTATATTG ATGCTCTTCC AAATCTTATT
AACGAAAAGA CAGGAAGAGT TCATACAAAC TTTAATCAAG CTGCTACAGC GACTGGGAGA
CTAAGTAGTA GCAATCCTAA TCTTCAAAAT ATCCCTGTTA GGACTGAATT TAGTAGGAGA
ATCAGAAAAG CATTCTTGCC TGAAAAAAAT TGGAAACTTT TATCAGCTGA TTATTCTCAG
ATCGAATTAA GAATACTCGC TCACTTAGCG GATGAAGAAA TACTAATAAA TGCATTTCAT
AAAAATGATG ACATTCATTC TTTGACTGCA AGATTAATTT TTGAGAAAGA AGAAATTTCT
TCTGATGAGA GGAGAGTTGG GAAAACAATA AATTTCGGAG TTATCTATGG TATGGGAATT
AAAAAGTTTG CACGTTCTAC AGGAGTAAGT ACTCCAGAAG CAAAAGAATT CCTAATAAAA
TACAAAGAAA GATATTCAAA AATTTTCAAA TTTCTTGAAC TTCAAGAAAG GCTTGCCTTA
TCAAAAGGTT ATGTAAAAAC AATTTTTGGT AGAAAGAGAG AATTTAAGTT TGATAAAAAT
GGACTTGGAA GATTACTAGG AAAAGATCCT TACGAAATTG ACTTGCAAGC CGCAAGAAGA
GCTGGCATGG AAGCACAGTC ACTAAGAGCC GCAGCCAATG CCCCAATACA GGGTTCAAGT
GCAGATATTA TTAAAATTGC AATGGTTCAA CTAAATAAAA AATTCACAGA AATGAATGTT
CCAGCAAAAA TGCTTTTACA AGTACATGAT GAATTATTGT TTGAAGTCGA ACCAGATTCT
TTGGAAATTA CGACGAAATT AGTAAAGAAG ACTATGGAAG ATTGTGTAAA ATTAAATGTG
CCTCTTTTAG TTGATGTTGG AATTGGAGAC AATTGGATGG AGACAAAATA A
 
Protein sequence
MSLKSENSKK PILLLVDGHS LAFRSFYAFS KGIDGGLTTK EGFPTSVTYG FLKSLLDNCK 
NISPEGVCIT FDTEKPTFRH ELDPNYKANR DVAPDVFFQD IEQLEIILEE SLNLPIFKSP
GYEADDLLGT IANDASSKGW CVNILSGDRD LFQLVDDQKD IYVLYMGGGP YAKSGNPTLM
NENGVKEKLG VAPERVVDLK ALTGDSSDNI PGIKGVGPKT AINLLKENDT LDGIYQALDK
IQQNKDKKYK GFIKGSVIEK LRNDKHNAFL SRDLAKINTE VPLILSNGYE LKNINQELLS
ESLKKLELST LLRQIDIFNS TFSKGGFDKN NVAKEEEKAP KVAGNNELEN SENKIPKINV
TVVNDFELLD KLIQRLDKTN QIVSLDTETN SLNPIDAELV GIGLCLGEEN DDLFYIPLGH
QTKKETPDQL SIEDVFSKLR NWIEDPKKEK ALQNSKFDRQ IFFNHGLDLK GVTFDTLLAD
YLLNNQEKHG LSEISFRLFG FKPPSFKETV GKNKDFSFVD IDEASIYCGY DVFLTFKIVK
IFKESFSKEK DELIKLFEEI ELPLEPVLSQ MEMNGITIDI PYLDKLSKEL KSTLEDIESK
VYELADESFN LSSPKQLGEI LFEKLNLDKK KSRKTKTGWS TDAVVLERLV DEHEIIQHLI
KHRTLSKLLS TYIDALPNLI NEKTGRVHTN FNQAATATGR LSSSNPNLQN IPVRTEFSRR
IRKAFLPEKN WKLLSADYSQ IELRILAHLA DEEILINAFH KNDDIHSLTA RLIFEKEEIS
SDERRVGKTI NFGVIYGMGI KKFARSTGVS TPEAKEFLIK YKERYSKIFK FLELQERLAL
SKGYVKTIFG RKREFKFDKN GLGRLLGKDP YEIDLQAARR AGMEAQSLRA AANAPIQGSS
ADIIKIAMVQ LNKKFTEMNV PAKMLLQVHD ELLFEVEPDS LEITTKLVKK TMEDCVKLNV
PLLVDVGIGD NWMETK