Gene P9303_08691 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_08691 
SymbolpolA 
ID4776935 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp787413 
End bp790373 
Gene Length2961 bp 
Protein Length986 aa 
Translation table11 
GC content56% 
IMG OID640086378 
ProductDNA polymerase I 
Protein accessionYP_001016885 
Protein GI124022578 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGAGG CTGCTAACAA GCCACTGTTG CTGCTGGTGG ATGGCCACTC GCTGGCCTTC 
CGCAGCTTTT ATGCCTTCAG CAAAGGTGGT GAAGGGGGAT TGGCCACAAA AGACGGTACC
CCTACCAGCG TTACCTATGG CTTCCTAAAA GCCCTCCTGG ACAACTGCAA GGGTCTTAGC
CCGCAAGGGA TTGTGATCGC CTTTGATACA GCTGAGCCAA CCTTCCGCCA TCAAGCAGAT
GTCAACTACA AGGCCAATCG TGATGTTGCG CCAGACATCT TTTTTCAGGA CCTCAAACAA
CTGCAAGAGA TCCTTGAAAA CAACCTGCAA CTTCCCCTTT GCCTTGCCCC TGGCTATGAA
GCCGATGATG TGCTCGGCAC CCTCGCCAAC CAAGCAAGCA GAGATGGTTG GGGGGTCCGC
ATCCTCTCCG GCGACCGTGA CCTCTTTCAG CTGGTTGATG ATCAGCGCGA CATTGCCGTG
CTCTATATGG GTGGTGGACC CTACGCCAAA AGTAGTGGTC CCAGCCTGAT CGATGAAGCT
GGGGTGCTCA GCAAACTTGG TGTCATCCCC AACAAGGTGG TTGAGCTCAA GGCCCTCACT
GGCGACAGCT CTGACAACAT CCCCGGGGTC AAAGGTGTGG GTCCTAAAAC AGCCATCACT
CTTCTTAAGG AGAATGGCGA TCTCGATGGC ATCTACAACG CGCTGGCTGA AGTGGAAGCA
GAAGGCGAGA AAGCCAGCCG CGGTGCCATT AAAGGAGCAC TCAAAAGCAA GCTCAGCAAC
GATCGTGACA ACGCCTATCT ATCGCTCCAT CTCGCAGAGA TCCTGGTCGA TATTCCCCTA
CCGAAAGCGC CACGTCTAGA GCTAGGCAGC GTTGACAACG ATGGACTCAC TGACCGCCTG
AGCTCCCTTG AACTCAACAG CCTGGTGCGC CAAGTGCCAA GTTTCGTGGC CACCTTCTCC
AGCGGGGGAT TTAAAGCCAA TCGCCACGAA CTTGAGCCAT CCAAATCGGC GACAAGCCCA
GTAGAGCCTG AATCTTCAAC AGAAACAAAG GTTTCAAACG ACAACGAACA ACCGGCGCTA
GAGCCACAAC TGATTACCAA TCCAGAAGAG CTGCAAGAGC TTGTCAAACG GCTGATGGGC
TACCGAGATC GCCTCAAGCC AGTGGCCCTC GACACTGAAA CAACCGCCCT CAATCCCTTT
TGCGCCGAAC TGGTGGGCTT AGGCGTCTGT TGGGGTGAGG GGCTGCAAGA CCTGGCCTAC
ATCCCGATCG GACACCATCC GCCTGCCGAA CTGCTGGAGG CAGAGGCTGC CTGCCAACTC
CCCCTAGAGG CTGTGCTCAA AGCGATAGCC CCTTGGTTGG CGAGTAACGA CCACCCCAAA
GCACTGCAGA ACGCCAAATA CGACAGGCTC ATCCTGTTGC GCCATGGACT CGCCCTCGAG
GGGGTCGTGA TGGACACGTT GCTGGCCGAC TACCTCCGTG ATGCAGCAGC CAAACACGGC
CTGGAAGTGA TGGCAGAGCG TGAATTCAAG ATCACGCCAA CCGGCTTCAG CGAGCTGGTT
GGCAAGGGCC AAACCTTTGC TGATGTCGCG ATCCCAACGG CCAGCCTCTA CTGCGGCATG
GACGTGCATC TCACCCGGCG ACTAGCCCTA CGACTCAGAG CACAACTTGA GGACATGGGA
GCCAAGCTCC TCCCCCTACT CGAGCAAGTT GAGCAACCCC TAGAACCGGT TTTGGCTCTA
ATGGAAGCCA CCGGGATCCG CATCGACCTG CCCTATTTAC AGACACTTTC CGTTGAACTG
GGTGAGACCT TGGAGCGTCT GGAAGAACAA GCCCGAGAAG CAGCCGGAGT GGACTTCAAT
CTGGCCTCCC CCAAGCAATT GGGAGAACTG CTATTCGAAA CACTTGGGCT GAATCGCAAA
AAATCTAGGC GCACCAAAAC GGGCTGGAGC ACCGATGCCA ACGTGCTGGA AAAACTCGAA
GCTGATCATC CCGTGGTGCC GTTGGTGCTG GAGCACAGAG TGCTAAGCAA ACTGCGCAGT
ACCTACGTGG ATGCCTTGCC GCAGCTGGTG GAATCAGAGA CGGGCAGGGT GCACACAGAT
TTCAACCAAG CGGTTACGGC AACAGGCCGG CTGAGCAGTA GCAACCCCAA CCTGCAAAAC
ATCCCCATCC GCACCGAATT CAGCAGACGA ATCCGCAAGG CTTTTCTTCC CCAGGAGAAC
TGGCAACTAC TAAGTGCTGA CTACTCCCAG ATCGAATTGC GCATCCTCAC CCACCTCTCT
GGAGAGGAGG TTTTGCAAGA GGCCTTCCGC AATGGCGATG ACGTGCACGC TCTCACCGCA
AGGCTGCTGC TGGACAAAGA TGAAGTCAGC TCTGATGAAC GTCGACTCGG CAAAACAATC
AACTTCGGCG TGATTTATGG CATGGGCGCC CAACGCTTTG CACGCTCAAC CGGCGTCAGC
CAAGCAGAGG CCAAAGACTT CCTTAGCCGT TACAAACAGC GCTACGCGAA GGTGTTTACC
TTCCTGGAAC TACAAGAGCG GCTTGCACTC AGCGAAGGCT ATGTAGAAAC CCTGTTAGGG
CGGCGGCGGC CCTTCCATTT CGATCGCAAC GGATTAGGCC GATTGCTGGG CAAAGATCCA
ATGGACATTG ATCTCGACGT AGCCCGCCGC GGTGGCATGG AAGCCCAACA GCTCAGGGCG
GCAGCCAATG CACCGATCCA GGGTTCCAGT GCCGACATCA TCAAAGTGGC GATGGTGCAG
TTACAAGAAA AACTGACGGC AGCAGATCTA CCGGCACGCC TGCTGCTGCA AGTGCACGAC
GAACTGGTGT TGGAAGTAGA CCCCACCGCA CTTGAAGATG TTCAGCAGTT GGTGGTGCAA
ACGATGGAAA AGGCTGTTGA ACTGAGCGTG CCACTCGTGG TAGAAACAGG CATTGGCGAT
AACTGGATGG AGGCGAAATA A
 
Protein sequence
MPEAANKPLL LLVDGHSLAF RSFYAFSKGG EGGLATKDGT PTSVTYGFLK ALLDNCKGLS 
PQGIVIAFDT AEPTFRHQAD VNYKANRDVA PDIFFQDLKQ LQEILENNLQ LPLCLAPGYE
ADDVLGTLAN QASRDGWGVR ILSGDRDLFQ LVDDQRDIAV LYMGGGPYAK SSGPSLIDEA
GVLSKLGVIP NKVVELKALT GDSSDNIPGV KGVGPKTAIT LLKENGDLDG IYNALAEVEA
EGEKASRGAI KGALKSKLSN DRDNAYLSLH LAEILVDIPL PKAPRLELGS VDNDGLTDRL
SSLELNSLVR QVPSFVATFS SGGFKANRHE LEPSKSATSP VEPESSTETK VSNDNEQPAL
EPQLITNPEE LQELVKRLMG YRDRLKPVAL DTETTALNPF CAELVGLGVC WGEGLQDLAY
IPIGHHPPAE LLEAEAACQL PLEAVLKAIA PWLASNDHPK ALQNAKYDRL ILLRHGLALE
GVVMDTLLAD YLRDAAAKHG LEVMAEREFK ITPTGFSELV GKGQTFADVA IPTASLYCGM
DVHLTRRLAL RLRAQLEDMG AKLLPLLEQV EQPLEPVLAL MEATGIRIDL PYLQTLSVEL
GETLERLEEQ AREAAGVDFN LASPKQLGEL LFETLGLNRK KSRRTKTGWS TDANVLEKLE
ADHPVVPLVL EHRVLSKLRS TYVDALPQLV ESETGRVHTD FNQAVTATGR LSSSNPNLQN
IPIRTEFSRR IRKAFLPQEN WQLLSADYSQ IELRILTHLS GEEVLQEAFR NGDDVHALTA
RLLLDKDEVS SDERRLGKTI NFGVIYGMGA QRFARSTGVS QAEAKDFLSR YKQRYAKVFT
FLELQERLAL SEGYVETLLG RRRPFHFDRN GLGRLLGKDP MDIDLDVARR GGMEAQQLRA
AANAPIQGSS ADIIKVAMVQ LQEKLTAADL PARLLLQVHD ELVLEVDPTA LEDVQQLVVQ
TMEKAVELSV PLVVETGIGD NWMEAK