Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_08691 |
Symbol | polA |
ID | 4776935 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 787413 |
End bp | 790373 |
Gene Length | 2961 bp |
Protein Length | 986 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640086378 |
Product | DNA polymerase I |
Protein accession | YP_001016885 |
Protein GI | 124022578 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGAGG CTGCTAACAA GCCACTGTTG CTGCTGGTGG ATGGCCACTC GCTGGCCTTC CGCAGCTTTT ATGCCTTCAG CAAAGGTGGT GAAGGGGGAT TGGCCACAAA AGACGGTACC CCTACCAGCG TTACCTATGG CTTCCTAAAA GCCCTCCTGG ACAACTGCAA GGGTCTTAGC CCGCAAGGGA TTGTGATCGC CTTTGATACA GCTGAGCCAA CCTTCCGCCA TCAAGCAGAT GTCAACTACA AGGCCAATCG TGATGTTGCG CCAGACATCT TTTTTCAGGA CCTCAAACAA CTGCAAGAGA TCCTTGAAAA CAACCTGCAA CTTCCCCTTT GCCTTGCCCC TGGCTATGAA GCCGATGATG TGCTCGGCAC CCTCGCCAAC CAAGCAAGCA GAGATGGTTG GGGGGTCCGC ATCCTCTCCG GCGACCGTGA CCTCTTTCAG CTGGTTGATG ATCAGCGCGA CATTGCCGTG CTCTATATGG GTGGTGGACC CTACGCCAAA AGTAGTGGTC CCAGCCTGAT CGATGAAGCT GGGGTGCTCA GCAAACTTGG TGTCATCCCC AACAAGGTGG TTGAGCTCAA GGCCCTCACT GGCGACAGCT CTGACAACAT CCCCGGGGTC AAAGGTGTGG GTCCTAAAAC AGCCATCACT CTTCTTAAGG AGAATGGCGA TCTCGATGGC ATCTACAACG CGCTGGCTGA AGTGGAAGCA GAAGGCGAGA AAGCCAGCCG CGGTGCCATT AAAGGAGCAC TCAAAAGCAA GCTCAGCAAC GATCGTGACA ACGCCTATCT ATCGCTCCAT CTCGCAGAGA TCCTGGTCGA TATTCCCCTA CCGAAAGCGC CACGTCTAGA GCTAGGCAGC GTTGACAACG ATGGACTCAC TGACCGCCTG AGCTCCCTTG AACTCAACAG CCTGGTGCGC CAAGTGCCAA GTTTCGTGGC CACCTTCTCC AGCGGGGGAT TTAAAGCCAA TCGCCACGAA CTTGAGCCAT CCAAATCGGC GACAAGCCCA GTAGAGCCTG AATCTTCAAC AGAAACAAAG GTTTCAAACG ACAACGAACA ACCGGCGCTA GAGCCACAAC TGATTACCAA TCCAGAAGAG CTGCAAGAGC TTGTCAAACG GCTGATGGGC TACCGAGATC GCCTCAAGCC AGTGGCCCTC GACACTGAAA CAACCGCCCT CAATCCCTTT TGCGCCGAAC TGGTGGGCTT AGGCGTCTGT TGGGGTGAGG GGCTGCAAGA CCTGGCCTAC ATCCCGATCG GACACCATCC GCCTGCCGAA CTGCTGGAGG CAGAGGCTGC CTGCCAACTC CCCCTAGAGG CTGTGCTCAA AGCGATAGCC CCTTGGTTGG CGAGTAACGA CCACCCCAAA GCACTGCAGA ACGCCAAATA CGACAGGCTC ATCCTGTTGC GCCATGGACT CGCCCTCGAG GGGGTCGTGA TGGACACGTT GCTGGCCGAC TACCTCCGTG ATGCAGCAGC CAAACACGGC CTGGAAGTGA TGGCAGAGCG TGAATTCAAG ATCACGCCAA CCGGCTTCAG CGAGCTGGTT GGCAAGGGCC AAACCTTTGC TGATGTCGCG ATCCCAACGG CCAGCCTCTA CTGCGGCATG GACGTGCATC TCACCCGGCG ACTAGCCCTA CGACTCAGAG CACAACTTGA GGACATGGGA GCCAAGCTCC TCCCCCTACT CGAGCAAGTT GAGCAACCCC TAGAACCGGT TTTGGCTCTA ATGGAAGCCA CCGGGATCCG CATCGACCTG CCCTATTTAC AGACACTTTC CGTTGAACTG GGTGAGACCT TGGAGCGTCT GGAAGAACAA GCCCGAGAAG CAGCCGGAGT GGACTTCAAT CTGGCCTCCC CCAAGCAATT GGGAGAACTG CTATTCGAAA CACTTGGGCT GAATCGCAAA AAATCTAGGC GCACCAAAAC GGGCTGGAGC ACCGATGCCA ACGTGCTGGA AAAACTCGAA GCTGATCATC CCGTGGTGCC GTTGGTGCTG GAGCACAGAG TGCTAAGCAA ACTGCGCAGT ACCTACGTGG ATGCCTTGCC GCAGCTGGTG GAATCAGAGA CGGGCAGGGT GCACACAGAT TTCAACCAAG CGGTTACGGC AACAGGCCGG CTGAGCAGTA GCAACCCCAA CCTGCAAAAC ATCCCCATCC GCACCGAATT CAGCAGACGA ATCCGCAAGG CTTTTCTTCC CCAGGAGAAC TGGCAACTAC TAAGTGCTGA CTACTCCCAG ATCGAATTGC GCATCCTCAC CCACCTCTCT GGAGAGGAGG TTTTGCAAGA GGCCTTCCGC AATGGCGATG ACGTGCACGC TCTCACCGCA AGGCTGCTGC TGGACAAAGA TGAAGTCAGC TCTGATGAAC GTCGACTCGG CAAAACAATC AACTTCGGCG TGATTTATGG CATGGGCGCC CAACGCTTTG CACGCTCAAC CGGCGTCAGC CAAGCAGAGG CCAAAGACTT CCTTAGCCGT TACAAACAGC GCTACGCGAA GGTGTTTACC TTCCTGGAAC TACAAGAGCG GCTTGCACTC AGCGAAGGCT ATGTAGAAAC CCTGTTAGGG CGGCGGCGGC CCTTCCATTT CGATCGCAAC GGATTAGGCC GATTGCTGGG CAAAGATCCA ATGGACATTG ATCTCGACGT AGCCCGCCGC GGTGGCATGG AAGCCCAACA GCTCAGGGCG GCAGCCAATG CACCGATCCA GGGTTCCAGT GCCGACATCA TCAAAGTGGC GATGGTGCAG TTACAAGAAA AACTGACGGC AGCAGATCTA CCGGCACGCC TGCTGCTGCA AGTGCACGAC GAACTGGTGT TGGAAGTAGA CCCCACCGCA CTTGAAGATG TTCAGCAGTT GGTGGTGCAA ACGATGGAAA AGGCTGTTGA ACTGAGCGTG CCACTCGTGG TAGAAACAGG CATTGGCGAT AACTGGATGG AGGCGAAATA A
|
Protein sequence | MPEAANKPLL LLVDGHSLAF RSFYAFSKGG EGGLATKDGT PTSVTYGFLK ALLDNCKGLS PQGIVIAFDT AEPTFRHQAD VNYKANRDVA PDIFFQDLKQ LQEILENNLQ LPLCLAPGYE ADDVLGTLAN QASRDGWGVR ILSGDRDLFQ LVDDQRDIAV LYMGGGPYAK SSGPSLIDEA GVLSKLGVIP NKVVELKALT GDSSDNIPGV KGVGPKTAIT LLKENGDLDG IYNALAEVEA EGEKASRGAI KGALKSKLSN DRDNAYLSLH LAEILVDIPL PKAPRLELGS VDNDGLTDRL SSLELNSLVR QVPSFVATFS SGGFKANRHE LEPSKSATSP VEPESSTETK VSNDNEQPAL EPQLITNPEE LQELVKRLMG YRDRLKPVAL DTETTALNPF CAELVGLGVC WGEGLQDLAY IPIGHHPPAE LLEAEAACQL PLEAVLKAIA PWLASNDHPK ALQNAKYDRL ILLRHGLALE GVVMDTLLAD YLRDAAAKHG LEVMAEREFK ITPTGFSELV GKGQTFADVA IPTASLYCGM DVHLTRRLAL RLRAQLEDMG AKLLPLLEQV EQPLEPVLAL MEATGIRIDL PYLQTLSVEL GETLERLEEQ AREAAGVDFN LASPKQLGEL LFETLGLNRK KSRRTKTGWS TDANVLEKLE ADHPVVPLVL EHRVLSKLRS TYVDALPQLV ESETGRVHTD FNQAVTATGR LSSSNPNLQN IPIRTEFSRR IRKAFLPQEN WQLLSADYSQ IELRILTHLS GEEVLQEAFR NGDDVHALTA RLLLDKDEVS SDERRLGKTI NFGVIYGMGA QRFARSTGVS QAEAKDFLSR YKQRYAKVFT FLELQERLAL SEGYVETLLG RRRPFHFDRN GLGRLLGKDP MDIDLDVARR GGMEAQQLRA AANAPIQGSS ADIIKVAMVQ LQEKLTAADL PARLLLQVHD ELVLEVDPTA LEDVQQLVVQ TMEKAVELSV PLVVETGIGD NWMEAK
|
| |