Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_13151 |
Symbol | polA |
ID | 4718034 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | + |
Start bp | 1091354 |
End bp | 1094284 |
Gene Length | 2931 bp |
Protein Length | 976 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 640079034 |
Product | DNA polymerase I |
Protein accession | YP_001009706 |
Protein GI | 123968848 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTTAA AATCTGAAAA CTCTAAAAAA CCAATTTTAC TTTTAGTCGA TGGCCACTCA CTTGCTTTTA GAAGCTTCTA TGCATTTAGC AAAGGGATTG ATGGAGGTTT AACTACCAAA GAGGGATTTC CAACAAGTGT CACTTATGGA TTTCTAAAGA GCCTTCTGGA TAATTGTAAA AATATTAGTC CTGAGGGTGT TTGTATTACG TTTGATACCG AAAAACCTAC TTTCAGACAT GAATTAGATC CAAATTATAA GGCCAATAGA GATGTAGCAC CAGATGTTTT TTTTCAGGAT ATTGAACAAC TAGAAATCAT TTTAGAAGAA AGCCTTAATT TACCAATTTT CAAATCTCCA GGATACGAAG CAGATGATCT CCTAGGCACA ATTGCAAATG ATGCTTCTTC TAAAGGATGG TGCGTGAATA TTCTTTCTGG AGATAGGGAC TTATTTCAAT TAGTAGATGA TCAAAAAGAT ATTTATGTGC TTTATATGGG TGGTGGTCCA TATGCGAAAA GTGGAAATCC AACTCTTATG AATGAAAATG GAGTAAAAGA AAAATTAGGT GTTGCGCCAG AAAGAGTAGT TGATCTCAAA GCCCTAACTG GTGATAGTTC TGATAATATT CCAGGTATTA AAGGAGTAGG GCCAAAAACT GCAATTAATC TACTAAAAGA GAACGATACG CTTGATGGAA TCTATCAGGC TTTGGACAAG ATTCAGCAGA ACAAAGATAA GAAATATAAA GGATTCATCA AAGGTTCAGT TATAGAAAAG CTCAGAAACG ATAAACATAA TGCTTTTCTT TCCAGGGATT TAGCAAAAAT AAATACTGAG GTGCCTTTAA TTTTAAGTAA CGGTTATGAA TTAAAAAATA TAAATCAAGA ACTACTTTCA GAGTCACTGA AAAAATTAGA ACTATCAACA CTACTTAGAC AAATTGATAT TTTCAATTCA ACTTTCAGCA AAGGTGGTTT TGACAAAAAT AATGTAGCTA AAGAGGAGGA GAAGGCACCA AAAGTCGCAG GCAATAATGA ATTAGAAAAT AGTGAAAATA AAATCCCTAA AATCAACGTA ACTGTTGTAA ATGATTTCGA ATTACTTGAT AAATTAATTC AAAGATTAGA CAAGACTAAT CAAATAGTTT CTTTAGATAC AGAGACCAAT AGTTTGAATC CAATCGATGC GGAACTTGTT GGGATAGGGT TATGTCTTGG AGAAGAAAAT GATGATTTAT TTTATATACC CCTTGGTCAT CAAACAAAAA AGGAGACCCC CGATCAATTA TCAATTGAAG ATGTTTTCTC AAAGCTAAGA AATTGGATAG AAGATCCAAA AAAAGAAAAG GCACTCCAAA ATTCTAAATT TGATAGGCAA ATATTTTTTA ATCATGGACT TGATCTTAAA GGCGTAACCT TTGACACCTT ATTAGCAGAC TACCTTCTTA ATAATCAGGA GAAACATGGG TTAAGTGAAA TTAGTTTTAG ATTATTTGGA TTTAAGCCTC CTTCATTTAA GGAGACAGTT GGAAAAAATA AAGACTTTTC ATTTGTTGAT ATTGATGAAG CAAGTATTTA CTGCGGTTAT GATGTTTTTC TAACTTTTAA GATTGTCAAA ATTTTTAAAG AAAGTTTTTC AAAGGAAAAA GATGAATTAA TCAAATTGTT CGAAGAAATC GAGCTACCTT TAGAGCCGGT ATTGTCCCAA ATGGAGATGA ATGGCATTAC GATCGACATC CCTTATTTGG ATAAACTCTC AAAAGAATTA AAAAGTACCT TAGAAGATAT TGAAAGTAAA GTTTATGAGT TAGCAGATGA AAGTTTCAAT CTATCTTCAC CAAAACAACT TGGTGAGATC TTGTTTGAAA AATTAAATTT GGATAAGAAA AAATCACGGA AAACAAAAAC AGGATGGAGC ACAGATGCAG TAGTTCTGGA AAGATTAGTC GACGAACATG AAATAATCCA ACATTTAATA AAACATAGAA CTCTTAGCAA ATTACTTAGC ACCTATATTG ATGCTCTTCC AAATCTTATT AACGAAAAGA CAGGAAGAGT TCATACAAAC TTTAATCAAG CTGCTACAGC GACTGGGAGA CTAAGTAGTA GCAATCCTAA TCTTCAAAAT ATCCCTGTTA GGACTGAATT TAGTAGGAGA ATCAGAAAAG CATTCTTGCC TGAAAAAAAT TGGAAACTTT TATCAGCTGA TTATTCTCAG ATCGAATTAA GAATACTCGC TCACTTAGCG GATGAAGAAA TACTAATAAA TGCATTTCAT AAAAATGATG ACATTCATTC TTTGACTGCA AGATTAATTT TTGAGAAAGA AGAAATTTCT TCTGATGAGA GGAGAGTTGG GAAAACAATA AATTTCGGAG TTATCTATGG TATGGGAATT AAAAAGTTTG CACGTTCTAC AGGAGTAAGT ACTCCAGAAG CAAAAGAATT CCTAATAAAA TACAAAGAAA GATATTCAAA AATTTTCAAA TTTCTTGAAC TTCAAGAAAG GCTTGCCTTA TCAAAAGGTT ATGTAAAAAC AATTTTTGGT AGAAAGAGAG AATTTAAGTT TGATAAAAAT GGACTTGGAA GATTACTAGG AAAAGATCCT TACGAAATTG ACTTGCAAGC CGCAAGAAGA GCTGGCATGG AAGCACAGTC ACTAAGAGCC GCAGCCAATG CCCCAATACA GGGTTCAAGT GCAGATATTA TTAAAATTGC AATGGTTCAA CTAAATAAAA AATTCACAGA AATGAATGTT CCAGCAAAAA TGCTTTTACA AGTACATGAT GAATTATTGT TTGAAGTCGA ACCAGATTCT TTGGAAATTA CGACGAAATT AGTAAAGAAG ACTATGGAAG ATTGTGTAAA ATTAAATGTG CCTCTTTTAG TTGATGTTGG AATTGGAGAC AATTGGATGG AGACAAAATA A
|
Protein sequence | MSLKSENSKK PILLLVDGHS LAFRSFYAFS KGIDGGLTTK EGFPTSVTYG FLKSLLDNCK NISPEGVCIT FDTEKPTFRH ELDPNYKANR DVAPDVFFQD IEQLEIILEE SLNLPIFKSP GYEADDLLGT IANDASSKGW CVNILSGDRD LFQLVDDQKD IYVLYMGGGP YAKSGNPTLM NENGVKEKLG VAPERVVDLK ALTGDSSDNI PGIKGVGPKT AINLLKENDT LDGIYQALDK IQQNKDKKYK GFIKGSVIEK LRNDKHNAFL SRDLAKINTE VPLILSNGYE LKNINQELLS ESLKKLELST LLRQIDIFNS TFSKGGFDKN NVAKEEEKAP KVAGNNELEN SENKIPKINV TVVNDFELLD KLIQRLDKTN QIVSLDTETN SLNPIDAELV GIGLCLGEEN DDLFYIPLGH QTKKETPDQL SIEDVFSKLR NWIEDPKKEK ALQNSKFDRQ IFFNHGLDLK GVTFDTLLAD YLLNNQEKHG LSEISFRLFG FKPPSFKETV GKNKDFSFVD IDEASIYCGY DVFLTFKIVK IFKESFSKEK DELIKLFEEI ELPLEPVLSQ MEMNGITIDI PYLDKLSKEL KSTLEDIESK VYELADESFN LSSPKQLGEI LFEKLNLDKK KSRKTKTGWS TDAVVLERLV DEHEIIQHLI KHRTLSKLLS TYIDALPNLI NEKTGRVHTN FNQAATATGR LSSSNPNLQN IPVRTEFSRR IRKAFLPEKN WKLLSADYSQ IELRILAHLA DEEILINAFH KNDDIHSLTA RLIFEKEEIS SDERRVGKTI NFGVIYGMGI KKFARSTGVS TPEAKEFLIK YKERYSKIFK FLELQERLAL SKGYVKTIFG RKREFKFDKN GLGRLLGKDP YEIDLQAARR AGMEAQSLRA AANAPIQGSS ADIIKIAMVQ LNKKFTEMNV PAKMLLQVHD ELLFEVEPDS LEITTKLVKK TMEDCVKLNV PLLVDVGIGD NWMETK
|
| |