Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PMT9312_1236 |
Symbol | |
ID | 3766043 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9312 |
Kingdom | Bacteria |
Replicon accession | NC_007577 |
Strand | + |
Start bp | 1149408 |
End bp | 1152338 |
Gene Length | 2931 bp |
Protein Length | 976 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 637797764 |
Product | DNA polymerase I |
Protein accession | YP_397731 |
Protein GI | 78779619 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.206272 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTTAA AATCAGAAAA CTCTAAAAAA CCAATTTTAC TTTTAGTCGA TGGTCACTCA CTTGCTTTTA GAAGCTTCTA TGCATTTAGC AAAGGGATTG ATGGAGGTTT AACTACCAAA GAGGGTTTCC CAACAAGTGT CACATATGGA TTTCTAAAAA GCCTTCTGGA TAATTGCAAA AATATAAGTC CTGAGGGTGT TTGTATTACT TTTGACACAG AAAAACCTAC TTTCAGGCAT GAATTAGATC CAAATTATAA GGCAAATAGA GATGTAGCAC CTGATGTTTT TTTTCAGGAT ATTGAACAAC TAGAAATCAT TTTAGAAGAA AGTCTTAATT TACCAATTTT CAAATCTCCA GGTTACGAAG CAGATGATCT CCTAGGCACA ATTGCAAATG ACGCTTCTTC TAAAGGATGG TGCGTGAATA TTCTTTCTGG AGATAGGGAT TTATTTCAAT TAGTTGATGA TCAAAAAGGT ATTTATGTCC TTTATATGGG TGGTGGTCCA TATGCAAAAA GTGGTAATCC AACCCTAATT AATGAAAATG GAGTAAAAGA AAAATTAGGT GTAGTTCCAG AAAGAGTGGT TGATCTTAAA GCTCTAACTG GTGATAGTTC TGATAATATT CCTGGAATTA AAGGTGTAGG TCCAAAAACC GCAATTAATC TACTAAAAGA AAACGATACA CTTGATGGAA TCTATCAGGC TTTGGACAAA ATTCTGCAGA ACAATGATAA AAAATATAAA GGATTCATCA AAGGTTCAGT TATAGAAAAG CTCAAAAATG ATAAATATAA TGCTTATCTT TCTAGAGATT TAGCAAAAAT TAATACTGAA GTACCTTTGA TATTAAGTGA TGGTTATGAA CTAAAAAATA TAAATCAAGA ACTTCTTTCA GAGTCACTGC AAAAACTAGA ATTATCAACA CTACTTCGGC AAATTGATAT TTTCAATTCA ACTTTTAGTA AAGGTGGTTT TGACAAAAAT AATGTAGCTA AAGTGGACGA GAAGGCACCA AAAGTCTCTA GTAAAAATGA ATTGGACAAT CGTGAGAATA AAATCCCTAA AATCAGCGTA ACTATTGTAA ATAATTTTGA ATTACTTGAT AAATTAATTC AAAGATTAGA AAAGACTAAT GAGATAGTTT CTTTAGATAC AGAGACTAAT AGTTTAAATC CAATCGATGC CGAACTTGTT GGGATAGGAT TATGTCTTGG AGAAGAAACT GATGATTTAT TTTATATACC TCTTGGTCAT CAAACAAAAA AAGAAACTAC CAATCAATTA TCAATCGAAG AAGTTTTTTC AAAAATAAGA ACTTGGATTG AAGATCCAAA AAAAGAAAAG GCACTCCAAA ATTCTAAATT TGACAGGCAA ATATTTTTTA ATCACGGACT TGATCTTAAA GGAGTAACCT TTGACACCTT GTTAGCAGAC TATCTTCTAA ATAATCAGGA AAAGCACGGG TTAAGTGAAA TTAGTTTTAG ATTATTTGGA TTTAAGCCCC CTTCATTTAA GGAAACAGTT GGAAAAAATA AAGACTTTTC ATTTGTTGAT ATTGATGAAG CAAGTGTTTA CTGCGGTTAT GATGTATTCC TAACTTTTAA AATCGTCAAA ATTTTTAAAG AAAGATTCTC AATGGAAAAA GATGAATTAA TCAAACTTTT CGAAGAAATT GAGCTACCTT TAGAGCCGGT ATTATCCCAA ATGGAAATGA ATGGCATCAC TATAGACATC CCTTATTTGG ATAAACTCTC AAAAGAACTA AAAAGTACCT TAGAAGATAT TGAAAGTAAA GTTTATGAAT TAGCGGAAGG AAATTTTAAT CTATCTTCAC CAAAACAACT TGGTGAGATC TTGTTTGAAA AATTAAATTT GGATAAAAAG AAATCAAGGA AAACAAAAAC AGGATGGAGT ACGGATGCAG TAGTTCTAGA AAGATTAGTC GACGAACATG AAATAATTCA ATATTTAATA AAACATAGAA CTCTTAGCAA ATTACTAAGC ACCTATATTG ATGCTCTTCC AAATCTTATT AACGAAAAGA CAGGAAGGGT TCATACAAAC TTTAATCAAG CTGCTACAGC GACTGGGAGG CTAAGTAGTA GCAATCCTAA TCTTCAAAAT ATCCCGGTTA GGACTGAATT TAGTAGGAGA ATCAGAAAAG CATTCTTGCC TGAAAAAAAT TGGAAACTTT TATCAGCTGA TTATTCTCAG ATCGAATTAA GAATACTTGC TCACTTAGCG GATGAAGAAA TATTAATTAA TGCATTTCAT AAAAATGATG ATATTCATTC TTTGACTGCA AGATTAATTT TCGAGAAAGA AGAAATATCA TCCGACGAAA GGAGAGTTGG AAAAACAATA AATTTTGGAG TTATCTATGG TATGGGGATC AAAAAGTTTG CCCGTTCAAC AGGAGTAAGT ACTCCAGAGG CAAAAGAATT CCTAATAAAA TACAAAGAAA GATATTCAAA AATTTTCAAA TTTCTTGAAC TCCAAGAAAG GCTTGCCTTA TCAAAAGGAT ATGTAAAAAC AATTTTTGGT CGAAAGAGAG AATTTAAGTT TGATAAAAAT GGACTTGGAA GATTAATAGG GAAAGATCCT TACGAAATTG ACTTGCAATC CGCAAGAAGG GCTGGCATGG AAGCACAGTC ATTAAGAGCC GCAGCTAATG CACCAATTCA GGGTTCAAGT GCAGACATTA TTAAAATTGC AATGGTTCAA ATAAATAAGA AATTCATAGA AATGAATGTT CCAGCAAAAA TGCTTTTACA AGTACATGAT GAATTATTGT TTGAAGTTGA ACCAGATTCG TTGGAAATTA CGACGAAATT AGTGAAGAAG ACTATGGAAG ATTGTGTAAA ATTGAATGTG CCTCTTTTAG TTGATATTGG CATTGGAGAC AATTGGATGG AGACAAAATA A
|
Protein sequence | MSLKSENSKK PILLLVDGHS LAFRSFYAFS KGIDGGLTTK EGFPTSVTYG FLKSLLDNCK NISPEGVCIT FDTEKPTFRH ELDPNYKANR DVAPDVFFQD IEQLEIILEE SLNLPIFKSP GYEADDLLGT IANDASSKGW CVNILSGDRD LFQLVDDQKG IYVLYMGGGP YAKSGNPTLI NENGVKEKLG VVPERVVDLK ALTGDSSDNI PGIKGVGPKT AINLLKENDT LDGIYQALDK ILQNNDKKYK GFIKGSVIEK LKNDKYNAYL SRDLAKINTE VPLILSDGYE LKNINQELLS ESLQKLELST LLRQIDIFNS TFSKGGFDKN NVAKVDEKAP KVSSKNELDN RENKIPKISV TIVNNFELLD KLIQRLEKTN EIVSLDTETN SLNPIDAELV GIGLCLGEET DDLFYIPLGH QTKKETTNQL SIEEVFSKIR TWIEDPKKEK ALQNSKFDRQ IFFNHGLDLK GVTFDTLLAD YLLNNQEKHG LSEISFRLFG FKPPSFKETV GKNKDFSFVD IDEASVYCGY DVFLTFKIVK IFKERFSMEK DELIKLFEEI ELPLEPVLSQ MEMNGITIDI PYLDKLSKEL KSTLEDIESK VYELAEGNFN LSSPKQLGEI LFEKLNLDKK KSRKTKTGWS TDAVVLERLV DEHEIIQYLI KHRTLSKLLS TYIDALPNLI NEKTGRVHTN FNQAATATGR LSSSNPNLQN IPVRTEFSRR IRKAFLPEKN WKLLSADYSQ IELRILAHLA DEEILINAFH KNDDIHSLTA RLIFEKEEIS SDERRVGKTI NFGVIYGMGI KKFARSTGVS TPEAKEFLIK YKERYSKIFK FLELQERLAL SKGYVKTIFG RKREFKFDKN GLGRLIGKDP YEIDLQSARR AGMEAQSLRA AANAPIQGSS ADIIKIAMVQ INKKFIEMNV PAKMLLQVHD ELLFEVEPDS LEITTKLVKK TMEDCVKLNV PLLVDIGIGD NWMETK
|
| |