Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmwyl1_0280 |
Symbol | |
ID | 5365921 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Marinomonas sp. MWYL1 |
Kingdom | Bacteria |
Replicon accession | NC_009654 |
Strand | - |
Start bp | 328942 |
End bp | 331692 |
Gene Length | 2751 bp |
Protein Length | 916 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640802621 |
Product | DNA polymerase I |
Protein accession | YP_001339156 |
Protein GI | 152994321 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTTTGG TAGACGGTTC GTCTTACCTT TACCGAGCAT TCCACGCGAT TCCACCGATG CACACCAGCG ATGGCCAACC GACCAACGCC ACGCGCGGGG TGATCAGCAT GATCCGCAGC CTTATCAAAA CCTACCCTGA TTCCCCCATG GTCATCATCT TCGACGCCAA GGGCAAAACC TTCCGCGACG AGATCTACAG CGAATACAAA GCGCACCGCC CGCCTATGCC GGACGATCTT CGCCCACAAA TCGAGCCAAT TCACCGTGTG GTAGAAGCCA TGGGCTTGCC TTTGGTCATT GTTGATGGCG TGGAAGCCGA CGACGTGATC GGCACCATCG CCAAACAAGT GGGCGAACAA GGCCGTGAAG TGGTGGTTTC CACGGGCGAT AAAGACATGG CTCAGTTGGT TACCGACAAA GTCACCCTTG TGAACACCAT GAACAACACG GTGATGGACA TCCAAGGCGT GAAAGACAAG TTCGGCATTC CGCCTGAACT TATTATCGAT TACCTCGCAC TGATGGGCGA CAAAGTCGAT AACATCCCAG GCGTACCAGG CGTGGGTGAA AAAACCGCTC TTGCCTTATT GCAAGGCTTG GGCAGCATCA AAGACATCTA CACCCGCTTA GACGAAATCG CTGATTTGGG CTTCCGTGGT TCTAAAACCA TGAAGCAAAA AATGGAAGAC AACAAAGAAA TGGCGGAATT GTCTTACACC TTGGCGACCA TCAAATGCGA CGTGGAACTG CCTTTCGAAC CACAAGAATT GAAAAACGGC GAAGCGGACA AAGAAAAACT GCGCGAGTGG TTCACCAAAC TCGAATTCAA AACCTGGCTG TCTGATCTAG ATAAAGCACC GAGCGTCGAA ACCGCCAGCG ACACCGTTGG CAATGCTTCC TCAGAGCAAA ATGGCCAAGT AGAAATACCT GTTAAATCCG ATCTAGAAGC CAACTACGAA ACCGTCTTAG ACAAAGCAAG CTTCAACAAA TGGCTAGAGA AAATCCAAGC CGCCGAGCTG GTTGCCTTCG ATACCGAAAC CACCTCGCTG AACTACATGG CGGCGGAATT GGTTGGCTTA TCCTTCTCGG TAGAAGCAGG CGAAGCCGCT TACGTTCCCG TCGCCCATGA TTACGAAGGT GCGCCAGAAC AACTGGATCG CGATTGGGTA CTAGAACAGC TCAAACCTTG GTTAGAAGAC GACAGCAAAG CCAAAGTCGG CCAGCATCTC AAATACGATG CCAACGTACT AAACAACTAC CAGATCACCT TGCGTGGCAT TGCTTACGAC ACCATGTTGG AATCTTATGT GTACAACTCC GTGAGCTCTC GCCACGACAT GAACACATTA GCAAGCAAAT TCCTTGGCCA CACCTGCGTC AGCTTTGAAG ACATCGCAGG TAAAGGCGCA AAACAAAAAA CCTTCAACCA GATCGATTTA GAAGTCGCCG CTTTCTACGC CGCCGAAGAC GCCGACATCA CCTTGCGTTT GCACCAAGCG ATCTGGCCGA AAATCGAAAC CACGCCAGAA TTAGTCAGCA TTTTTAAAGA CATCGAATGC CCATTGATTC CCGTCTTGGC GAAAATGGAA CAAACTGGCG CCTTGATTGA TCCAGAACTG CTTCATGCAC AAAGCAGCGA GATCGCCGCC AAACTGCAAG AACTAGAAAT CAAAGCCCAC GAAGAAGCAG GCGAAAGCTT CAACCTAAGC TCGCCAAAAC AACTGCAAGT CATCTTGTTC GAAAAACAAG GCTTGCCGGT TATCAAGAAA ACACCAAAAG GCCAACCATC CACGGCAGAG CCGATCTTGC AAGAACTGGC ACAAGACTAT GAACTACCGC GTTTGATCAT GGAACACCGC AGCCTGTCTA AGTTAAAATC CACTTACACC GACAAACTGC CAGAGATGAT CCAGAAAACC GGACGGATTC ACACCTCTTA CCACCAAGCC ATTACCGCGA CAGGCCGTTT GTCGTCGACC GATCCGAACT TGCAGAACAT CCCAATTCGT TCTGCCGAAG GTCGTCGTAT TCGCCAAGCC TTCATCGCGC CTAAAGGCTA CAAGTTAGTG GCGGCCGACT ACTCGCAAGT CGAATTACGC ATCATGGCGC ATTTGTCCCA AGATTCGGGC TTATTAGACG CGTTTACTAA AGACGCCGAC GTGCACAAAG CCACCGCGGC GGAAGTCTTC GAAGTCAGCC TAGACGAGGT CACCACAGAA CAACGCCGCC GCGCCAAAGC CATCAACTTC GGTTTGATCT ACGGCATGTC CGCCTTTGGT CTAGCTAAAC AACTTGGCAT TGGTCGCCCA GAAGCAGGCA AATACATCAA ACGCTACTTC GAACGTTACC CAGGCGTACA GCAATACATG GAAAACACCC GCGAAGGCGC GAAAGAAAAA GGCTATGTAG AAACCATCTA CGGCCGCCGC TTGTACCTGC CAGACATCAA AGCGAAAAAC GCCATGATGC GCCAAGCCGC CGAACGCACC GCGATCAACG CCCCGATGCA AGGCTCCGCC GCCGACATCA TCAAACGCGC CATGATCAAA ATGCACGACT GGCTACAAGG CACCGACCTA GACGTGAAAA TGATCATGCA AGTACACGAT GAACTCATCT TCGAAGTGGC CGAAAAAGAC CTAGAGGCTG CACAAAAGAA AATCGTCGAC ATCATGCAAA ACAGCAGCAA AATCGACGTG CCTTTACTTG TGGAAGCAGG TGTTGGGGAT AATTGGGACG AGGCGCATTG A
|
Protein sequence | MVLVDGSSYL YRAFHAIPPM HTSDGQPTNA TRGVISMIRS LIKTYPDSPM VIIFDAKGKT FRDEIYSEYK AHRPPMPDDL RPQIEPIHRV VEAMGLPLVI VDGVEADDVI GTIAKQVGEQ GREVVVSTGD KDMAQLVTDK VTLVNTMNNT VMDIQGVKDK FGIPPELIID YLALMGDKVD NIPGVPGVGE KTALALLQGL GSIKDIYTRL DEIADLGFRG SKTMKQKMED NKEMAELSYT LATIKCDVEL PFEPQELKNG EADKEKLREW FTKLEFKTWL SDLDKAPSVE TASDTVGNAS SEQNGQVEIP VKSDLEANYE TVLDKASFNK WLEKIQAAEL VAFDTETTSL NYMAAELVGL SFSVEAGEAA YVPVAHDYEG APEQLDRDWV LEQLKPWLED DSKAKVGQHL KYDANVLNNY QITLRGIAYD TMLESYVYNS VSSRHDMNTL ASKFLGHTCV SFEDIAGKGA KQKTFNQIDL EVAAFYAAED ADITLRLHQA IWPKIETTPE LVSIFKDIEC PLIPVLAKME QTGALIDPEL LHAQSSEIAA KLQELEIKAH EEAGESFNLS SPKQLQVILF EKQGLPVIKK TPKGQPSTAE PILQELAQDY ELPRLIMEHR SLSKLKSTYT DKLPEMIQKT GRIHTSYHQA ITATGRLSST DPNLQNIPIR SAEGRRIRQA FIAPKGYKLV AADYSQVELR IMAHLSQDSG LLDAFTKDAD VHKATAAEVF EVSLDEVTTE QRRRAKAINF GLIYGMSAFG LAKQLGIGRP EAGKYIKRYF ERYPGVQQYM ENTREGAKEK GYVETIYGRR LYLPDIKAKN AMMRQAAERT AINAPMQGSA ADIIKRAMIK MHDWLQGTDL DVKMIMQVHD ELIFEVAEKD LEAAQKKIVD IMQNSSKIDV PLLVEAGVGD NWDEAH
|
| |