Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PG1794 |
Symbol | polA |
ID | 2552230 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Porphyromonas gingivalis W83 |
Kingdom | Bacteria |
Replicon accession | NC_002950 |
Strand | + |
Start bp | 1886118 |
End bp | 1888898 |
Gene Length | 2781 bp |
Protein Length | 926 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637150404 |
Product | DNA polymerase type I |
Protein accession | NP_905895 |
Protein GI | 34541416 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGAGC GACTTTTCCT TCTTGATGCT TATGCTCTCA TCTTTCGTGC CTATTATGCT TTCATCCGTA GTCCGCGGAT CGACTCTACC GGTCGTGACA CCGGAGCTGT ATTCGGCTTT GCCCTGACCT TATTGGATAT ACTGGAGAAG GAGTCGCCGG AGCATATCGC CGTGGTATTC GATCCTCCGG GAGGTAGTTT TCGTCATCGC GAGTATGCAG AGTACAAGGC TCAACGAGAG GAAACGCCGG AAGGTATTCG CATCGCCGTT CCGTTGATTA AAGAGATATT GGCAGCTTTC CGTATTCCTG CTGTAGAAGT GCCGGACTTT GAAGCAGACG ACACGATAGG TACTTTGGCC AAACAAGCCG AAGAGCAAGG GCTTGCCGTC AGAATGGTAA CGCCCGATAA GGATTTCGGC CAACTGGTGT CGGAGCGAAT CAAGATCTAT CGACCCAAGT CGGGTGGTGG CTATGAAACA TGGGGGCCGG CAGAAGTCTG CGAAAAGTTC GGACTTTCCA TCCCCGGGCA AATGATAGAT TACCTCGGCC TCGTGGGAGA CAGCTCGGAC AATATACCCG GATGTAAAGG CATTGGAGCG AAAACAGCTG AGAAGCTACT GGCCGAGTAT GGAAGCATAG ACGGCATATA CGCCCACCAA GATGAATTGA AAGGAGCTGT GGCCAAGAAA ATTCAGGAGG GGGAAGAGCA AACGCGCTTC TCGCGCTACT TGGCCACTAT CCGCACGGAT GCTCCTATTG TCTTCGATTC CGAAGCCTAT CGACGTACTT CGCCCGATAT GGCAGCCGTT CGTGAATGCT TTGCTGCACT GGAATTTCGC ACTTTGCTGA AACGGTTGGA GAGTACACCT ACCGATGCCC CTGCGACAGA CCTGTTTGCC GGCATGGTAC AGGCACAAGA GCCTCCGACA GATTTGTTCG GAGAAGGCAC CGATGCTACA GGACTTCCGC TAAAGAAACT GACAGACGTA CCACACGAAT ATACGATTCT CAAAACCGAG GAAGAGATAG CGGATTGCAT CCGAATGTTC TCTGCCACGC CTTGTTTCTC ATTCGACACA GAGACCGACT CGAAAGATGC ACTTCGGGCC AATATCGTTG CCATCACGCT ATGTGCCGAG TCGGGACGGG CTTTCTTCAT ACCGCTGCCG GAAGACGAAG AAATCGGAAA ACGCAGATTA GATCTCTTGC GTCCGCTTTT TGCCGATACA GCTATCGGCA AAGTCGGTCA GAATATGAAG TATGATATCC AAGTACTCTC CCGATATGGT ATAGAAGTAC GCGGACAGCT ATTCGATACG ATGATAGCAC ACTACCTCCT CTTTCCCGAT CTCCGCCACA ATATGGATGA GATGGCGGAG ACGTTGCTGG GCTATTGCAC CGTCCACTAC TCGGATCTTG TCGGAAGCGA CAAACAGGAG GTGCACATCC GTCAGGTACC ATTGCAGAAT CTGGCAGACT ATGCCATGGA AGACGCCGAT ATTACTTGGC AGCTATATGA ACGCCTCAAT GCTATGCTCT CCGAGGCCGG GATGACCTCC TTATTCGAAA GTATAGAGAT GCCACTCGTG CCGGTGCTTG CCAATATGGA ACGCTCCGGC GTAAAGCTGG ACACGGAGGT GCTCCGACGC ACAGCTTCCG GACTTGGCGA AGAGATGCAG CGAATCGAAG ATGAAATCTA CCGTTTGGCC GGACACTCAT TCAATATCAA CAGTCCCTCT CAAGTGGGAA CCGTGCTCTT CGAGGAACTG CAAATTACCG AAAAGCCCAA AAAGACGAAG TCCGGCAGCT ACTCTACAAA CGAAGAAATT CTGGTCAAGC TACAGGAAAA ACATCCCATC GTGCGTCTCA TACTGGACTA TAGAGGAATC AAAAAACTAC TCAGTACCTA TGTAGAAGCT CTGCCGGAGA TGCGCTACCC CGATGGTAAG CTGCATACCT CGTTCAATCA GACCGTCGCT ACCACGGGAC GTCTCTCCAG CAGCAATCCT AATCTGCAAA ACATTCCGAT CCGGACCGAA GTCGGGAGGG GGCTGAGGGC TGCTTTCGTA CCGGACAATG ACGAATGTAT TTTCATGAGC GCAGACTATT CTCAGATCGA GCTGCGGCTG ATGGCGCATC TGAGTGAAGA CGAGAGCTTG ATACAGGCTT TTCTCCATGG GGAAGACATC CACCGAGCCA CAGCCTCCAA AATATACAGG CTGCCTCTTG TAGAAGTCAC GGATGACATG CGCCGCCGAG CTAAAACGGC CAACTTCGGG ATCATCTACG GCATCTCTGC TTTCGGGCTT AGCGAACGGC TCAATATCTC CCGTACAGAA GCCAAGGCCT TGATCGAAGG TTACTTTGCC TCTTACCCGG GGGTTAAAGC GTATATGGAT CGGAGTATTG CCGAAGCCAA GCGACAGGGC TACGTCACCA CACTGTTCGG TCGCAAGCGA TTCCTCCGGG ATATAAACAG TGCCAATGCC GTCGTCCGCG GCTATGCAGA GCGGAATGCC ATCAATGCCC CTATACAAGG CTCTGCTGCC GACCTGATCA AGCTGGCGAT GATCAGAATA CACGAGGAAA TCACTGAGCG TAAGCTGCGG AGCCGAATGA TCCTGCAAGT GCACGACGAA CTCAACTTCA ACGTTCTTCG TCCTGAAGCT GCAGAAGTTC GCGAGCTTGT ACGCTCGTGT ATGGAGGGGG TGATGCCCTC TTTGCGTGTT CCTCTGATAG CCGAGATCGG CGAAGGAGCC AATTGGCTGG AGGCGCACTG A
|
Protein sequence | MTERLFLLDA YALIFRAYYA FIRSPRIDST GRDTGAVFGF ALTLLDILEK ESPEHIAVVF DPPGGSFRHR EYAEYKAQRE ETPEGIRIAV PLIKEILAAF RIPAVEVPDF EADDTIGTLA KQAEEQGLAV RMVTPDKDFG QLVSERIKIY RPKSGGGYET WGPAEVCEKF GLSIPGQMID YLGLVGDSSD NIPGCKGIGA KTAEKLLAEY GSIDGIYAHQ DELKGAVAKK IQEGEEQTRF SRYLATIRTD APIVFDSEAY RRTSPDMAAV RECFAALEFR TLLKRLESTP TDAPATDLFA GMVQAQEPPT DLFGEGTDAT GLPLKKLTDV PHEYTILKTE EEIADCIRMF SATPCFSFDT ETDSKDALRA NIVAITLCAE SGRAFFIPLP EDEEIGKRRL DLLRPLFADT AIGKVGQNMK YDIQVLSRYG IEVRGQLFDT MIAHYLLFPD LRHNMDEMAE TLLGYCTVHY SDLVGSDKQE VHIRQVPLQN LADYAMEDAD ITWQLYERLN AMLSEAGMTS LFESIEMPLV PVLANMERSG VKLDTEVLRR TASGLGEEMQ RIEDEIYRLA GHSFNINSPS QVGTVLFEEL QITEKPKKTK SGSYSTNEEI LVKLQEKHPI VRLILDYRGI KKLLSTYVEA LPEMRYPDGK LHTSFNQTVA TTGRLSSSNP NLQNIPIRTE VGRGLRAAFV PDNDECIFMS ADYSQIELRL MAHLSEDESL IQAFLHGEDI HRATASKIYR LPLVEVTDDM RRRAKTANFG IIYGISAFGL SERLNISRTE AKALIEGYFA SYPGVKAYMD RSIAEAKRQG YVTTLFGRKR FLRDINSANA VVRGYAERNA INAPIQGSAA DLIKLAMIRI HEEITERKLR SRMILQVHDE LNFNVLRPEA AEVRELVRSC MEGVMPSLRV PLIAEIGEGA NWLEAH
|
| |