Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpsIP31758_0021 |
Symbol | polA |
ID | 5386980 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pseudotuberculosis IP 31758 |
Kingdom | Bacteria |
Replicon accession | NC_009708 |
Strand | + |
Start bp | 22666 |
End bp | 25464 |
Gene Length | 2799 bp |
Protein Length | 932 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640862979 |
Product | DNA polymerase I |
Protein accession | YP_001399022 |
Protein GI | 153948148 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.00121039 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTCAGA TTGCAGAAAA CCCATTGATC CTTGTTGACG GTTCCTCTTA CCTCTACCGT GCTTACCATG CTTTCCCACC ACTGACTAAT GGTAGCGGGG AGCCAACCGG TGCGATGTAT GGCGTTCTGA ACATGCTACG CAGTCTGCTG CTGCAATATC GACCAAGTCA TGTTGCAGTA GTTTTTGATG CAAAAGGTAA AACGTTCCGT GATGAACTTT TTGCTGAATA TAAATCCCAC CGACCGCCAA TGCCGGATGA TTTACGGGTG CAAATTGAAC CTCTTCATCA GATGGTTAAA GCGATGGGAT TGCCTTTATT GGTCGTTTCT GGCGTCGAAG CTGATGATGT TATTGGCACC CTAGCTCGGG AAGCCGAAAA AGCAGGCCAT TCGGTATTAA TCAGTACTGG TGATAAAGAC ATGGCGCAAT TAGTTACGCC GAATATCACT TTGATCAATA CCATGAATAA CACTATTTTA GGGCCGCAGG ATGTTTGCGA TAAATATGGT GTCCCTCCCG AACTGATCAT TGATTTTCTC GCTCTGATGG GTGACTCCTC CGATAATATT CCGGGAGTAC CTGGTGTTGG CGAAAAAACA GCACAGGCTT TATTGCAAGG TTTGGGTGGG TTGGATACGT TATTCAGTAA TCTGGATAAA ATATCAACCT TGACGTTCCG TGGTGCGAAA ACAATGTCGG CCAAGCTAGA GCAAAATAAA GACGTTGCTT ATCTCTCCTA TAAATTGGCT ACTATCAAGA CAGACGTTGA GTTAGATGTA ACCTGTGATG AGCTAACCGT TTCTCCTCCC GATGATAAAC AGTTACATCA GTTATTCAGT CGTTATGAAT TTAAGCGTTG GCTGGCTGAT GTCGAAGCCG GCAAATGGTT GGACAGTAAA AAAGATCGGC CAACAGGGCA AACAAGCAGT CAATCTTTTG TTGCCGCAGA TACGGCCCCT ACTGCTGAAG TCACCGCAGT GCTTTCACAA GAGAATTACC AGACTATTTT AGATGAGAAA GCATTGTCTG ATTGGATTGA GCGCCTTAAA GCCGCTGAAG TTTTTGCTTT TGATACTGAA ACTGATGGCC TTGATACCCT TAGCTGTAAC TTAATTGGGA TGTCTTTTGC TGTCGCTCCA GGTGAAGCGG CTTATCTGCC TCTGGCTCAT GATTACCTGG ATGCCCCGCC CCAACTTGAC CGTGACTGGG TTCTGGCGAC CCTGAAACCA CTTCTGGAAG ACGATAAGGC GCTTAAAGTT GGGCAGAACC TCAAGTTCGA TAAAAGTATG CTGGCTCGTT ATGGTATCGA TCTAAAAGGT ATTGCTTTCG ATACCATGCT GGAGTCTTAT GTTTTGGATA GTGTTGCGGG CCGTCATGAT ATGGACAGCT TGGCGGAGCG CTACCTCAAT CATAAAACGA TTACGTTTGA AGAGATTGCC GGTAAAGGTA AAAATCAGCT AACGTTTAAT CAGATTGCGT TGGAGCAAGC TGGCCCGTAT GCCGCAGAGG ATGCCGATGT TACCCTTCAA TTACATTTGG TCTTATGGCC AAAATTACAG CAAAGTGAAG GCCTCAAGCG GGTATTCCAA GAAATTGAGA TGCCGTTATT GCCGATCTTG TCTCGTATTG AGCGGACTGG CGTATTGATT GACCAAAATA TATTAGCGGC ACACTCAAAA GAGCTCACCA TCCGTTTGGA TGAGCTGGAA AAGCAGGCCC ATGAATTGGC TGAAGAGCCA TTCAACCTGG CATCACCTAA ACAGCTACAG GCTATTCTTT ATGAAAAGCA AAAATTGCCT ATCTTGAAGA AAACACCTGG AGGCGCGGCG TCAACGAATG AGGAAGTGCT GGCTGAGTTG GCTCTGGATT ATCCTTTGCC GAAGGTGATT CTGGAATATC GTGGTCTGGC GAAACTAAAA AGCACTTACA CCGACAAATT GCCGCTAATG ATTAACCCCG TCTCCGGTCG GGTACACACT TCCTATCATC AGGCAGTGAC AGCAACCGGG CGCTTGTCTT CCCGCGATCC TAACCTACAA AATATCCCTG TACGTAATGA AGAGGGGCGA CGTATTCGCC AGGCCTTTAT TGCACCGAAG GGCTACTGCA TTATGGCGGC CGACTATTCG CAAATTGAAC TGCGTATTAT GGCGCATTTG TCGCAGGATA ACGGGCTGTT AGCTGCATTT GCTGCTGGGC AGGATATTCA CCGGGCAACC GCCGCGGAAG TATTTGGTTC GCCATTGGAA AAAGTGACGA CGGAGCAGCG TCGTAGCGCA AAAGCGATTA ATTTTGGTTT GATTTATGGC ATGAGTGCTT TTGGCTTGGC ACGCCAGTTG GGGATCCCTC GTGGAGAGGC TCAACGTTAT ATGGATCTCT ATTTTGAGCG TTATCCGGGG GTGTTGGAGT ATATGGAGCG TACTCGTAAA CAGGCTGCTG AGCAGGGCTA TGTCACGACA CTGGATGGTC GCCGCCTCTA TCTGCCGGAT ATTCACTCAC GGAATGCAAA CCGTCGAAAA GCGGCTGAAC GTGAGGCGAT TAATGCCCCT ATGCAAGGTA CGGCTGCGGA TATTATCAAG CGGGCGATGA TTGCGGTGGA TGGTTGGTTA CAGCAAGAGC CAGAACCGTT GGTGCGTGTC ATCATGCAAG TACACGATGA ATTGGTCTTT GAAGTGCATG AAAGTGTTTT GCAAAGTGCT GAGCAGAAAA TCCGTGAGTT GATGGAGCAA AGTATGCAAC TGGCTGTGCC ATTGAAGGTG GATGTCGGTG TTGGCGCTAA CTGGGATCAA GCTCATTAG
|
Protein sequence | MAQIAENPLI LVDGSSYLYR AYHAFPPLTN GSGEPTGAMY GVLNMLRSLL LQYRPSHVAV VFDAKGKTFR DELFAEYKSH RPPMPDDLRV QIEPLHQMVK AMGLPLLVVS GVEADDVIGT LAREAEKAGH SVLISTGDKD MAQLVTPNIT LINTMNNTIL GPQDVCDKYG VPPELIIDFL ALMGDSSDNI PGVPGVGEKT AQALLQGLGG LDTLFSNLDK ISTLTFRGAK TMSAKLEQNK DVAYLSYKLA TIKTDVELDV TCDELTVSPP DDKQLHQLFS RYEFKRWLAD VEAGKWLDSK KDRPTGQTSS QSFVAADTAP TAEVTAVLSQ ENYQTILDEK ALSDWIERLK AAEVFAFDTE TDGLDTLSCN LIGMSFAVAP GEAAYLPLAH DYLDAPPQLD RDWVLATLKP LLEDDKALKV GQNLKFDKSM LARYGIDLKG IAFDTMLESY VLDSVAGRHD MDSLAERYLN HKTITFEEIA GKGKNQLTFN QIALEQAGPY AAEDADVTLQ LHLVLWPKLQ QSEGLKRVFQ EIEMPLLPIL SRIERTGVLI DQNILAAHSK ELTIRLDELE KQAHELAEEP FNLASPKQLQ AILYEKQKLP ILKKTPGGAA STNEEVLAEL ALDYPLPKVI LEYRGLAKLK STYTDKLPLM INPVSGRVHT SYHQAVTATG RLSSRDPNLQ NIPVRNEEGR RIRQAFIAPK GYCIMAADYS QIELRIMAHL SQDNGLLAAF AAGQDIHRAT AAEVFGSPLE KVTTEQRRSA KAINFGLIYG MSAFGLARQL GIPRGEAQRY MDLYFERYPG VLEYMERTRK QAAEQGYVTT LDGRRLYLPD IHSRNANRRK AAEREAINAP MQGTAADIIK RAMIAVDGWL QQEPEPLVRV IMQVHDELVF EVHESVLQSA EQKIRELMEQ SMQLAVPLKV DVGVGANWDQ AH
|
| |