Gene YPK_4195 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_4195 
Symbol 
ID6089111 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp4629710 
End bp4632508 
Gene Length2799 bp 
Protein Length932 aa 
Translation table11 
GC content47% 
IMG OID641599294 
ProductDNA polymerase I 
Protein accessionYP_001722907 
Protein GI170026402 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00580719 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTCAGA TTGCAGAAAA CCCATTGATC CTTGTTGACG GTTCCTCTTA CCTCTACCGT 
GCTTACCATG CTTTCCCACC ACTGACTAAT GGTAGCGGGG AGCCAACCGG TGCGATGTAT
GGCGTTCTGA ACATGCTACG CAGTCTGCTG CTGCAATATC GACCAAGTCA TGTTGCAGTA
GTTTTTGATG CAAAAGGTAA AACGTTCCGT GATGAACTTT TTGCTGAATA TAAATCCCAC
CGACCGCCAA TGCCGGATGA TTTACGGGTG CAAATTGAAC CTCTTCATCA GATGGTTAAA
GCGATGGGAT TGCCTTTATT GGTCGTTTCT GGCGTCGAAG CTGATGATGT TATTGGCACC
CTAGCTCGGG AAGCCGAAAA AGCAGGCCAT TCGGTATTAA TCAGTACTGG TGATAAAGAC
ATGGCGCAAT TAGTTACGCC GAATATCACT TTGATCAATA CCATGAATAA CACTATTTTA
GGGCCGCAGG ATGTTTGCGA TAAATATGGT GTCCCTCCCG AACTGATCAT TGATTTTCTC
GCTCTGATGG GTGACTCCTC CGATAATATT CCGGGAGTAC CTGGTGTTGG CGAAAAAACA
GCACAGGCTT TATTGCAAGG TTTGGGTGGG TTGGATACGT TATTCAGTAA TCTGGATAAA
ATATCAACCT TGACGTTCCG TGGTGCGAAA ACAATGTCGG CCAAGCTAGA GCAAAATAAA
GACGTTGCTT ATCTCTCCTA TAAATTGGCT ACTATCAAGA CAGACGTTGA GTTAGATGTA
ACCTGTGATG AGCTAACCGT TTCTCCTCCC GATGATAAAC AGTTACATCA GTTATTCAGT
CGTTATGAAT TTAAGCGTTG GCTGGCTGAT GTCGAAGCCG GCAAATGGTT GGACAGTAAA
AAAGATCGGC CAACAGGGCA AACAAGCAGT CAATCTTTTG TTGCCGCAGA TACGGCCCCT
ACTGCTGAAG TCACCGCAGT GCTTTCACAA GAGAATTACC AGACTATTTT AGATGAGAAA
GCATTGGCTG ATTGGATTGA GCGCCTTAAA GCCGCTGAAG TTTTTGCTTT TGATACTGAA
ACTGATGGCC TTGATACCCT TAGCTGTAAC TTAATTGGGA TGTCTTTTGC TGTCGCTCCA
GGTGAAGCGG CTTATCTGCC TCTGGCTCAT GATTACCTGG ATGCCCCGCC CCAACTTGAC
CGTGACTGGG TTCTGGCGAC CCTGAAACCA CTTCTGGAAG ACGATAAGGC GCTTAAAGTT
GGGCAGAACC TCAAGTTCGA TAAAAGTATG CTGGCTCGTT ATGGTATCGA TCTAAAAGGT
ATTGCTTTCG ATACCATGCT GGAGTCTTAT GTTTTGGATA GTGTTGCGGG CCGTCATGAT
ATGGACAGCT TGGCGGAGCG CTACCTCAAT CATAAAACGA TTACGTTTGA AGAGATTGCC
GGTAAAGGTA AAAATCAGCT AACGTTTAAT CAGATTGCGT TGGAGCAAGC TGGCCCGTAT
GCCGCAGAGG ATGCCGATGT TACCCTTCAA TTACATTTGG TCTTATGGCC AAAATTACAG
CAAAGTGAAG GCCTCAAGCG GGTATTCCAA GAAATTGAGA TGCCGTTATT GCCGATCTTG
TCTCGTATTG AGCGAACTGG CGTATTGATT GACCAAAATA TATTAGCGGC ACACTCAAAA
GAGCTCACCA TCCGTTTGGA TGAGCTGGAA AAGCAGGCCC ATGAATTGGC TGAAGAGCCA
TTCAACCTGG CATCACCTAA ACAGCTACAG GCTATTCTTT ATGAAAAGCA AAAATTGCCT
ATCTTGAAGA AAACACCTGG AGGCGCGGCG TCAACGAATG AGGAAGTGCT GGCTGAGTTG
GCTCTGGATT ATCCTTTGCC GAAGGTGATT CTGGAATATC GTGGTCTGGC GAAACTAAAA
AGCACTTACA CCGACAAATT GCCGCTAATG ATTAACCCCG TCTCCGGTCG GGTACACACT
TCCTATCATC AGGCAGTGAC AGCAACCGGG CGCTTGTCTT CCCGCGATCC TAACCTACAA
AATATCCCTG TACGTAATGA AGAGGGGCGA CGTATTCGTC AGGCCTTTAT TGCACCGAAG
GGCTACTGCA TTATGGCGGC CGACTACTCG CAAATTGAAC TGCGTATTAT GGCGCATTTG
TCGCAGGATA ACGGGCTGTT AGCTGCATTT GCTGCTGGGC AGGATATCCA CCGGGCAACC
GCCGCGGAAG TATTTGGTTC GCCATTGGAA AAAGTGACGA CGGAGCAGCG TCGTAGCGCA
AAAGCGATTA ATTTTGGTTT GATTTATGGC ATGAGTGCTT TTGGCTTGGC ACGCCAGTTG
GGGATCCCTC GTGGAGAGGC TCAACGTTAT ATGGATCTCT ATTTTGAGCG TTATCCGGGG
GTGTTGGAGT ATATGGAGCG TACTCGTAAA CAGGCTGCTG AGCAGGGCTA TGTCACGACA
CTGGATGGTC GCCGCCTCTA TCTGCCGGAT ATTCACTCAC GGAATGCAAA CCGTCGAAAA
GCGGCTGAAC GTGAGGCGAT TAATGCCCCT ATGCAAGGTA CGGCTGCGGA TATTATCAAG
CGGGCGATGA TTGCGGTGGA TGGTTGGTTA CAGCAAGAGC CAGAACCGTT GGTGCGTGTC
ATCATGCAAG TACACGATGA ATTGGTCTTT GAAGTGCATG AAAGTGTTTT GCAAAGTGCT
GAGCAGAAAA TCCGTGAGTT GATGGAGCAA AGTATGCAAC TGGCTGTGCC ATTGAAGGTG
GATGTCGGTG TTGGCGCTAA CTGGGATCAA GCTCATTAG
 
Protein sequence
MAQIAENPLI LVDGSSYLYR AYHAFPPLTN GSGEPTGAMY GVLNMLRSLL LQYRPSHVAV 
VFDAKGKTFR DELFAEYKSH RPPMPDDLRV QIEPLHQMVK AMGLPLLVVS GVEADDVIGT
LAREAEKAGH SVLISTGDKD MAQLVTPNIT LINTMNNTIL GPQDVCDKYG VPPELIIDFL
ALMGDSSDNI PGVPGVGEKT AQALLQGLGG LDTLFSNLDK ISTLTFRGAK TMSAKLEQNK
DVAYLSYKLA TIKTDVELDV TCDELTVSPP DDKQLHQLFS RYEFKRWLAD VEAGKWLDSK
KDRPTGQTSS QSFVAADTAP TAEVTAVLSQ ENYQTILDEK ALADWIERLK AAEVFAFDTE
TDGLDTLSCN LIGMSFAVAP GEAAYLPLAH DYLDAPPQLD RDWVLATLKP LLEDDKALKV
GQNLKFDKSM LARYGIDLKG IAFDTMLESY VLDSVAGRHD MDSLAERYLN HKTITFEEIA
GKGKNQLTFN QIALEQAGPY AAEDADVTLQ LHLVLWPKLQ QSEGLKRVFQ EIEMPLLPIL
SRIERTGVLI DQNILAAHSK ELTIRLDELE KQAHELAEEP FNLASPKQLQ AILYEKQKLP
ILKKTPGGAA STNEEVLAEL ALDYPLPKVI LEYRGLAKLK STYTDKLPLM INPVSGRVHT
SYHQAVTATG RLSSRDPNLQ NIPVRNEEGR RIRQAFIAPK GYCIMAADYS QIELRIMAHL
SQDNGLLAAF AAGQDIHRAT AAEVFGSPLE KVTTEQRRSA KAINFGLIYG MSAFGLARQL
GIPRGEAQRY MDLYFERYPG VLEYMERTRK QAAEQGYVTT LDGRRLYLPD IHSRNANRRK
AAEREAINAP MQGTAADIIK RAMIAVDGWL QQEPEPLVRV IMQVHDELVF EVHESVLQSA
EQKIRELMEQ SMQLAVPLKV DVGVGANWDQ AH