Gene YpsIP31758_0021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_0021 
SymbolpolA 
ID5386980 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp22666 
End bp25464 
Gene Length2799 bp 
Protein Length932 aa 
Translation table11 
GC content47% 
IMG OID640862979 
ProductDNA polymerase I 
Protein accessionYP_001399022 
Protein GI153948148 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.00121039 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTCAGA TTGCAGAAAA CCCATTGATC CTTGTTGACG GTTCCTCTTA CCTCTACCGT 
GCTTACCATG CTTTCCCACC ACTGACTAAT GGTAGCGGGG AGCCAACCGG TGCGATGTAT
GGCGTTCTGA ACATGCTACG CAGTCTGCTG CTGCAATATC GACCAAGTCA TGTTGCAGTA
GTTTTTGATG CAAAAGGTAA AACGTTCCGT GATGAACTTT TTGCTGAATA TAAATCCCAC
CGACCGCCAA TGCCGGATGA TTTACGGGTG CAAATTGAAC CTCTTCATCA GATGGTTAAA
GCGATGGGAT TGCCTTTATT GGTCGTTTCT GGCGTCGAAG CTGATGATGT TATTGGCACC
CTAGCTCGGG AAGCCGAAAA AGCAGGCCAT TCGGTATTAA TCAGTACTGG TGATAAAGAC
ATGGCGCAAT TAGTTACGCC GAATATCACT TTGATCAATA CCATGAATAA CACTATTTTA
GGGCCGCAGG ATGTTTGCGA TAAATATGGT GTCCCTCCCG AACTGATCAT TGATTTTCTC
GCTCTGATGG GTGACTCCTC CGATAATATT CCGGGAGTAC CTGGTGTTGG CGAAAAAACA
GCACAGGCTT TATTGCAAGG TTTGGGTGGG TTGGATACGT TATTCAGTAA TCTGGATAAA
ATATCAACCT TGACGTTCCG TGGTGCGAAA ACAATGTCGG CCAAGCTAGA GCAAAATAAA
GACGTTGCTT ATCTCTCCTA TAAATTGGCT ACTATCAAGA CAGACGTTGA GTTAGATGTA
ACCTGTGATG AGCTAACCGT TTCTCCTCCC GATGATAAAC AGTTACATCA GTTATTCAGT
CGTTATGAAT TTAAGCGTTG GCTGGCTGAT GTCGAAGCCG GCAAATGGTT GGACAGTAAA
AAAGATCGGC CAACAGGGCA AACAAGCAGT CAATCTTTTG TTGCCGCAGA TACGGCCCCT
ACTGCTGAAG TCACCGCAGT GCTTTCACAA GAGAATTACC AGACTATTTT AGATGAGAAA
GCATTGTCTG ATTGGATTGA GCGCCTTAAA GCCGCTGAAG TTTTTGCTTT TGATACTGAA
ACTGATGGCC TTGATACCCT TAGCTGTAAC TTAATTGGGA TGTCTTTTGC TGTCGCTCCA
GGTGAAGCGG CTTATCTGCC TCTGGCTCAT GATTACCTGG ATGCCCCGCC CCAACTTGAC
CGTGACTGGG TTCTGGCGAC CCTGAAACCA CTTCTGGAAG ACGATAAGGC GCTTAAAGTT
GGGCAGAACC TCAAGTTCGA TAAAAGTATG CTGGCTCGTT ATGGTATCGA TCTAAAAGGT
ATTGCTTTCG ATACCATGCT GGAGTCTTAT GTTTTGGATA GTGTTGCGGG CCGTCATGAT
ATGGACAGCT TGGCGGAGCG CTACCTCAAT CATAAAACGA TTACGTTTGA AGAGATTGCC
GGTAAAGGTA AAAATCAGCT AACGTTTAAT CAGATTGCGT TGGAGCAAGC TGGCCCGTAT
GCCGCAGAGG ATGCCGATGT TACCCTTCAA TTACATTTGG TCTTATGGCC AAAATTACAG
CAAAGTGAAG GCCTCAAGCG GGTATTCCAA GAAATTGAGA TGCCGTTATT GCCGATCTTG
TCTCGTATTG AGCGGACTGG CGTATTGATT GACCAAAATA TATTAGCGGC ACACTCAAAA
GAGCTCACCA TCCGTTTGGA TGAGCTGGAA AAGCAGGCCC ATGAATTGGC TGAAGAGCCA
TTCAACCTGG CATCACCTAA ACAGCTACAG GCTATTCTTT ATGAAAAGCA AAAATTGCCT
ATCTTGAAGA AAACACCTGG AGGCGCGGCG TCAACGAATG AGGAAGTGCT GGCTGAGTTG
GCTCTGGATT ATCCTTTGCC GAAGGTGATT CTGGAATATC GTGGTCTGGC GAAACTAAAA
AGCACTTACA CCGACAAATT GCCGCTAATG ATTAACCCCG TCTCCGGTCG GGTACACACT
TCCTATCATC AGGCAGTGAC AGCAACCGGG CGCTTGTCTT CCCGCGATCC TAACCTACAA
AATATCCCTG TACGTAATGA AGAGGGGCGA CGTATTCGCC AGGCCTTTAT TGCACCGAAG
GGCTACTGCA TTATGGCGGC CGACTATTCG CAAATTGAAC TGCGTATTAT GGCGCATTTG
TCGCAGGATA ACGGGCTGTT AGCTGCATTT GCTGCTGGGC AGGATATTCA CCGGGCAACC
GCCGCGGAAG TATTTGGTTC GCCATTGGAA AAAGTGACGA CGGAGCAGCG TCGTAGCGCA
AAAGCGATTA ATTTTGGTTT GATTTATGGC ATGAGTGCTT TTGGCTTGGC ACGCCAGTTG
GGGATCCCTC GTGGAGAGGC TCAACGTTAT ATGGATCTCT ATTTTGAGCG TTATCCGGGG
GTGTTGGAGT ATATGGAGCG TACTCGTAAA CAGGCTGCTG AGCAGGGCTA TGTCACGACA
CTGGATGGTC GCCGCCTCTA TCTGCCGGAT ATTCACTCAC GGAATGCAAA CCGTCGAAAA
GCGGCTGAAC GTGAGGCGAT TAATGCCCCT ATGCAAGGTA CGGCTGCGGA TATTATCAAG
CGGGCGATGA TTGCGGTGGA TGGTTGGTTA CAGCAAGAGC CAGAACCGTT GGTGCGTGTC
ATCATGCAAG TACACGATGA ATTGGTCTTT GAAGTGCATG AAAGTGTTTT GCAAAGTGCT
GAGCAGAAAA TCCGTGAGTT GATGGAGCAA AGTATGCAAC TGGCTGTGCC ATTGAAGGTG
GATGTCGGTG TTGGCGCTAA CTGGGATCAA GCTCATTAG
 
Protein sequence
MAQIAENPLI LVDGSSYLYR AYHAFPPLTN GSGEPTGAMY GVLNMLRSLL LQYRPSHVAV 
VFDAKGKTFR DELFAEYKSH RPPMPDDLRV QIEPLHQMVK AMGLPLLVVS GVEADDVIGT
LAREAEKAGH SVLISTGDKD MAQLVTPNIT LINTMNNTIL GPQDVCDKYG VPPELIIDFL
ALMGDSSDNI PGVPGVGEKT AQALLQGLGG LDTLFSNLDK ISTLTFRGAK TMSAKLEQNK
DVAYLSYKLA TIKTDVELDV TCDELTVSPP DDKQLHQLFS RYEFKRWLAD VEAGKWLDSK
KDRPTGQTSS QSFVAADTAP TAEVTAVLSQ ENYQTILDEK ALSDWIERLK AAEVFAFDTE
TDGLDTLSCN LIGMSFAVAP GEAAYLPLAH DYLDAPPQLD RDWVLATLKP LLEDDKALKV
GQNLKFDKSM LARYGIDLKG IAFDTMLESY VLDSVAGRHD MDSLAERYLN HKTITFEEIA
GKGKNQLTFN QIALEQAGPY AAEDADVTLQ LHLVLWPKLQ QSEGLKRVFQ EIEMPLLPIL
SRIERTGVLI DQNILAAHSK ELTIRLDELE KQAHELAEEP FNLASPKQLQ AILYEKQKLP
ILKKTPGGAA STNEEVLAEL ALDYPLPKVI LEYRGLAKLK STYTDKLPLM INPVSGRVHT
SYHQAVTATG RLSSRDPNLQ NIPVRNEEGR RIRQAFIAPK GYCIMAADYS QIELRIMAHL
SQDNGLLAAF AAGQDIHRAT AAEVFGSPLE KVTTEQRRSA KAINFGLIYG MSAFGLARQL
GIPRGEAQRY MDLYFERYPG VLEYMERTRK QAAEQGYVTT LDGRRLYLPD IHSRNANRRK
AAEREAINAP MQGTAADIIK RAMIAVDGWL QQEPEPLVRV IMQVHDELVF EVHESVLQSA
EQKIRELMEQ SMQLAVPLKV DVGVGANWDQ AH