Gene ECH74115_5306 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5306 
SymbolpolA 
ID6966611 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4946033 
End bp4948819 
Gene Length2787 bp 
Protein Length928 aa 
Translation table11 
GC content52% 
IMG OID643388969 
ProductDNA polymerase I 
Protein accessionYP_002273378 
Protein GI209400301 
COG category[L] Replication, recombination and repair 
COG ID[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000330989 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTCAGA TCCCCCAAAA TCCACTTATC CTTGTAGATG GTTCATCTTA TCTTTATCGC 
GCATATCACG CGTTTCCCCC GCTGACTAAC AGCGCAGGCG AGCCGACCGG TGCGATGTAT
GGTGTCCTCA ACATGCTGCG CAGTCTGATC ATGCAATATA AACCGACGCA TGCAGCGGTG
GTCTTTGACG CCAAGGGAAA AACCTTTCGT GATGAACTGT TTGAACATTA CAAATCACAT
CGCCCGCCAA TGCCGGACGA TCTGCGAGCA CAAATCGAAC CCTTGCACGC GATGGTTAAA
GCGATGGGAC TGCCGCTGCT GGCGGTTTCT GGCGTAGAAG CGGACGACGT TATCGGTACT
CTGGCGCGCG AAGCCGAAAA AGCCGGGCGT CCGGTGCTGA TCAGCACTGG CGATAAAGAT
ATGGCGCAGC TGGTGACGCC AAATATTACG CTTATCAACA CCATGACGAA TACCATCCTC
GGACCGGAAG AGGTGGTGAA TAAGTACGGC GTGCCGCCAG AACTGATCAT CGATTTCCTG
GCGCTGATGG GTGACTCCTC TGATAACATT CCTGGCGTAC CGGGCGTCGG TGAAAAAACC
GCGCAGGCAT TGCTGCAAGG TCTTGGCGGG CTGGATACGC TGTATGCCGA GCCAGAAAAA
ATTGCTGGGT TGAGCTTCCG TGGCGCGAAA ACAATGGCAG CGAAGCTCGA GCAAAACAAA
GAAGTTGCTT ATCTCTCATA CCAGCTGGCG ACGATTAAAA CCGACGTTGA ACTGGAGCTG
ACCTGTGAAC AACTGGAAGT GCAGCAACCG GCAGCGGAAG AGTTGTTGGG GCTGTTCAAA
AAGTATGAGT TCAAACGCTG GACTGCTGAT GTCGAAGCGG GCAAATGGTT ACAGGCCAAA
GGGGCAAAAC CAGCCGCGAA GCCACAGGAA ACCAGTGTTG CAGACGAAGC ACCAGAAGTG
ACGGCAACGG TGATTTCTTA TGACAACTAC GTCACCATCC TTGATGAAGA AACACTGAAA
GCGTGGATTG CGAAGCTGGA AAAAGCGCCG GTATTTGCAT TTGATACCGA AACCGACAGC
CTTGATAACA TCTCAGCTAA CCTGGTCGGG CTTTCTTTTG CTATCGAGCC AGGCGTAGCG
GCATATATTC CGGTTGCTCA TGATTATCTT GATGCGCCCG ATCAAATCTC TCGCGAGCGT
GCACTCGAGT TGCTAAAACC GCTGCTGGAA GATGAAAAGG CGCTGAAGGT CGGGCAAAAC
CTGAAATACG ATCGCGGTAT TCTGGCGAAT TACGGCATTG AGCTGCGTGG GATTGCGTTT
GATACCATGC TGGAGTCCTA CATTCTCAAT AGCGTTGCCG GGCGTCACGA TATGGACAGC
CTCGCGGAAC GTTGGTTGAA GCACAAAACC ATCACTTTTG AAGAGATTGC GGGTAAAGGC
AAAAATCAAC TGACTTTTAA CCAGATTGCC CTCGAAGAGG CCGGACGTTA CGCCGCCGAA
GATGCTGATG TCACCTTGCA GTTGCATCTG AAAATGTGGC CGGATCTGCA AAAACACAAA
GGGCCGTTGA ACGTCTTCGA GAATATCGAA ATGCCGCTGG TGCCAGTACT TTCACGCATT
GAACGTAACG GTGTGAAGAT CGATCCGAAA GTGCTGCACA ATCATTCTGA AGAGCTCACC
CTTCGTCTGG CTGAGCTGGA AAAGAAAGCG CATGAAATTG CAGGTGAGGA ATTTAACCTT
TCTTCCACCA AGCAGTTACA AACCATTCTG TTTGAAAAAC AGGGCATTAA ACCGCTGAAG
AAAACGCCGG GTGGCGCGCC GTCAACGTCG GAAGAGGTAC TGGAAGAACT GGCGCTGGAC
TATCCGTTGC CAAAAGTGAT TCTGGAGTAT CGTGGTCTGG CGAAGCTGAA ATCGACTTAC
ACCGACAAGC TGCCGCTGAT GATCAACCCG AAAACCGGGC GTGTGCATAC CTCTTATCAC
CAGGCTGTAA CTGCAACGGG ACGTTTATCG TCAACCGATC CTAACCTGCA AAACATTCCG
GTGCGTAACG AAGAAGGTCG TCGTATCCGC CAGGCGTTTA TTGCGCCAGA GGATTATGTG
ATTGTCTCAG CGGACTACTC GCAGATTGAA CTGCGCATTA TGGCGCATCT CTCGCGTGAC
AAAGGCTTGC TGACCGCATT CGCGGAAGGA AAAGATATCC ACCGGGCAAC GGCGGCAGAA
GTGTTTGGTT TGCCACTGGA AACCGTCACC AGCGAGCAAC GCCGTAGCGC GAAAGCGATC
AACTTTGGTC TGATTTATGG CATGAGTGCT TTCGGTCTGG CGCGGCAATT GAACATTCCA
CGTAAAGAAG CGCAGAAGTA CATGGACCTT TACTTCGAAC GCTACCCTGG CGTGCTGGAG
TATATGGAAC GCACCCGTGC TCAGGCGAAA GAGCAGGGCT ACGTTGAAAC GCTGGACGGA
CGCCGTCTGT ATCTGCCGGA TATCAAATCC AGCAATGGTG CTCGTCGTGC AGCGGCTGAA
CGTGCAGCCA TTAACGCGCC AATGCAGGGA ACCGCCGCCG ACATTATCAA ACGGGCGATG
ATTGCCGTTG ATGCGTGGTT ACAGGCTGAG CAACCGCGTG TACGTATGAT CATGCAGGTA
CACGATGAAC TGGTATTTGA AGTTCATAGA GATGATGTCG ATGCCGTTGC GAAGCAGATT
CATCAACTGA TGGAAAACTG TACCCGTCTG GATGTGCCGT TGCTGGTGGA AGTGGGGAGT
GGCGAAAACT GGGATCAGGC GCACTAA
 
Protein sequence
MVQIPQNPLI LVDGSSYLYR AYHAFPPLTN SAGEPTGAMY GVLNMLRSLI MQYKPTHAAV 
VFDAKGKTFR DELFEHYKSH RPPMPDDLRA QIEPLHAMVK AMGLPLLAVS GVEADDVIGT
LAREAEKAGR PVLISTGDKD MAQLVTPNIT LINTMTNTIL GPEEVVNKYG VPPELIIDFL
ALMGDSSDNI PGVPGVGEKT AQALLQGLGG LDTLYAEPEK IAGLSFRGAK TMAAKLEQNK
EVAYLSYQLA TIKTDVELEL TCEQLEVQQP AAEELLGLFK KYEFKRWTAD VEAGKWLQAK
GAKPAAKPQE TSVADEAPEV TATVISYDNY VTILDEETLK AWIAKLEKAP VFAFDTETDS
LDNISANLVG LSFAIEPGVA AYIPVAHDYL DAPDQISRER ALELLKPLLE DEKALKVGQN
LKYDRGILAN YGIELRGIAF DTMLESYILN SVAGRHDMDS LAERWLKHKT ITFEEIAGKG
KNQLTFNQIA LEEAGRYAAE DADVTLQLHL KMWPDLQKHK GPLNVFENIE MPLVPVLSRI
ERNGVKIDPK VLHNHSEELT LRLAELEKKA HEIAGEEFNL SSTKQLQTIL FEKQGIKPLK
KTPGGAPSTS EEVLEELALD YPLPKVILEY RGLAKLKSTY TDKLPLMINP KTGRVHTSYH
QAVTATGRLS STDPNLQNIP VRNEEGRRIR QAFIAPEDYV IVSADYSQIE LRIMAHLSRD
KGLLTAFAEG KDIHRATAAE VFGLPLETVT SEQRRSAKAI NFGLIYGMSA FGLARQLNIP
RKEAQKYMDL YFERYPGVLE YMERTRAQAK EQGYVETLDG RRLYLPDIKS SNGARRAAAE
RAAINAPMQG TAADIIKRAM IAVDAWLQAE QPRVRMIMQV HDELVFEVHR DDVDAVAKQI
HQLMENCTRL DVPLLVEVGS GENWDQAH