Gene EcSMS35_4246 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4246 
SymbolpolA 
ID6144695 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4341748 
End bp4344534 
Gene Length2787 bp 
Protein Length928 aa 
Translation table11 
GC content52% 
IMG OID641619067 
ProductDNA polymerase I 
Protein accessionYP_001746191 
Protein GI170683498 
COG category[L] Replication, recombination and repair 
COG ID[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000118244 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.166094 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTCAGA TCCCCCAAAA TCCACTTATC CTTGTAGATG GTTCATCTTA TCTTTATCGC 
GCATATCACG CGTTTCCCCC GCTGACTAAC AGCGCAGGCG AGCCGACTGG TGCGATGTAT
GGTGTCCTCA ACATGCTGCG CAGTCTGATC ATGCAATATA AACCGACGCA TGCAGCGGTG
GTCTTTGACG CCAAGGGAAA AACCTTTCGT GATGAACTGT TTGAACATTA CAAATCACAT
CGCCCGCCAA TGCCGGACGA TCTGCGAGCA CAAATCGAAC CTCTGCACGC GATGGTTAAA
GCGATGGGAC TGCCGCTGCT GGCGGTTTCT GGCGTAGAAG CGGACGACGT TATCGGTACT
CTGGCGCGCG AAGCCGAAAA AGCCGGGCGT CCGGTGCTTA TCAGCACTGG CGATAAAGAT
ATGGCGCAGC TGGTGACGCC AAATATTACG CTTATCAATA CCATGACGAA TACCATCCTC
GGCCCGGAAG AGGTGGTGAA TAAGTACGGC GTGCCGCCAG AACTGATCAT CGATTTCCTG
GCGCTGATGG GTGACTCCTC TGATAACATT CCTGGCGTAC CGGGCGTCGG TGAAAAAACC
GCGCAGGCAT TGCTGCAAGG TCTTGGCGGA CTGGATACGC TGTATGCCGA GCCAGAAAAA
ATTGCTGGGT TGAGCTTCCG TGGCGCGAAA ACAATGGCAG CGAAGCTCGA GCAAAACAAA
GAAGTTGCTT ATCTCTCATA CCAGCTGGCG ACGATTAAAA CCGACGTTGA ACTGGAGCTG
ACCTGTGAAC AACTGGAAGT GCAGCCACCG GCGGCGGAAG AGTTGTTGGG GCTGTTCAAA
AAGTATGAGT TCAAACGCTG GACTGCTGAT GTCGAAGCGG GCAAATGGTT ACAGGCTAAA
GGGGCGAAAC CCGCAGCCAG GCCGCAGGAA ACCAGTGTTG CAGACGAAGC GCCAGAAGTG
ACGGCAACGG TGATTTCTTA TGACAACTAC GTCACCATCC TTGATGAAGA AACACTGAAA
ACGTGGATTG CGAAGCTGGA AAAAGCGCCG GTATTTGCAT TTGATACCGA AACCGACAGC
CTTGATAACA TCTCTGCTAA CCTGGTCGGG CTTTCTTTTG CTATCGAGCC AGGCGTAGCG
GCATATATTC CGGTTGCTCA TGATTATCTT GATGCGCCCG ATCAAATCTC TCGCGAGCGT
GCACTCGAGT TGCTAAAACC GCTGCTGGAA AATGAAAAGG CGCTGAAGGT CGGGCAAAAC
CTGAAATACG ACCGCGGTAT TCTGGCGAAT TACGGAATTG AGCTGCGTGG GATTGCGTTT
GATACCATGC TGGAGTCCTA CATTCTCAAT AGCGTTGCCG GGCGTCACGA TATGGACAGC
CTCGCGGAAC GTTGGTTGAA GCACAAAACC ATCACTTTTG AAGAGATTGC GGGTAAAGGC
AAAAATCAAC TGACCTTTAA CCAGATTGCC CTCGAAGAAG CCGGACGTTA CGCCGCCGAA
GATGCAGATG TCACCTTGCA GTTGCATCTG AAAATGTGGC CGGATCTGCA AAAACACAAA
GGGCCGTTGA ACGTCTTCGA GAATATCGAA ATGCCGCTGG TGCCGGTGCT TTCACGCATT
GAACGTAACG GTGTGAAGAT CGATCCGAAA GTGCTGCACA ATCACTCTGA AGAGCTCACC
TTGCGTCTGG CTGAGCTGGA AAAGAAAGCG CATGAAATTG CAGGTGAAGA ATTTAACCTT
TCTTCCACCA AGCAGTTACA AACCATTCTG TTTGAAAAAC AGGGCATTAA ACCGCTGAAG
AAAACGCCGG GTGGCGCGCC GTCAACGTCG GAAGAGGTAC TGGAAGAACT GGCGCTGGAC
TATCCGTTGC CAAAAGTGAT TCTGGAGTAT CGTGGTCTGG CGAAGCTGAA ATCGACCTAC
ACCGACAAGC TGCCGTTGAT GATCAACCCG AAAACCGGGC GTGTACATAC CTCTTATCAC
CAGGCAGTAA CCGCAACGGG ACGTTTATCG TCAACCGATC CTAACCTGCA AAACATTCCG
GTGCGTAATG AAGAAGGTCG TCGTATCCGC CAGGCGTTTA TTGCGCCAGA GGATTATGTG
ATTGTCTCGG CGGACTACTC GCAGATTGAA CTGCGCATTA TGGCGCATCT CTCGCGTGAC
AAAGGCTTGC TGACCGCATT CGCGGAAGGA AAAGATATCC ACCGTGCAAC GGCGGCAGAA
GTGTTTGGTT TGCCACTGGA AACCGTCACC AGCGAGCAAC GCCGTAGCGC GAAAGCGATC
AACTTTGGTC TGATTTATGG CATGAGTGCT TTCGGCCTGG CGCGGCAATT GAACATTCCA
CGTAAAGAAG CGCAGAAGTA CATGGACCTT TACTTCGAAC GCTACCCTGG CGTGCTGGAG
TATATGGAAC GCACCCGTGC TCAGGCGAAA GAGCAGGGCT ACGTTGAAAC GCTGGACGGA
CGCCGTCTGT ATCTGCCGGA TATCAAATCC AGCAATGGTG CTCGTCGTGC AGCGGCTGAA
CGTGCAGCCA TTAACGCGCC AATGCAGGGA ACCGCCGCCG ACATTATCAA ACGGGCGATG
ATTGCCGTTG ATGCGTGGCT ACAGGCTGAG CAACCGCGTG TACGTATGAT CATGCAGGTA
CACGATGAAC TGGTATTTGA AGTTCATAAA GATGATGTCG ATGCCGTCGC GAAGCAGATT
CATCAACTGA TGGAAAACTG TACCCGTCTG GATGTGCCGT TGCTGGTGGA AGTGGGGAGT
GGCGAAAACT GGGATCAGGC GCACTAA
 
Protein sequence
MVQIPQNPLI LVDGSSYLYR AYHAFPPLTN SAGEPTGAMY GVLNMLRSLI MQYKPTHAAV 
VFDAKGKTFR DELFEHYKSH RPPMPDDLRA QIEPLHAMVK AMGLPLLAVS GVEADDVIGT
LAREAEKAGR PVLISTGDKD MAQLVTPNIT LINTMTNTIL GPEEVVNKYG VPPELIIDFL
ALMGDSSDNI PGVPGVGEKT AQALLQGLGG LDTLYAEPEK IAGLSFRGAK TMAAKLEQNK
EVAYLSYQLA TIKTDVELEL TCEQLEVQPP AAEELLGLFK KYEFKRWTAD VEAGKWLQAK
GAKPAARPQE TSVADEAPEV TATVISYDNY VTILDEETLK TWIAKLEKAP VFAFDTETDS
LDNISANLVG LSFAIEPGVA AYIPVAHDYL DAPDQISRER ALELLKPLLE NEKALKVGQN
LKYDRGILAN YGIELRGIAF DTMLESYILN SVAGRHDMDS LAERWLKHKT ITFEEIAGKG
KNQLTFNQIA LEEAGRYAAE DADVTLQLHL KMWPDLQKHK GPLNVFENIE MPLVPVLSRI
ERNGVKIDPK VLHNHSEELT LRLAELEKKA HEIAGEEFNL SSTKQLQTIL FEKQGIKPLK
KTPGGAPSTS EEVLEELALD YPLPKVILEY RGLAKLKSTY TDKLPLMINP KTGRVHTSYH
QAVTATGRLS STDPNLQNIP VRNEEGRRIR QAFIAPEDYV IVSADYSQIE LRIMAHLSRD
KGLLTAFAEG KDIHRATAAE VFGLPLETVT SEQRRSAKAI NFGLIYGMSA FGLARQLNIP
RKEAQKYMDL YFERYPGVLE YMERTRAQAK EQGYVETLDG RRLYLPDIKS SNGARRAAAE
RAAINAPMQG TAADIIKRAM IAVDAWLQAE QPRVRMIMQV HDELVFEVHK DDVDAVAKQI
HQLMENCTRL DVPLLVEVGS GENWDQAH