Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4246 |
Symbol | polA |
ID | 6144695 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4341748 |
End bp | 4344534 |
Gene Length | 2787 bp |
Protein Length | 928 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641619067 |
Product | DNA polymerase I |
Protein accession | YP_001746191 |
Protein GI | 170683498 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000118244 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 0.166094 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTCAGA TCCCCCAAAA TCCACTTATC CTTGTAGATG GTTCATCTTA TCTTTATCGC GCATATCACG CGTTTCCCCC GCTGACTAAC AGCGCAGGCG AGCCGACTGG TGCGATGTAT GGTGTCCTCA ACATGCTGCG CAGTCTGATC ATGCAATATA AACCGACGCA TGCAGCGGTG GTCTTTGACG CCAAGGGAAA AACCTTTCGT GATGAACTGT TTGAACATTA CAAATCACAT CGCCCGCCAA TGCCGGACGA TCTGCGAGCA CAAATCGAAC CTCTGCACGC GATGGTTAAA GCGATGGGAC TGCCGCTGCT GGCGGTTTCT GGCGTAGAAG CGGACGACGT TATCGGTACT CTGGCGCGCG AAGCCGAAAA AGCCGGGCGT CCGGTGCTTA TCAGCACTGG CGATAAAGAT ATGGCGCAGC TGGTGACGCC AAATATTACG CTTATCAATA CCATGACGAA TACCATCCTC GGCCCGGAAG AGGTGGTGAA TAAGTACGGC GTGCCGCCAG AACTGATCAT CGATTTCCTG GCGCTGATGG GTGACTCCTC TGATAACATT CCTGGCGTAC CGGGCGTCGG TGAAAAAACC GCGCAGGCAT TGCTGCAAGG TCTTGGCGGA CTGGATACGC TGTATGCCGA GCCAGAAAAA ATTGCTGGGT TGAGCTTCCG TGGCGCGAAA ACAATGGCAG CGAAGCTCGA GCAAAACAAA GAAGTTGCTT ATCTCTCATA CCAGCTGGCG ACGATTAAAA CCGACGTTGA ACTGGAGCTG ACCTGTGAAC AACTGGAAGT GCAGCCACCG GCGGCGGAAG AGTTGTTGGG GCTGTTCAAA AAGTATGAGT TCAAACGCTG GACTGCTGAT GTCGAAGCGG GCAAATGGTT ACAGGCTAAA GGGGCGAAAC CCGCAGCCAG GCCGCAGGAA ACCAGTGTTG CAGACGAAGC GCCAGAAGTG ACGGCAACGG TGATTTCTTA TGACAACTAC GTCACCATCC TTGATGAAGA AACACTGAAA ACGTGGATTG CGAAGCTGGA AAAAGCGCCG GTATTTGCAT TTGATACCGA AACCGACAGC CTTGATAACA TCTCTGCTAA CCTGGTCGGG CTTTCTTTTG CTATCGAGCC AGGCGTAGCG GCATATATTC CGGTTGCTCA TGATTATCTT GATGCGCCCG ATCAAATCTC TCGCGAGCGT GCACTCGAGT TGCTAAAACC GCTGCTGGAA AATGAAAAGG CGCTGAAGGT CGGGCAAAAC CTGAAATACG ACCGCGGTAT TCTGGCGAAT TACGGAATTG AGCTGCGTGG GATTGCGTTT GATACCATGC TGGAGTCCTA CATTCTCAAT AGCGTTGCCG GGCGTCACGA TATGGACAGC CTCGCGGAAC GTTGGTTGAA GCACAAAACC ATCACTTTTG AAGAGATTGC GGGTAAAGGC AAAAATCAAC TGACCTTTAA CCAGATTGCC CTCGAAGAAG CCGGACGTTA CGCCGCCGAA GATGCAGATG TCACCTTGCA GTTGCATCTG AAAATGTGGC CGGATCTGCA AAAACACAAA GGGCCGTTGA ACGTCTTCGA GAATATCGAA ATGCCGCTGG TGCCGGTGCT TTCACGCATT GAACGTAACG GTGTGAAGAT CGATCCGAAA GTGCTGCACA ATCACTCTGA AGAGCTCACC TTGCGTCTGG CTGAGCTGGA AAAGAAAGCG CATGAAATTG CAGGTGAAGA ATTTAACCTT TCTTCCACCA AGCAGTTACA AACCATTCTG TTTGAAAAAC AGGGCATTAA ACCGCTGAAG AAAACGCCGG GTGGCGCGCC GTCAACGTCG GAAGAGGTAC TGGAAGAACT GGCGCTGGAC TATCCGTTGC CAAAAGTGAT TCTGGAGTAT CGTGGTCTGG CGAAGCTGAA ATCGACCTAC ACCGACAAGC TGCCGTTGAT GATCAACCCG AAAACCGGGC GTGTACATAC CTCTTATCAC CAGGCAGTAA CCGCAACGGG ACGTTTATCG TCAACCGATC CTAACCTGCA AAACATTCCG GTGCGTAATG AAGAAGGTCG TCGTATCCGC CAGGCGTTTA TTGCGCCAGA GGATTATGTG ATTGTCTCGG CGGACTACTC GCAGATTGAA CTGCGCATTA TGGCGCATCT CTCGCGTGAC AAAGGCTTGC TGACCGCATT CGCGGAAGGA AAAGATATCC ACCGTGCAAC GGCGGCAGAA GTGTTTGGTT TGCCACTGGA AACCGTCACC AGCGAGCAAC GCCGTAGCGC GAAAGCGATC AACTTTGGTC TGATTTATGG CATGAGTGCT TTCGGCCTGG CGCGGCAATT GAACATTCCA CGTAAAGAAG CGCAGAAGTA CATGGACCTT TACTTCGAAC GCTACCCTGG CGTGCTGGAG TATATGGAAC GCACCCGTGC TCAGGCGAAA GAGCAGGGCT ACGTTGAAAC GCTGGACGGA CGCCGTCTGT ATCTGCCGGA TATCAAATCC AGCAATGGTG CTCGTCGTGC AGCGGCTGAA CGTGCAGCCA TTAACGCGCC AATGCAGGGA ACCGCCGCCG ACATTATCAA ACGGGCGATG ATTGCCGTTG ATGCGTGGCT ACAGGCTGAG CAACCGCGTG TACGTATGAT CATGCAGGTA CACGATGAAC TGGTATTTGA AGTTCATAAA GATGATGTCG ATGCCGTCGC GAAGCAGATT CATCAACTGA TGGAAAACTG TACCCGTCTG GATGTGCCGT TGCTGGTGGA AGTGGGGAGT GGCGAAAACT GGGATCAGGC GCACTAA
|
Protein sequence | MVQIPQNPLI LVDGSSYLYR AYHAFPPLTN SAGEPTGAMY GVLNMLRSLI MQYKPTHAAV VFDAKGKTFR DELFEHYKSH RPPMPDDLRA QIEPLHAMVK AMGLPLLAVS GVEADDVIGT LAREAEKAGR PVLISTGDKD MAQLVTPNIT LINTMTNTIL GPEEVVNKYG VPPELIIDFL ALMGDSSDNI PGVPGVGEKT AQALLQGLGG LDTLYAEPEK IAGLSFRGAK TMAAKLEQNK EVAYLSYQLA TIKTDVELEL TCEQLEVQPP AAEELLGLFK KYEFKRWTAD VEAGKWLQAK GAKPAARPQE TSVADEAPEV TATVISYDNY VTILDEETLK TWIAKLEKAP VFAFDTETDS LDNISANLVG LSFAIEPGVA AYIPVAHDYL DAPDQISRER ALELLKPLLE NEKALKVGQN LKYDRGILAN YGIELRGIAF DTMLESYILN SVAGRHDMDS LAERWLKHKT ITFEEIAGKG KNQLTFNQIA LEEAGRYAAE DADVTLQLHL KMWPDLQKHK GPLNVFENIE MPLVPVLSRI ERNGVKIDPK VLHNHSEELT LRLAELEKKA HEIAGEEFNL SSTKQLQTIL FEKQGIKPLK KTPGGAPSTS EEVLEELALD YPLPKVILEY RGLAKLKSTY TDKLPLMINP KTGRVHTSYH QAVTATGRLS STDPNLQNIP VRNEEGRRIR QAFIAPEDYV IVSADYSQIE LRIMAHLSRD KGLLTAFAEG KDIHRATAAE VFGLPLETVT SEQRRSAKAI NFGLIYGMSA FGLARQLNIP RKEAQKYMDL YFERYPGVLE YMERTRAQAK EQGYVETLDG RRLYLPDIKS SNGARRAAAE RAAINAPMQG TAADIIKRAM IAVDAWLQAE QPRVRMIMQV HDELVFEVHK DDVDAVAKQI HQLMENCTRL DVPLLVEVGS GENWDQAH
|
| |