Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0195 |
Symbol | dnaE |
ID | 6145063 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 214308 |
End bp | 217790 |
Gene Length | 3483 bp |
Protein Length | 1160 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641615096 |
Product | DNA polymerase III subunit alpha |
Protein accession | YP_001742312 |
Protein GI | 170682325 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0587] DNA polymerase III, alpha subunit |
TIGRFAM ID | [TIGR00594] DNA-directed DNA polymerase III (polc) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.220929 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGAAC CACGTTTCGT ACACCTGCGG GTGCACAGCG ACTACTCGAT GATCGATGGC CTGGCCAAAA CCGCACCGCT GGTAAAAAAG GCGGCGGCGT TGGGTATGCC AGCACTGGCG ATCACCGATT TCACCAACCT TTGCGGTCTG GTGAAGTTCT ACGGAGCGGG ACATGGCGCA GGGATTAAGC CCATCGTCGG GGCGGATTTT AACGTCCAGT GCGACCTGCT GGGTGATGAG TTAACCCACC TGACGGTGCT GGCGGCGAAC AATACCGGCT ATCAGAATCT GACGTTGCTG ATCTCAAAAG CGTATCAGCG CGGGTACGGT GCCGCCGGGC CGATCATCGA TCGCGACTGG CTTATCGAAT TAAATGAAGG GTTGATCCTT CTTTCCGGCG GGCGCATGGG CGACGTTGGA CGCAGTCTTT TGCGTGGTAA CAGCGCGCTG GTAGATGAGT GTGTCGCGTT TTATGAAGAA CACTTCCCGG ATCGCTATTT TCTCGAGCTG ATCCGCACCG GCAGGCCGGA CGAAGAAAGC TATCTGCACG CTGCGGTGGA ACTGGCGGAA TCACGCGGTT TGCCCGTTGT GGCGACCAAC GACGTGCGCT TTATCGACAG CAGCGACTTT GACGCACACG AAATCCGCGT CGCGATCCAC GACGGCTTTA CCCTCGACGA TCCTAAACGC CCGCGTAACT ATTCGCCGCA GCAATATATG CGTAGCGAAG AGGAGATGTG CGAGCTTTTT GCCGACATTC CCGAAGCCCT TGCCAACACC GTTGAGATAG CCAAACGTTG TAACGTAACT GTGCGTCTGG GTGAATACTT CTTGCCGCAA TTCCCGACCG GGGACATGAG CACCGAAGAT TATCTGGTCA AGCGTGCAAA AGAAGGCCTG GAAGAGCGTC TGGCCTTTTT ATTCCCTGAC GAGGAAGAAC GTCTGAAGCG CCGCCCGGAA TATGACGAGC GTCTGGAGAC TGAACTTCAG GTTATCAACC AGATGGGCTT CCCGGGTTAC TTCCTCATCG TTATGGAATT TATCCAGTGG TCGAAAGATA ACGGCGTACC GGTAGGGCCA GGCCGTGGCT CCGGTGCGGG TTCACTGGTG GCCTACGCGC TGAAAATCAC CGACCTCGAT CCGCTGGAAT TTGACCTGCT GTTCGAACGT TTCCTTAACC CGGAACGTGT CTCCATGCCT GACTTCGACG TTGACTTCTG TATGGAGAAA CGCGACCAGG TTATCGAACA TGTGGCGGAC ATGTACGGTC GCGATGCGGT ATCGCAGATT ATCACCTTCG GTACGATGGC GGCGAAAGCG GTGATCCGCG ACGTAGGCCG CGTGCTGGGC CATCCGTACG GCTTTGTCGA TCGTATCTCG AAACTGATCC CGCCCGATCC GGGGATGACG CTGGCGAAAG CGTTTGAAGC CGAGCCGCAG CTGCCGGAAA TCTACGAAGC GGATGAAGAA GTTAAGGCGC TGATCGACAT GGCGCGCAAA CTGGAAGGGG TCACCCGTAA CGCCGGTAAG CACGCCGGTG GGGTGGTTAT CGCACCGACC AAAATTACCG ATTTTGCGCC GCTTTACTGC GATGAAGAGG GCAAACATCC GGTCACCCAG TTTGATAAAA GCGACGTTGA ATACGCCGGG CTGGTGAAGT TCGACTTCCT CGGTTTGCGT ACGCTCACCA TCATCAACTG GGCGCTGGAG ATGATCAACA AGCGGCGGGC GAAGAATGGC GAGCCGCCGC TGGATATCGC CGCGATCCCG CTGGATGACA AGAAAAGCTT CGACATGCTG CAACGCTCGG AAACCACGGC GGTATTCCAG CTTGAATCGC GCGGCATGAA GGACCTGATC AAGCGTCTGC AACCTGACTG CTTCGAAGAT ATGATCGCAC TGGTGGCACT GTTCCGCCCT GGTCCGTTGC AGTCAGGGAT GGTGGATAAC TTTATCGACC GTAAACATGG TCGCGAAGAG ATCTCCTATC CGGACGTGCA GTGGCAGCAT GAAAGCCTGA AACCGGTACT GGAGCCAACC TACGGCATCA TCCTGTATCA GGAACAGGTC ATGCAGATTG CCCAGGTGCT TTCTGGTTAT ACCCTCGGTG GCGCGGATAT GCTGCGTCGT GCGATGGGTA AGAAAAAGCC GGAAGAGATG GCTAAGCAGC GTTCTGTATT TGCTGAAGGT GCAGAAAAGA ACGGAATTAA CGCCGAACTG GCGATGAAAA TCTTCGACCT GGTGGAGAAA TTCGCGGGTT ACGGATTTAA CAAATCGCAC TCTGCGGCCT ATGCTTTGGT GTCATATCAA ACGTTATGGC TGAAAGCGCA CTATCCGGCG GAGTTTATGG CGGCGGTAAT GACCGCCGAT ATGGACAACA CCGAGAAGGT GGTGGGCCTG GTGGATGAGT GCTGGCGGAT GGGGCTGAAA ATCCTGCCAC CAGATATAAA CTCCGGTCTT TACCATTTCC ACGTCAACGA CGACGGCGAA ATCGTGTATG GTATTGGCGC GATCAAAGGG GTCGGTGAAG GTCCGATTGA GGCCATCATC GAAGCCCGTA ATAAAGGCGG CTACTTCCGC GAACTGTTTG ATCTCTGCGC CCGAACCGAC ACCAAAAAGT TAAACCGGCG AGTGCTGGAA AAATTGATCA TGTCCGGGGC GTTTGACCGT CTTGGGCCAC ACCGCGCGGC GTTGATGAAC TCGCTGGGCG ATGCGTTAAA AGCGGCTGAT CAACACGCAA AAGCGGAAGC CATCGGTCAG GCCGATATGT TCGGCGTGCT GGCAGAAGAG CCGGAACAAA TTGAACAATC CTACGCCAGC TGCCAACCGT GGCCGGAGCA GGTGGTATTA GATGGGGAAC GTGAAACGTT AGGTCTGTAC CTGACGGGAC ACCCTATCAA CCAGTATTTA AAAGAGATTG AGCGTTATGT CGGAGGCATA AGGCTGAAAG ACATGCACCC GACAGAACGT GGTAAAGTCA TCACGGCTGC GGGGCTCGTT GTTGCTGCGC GGGTTATGGT CACCAAGCGC GGCAATCGTA TCGGTATCTG CACGCTGGAT GACCGTTCCG GGCGGCTGGA AGTGATGTTG TTTACTGACG CCCTGGATAA ATACCAGCAA TTGCTGGAAA AAGACCGCAT ACTTATCGTC AGCGGACAGG TCAGCTTTGA TGACTTCAGC GGTGGGCTTA AAATGACCGC TCGCGAAGTG ATGGATATTG ACGAAGCCCG GGAAAAATAT GCTCGCGGGC TTGCTATCTC GCTGACGGAC AGGCAAATTG ATGACCAGCT TTTAAACCGA CTCCGTCAGT CTCTGGAACC CCACCGCTCT GGGACAATTC CAGTACATCT CTACTATCAG AGGGCGGATG CACGCGCGCG GTTGCGTTTT GGCGCGACGT GGCGTGTCTC TCCGAGCGAT CGTTTATTAA ACGATCTCCG TGGCCTCATT GGTTCGGAGC AGGTGGAACT GGAGTTTGAC TAA
|
Protein sequence | MSEPRFVHLR VHSDYSMIDG LAKTAPLVKK AAALGMPALA ITDFTNLCGL VKFYGAGHGA GIKPIVGADF NVQCDLLGDE LTHLTVLAAN NTGYQNLTLL ISKAYQRGYG AAGPIIDRDW LIELNEGLIL LSGGRMGDVG RSLLRGNSAL VDECVAFYEE HFPDRYFLEL IRTGRPDEES YLHAAVELAE SRGLPVVATN DVRFIDSSDF DAHEIRVAIH DGFTLDDPKR PRNYSPQQYM RSEEEMCELF ADIPEALANT VEIAKRCNVT VRLGEYFLPQ FPTGDMSTED YLVKRAKEGL EERLAFLFPD EEERLKRRPE YDERLETELQ VINQMGFPGY FLIVMEFIQW SKDNGVPVGP GRGSGAGSLV AYALKITDLD PLEFDLLFER FLNPERVSMP DFDVDFCMEK RDQVIEHVAD MYGRDAVSQI ITFGTMAAKA VIRDVGRVLG HPYGFVDRIS KLIPPDPGMT LAKAFEAEPQ LPEIYEADEE VKALIDMARK LEGVTRNAGK HAGGVVIAPT KITDFAPLYC DEEGKHPVTQ FDKSDVEYAG LVKFDFLGLR TLTIINWALE MINKRRAKNG EPPLDIAAIP LDDKKSFDML QRSETTAVFQ LESRGMKDLI KRLQPDCFED MIALVALFRP GPLQSGMVDN FIDRKHGREE ISYPDVQWQH ESLKPVLEPT YGIILYQEQV MQIAQVLSGY TLGGADMLRR AMGKKKPEEM AKQRSVFAEG AEKNGINAEL AMKIFDLVEK FAGYGFNKSH SAAYALVSYQ TLWLKAHYPA EFMAAVMTAD MDNTEKVVGL VDECWRMGLK ILPPDINSGL YHFHVNDDGE IVYGIGAIKG VGEGPIEAII EARNKGGYFR ELFDLCARTD TKKLNRRVLE KLIMSGAFDR LGPHRAALMN SLGDALKAAD QHAKAEAIGQ ADMFGVLAEE PEQIEQSYAS CQPWPEQVVL DGERETLGLY LTGHPINQYL KEIERYVGGI RLKDMHPTER GKVITAAGLV VAARVMVTKR GNRIGICTLD DRSGRLEVML FTDALDKYQQ LLEKDRILIV SGQVSFDDFS GGLKMTAREV MDIDEAREKY ARGLAISLTD RQIDDQLLNR LRQSLEPHRS GTIPVHLYYQ RADARARLRF GATWRVSPSD RLLNDLRGLI GSEQVELEFD
|
| |