Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mkms_3147 |
Symbol | dnaE |
ID | 4610982 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. KMS |
Kingdom | Bacteria |
Replicon accession | NC_008705 |
Strand | - |
Start bp | 3292459 |
End bp | 3296034 |
Gene Length | 3576 bp |
Protein Length | 1191 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639792818 |
Product | DNA polymerase III subunit alpha |
Protein accession | YP_939131 |
Protein GI | 119869179 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0587] DNA polymerase III, alpha subunit |
TIGRFAM ID | [TIGR00594] DNA-directed DNA polymerase III (polc) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0480917 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.366201 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGGTT CAGACGGTCG GTCCTCGGGG TCTTTTGTTC ACCTGCACAA CCACACCGAG TATTCGATGC TGGACGGCGC CGCCAAGGTC AAGCCGATGC TCGCGGAGGC CCAACGGCTG GAGATGCCCG CGATCGGGAT GACCGACCAC GGAAACATGT TCGGCGCCAG CGAGTTCTAC AACGCGGCCA CCGACGCCGG CATCAAACCG ATCATCGGGA TCGAGGCCTA CATCGCACCG GCCTCGCGGT TCGAGACCAA GCGTGTCCTG TGGGGTGATC CGAGCCAGAA ATCCGACGAC GTCTCCGGCA GCGGGTCCTA CACCCACATG ACGATGGTCG CCGAGAACGC GACCGGTCTG CGCAACCTGT TCAAGCTCTC CTCACTCGCC TCGTTCGAGG GTCAGCTCGG CAAGTGGTCA CGGATGGACG CCGAGATCAT CGCCGAACAC GCCGAGGGCA TCATCGCCAC CACGGGCTGC CCGTCCGGCG AAGTGCAGAC CCGGCTGCGG CTGGGGCACC AGCGTGAGGC GCTGGAGGCG GCGGCCAAGT GGCGCGAGAT CTTCGGGCCG CAGAACTTCT TCCTCGAGTT GATGGACCAC GGGCTCGACA TCGAACGCCG CGTCCGCGAA GGGCTGCTCG AGATCGGTCA GAAGCTCGGT ATCCCGCCGC TGGCCACCAA CGACTGCCAC TACGTCACCC GAGACGCCTC GCAGAACCAC GAGGCGCTGC TGTGCATCCA GACCGGTAAG ACACTCTCGG ATCCGACCCG CTTCAAATTC GACGGCGACG GCTACTACCT GAAGTCGGCA GCCGAGATGC GCGCGCTGTG GGACTCCCAG GTGCCCGGCG CGTGTGACTC GACACTGCTG ATCGCCGAAC GCGTGCAGTC CTACGCCGAC GTGTGGGCGC CGCGCGACCG GATGCCGATC TTCCCGGTCC CCGAGGGGCA CGATCAGGCC TCCTGGCTGC ACCACGAGGT GATGGCCGGG CTCAAGCGCC GCTTCAGTGC GGTCTCCGGC GGGGTCGTGC CGAATGACTA CATCGAGCGC GCCGAGTACG AGATCAAGGT CATCTGCGAC AAGGGCTTCC CGTCCTACTT CCTCATCGTC GCCGACCTGA TCAACTACGC GAAGTCGGTC GACATCCGCG TCGGGCCGGG CCGCGGGTCG GCCGCCGGCT CGCTGGTGGC CTACGCGCTG GGCATCACCA ACATCGACCC GATCCCGCAC GGTCTGCTGT TCGAGCGCTT CCTCAACCCG GAGCGGCCGT CGGCGCCCGA TATCGACATC GACTTCGACG ACCGTCGCCG CGGCGAGATG CTGCGCTATG CGGCCAACAA GTGGGGCAGT GACCGCGTCG CCCAGGTCAT CACGTTCGGC ACCATCAAAA CCAAGGCCGC GCTGAAGGAT TCGGCCCGGG TGCACTACGG CCAGCCGGGT TTCGCGATCG CCGACCGGAT CACCAAGGCA CTGCCGCCGC CGATCATGGC CAAGGACATC CCGGTGTCGG GCATCACCGA CCCCACCCAC GAGCGGTACA AGGAGGCCGC CGAGGTCCGC GCCCTGATCG ACACCGACCC GGACGTCCGC ACCATCTACG AGACCGCTCG CGGCCTCGAG GGTCTGGTCC GCAACGCCGG CGTGCACGCG TGCGCGGTCA TCATGAGCTC CGAACCGCTG ATCGACGCGA TCCCGTTGTG GCGCCGCCCG CAGGACGGTG CGGTGATCAC CGGCTGGGAC TATCCGTCAT GTGAGGCCAT CGGCCTGCTG AAGATGGACT TCCTCGGGCT GCGGAACCTG ACGATCATCG GCGACTGCAT CGAGAACATC AAGGCCAACC GCGGTGTCGA CCTGGACCTC GAATCGCTGG CGCTCGACGA TCCCAAGGCC TACGAACTGC TCGGCCGCGG CGACACGCTC GGGGTGTTCC AGCTCGACGG CGGGCCGATG CGCGATCTGC TGCGCCGCAT GCAGCCCACC GAGTTCAACG ACATCGTCGC CGTGCTGGCG CTCTACCGGC CCGGCCCGAT GGGCATGAAC GCCCACAACG ACTACGCCGA CCGCAAGAAC GGCCGCCAGC CGATCAAGCC GATCCACCCC GAACTCGAAG AGCCGCTCAA GGAGATCCTC GCCGAGACCT ACGGCCTGAT CGTCTACCAA GAGCAGATCA TGTTCATCGC CCAGAAGGTC GCCTCCTACA CGATGGGTAA GGCCGACGCC CTGCGCAAGG CCATGGGCAA GAAGAAGCTC GAGGTGCTCG AGGCCGAGTA CCAGGGGTTC CGCGAGGGCA TGACCGCCAA CGGGTTCTCC GAGGCGGCGG TGAAAGCCCT GTGGGACACC ATCCTTCCGT TCGCCGGCTA CGCGTTCAAC AAATCGCATG CGGCGGGCTA CGGGCTGGTG TCGTACTGGA CGGCGTACCT GAAGGCCAAC TATCCGGCCG AGTACATGGC GGGTCTGCTC ACCTCGGTCG GGGACGACAA GGACAAGGCC GCGGTGTACC TGGCGGACTG CCGGCGGTTG GGTATCACCG TGCTGCCGCC CGACGTCAAC GAGTCGGTGC AGAACTTCGC CTCCGTCGGT GACGACATCC GGTTCGGTCT CGGTGCGGTG CGCAACGTCG GCGCGAATGT GGTTGCGTCC CTGGTGAACA CCCGCGCCGA GAAGGGTAAG TACTCCGACT TCTCGGACTA CCTGAACAAG ATCGACATCG CCGCCTGCAA CAAGAAGGTG ACGGAGTCGC TGATCAAGGC CGGCGCATTC GATTCGCTCG GCCATCCCCG TAAGGGTCTG TTCCTCATCC ACACCGATGC CGTCGACTCG GTGCTGGGCA CCAAGAAGGC CGAGGCGATG GGTCAGTTCG ACCTGTTCGG CAGCGGGGAC GGTTCCGGGG CGGACGCCGG AGACTCAGCG TTCAGCATCA AGGTGCCCGA CGAGGAGTGG GAGGACAAAC ACAAGCTCGC CCTCGAACGG GAGATGCTCG GTCTGTACGT GTCCGGACAC CCGCTCAACG GGGTGGCGCA CCTGCTCGCG AACCAGGTCG ACACCCAGAT CCCCGCGATC CTCGACGGTG ACGTCGCCAA CGATGCGCAG GTGCTGGTCG GGGGCATCCT CGCCTCGGTC AACCGCCGGG TGAACAAGAA CGGGTTGCCC TGGGCCTCAG CACAATTGGA GGATCTGACC GGCGGGATCG AGGTGCTGTT CTTCCCGCAG ACCTACTCGG TGTTCGGCGC GGAGATCGCC GACGACGTGG TGGTGCTGGT GAAGGCCAAG GTGGCCGCTC GCGACGACCG CATCGCGCTG ATCGCCCACG AACTCGTCGT GCCCGACTTC TCCAGCGCGC AGGCCGACCG GCCCCTTGCG GTCAGCCTGC CCACCCGGCA GTGCACGGTC GACAAGGTCA CCGCGCTGAA GCAGGTGCTG GCCAACCATC CCGGCACCTC CCAGGTGCAC CTGCGGTTGA TCAGCGGTGA GCGGATCACG ACGCTCGAAC TCGACCAGTC ACTGCGGGTG ACGCCCTCGT CGGCGCTGAT GGGGGATCTC AAGGCGCTGC TCGGCCCTGG CTGTCTCGGG GGTTGA
|
Protein sequence | MSGSDGRSSG SFVHLHNHTE YSMLDGAAKV KPMLAEAQRL EMPAIGMTDH GNMFGASEFY NAATDAGIKP IIGIEAYIAP ASRFETKRVL WGDPSQKSDD VSGSGSYTHM TMVAENATGL RNLFKLSSLA SFEGQLGKWS RMDAEIIAEH AEGIIATTGC PSGEVQTRLR LGHQREALEA AAKWREIFGP QNFFLELMDH GLDIERRVRE GLLEIGQKLG IPPLATNDCH YVTRDASQNH EALLCIQTGK TLSDPTRFKF DGDGYYLKSA AEMRALWDSQ VPGACDSTLL IAERVQSYAD VWAPRDRMPI FPVPEGHDQA SWLHHEVMAG LKRRFSAVSG GVVPNDYIER AEYEIKVICD KGFPSYFLIV ADLINYAKSV DIRVGPGRGS AAGSLVAYAL GITNIDPIPH GLLFERFLNP ERPSAPDIDI DFDDRRRGEM LRYAANKWGS DRVAQVITFG TIKTKAALKD SARVHYGQPG FAIADRITKA LPPPIMAKDI PVSGITDPTH ERYKEAAEVR ALIDTDPDVR TIYETARGLE GLVRNAGVHA CAVIMSSEPL IDAIPLWRRP QDGAVITGWD YPSCEAIGLL KMDFLGLRNL TIIGDCIENI KANRGVDLDL ESLALDDPKA YELLGRGDTL GVFQLDGGPM RDLLRRMQPT EFNDIVAVLA LYRPGPMGMN AHNDYADRKN GRQPIKPIHP ELEEPLKEIL AETYGLIVYQ EQIMFIAQKV ASYTMGKADA LRKAMGKKKL EVLEAEYQGF REGMTANGFS EAAVKALWDT ILPFAGYAFN KSHAAGYGLV SYWTAYLKAN YPAEYMAGLL TSVGDDKDKA AVYLADCRRL GITVLPPDVN ESVQNFASVG DDIRFGLGAV RNVGANVVAS LVNTRAEKGK YSDFSDYLNK IDIAACNKKV TESLIKAGAF DSLGHPRKGL FLIHTDAVDS VLGTKKAEAM GQFDLFGSGD GSGADAGDSA FSIKVPDEEW EDKHKLALER EMLGLYVSGH PLNGVAHLLA NQVDTQIPAI LDGDVANDAQ VLVGGILASV NRRVNKNGLP WASAQLEDLT GGIEVLFFPQ TYSVFGAEIA DDVVVLVKAK VAARDDRIAL IAHELVVPDF SSAQADRPLA VSLPTRQCTV DKVTALKQVL ANHPGTSQVH LRLISGERIT TLELDQSLRV TPSSALMGDL KALLGPGCLG G
|
| |