Gene Mkms_3147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_3147 
SymboldnaE 
ID4610982 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp3292459 
End bp3296034 
Gene Length3576 bp 
Protein Length1191 aa 
Translation table11 
GC content66% 
IMG OID639792818 
ProductDNA polymerase III subunit alpha 
Protein accessionYP_939131 
Protein GI119869179 
COG category[L] Replication, recombination and repair 
COG ID[COG0587] DNA polymerase III, alpha subunit 
TIGRFAM ID[TIGR00594] DNA-directed DNA polymerase III (polc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0480917 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.366201 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGGTT CAGACGGTCG GTCCTCGGGG TCTTTTGTTC ACCTGCACAA CCACACCGAG 
TATTCGATGC TGGACGGCGC CGCCAAGGTC AAGCCGATGC TCGCGGAGGC CCAACGGCTG
GAGATGCCCG CGATCGGGAT GACCGACCAC GGAAACATGT TCGGCGCCAG CGAGTTCTAC
AACGCGGCCA CCGACGCCGG CATCAAACCG ATCATCGGGA TCGAGGCCTA CATCGCACCG
GCCTCGCGGT TCGAGACCAA GCGTGTCCTG TGGGGTGATC CGAGCCAGAA ATCCGACGAC
GTCTCCGGCA GCGGGTCCTA CACCCACATG ACGATGGTCG CCGAGAACGC GACCGGTCTG
CGCAACCTGT TCAAGCTCTC CTCACTCGCC TCGTTCGAGG GTCAGCTCGG CAAGTGGTCA
CGGATGGACG CCGAGATCAT CGCCGAACAC GCCGAGGGCA TCATCGCCAC CACGGGCTGC
CCGTCCGGCG AAGTGCAGAC CCGGCTGCGG CTGGGGCACC AGCGTGAGGC GCTGGAGGCG
GCGGCCAAGT GGCGCGAGAT CTTCGGGCCG CAGAACTTCT TCCTCGAGTT GATGGACCAC
GGGCTCGACA TCGAACGCCG CGTCCGCGAA GGGCTGCTCG AGATCGGTCA GAAGCTCGGT
ATCCCGCCGC TGGCCACCAA CGACTGCCAC TACGTCACCC GAGACGCCTC GCAGAACCAC
GAGGCGCTGC TGTGCATCCA GACCGGTAAG ACACTCTCGG ATCCGACCCG CTTCAAATTC
GACGGCGACG GCTACTACCT GAAGTCGGCA GCCGAGATGC GCGCGCTGTG GGACTCCCAG
GTGCCCGGCG CGTGTGACTC GACACTGCTG ATCGCCGAAC GCGTGCAGTC CTACGCCGAC
GTGTGGGCGC CGCGCGACCG GATGCCGATC TTCCCGGTCC CCGAGGGGCA CGATCAGGCC
TCCTGGCTGC ACCACGAGGT GATGGCCGGG CTCAAGCGCC GCTTCAGTGC GGTCTCCGGC
GGGGTCGTGC CGAATGACTA CATCGAGCGC GCCGAGTACG AGATCAAGGT CATCTGCGAC
AAGGGCTTCC CGTCCTACTT CCTCATCGTC GCCGACCTGA TCAACTACGC GAAGTCGGTC
GACATCCGCG TCGGGCCGGG CCGCGGGTCG GCCGCCGGCT CGCTGGTGGC CTACGCGCTG
GGCATCACCA ACATCGACCC GATCCCGCAC GGTCTGCTGT TCGAGCGCTT CCTCAACCCG
GAGCGGCCGT CGGCGCCCGA TATCGACATC GACTTCGACG ACCGTCGCCG CGGCGAGATG
CTGCGCTATG CGGCCAACAA GTGGGGCAGT GACCGCGTCG CCCAGGTCAT CACGTTCGGC
ACCATCAAAA CCAAGGCCGC GCTGAAGGAT TCGGCCCGGG TGCACTACGG CCAGCCGGGT
TTCGCGATCG CCGACCGGAT CACCAAGGCA CTGCCGCCGC CGATCATGGC CAAGGACATC
CCGGTGTCGG GCATCACCGA CCCCACCCAC GAGCGGTACA AGGAGGCCGC CGAGGTCCGC
GCCCTGATCG ACACCGACCC GGACGTCCGC ACCATCTACG AGACCGCTCG CGGCCTCGAG
GGTCTGGTCC GCAACGCCGG CGTGCACGCG TGCGCGGTCA TCATGAGCTC CGAACCGCTG
ATCGACGCGA TCCCGTTGTG GCGCCGCCCG CAGGACGGTG CGGTGATCAC CGGCTGGGAC
TATCCGTCAT GTGAGGCCAT CGGCCTGCTG AAGATGGACT TCCTCGGGCT GCGGAACCTG
ACGATCATCG GCGACTGCAT CGAGAACATC AAGGCCAACC GCGGTGTCGA CCTGGACCTC
GAATCGCTGG CGCTCGACGA TCCCAAGGCC TACGAACTGC TCGGCCGCGG CGACACGCTC
GGGGTGTTCC AGCTCGACGG CGGGCCGATG CGCGATCTGC TGCGCCGCAT GCAGCCCACC
GAGTTCAACG ACATCGTCGC CGTGCTGGCG CTCTACCGGC CCGGCCCGAT GGGCATGAAC
GCCCACAACG ACTACGCCGA CCGCAAGAAC GGCCGCCAGC CGATCAAGCC GATCCACCCC
GAACTCGAAG AGCCGCTCAA GGAGATCCTC GCCGAGACCT ACGGCCTGAT CGTCTACCAA
GAGCAGATCA TGTTCATCGC CCAGAAGGTC GCCTCCTACA CGATGGGTAA GGCCGACGCC
CTGCGCAAGG CCATGGGCAA GAAGAAGCTC GAGGTGCTCG AGGCCGAGTA CCAGGGGTTC
CGCGAGGGCA TGACCGCCAA CGGGTTCTCC GAGGCGGCGG TGAAAGCCCT GTGGGACACC
ATCCTTCCGT TCGCCGGCTA CGCGTTCAAC AAATCGCATG CGGCGGGCTA CGGGCTGGTG
TCGTACTGGA CGGCGTACCT GAAGGCCAAC TATCCGGCCG AGTACATGGC GGGTCTGCTC
ACCTCGGTCG GGGACGACAA GGACAAGGCC GCGGTGTACC TGGCGGACTG CCGGCGGTTG
GGTATCACCG TGCTGCCGCC CGACGTCAAC GAGTCGGTGC AGAACTTCGC CTCCGTCGGT
GACGACATCC GGTTCGGTCT CGGTGCGGTG CGCAACGTCG GCGCGAATGT GGTTGCGTCC
CTGGTGAACA CCCGCGCCGA GAAGGGTAAG TACTCCGACT TCTCGGACTA CCTGAACAAG
ATCGACATCG CCGCCTGCAA CAAGAAGGTG ACGGAGTCGC TGATCAAGGC CGGCGCATTC
GATTCGCTCG GCCATCCCCG TAAGGGTCTG TTCCTCATCC ACACCGATGC CGTCGACTCG
GTGCTGGGCA CCAAGAAGGC CGAGGCGATG GGTCAGTTCG ACCTGTTCGG CAGCGGGGAC
GGTTCCGGGG CGGACGCCGG AGACTCAGCG TTCAGCATCA AGGTGCCCGA CGAGGAGTGG
GAGGACAAAC ACAAGCTCGC CCTCGAACGG GAGATGCTCG GTCTGTACGT GTCCGGACAC
CCGCTCAACG GGGTGGCGCA CCTGCTCGCG AACCAGGTCG ACACCCAGAT CCCCGCGATC
CTCGACGGTG ACGTCGCCAA CGATGCGCAG GTGCTGGTCG GGGGCATCCT CGCCTCGGTC
AACCGCCGGG TGAACAAGAA CGGGTTGCCC TGGGCCTCAG CACAATTGGA GGATCTGACC
GGCGGGATCG AGGTGCTGTT CTTCCCGCAG ACCTACTCGG TGTTCGGCGC GGAGATCGCC
GACGACGTGG TGGTGCTGGT GAAGGCCAAG GTGGCCGCTC GCGACGACCG CATCGCGCTG
ATCGCCCACG AACTCGTCGT GCCCGACTTC TCCAGCGCGC AGGCCGACCG GCCCCTTGCG
GTCAGCCTGC CCACCCGGCA GTGCACGGTC GACAAGGTCA CCGCGCTGAA GCAGGTGCTG
GCCAACCATC CCGGCACCTC CCAGGTGCAC CTGCGGTTGA TCAGCGGTGA GCGGATCACG
ACGCTCGAAC TCGACCAGTC ACTGCGGGTG ACGCCCTCGT CGGCGCTGAT GGGGGATCTC
AAGGCGCTGC TCGGCCCTGG CTGTCTCGGG GGTTGA
 
Protein sequence
MSGSDGRSSG SFVHLHNHTE YSMLDGAAKV KPMLAEAQRL EMPAIGMTDH GNMFGASEFY 
NAATDAGIKP IIGIEAYIAP ASRFETKRVL WGDPSQKSDD VSGSGSYTHM TMVAENATGL
RNLFKLSSLA SFEGQLGKWS RMDAEIIAEH AEGIIATTGC PSGEVQTRLR LGHQREALEA
AAKWREIFGP QNFFLELMDH GLDIERRVRE GLLEIGQKLG IPPLATNDCH YVTRDASQNH
EALLCIQTGK TLSDPTRFKF DGDGYYLKSA AEMRALWDSQ VPGACDSTLL IAERVQSYAD
VWAPRDRMPI FPVPEGHDQA SWLHHEVMAG LKRRFSAVSG GVVPNDYIER AEYEIKVICD
KGFPSYFLIV ADLINYAKSV DIRVGPGRGS AAGSLVAYAL GITNIDPIPH GLLFERFLNP
ERPSAPDIDI DFDDRRRGEM LRYAANKWGS DRVAQVITFG TIKTKAALKD SARVHYGQPG
FAIADRITKA LPPPIMAKDI PVSGITDPTH ERYKEAAEVR ALIDTDPDVR TIYETARGLE
GLVRNAGVHA CAVIMSSEPL IDAIPLWRRP QDGAVITGWD YPSCEAIGLL KMDFLGLRNL
TIIGDCIENI KANRGVDLDL ESLALDDPKA YELLGRGDTL GVFQLDGGPM RDLLRRMQPT
EFNDIVAVLA LYRPGPMGMN AHNDYADRKN GRQPIKPIHP ELEEPLKEIL AETYGLIVYQ
EQIMFIAQKV ASYTMGKADA LRKAMGKKKL EVLEAEYQGF REGMTANGFS EAAVKALWDT
ILPFAGYAFN KSHAAGYGLV SYWTAYLKAN YPAEYMAGLL TSVGDDKDKA AVYLADCRRL
GITVLPPDVN ESVQNFASVG DDIRFGLGAV RNVGANVVAS LVNTRAEKGK YSDFSDYLNK
IDIAACNKKV TESLIKAGAF DSLGHPRKGL FLIHTDAVDS VLGTKKAEAM GQFDLFGSGD
GSGADAGDSA FSIKVPDEEW EDKHKLALER EMLGLYVSGH PLNGVAHLLA NQVDTQIPAI
LDGDVANDAQ VLVGGILASV NRRVNKNGLP WASAQLEDLT GGIEVLFFPQ TYSVFGAEIA
DDVVVLVKAK VAARDDRIAL IAHELVVPDF SSAQADRPLA VSLPTRQCTV DKVTALKQVL
ANHPGTSQVH LRLISGERIT TLELDQSLRV TPSSALMGDL KALLGPGCLG G