Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mkms_3057 |
Symbol | |
ID | 4610891 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. KMS |
Kingdom | Bacteria |
Replicon accession | NC_008705 |
Strand | - |
Start bp | 3199074 |
End bp | 3201836 |
Gene Length | 2763 bp |
Protein Length | 920 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639792727 |
Product | DNA polymerase I |
Protein accession | YP_939041 |
Protein GI | 119869089 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.826907 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCGGCGG GGGTGCATAG AGTTGAGCCC GTGAGCCCAG CCAGCACCTC GACCGACAAG AAGACGGCCA CCAGCCAGAA GGCGGACGAC AAGCCGACAC TGATGCTGCT GGACGGCAAC TCGCTGGCGT ACCGCGCCTT CTACGCACTC CCCGCCGAGA ACTTCAAGAC CCAGGGCGGC CTGACCACCA ACGCGGTCTA CGGATTCACC GCGATGCTGA TCAACCTGCT CCGCGACGAG CAGCCCTCCC ATGTGGCCGC CGCGTTCGAC GTCTCCCGGC AGACGTTCCG CGTCGACAAG TACCCGGAGT ACAAGGCCGG CCGCTCGTCG ACTCCGGACG AGTTCCGCGG GCAGATCGAC ATCGCCAAGG AGGTGCTCGT CGCGCTTGGC ATCGCCGTGC TGGCCGAACC CGGCTTCGAG GCCGACGACA TCATCGCCAC GCTCGCCACC CAGGCCGAGG GTGAGGGTTA CCGGGTGCTC GTCGTGACCG GTGACCGCGA CGCGCTCCAG CTCGTCAGCG ACGACGTGAC GGTGCTCTAT CCCCGTAAGG GCGTCAGCGA CCTGACCCGC TTCACGCCCG ACGCGGTGCA GGAGAAGTAC GGGCTGACGC CGCAGCAGTA CCCGGACTTC GCCGCGCTGC GGGGCGATCC GAGCGACAAC CTGCCCGGCA TCCCGGGCGT GGGGGAGAAG ACGGCGACCA AGTGGATCGC CGAGTACGGG TCGCTGCAGG CCCTGGTGGA CAACGTCGAC CAGGTCAAGG GCAAGGTCGG CGATGCTCTG CGGGCCAACC TGTCCCACGT GGTGCTCAAC CGCGAACTCA CCGACCTCGT CAAGGACGTC CCGCTTCCGC ACACTCCCGA CACCCTGCGG GTGCAGCCGT GGGACCGCGA TCAGCTCCAC CGGCTGTTCG ACGACCTCGA GTTCCGGGTG CTGCGCGACC GCCTGTTCGA GACTCTGGCG GCCGTCGAAC CCGAGGTCGA ACACGGCTTC GACGTGCGGG GCAAGGCGCT CGAACGGGGT GAACTCGCGG CCTGGCTGTC CGAACACAGC CTCGGCAACC GGTTCGGGCT CGCGGTCGTC GGCACCCATC TCGCCTACGA CGCCGACGCC ACCGCGCTGG CCATCGTCGC CGCCGACGGC GACGGTCGCT ACATCGACAC GACCTCGCTC GACCCGGACG ACGAGGCGGC GCTGGCGTCG TGGCTGGCCG ACCCGGGCCC GCCGAAGGCG CTGCACGAAG CCAAGCTCGC CATGCACGAC CTCGCCGGTC GCGGCTGGAA GCTCGCCGGG GTCACCTCCG ACACGGCGCT GGCCGCCTAC CTGGTGCGGC CCGGGCAGCG CAGCTTCAGC CTCGACGATC TGTCGCTGCG CTACCTGCGT CGGGAACTGC GCGCGGACAA CCCTGCGCAA CAACAACTCT CACTGCTCGA CGACAGTGAC GGCGGCGACG ACCAAGCCGT CCAGACGCTG ATCCTGCGGG CCGTGGCGGT GCTCGACCTC GCCGACGCGC TCGACGAGGA ACTCGCCCGC ATCGACTCGT CGTCGCTACT GGGCCGGATG GAGTTGCCGG TGCAGCGGGT GCTCGCCGAG ATGGAGCACA CCGGTATCGC CGTCGACATC GATCAACTGC GGCAGTTGCA GAGCGAGTTC GCCGACCTGA TCCGCGATGC CGCCGAGGCG GCCTACGCGG TGATCGGCAA GCAGATCAAC CTCGGCTCGC CCAAACAGCT GCAGGCGGTG CTGTTCGACG AACTCGAGAT GCCCAAGACT AAGCGGACCA AGACCGGATA CACCACGGAT GCCGATGCAC TGCAGTCGTT GTTCGACAAG ACCGGTCATC CTTTCCTGCA GCATCTGCTG GCCCATCGCG ACGCCACCCG GCTCAAGGTG ACCGTCGACG GCCTCTTGAA CTCCGTCGCC TCCGACGGCC GTATCCACAC GACGTTCAAC CAGACGATCG CCGCGACCGG TCGGCTGTCG TCGACCGAAC CGAATCTGCA GAACATCCCG ATCCGCACCG AGGCGGGCCG GCGTATCCGC GACGCCTTCG TGGTCGGAGA CGGATACACC GAAGTCATGA CCGCCGACTA CAGCCAGATC GAGATGCGCA TCATGGCGCA CCTGTCACAG GACGCCGGTC TGATCGAGGC GTTCAACACC GGAGAGGACC TGCACTCGTT CGTCGCCTCG CGCGCGTTCG ACGTGCCGAT CGACGAGGTG ACCCCGGACC TGCGCCGCCG GGTCAAGGCG ATGTCGTACG GGTTGGCCTA CGGGCTGAGC GCGTACGGAT TGGCCGCGCA GCTCAAGATC TCCACCGAAG AGGCCAAGGT CCAGATGGAC CAGTACTTCG CCCGTTTCGG CGGTATCCGC GATTACCTGC GCGACGTCGT CGACCAGGCC CGCAAGGACG GCTACACCTC GACGGTGTTC GGTCGCAGGC GGTATCTGCC CGAACTCGAC AGCAGCAACC GCAACGTCCG CGAGGCCGCG GAGCGGGCCG CGCTCAACGC CCCGATCCAG GGCAGCGCCG CGGACATCAT CAAGGTCGCG ATGATCGACG TCGATCAGGC GATCAAGGAC GCCGGCCTCT CGTCACGGAT GCTGCTGCAG GTGCACGACG AGTTGCTGTT CGAGGTCGCC GACGGCGAGC GCGACACGCT CGAGAAGCTG GTCCGCGACA AGATGGGCAG CGCGTACGCC CTCGACGTTC CGCTCGAGGT CAGTGTCGGG TACGGCCGGA GCTGGGACGC GGCGGCGCAC TGA
|
Protein sequence | MSAGVHRVEP VSPASTSTDK KTATSQKADD KPTLMLLDGN SLAYRAFYAL PAENFKTQGG LTTNAVYGFT AMLINLLRDE QPSHVAAAFD VSRQTFRVDK YPEYKAGRSS TPDEFRGQID IAKEVLVALG IAVLAEPGFE ADDIIATLAT QAEGEGYRVL VVTGDRDALQ LVSDDVTVLY PRKGVSDLTR FTPDAVQEKY GLTPQQYPDF AALRGDPSDN LPGIPGVGEK TATKWIAEYG SLQALVDNVD QVKGKVGDAL RANLSHVVLN RELTDLVKDV PLPHTPDTLR VQPWDRDQLH RLFDDLEFRV LRDRLFETLA AVEPEVEHGF DVRGKALERG ELAAWLSEHS LGNRFGLAVV GTHLAYDADA TALAIVAADG DGRYIDTTSL DPDDEAALAS WLADPGPPKA LHEAKLAMHD LAGRGWKLAG VTSDTALAAY LVRPGQRSFS LDDLSLRYLR RELRADNPAQ QQLSLLDDSD GGDDQAVQTL ILRAVAVLDL ADALDEELAR IDSSSLLGRM ELPVQRVLAE MEHTGIAVDI DQLRQLQSEF ADLIRDAAEA AYAVIGKQIN LGSPKQLQAV LFDELEMPKT KRTKTGYTTD ADALQSLFDK TGHPFLQHLL AHRDATRLKV TVDGLLNSVA SDGRIHTTFN QTIAATGRLS STEPNLQNIP IRTEAGRRIR DAFVVGDGYT EVMTADYSQI EMRIMAHLSQ DAGLIEAFNT GEDLHSFVAS RAFDVPIDEV TPDLRRRVKA MSYGLAYGLS AYGLAAQLKI STEEAKVQMD QYFARFGGIR DYLRDVVDQA RKDGYTSTVF GRRRYLPELD SSNRNVREAA ERAALNAPIQ GSAADIIKVA MIDVDQAIKD AGLSSRMLLQ VHDELLFEVA DGERDTLEKL VRDKMGSAYA LDVPLEVSVG YGRSWDAAAH
|
| |