Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mjls_3026 |
Symbol | |
ID | 4878739 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. JLS |
Kingdom | Bacteria |
Replicon accession | NC_009077 |
Strand | - |
Start bp | 3161126 |
End bp | 3163915 |
Gene Length | 2790 bp |
Protein Length | 929 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640140325 |
Product | DNA polymerase I |
Protein accession | YP_001071296 |
Protein GI | 126435605 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.248484 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0929498 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCGGCGG GGGTGCATAG AGTTGAGCCC GTGAGCCCAG CCAGCACCTC GACCGACAAG AAGACGGCCA CCAGCCAGAA GGCGGACGAC AAGCCGACAC TGATGCTGCT GGACGGCAAC TCGCTGGCGT ACCGCGCCTT CTACGCACTC CCCGCCGAGA ACTTCAAGAC CCAGGGCGGC CTGACCACCA ACGCGGTCTA CGGATTCACC GCGATGCTGA TCAACCTCCT CCGCGACGAG CAGCCCTCCC ATGTGGCCGC CGCGTTCGAC GTCTCCCGGC AGACGTTCCG CGTCGACAAG TACCCCGAGT ACAAGGCCGG CCGCTCGTCG ACTCCGGACG AGTTCCGCGG GCAGATCGAC ATCGCCAAGG AGGTGCTCGT CGCGCTGGGC ATCGCCGTGC TGGCCGAACC CGGCTTCGAG GCCGACGACA TCATCGCCAC GCTCGCCACC CAGGCCGAGG GTGAGGGTTA CCGGGTGCTC GTCGTGACCG GTGACCGCGA CGCGCTCCAG CTCGTCAGCG ACGACGTGAC CGTGCTCTAT CCCCGTAAGG GCGTCAGCGA CCTGACCCGC TTCACGCCCG ACGCGGTGCA GGAGAAGTAC GGGCTGACGC CGCAGCAGTA CCCGGACTTC GCCGCGCTGC GGGGCGATCC GAGCGACAAC CTGCCCGGCA TCCCGGGCGT GGGGGAGAAG ACGGCGACCA AGTGGATCGC CGAGTACGGT TCGCTGCAGG CCCTGGTCGA CAACGTCGAC CAGGTCAAGG GCAAGGTCGG CGATGCTCTG CGGGCCAACC TGTCTCACGT GGTGCTCAAT CGCGAACTCA CCGACCTCGT CAAGGACGTC CCGCTTCCGC ACACTCCCGA CACTCTGCGG GTGCAGCCGT GGGACCGCGA TCAGATCCAC CGGCTGTTCG ACGACCTCGA GTTCCGGGTG CTGCGCGACC GCCTGTTCGA GACTCTGGCG GCCGTCGAAC CCGAGGTCGA ACACGGCTTC GACGTGCGGG GCAAGGCGCT CGAACGGGGT GAACTCGCGG CCTGGCTGTC CGAACACAGC CTCGGCAACC GGTTCGGGCT CGCGGTCGTC GGCACCCATC TCGCCTACGA CGCCGACGCC ACCGCGCTGG CCATCGTCGC TGCCGACGGC GACGGTCGCT ACATCGACAC GACCTCGCTC GACCCGGACG ACGAGGCGGC GCTGGCGTCG TGGCTGGCCG ACCCGGGCCC GCCGAAGGCG CTGCACGAAG CCAAGCTCGC CATGCACGAC CTCGCCGGCC GCGGCTGGAA GCTCGCCGGG GTCACCTCCG ACACGGCGCT GGCCGCCTAC CTGGTGCGGC CCGGGCAGCG CAGCTTCAGC CTCGACGATC TGTCGTTGCG CTACCTGCGT CGGGAACTGC GCGCGGACAA CCCTGCGCAA CAACAACTCT CACTGCTCGA CGACAGTGAC GGCGGCGACG ACCAGGCCGT CCAGACGCTG ATCCTGCGGG CCGTGGCGGT ACTCGACCTC GCCGACGCCC TCGACGAGGA ACTCGCCCGC ATCGACTCGT CGTCGCTACT GGGCCGGATG GAGTTGCCGG TGCAGCGGGT GCTCGCCGAG ATGGAGCACA CCGGTATCGC CGTCGACATC GATCAACTGC GGCAGTTGCA GAGCGAGTTC GCCGACCTGA TCCGCGACGC CGCCGAGGCG GCCTACGCGG TGATCGGCAA GCAGATCAAC CTCGGCTCGC CCAAACAGCT GCAGGCGGTG CTGTTCGACG AACTCGCGAT GCCCAAGACG AAGCGGACCA AGACCGGATA CACCACGGAC GCCGATGCGC TGCAGTCGTT GTTCGACAAG ACCGGTCATC CTTTCCTGCA GCATCTGCTG GCCCATCGCG ACGCCACCCG GCTCAAGGTG ACCGTCGACG GCCTCCTGAA CTCCGTCGCC TCCGACGGCC GCATCCACAC GACGTTCAAC CAGACGATCG CCGCGACCGG TCGGCTGTCG TCGACCGAAC CGAATCTGCA GAACATCCCG ATCCGCACCG AGGCGGGCCG GCGTATCCGC GACGCCTTCG TGGTGGAAAG TCGCCCTCCG GGCTCCGGCG GCGGCGCGGT CTTCACCGAA CTCATGACCG CCGACTACAG CCAGATCGAG ATGCGCATCA TGGCACACCT GTCACAGGAC GCCGGTCTGA TCGAGGCGTT CAACACCGGA GAGGACCTGC ACTCGTTCGT CGCCTCGCGC GCGTTCGACG TGCCGATCGA CGAGGTGACC CCGGACCTGC GCCGCCGGGT CAAGGCGATG TCCTACGGGT TGGCCTACGG GCTGAGCGCG TACGGATTGG CCGCGCAGCT CAAGATCTCC ACCGAAGAGG CCAAGGTCCA GATGGACCAG TACTTCGCCC GTTTCGGCGG TATCCGCGAC TACCTGCGCG ACGTCGTCGA CCAGGCCCGC AAGGACGGCT ACACCTCGAC GGTGTTCGGT CGCCGGCGGT ACCTGCCCGA ACTCGACAGC AGCAACCGCA ACGTCCGCGA GGCCGCGGAG CGGGCCGCGC TCAACGCCCC GATCCAGGGC AGCGCCGCGG ACATCATCAA GGTCGCGATG ATCGACGTCG ATCAGGCGAT CAAGGACGCC GGCCTCTCGT CACGGATGCT GCTGCAGGTG CACGACGAGT TGCTGTTCGA GGTCGCCGAC GGCGAGCGCG ACACGCTCGA GAAGCTGGTC CGCGACAAGA TGGGCAGCGC GTACGCCCTC GACGTGCCGC TCGAGGTCAG TGTCGGGTAC GGCCGGAGCT GGGACGCGGC GGCGCACTGA
|
Protein sequence | MSAGVHRVEP VSPASTSTDK KTATSQKADD KPTLMLLDGN SLAYRAFYAL PAENFKTQGG LTTNAVYGFT AMLINLLRDE QPSHVAAAFD VSRQTFRVDK YPEYKAGRSS TPDEFRGQID IAKEVLVALG IAVLAEPGFE ADDIIATLAT QAEGEGYRVL VVTGDRDALQ LVSDDVTVLY PRKGVSDLTR FTPDAVQEKY GLTPQQYPDF AALRGDPSDN LPGIPGVGEK TATKWIAEYG SLQALVDNVD QVKGKVGDAL RANLSHVVLN RELTDLVKDV PLPHTPDTLR VQPWDRDQIH RLFDDLEFRV LRDRLFETLA AVEPEVEHGF DVRGKALERG ELAAWLSEHS LGNRFGLAVV GTHLAYDADA TALAIVAADG DGRYIDTTSL DPDDEAALAS WLADPGPPKA LHEAKLAMHD LAGRGWKLAG VTSDTALAAY LVRPGQRSFS LDDLSLRYLR RELRADNPAQ QQLSLLDDSD GGDDQAVQTL ILRAVAVLDL ADALDEELAR IDSSSLLGRM ELPVQRVLAE MEHTGIAVDI DQLRQLQSEF ADLIRDAAEA AYAVIGKQIN LGSPKQLQAV LFDELAMPKT KRTKTGYTTD ADALQSLFDK TGHPFLQHLL AHRDATRLKV TVDGLLNSVA SDGRIHTTFN QTIAATGRLS STEPNLQNIP IRTEAGRRIR DAFVVESRPP GSGGGAVFTE LMTADYSQIE MRIMAHLSQD AGLIEAFNTG EDLHSFVASR AFDVPIDEVT PDLRRRVKAM SYGLAYGLSA YGLAAQLKIS TEEAKVQMDQ YFARFGGIRD YLRDVVDQAR KDGYTSTVFG RRRYLPELDS SNRNVREAAE RAALNAPIQG SAADIIKVAM IDVDQAIKDA GLSSRMLLQV HDELLFEVAD GERDTLEKLV RDKMGSAYAL DVPLEVSVGY GRSWDAAAH
|
| |