Gene Mjls_3026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_3026 
Symbol 
ID4878739 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp3161126 
End bp3163915 
Gene Length2790 bp 
Protein Length929 aa 
Translation table11 
GC content68% 
IMG OID640140325 
ProductDNA polymerase I 
Protein accessionYP_001071296 
Protein GI126435605 
COG category[L] Replication, recombination and repair 
COG ID[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.248484 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0929498 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGGCGG GGGTGCATAG AGTTGAGCCC GTGAGCCCAG CCAGCACCTC GACCGACAAG 
AAGACGGCCA CCAGCCAGAA GGCGGACGAC AAGCCGACAC TGATGCTGCT GGACGGCAAC
TCGCTGGCGT ACCGCGCCTT CTACGCACTC CCCGCCGAGA ACTTCAAGAC CCAGGGCGGC
CTGACCACCA ACGCGGTCTA CGGATTCACC GCGATGCTGA TCAACCTCCT CCGCGACGAG
CAGCCCTCCC ATGTGGCCGC CGCGTTCGAC GTCTCCCGGC AGACGTTCCG CGTCGACAAG
TACCCCGAGT ACAAGGCCGG CCGCTCGTCG ACTCCGGACG AGTTCCGCGG GCAGATCGAC
ATCGCCAAGG AGGTGCTCGT CGCGCTGGGC ATCGCCGTGC TGGCCGAACC CGGCTTCGAG
GCCGACGACA TCATCGCCAC GCTCGCCACC CAGGCCGAGG GTGAGGGTTA CCGGGTGCTC
GTCGTGACCG GTGACCGCGA CGCGCTCCAG CTCGTCAGCG ACGACGTGAC CGTGCTCTAT
CCCCGTAAGG GCGTCAGCGA CCTGACCCGC TTCACGCCCG ACGCGGTGCA GGAGAAGTAC
GGGCTGACGC CGCAGCAGTA CCCGGACTTC GCCGCGCTGC GGGGCGATCC GAGCGACAAC
CTGCCCGGCA TCCCGGGCGT GGGGGAGAAG ACGGCGACCA AGTGGATCGC CGAGTACGGT
TCGCTGCAGG CCCTGGTCGA CAACGTCGAC CAGGTCAAGG GCAAGGTCGG CGATGCTCTG
CGGGCCAACC TGTCTCACGT GGTGCTCAAT CGCGAACTCA CCGACCTCGT CAAGGACGTC
CCGCTTCCGC ACACTCCCGA CACTCTGCGG GTGCAGCCGT GGGACCGCGA TCAGATCCAC
CGGCTGTTCG ACGACCTCGA GTTCCGGGTG CTGCGCGACC GCCTGTTCGA GACTCTGGCG
GCCGTCGAAC CCGAGGTCGA ACACGGCTTC GACGTGCGGG GCAAGGCGCT CGAACGGGGT
GAACTCGCGG CCTGGCTGTC CGAACACAGC CTCGGCAACC GGTTCGGGCT CGCGGTCGTC
GGCACCCATC TCGCCTACGA CGCCGACGCC ACCGCGCTGG CCATCGTCGC TGCCGACGGC
GACGGTCGCT ACATCGACAC GACCTCGCTC GACCCGGACG ACGAGGCGGC GCTGGCGTCG
TGGCTGGCCG ACCCGGGCCC GCCGAAGGCG CTGCACGAAG CCAAGCTCGC CATGCACGAC
CTCGCCGGCC GCGGCTGGAA GCTCGCCGGG GTCACCTCCG ACACGGCGCT GGCCGCCTAC
CTGGTGCGGC CCGGGCAGCG CAGCTTCAGC CTCGACGATC TGTCGTTGCG CTACCTGCGT
CGGGAACTGC GCGCGGACAA CCCTGCGCAA CAACAACTCT CACTGCTCGA CGACAGTGAC
GGCGGCGACG ACCAGGCCGT CCAGACGCTG ATCCTGCGGG CCGTGGCGGT ACTCGACCTC
GCCGACGCCC TCGACGAGGA ACTCGCCCGC ATCGACTCGT CGTCGCTACT GGGCCGGATG
GAGTTGCCGG TGCAGCGGGT GCTCGCCGAG ATGGAGCACA CCGGTATCGC CGTCGACATC
GATCAACTGC GGCAGTTGCA GAGCGAGTTC GCCGACCTGA TCCGCGACGC CGCCGAGGCG
GCCTACGCGG TGATCGGCAA GCAGATCAAC CTCGGCTCGC CCAAACAGCT GCAGGCGGTG
CTGTTCGACG AACTCGCGAT GCCCAAGACG AAGCGGACCA AGACCGGATA CACCACGGAC
GCCGATGCGC TGCAGTCGTT GTTCGACAAG ACCGGTCATC CTTTCCTGCA GCATCTGCTG
GCCCATCGCG ACGCCACCCG GCTCAAGGTG ACCGTCGACG GCCTCCTGAA CTCCGTCGCC
TCCGACGGCC GCATCCACAC GACGTTCAAC CAGACGATCG CCGCGACCGG TCGGCTGTCG
TCGACCGAAC CGAATCTGCA GAACATCCCG ATCCGCACCG AGGCGGGCCG GCGTATCCGC
GACGCCTTCG TGGTGGAAAG TCGCCCTCCG GGCTCCGGCG GCGGCGCGGT CTTCACCGAA
CTCATGACCG CCGACTACAG CCAGATCGAG ATGCGCATCA TGGCACACCT GTCACAGGAC
GCCGGTCTGA TCGAGGCGTT CAACACCGGA GAGGACCTGC ACTCGTTCGT CGCCTCGCGC
GCGTTCGACG TGCCGATCGA CGAGGTGACC CCGGACCTGC GCCGCCGGGT CAAGGCGATG
TCCTACGGGT TGGCCTACGG GCTGAGCGCG TACGGATTGG CCGCGCAGCT CAAGATCTCC
ACCGAAGAGG CCAAGGTCCA GATGGACCAG TACTTCGCCC GTTTCGGCGG TATCCGCGAC
TACCTGCGCG ACGTCGTCGA CCAGGCCCGC AAGGACGGCT ACACCTCGAC GGTGTTCGGT
CGCCGGCGGT ACCTGCCCGA ACTCGACAGC AGCAACCGCA ACGTCCGCGA GGCCGCGGAG
CGGGCCGCGC TCAACGCCCC GATCCAGGGC AGCGCCGCGG ACATCATCAA GGTCGCGATG
ATCGACGTCG ATCAGGCGAT CAAGGACGCC GGCCTCTCGT CACGGATGCT GCTGCAGGTG
CACGACGAGT TGCTGTTCGA GGTCGCCGAC GGCGAGCGCG ACACGCTCGA GAAGCTGGTC
CGCGACAAGA TGGGCAGCGC GTACGCCCTC GACGTGCCGC TCGAGGTCAG TGTCGGGTAC
GGCCGGAGCT GGGACGCGGC GGCGCACTGA
 
Protein sequence
MSAGVHRVEP VSPASTSTDK KTATSQKADD KPTLMLLDGN SLAYRAFYAL PAENFKTQGG 
LTTNAVYGFT AMLINLLRDE QPSHVAAAFD VSRQTFRVDK YPEYKAGRSS TPDEFRGQID
IAKEVLVALG IAVLAEPGFE ADDIIATLAT QAEGEGYRVL VVTGDRDALQ LVSDDVTVLY
PRKGVSDLTR FTPDAVQEKY GLTPQQYPDF AALRGDPSDN LPGIPGVGEK TATKWIAEYG
SLQALVDNVD QVKGKVGDAL RANLSHVVLN RELTDLVKDV PLPHTPDTLR VQPWDRDQIH
RLFDDLEFRV LRDRLFETLA AVEPEVEHGF DVRGKALERG ELAAWLSEHS LGNRFGLAVV
GTHLAYDADA TALAIVAADG DGRYIDTTSL DPDDEAALAS WLADPGPPKA LHEAKLAMHD
LAGRGWKLAG VTSDTALAAY LVRPGQRSFS LDDLSLRYLR RELRADNPAQ QQLSLLDDSD
GGDDQAVQTL ILRAVAVLDL ADALDEELAR IDSSSLLGRM ELPVQRVLAE MEHTGIAVDI
DQLRQLQSEF ADLIRDAAEA AYAVIGKQIN LGSPKQLQAV LFDELAMPKT KRTKTGYTTD
ADALQSLFDK TGHPFLQHLL AHRDATRLKV TVDGLLNSVA SDGRIHTTFN QTIAATGRLS
STEPNLQNIP IRTEAGRRIR DAFVVESRPP GSGGGAVFTE LMTADYSQIE MRIMAHLSQD
AGLIEAFNTG EDLHSFVASR AFDVPIDEVT PDLRRRVKAM SYGLAYGLSA YGLAAQLKIS
TEEAKVQMDQ YFARFGGIRD YLRDVVDQAR KDGYTSTVFG RRRYLPELDS SNRNVREAAE
RAALNAPIQG SAADIIKVAM IDVDQAIKDA GLSSRMLLQV HDELLFEVAD GERDTLEKLV
RDKMGSAYAL DVPLEVSVGY GRSWDAAAH