Gene Mkms_0821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_0821 
Symbol 
ID4614841 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp884509 
End bp889140 
Gene Length4632 bp 
Protein Length1543 aa 
Translation table11 
GC content62% 
IMG OID639790497 
ProductDEAD/DEAH box helicase domain-containing protein 
Protein accessionYP_936827 
Protein GI119866875 
COG category[R] General function prediction only 
COG ID[COG1205] Distinct helicase family with a unique C-terminal domain including a metal-binding cysteine cluster 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.301369 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGACC GTCTCGACCC CCTCAAGACA GCGGACCAGA TCGAGGGCTC CTACAAGCGA 
TACCTCAAGA CGCTGCTGGC GCCGCGAGAT GAAGCGCTCG CCACCGCCTT CGACACCGAG
GTCGACTCCA GCACGATGCT CACCAAGGGG CCAATCCTGG AACTGACGCC CCCTTACGAG
ACAGGTGCGA CGTGCCGACA GCTGATCGAG GAAGGCGTTC TCCACCGTGA CTTCGCGCGG
CTCGACAGCC ACGCGTTCCG CATCGATCAG CCACTGTACG TGCACCAGGA ATCTGCGGTG
CGGAAGTTCC TGTCGGGCCG GAACCTGGTT GTCAGTACGG GAACCGGCTC CGGCAAGACC
GAGAGCTTCC TCATCCCGAT CATCAACACG TTGCTCGAGG AGTCAGCACG GGGCACACTC
GGCCCGGGAG TGAGGGCTCT TCTGCTCTAC CCGATGAACG CCCTGGCCAA TGATCAACTG
AAACGGCTGC GAGGAATGCT CGCCGGGGTG CCGGACATCA CGTTCGGCCG CTACACCGGC
GAGACGCGAG ACGACGCACG CACCGCGGAA AACGATTTCC TCCAGTACAA CCCGGGTCAA
CTGCGTCTGA GCAATGAACT GCTCAGTCGA GAGGAGATGC GGAGTAGTCC GCCACACCTG
CTGCTGACGA ACTACGCAAT GCTCGAGTAC CTACTCCTGA GGCCCCTGGA CATCGACCTG
TTTGACGGTC CCCATGCGGG CACCTGGCGC TTCATCGTCA TGGACGAAGC TCACGTCTAC
GACGGGGCAC AAGGGTCGGA AGTGGCACTG CTGATTCGCC GCCTCAAACA ACGCGTCGCC
CCCGATTCGA ACATCCAATG CATCGCCACG TCGGCGTCGT TGACAGGATC GGTCCGCAAC
GACCCTCGCG GAGAGGCGAT GGACTTCGCT TCCAACCTCT TCGACGCACC TTTCGAGTAC
GTCGAAGGTG ATGCGAACAG GCAGGACCTC GTCGAGCCGA CGCGCAAAAG ACACCTGGCA
ACACCCGAAT GGCGACTGAC GGGCGAGCAA TTGCTGGCAC TACGTAGCGG GTCCACCACA
CTCACCCAGA TCATCGGTCC AGGTGCCGAC CCGGCCGAGG CGCTCGCCCG TGAGCAATCG
ATCATCGAAC TCAAGGACGC GCTCAGCGGC GGACCGGACG CCGTGCGATC TCTGCGAGAG
AAGCTCTGGC CCGACAATCC GCGATCTGCC GAATATCTCG ATGCGCTGGT CGAACTCGGC
AGCAGCACAT GCGATGAAGC GGGACACCCG GTTCTGTCCG CCCGATACCA TTTCTTCGTG
CGCGCCACGG AAGGTGCATT CGTCAGCTTC AACGATGACG GACCACGGAT ATTTCTCAGC
CGACATGAAG TCGATCCTGC TACCGGCCGC GCAGTGTTCG AGTTCGGAAC GTGTACTCGC
TGCGGAGCCG TTCATCTCGC GGGAGAACTC GAGCACCGCG ACCGGCGGGA GTACTTCACG
CCTTCGAAGA AGGCGGATGC GTCGGTCAAC TGGTTGGTGC TTGCTGACGG TGATCTTGAC
GTGGTGGTCG ACGAGGACGA AGCCACGTTG GCCAGCGATG AGCCGAAGAA TCTTGCCGAT
CCGACCACGC GGCGGCTGTG TACCGGCTGC GGTCAACTCA CTGCAGCCGA CGCGGCCCGC
TGCGCGACAA CGAATTGTCG AGCCGGACAG ATGCTGCTCG TTCGTGAACA TCCGCGGCCG
ACGCGGATCA TGAGCCGCTG CACTGAGTGC GGAACGCAAT CTCGGCAGGG GATCCGGCGC
CTGCGCACCG ATGTGAATGC GGCGCCGGCC GTCGTGACGA CGGCTCTCTA CCAACAGCTT
CCCGAGGCAT CCGGCGACGC GGCCGACAAC GTGGGTGGCG GACGCAAACT GCTGATGTTC
TCCGACTCTC GCCAAGCTGC GGCGTTCGCG GCGCCCTACC TGGACCGAAC GTACTCCCGG
ATGCTCGAGC GTCGCTACAT CACCGAGGCA CTCCGAGATC CCGTGGCGGC GACGAGTGAG
CTCACGGTCG GCGACCTGGC GATCCTGGCG CGCGAGAAGG CGCAAGCCGC CGGTCATTTC
GATCGCAGAT TGGGAAGCAT CGAGATCGCC CAAGCCGTGA ACCAGTGGAT CTCCGGGGAG
TTGATGACCC TCGAGACGCG GCAATCCCTG GAAGGCCTCG GCCTCATGCG GGTGGCTCTT
CGGCGCGAGC CATCAGTACC ACTTCGAGGG TTCACGTCGC TTGGCCTCAC CGAGGAAGAG
GCATGGGCGT TGATGAACGA ACTCGTCAAG ACCGTGCGAC TGCAGGGCGC CATCACGGTA
CTCGACCGTG TCGACATCAA AGATGAGCGG TTCGCACCCC GGAACACACG GGTGCGGATG
CGGTCAGCCG GGTCCGACAG AGCGCGGCAG GTCATCAGCT GGAACCCCAG CGGCACCGGA
ACCACGAATG GGCGTATCAC GCTGCTTAGG AAGGTCTTCG AGAGTCTGTC GAACAAGACC
CCAGTGGAGA AGGTTCTCGA GGGTTGTTGG CGCCTGCTCG AGTCGGGCGG ATTCCTCGTC
GCCGAATCTG ATCGAGTGCT CGGCCAGGTC TACCAGCTCG ACCACAGCAT GCTCACGGTG
ACGAATGGTG CCAATTGCGA ATGGTACCGG TGCGATACGT GTCGGTTGTT GACTGCGTTC
TCGATCCGCG ACGTGTGCCC GAACAGTCGC TGCACCGGAC GATTGAGGCT GTACGAGGTC
CCTGCTTCCG GCAGGGATTC GAACCATTAC CGCGTGGTGT ATCAAACGAT GACTTCCGCG
CCGTTGAGGG CGCACGAGCA CACAGCGCAG TGGGATGCCA AGACGGCGGC CAACATCCAG
CGCGAGTTCA TCACGGGAAG GGTCAACGTG CTTTCGTGCT CGACAACATT CGAGCTCGGG
GTCGACGTCG GTGATCTTCA ATCTGTGGTC ATGCGGAACA TGCCGCCGAA GACGGCGAAC
TACGTGCAGC GCGCGGGTCG CGCGGGTCGC CGCGCGGCGT CAGCAGCTCT GGTGGTCACT
TACGCCAACC GATCTGCACA TGACCTGGCG ATGTATCAGG ACCCGAATGC CATGATCGCC
GGTCGGATGC GGATTCCGTG GGTACCGGTC GACAACGCGC GAATCGCCCG ACGCCATGCG
CATTCCGTGG CTTTGGCCGC GTACTTCCGG CATTCCTACG AGCAGCGTAG GGAGCGGTGG
AAGTCAGCCG GTGAATTCTT CTCGCCAGGT CCGAACGACG AAGACTCGCC GGCGCGTCGC
GTGGGCCGGT ATCTATCGCC GATGCCCTCG GCCGTGCAGG AAGCTCTCTG CGCGGCACTG
CCTCGAAGCG TGCATGCGGA AATCGGTATC GCCGACGGTT CCTGGGTTGG CGAGCTGGTC
GCTCTCCTCA ACTCCGTTGA AGACGAGCTG ACCAAGGATA TCGCCGATAT CGAGGAGCGG
ATCGAAGAGA CGATTACGGA ACGCAAGTTC TGGCTGAGTA AGCGCCTCGA AGATACGAAA
CGAACGATTG TCGGACGTGA ACTGCTTGGA TACCTCGCGA ATCGGAACAT TCTGCCCAAG
TATGGCTTTC CCGTCGACAC AGTCGAGCTC AGTACGCTGG CCTCGGCCGA CCCAGTGGGT
CGTCAACTGG ACTTGTCCCG CGATCTCAGC CTGGCGATCT ATGACTATGC ACCCGGGAAT
GAGGTCGTGG CGGGCGGCAA GGTATGGACG TCAGTTGGAT TGAAGCGGCG TCCCGGAAAG
GAGCTGGTTC GCCACAAGTA TCGCGTCTGC CCGACGTGCG GACGTTTCCA GCGTGGGCAG
GAGCTCGATC CGGCGGATGT GTGCCCCAGC TGTGGAGAGC CGTTCCGATC GATCGGCACA
ATGGTCATCC CGGAGTTTGG CTTCATTGCC GCGAGTGAGA CGCGAGAGGT TGGGAGCGCG
CCACCTGAGC GGCGTTGGCA CGGCGGCAGT TACGTCGAGA CGCCGGGCGA TGACGTCGGT
TCCTATCGGT GGTCCGGCCC CGGCGGGCTT CGTGTCAACG CCCGCGCGGG CGTGCGAGCC
TGGTTAGCGG TCGTGTCGGA CGGCAGGGGC GACGGGTTCC AGCTGTGCCA GTGGTGCGGA
TGGGCGAAAC CCGCCGAAAG AGGCAGCCGG CGTCGCAAGC ATCACCAGCC GGACACCAAT
AAGGAATGCG ATGGCCCGCT GGAGAAGATC TCGCTCGGAC ACCGCTACCA GTCGGATGTC
GCCGAGTTCA CCTTCGACGG ACTGCAGTAC CGCAAAGACC ACGAATCCAA TTGGTTGTCA
GCGCTGTACG CGATATTGGA AGGCGCGTCC TATGCGCTGG AGATCAGTCG CGACGACATC
GACGGGGCAT TGTCGTGGAG TGCAGACCAC CGGCGGAGCA TCGTGGTTTT CGACACGGTG
CCCGGCGGAG CCGGATCGGC GAAGAAGATC GCGGAAAACA TCGGTGAGGT CTTGAACGCC
GCGGTGAAAC GTGTGACCGA ATGCGACTGC GGGGTAGAGA CGTCGTGCTA CGGGTGTCTG
CGGTCATTTC GCAATGCTCG TTTCCATGAA CAGCTTTCGC GAGGGGCAGC ACTGCAAATT
CTTGGAAGGT AG
 
Protein sequence
MNDRLDPLKT ADQIEGSYKR YLKTLLAPRD EALATAFDTE VDSSTMLTKG PILELTPPYE 
TGATCRQLIE EGVLHRDFAR LDSHAFRIDQ PLYVHQESAV RKFLSGRNLV VSTGTGSGKT
ESFLIPIINT LLEESARGTL GPGVRALLLY PMNALANDQL KRLRGMLAGV PDITFGRYTG
ETRDDARTAE NDFLQYNPGQ LRLSNELLSR EEMRSSPPHL LLTNYAMLEY LLLRPLDIDL
FDGPHAGTWR FIVMDEAHVY DGAQGSEVAL LIRRLKQRVA PDSNIQCIAT SASLTGSVRN
DPRGEAMDFA SNLFDAPFEY VEGDANRQDL VEPTRKRHLA TPEWRLTGEQ LLALRSGSTT
LTQIIGPGAD PAEALAREQS IIELKDALSG GPDAVRSLRE KLWPDNPRSA EYLDALVELG
SSTCDEAGHP VLSARYHFFV RATEGAFVSF NDDGPRIFLS RHEVDPATGR AVFEFGTCTR
CGAVHLAGEL EHRDRREYFT PSKKADASVN WLVLADGDLD VVVDEDEATL ASDEPKNLAD
PTTRRLCTGC GQLTAADAAR CATTNCRAGQ MLLVREHPRP TRIMSRCTEC GTQSRQGIRR
LRTDVNAAPA VVTTALYQQL PEASGDAADN VGGGRKLLMF SDSRQAAAFA APYLDRTYSR
MLERRYITEA LRDPVAATSE LTVGDLAILA REKAQAAGHF DRRLGSIEIA QAVNQWISGE
LMTLETRQSL EGLGLMRVAL RREPSVPLRG FTSLGLTEEE AWALMNELVK TVRLQGAITV
LDRVDIKDER FAPRNTRVRM RSAGSDRARQ VISWNPSGTG TTNGRITLLR KVFESLSNKT
PVEKVLEGCW RLLESGGFLV AESDRVLGQV YQLDHSMLTV TNGANCEWYR CDTCRLLTAF
SIRDVCPNSR CTGRLRLYEV PASGRDSNHY RVVYQTMTSA PLRAHEHTAQ WDAKTAANIQ
REFITGRVNV LSCSTTFELG VDVGDLQSVV MRNMPPKTAN YVQRAGRAGR RAASAALVVT
YANRSAHDLA MYQDPNAMIA GRMRIPWVPV DNARIARRHA HSVALAAYFR HSYEQRRERW
KSAGEFFSPG PNDEDSPARR VGRYLSPMPS AVQEALCAAL PRSVHAEIGI ADGSWVGELV
ALLNSVEDEL TKDIADIEER IEETITERKF WLSKRLEDTK RTIVGRELLG YLANRNILPK
YGFPVDTVEL STLASADPVG RQLDLSRDLS LAIYDYAPGN EVVAGGKVWT SVGLKRRPGK
ELVRHKYRVC PTCGRFQRGQ ELDPADVCPS CGEPFRSIGT MVIPEFGFIA ASETREVGSA
PPERRWHGGS YVETPGDDVG SYRWSGPGGL RVNARAGVRA WLAVVSDGRG DGFQLCQWCG
WAKPAERGSR RRKHHQPDTN KECDGPLEKI SLGHRYQSDV AEFTFDGLQY RKDHESNWLS
ALYAILEGAS YALEISRDDI DGALSWSADH RRSIVVFDTV PGGAGSAKKI AENIGEVLNA
AVKRVTECDC GVETSCYGCL RSFRNARFHE QLSRGAALQI LGR