Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mkms_0821 |
Symbol | |
ID | 4614841 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. KMS |
Kingdom | Bacteria |
Replicon accession | NC_008705 |
Strand | + |
Start bp | 884509 |
End bp | 889140 |
Gene Length | 4632 bp |
Protein Length | 1543 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 639790497 |
Product | DEAD/DEAH box helicase domain-containing protein |
Protein accession | YP_936827 |
Protein GI | 119866875 |
COG category | [R] General function prediction only |
COG ID | [COG1205] Distinct helicase family with a unique C-terminal domain including a metal-binding cysteine cluster |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.301369 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGACC GTCTCGACCC CCTCAAGACA GCGGACCAGA TCGAGGGCTC CTACAAGCGA TACCTCAAGA CGCTGCTGGC GCCGCGAGAT GAAGCGCTCG CCACCGCCTT CGACACCGAG GTCGACTCCA GCACGATGCT CACCAAGGGG CCAATCCTGG AACTGACGCC CCCTTACGAG ACAGGTGCGA CGTGCCGACA GCTGATCGAG GAAGGCGTTC TCCACCGTGA CTTCGCGCGG CTCGACAGCC ACGCGTTCCG CATCGATCAG CCACTGTACG TGCACCAGGA ATCTGCGGTG CGGAAGTTCC TGTCGGGCCG GAACCTGGTT GTCAGTACGG GAACCGGCTC CGGCAAGACC GAGAGCTTCC TCATCCCGAT CATCAACACG TTGCTCGAGG AGTCAGCACG GGGCACACTC GGCCCGGGAG TGAGGGCTCT TCTGCTCTAC CCGATGAACG CCCTGGCCAA TGATCAACTG AAACGGCTGC GAGGAATGCT CGCCGGGGTG CCGGACATCA CGTTCGGCCG CTACACCGGC GAGACGCGAG ACGACGCACG CACCGCGGAA AACGATTTCC TCCAGTACAA CCCGGGTCAA CTGCGTCTGA GCAATGAACT GCTCAGTCGA GAGGAGATGC GGAGTAGTCC GCCACACCTG CTGCTGACGA ACTACGCAAT GCTCGAGTAC CTACTCCTGA GGCCCCTGGA CATCGACCTG TTTGACGGTC CCCATGCGGG CACCTGGCGC TTCATCGTCA TGGACGAAGC TCACGTCTAC GACGGGGCAC AAGGGTCGGA AGTGGCACTG CTGATTCGCC GCCTCAAACA ACGCGTCGCC CCCGATTCGA ACATCCAATG CATCGCCACG TCGGCGTCGT TGACAGGATC GGTCCGCAAC GACCCTCGCG GAGAGGCGAT GGACTTCGCT TCCAACCTCT TCGACGCACC TTTCGAGTAC GTCGAAGGTG ATGCGAACAG GCAGGACCTC GTCGAGCCGA CGCGCAAAAG ACACCTGGCA ACACCCGAAT GGCGACTGAC GGGCGAGCAA TTGCTGGCAC TACGTAGCGG GTCCACCACA CTCACCCAGA TCATCGGTCC AGGTGCCGAC CCGGCCGAGG CGCTCGCCCG TGAGCAATCG ATCATCGAAC TCAAGGACGC GCTCAGCGGC GGACCGGACG CCGTGCGATC TCTGCGAGAG AAGCTCTGGC CCGACAATCC GCGATCTGCC GAATATCTCG ATGCGCTGGT CGAACTCGGC AGCAGCACAT GCGATGAAGC GGGACACCCG GTTCTGTCCG CCCGATACCA TTTCTTCGTG CGCGCCACGG AAGGTGCATT CGTCAGCTTC AACGATGACG GACCACGGAT ATTTCTCAGC CGACATGAAG TCGATCCTGC TACCGGCCGC GCAGTGTTCG AGTTCGGAAC GTGTACTCGC TGCGGAGCCG TTCATCTCGC GGGAGAACTC GAGCACCGCG ACCGGCGGGA GTACTTCACG CCTTCGAAGA AGGCGGATGC GTCGGTCAAC TGGTTGGTGC TTGCTGACGG TGATCTTGAC GTGGTGGTCG ACGAGGACGA AGCCACGTTG GCCAGCGATG AGCCGAAGAA TCTTGCCGAT CCGACCACGC GGCGGCTGTG TACCGGCTGC GGTCAACTCA CTGCAGCCGA CGCGGCCCGC TGCGCGACAA CGAATTGTCG AGCCGGACAG ATGCTGCTCG TTCGTGAACA TCCGCGGCCG ACGCGGATCA TGAGCCGCTG CACTGAGTGC GGAACGCAAT CTCGGCAGGG GATCCGGCGC CTGCGCACCG ATGTGAATGC GGCGCCGGCC GTCGTGACGA CGGCTCTCTA CCAACAGCTT CCCGAGGCAT CCGGCGACGC GGCCGACAAC GTGGGTGGCG GACGCAAACT GCTGATGTTC TCCGACTCTC GCCAAGCTGC GGCGTTCGCG GCGCCCTACC TGGACCGAAC GTACTCCCGG ATGCTCGAGC GTCGCTACAT CACCGAGGCA CTCCGAGATC CCGTGGCGGC GACGAGTGAG CTCACGGTCG GCGACCTGGC GATCCTGGCG CGCGAGAAGG CGCAAGCCGC CGGTCATTTC GATCGCAGAT TGGGAAGCAT CGAGATCGCC CAAGCCGTGA ACCAGTGGAT CTCCGGGGAG TTGATGACCC TCGAGACGCG GCAATCCCTG GAAGGCCTCG GCCTCATGCG GGTGGCTCTT CGGCGCGAGC CATCAGTACC ACTTCGAGGG TTCACGTCGC TTGGCCTCAC CGAGGAAGAG GCATGGGCGT TGATGAACGA ACTCGTCAAG ACCGTGCGAC TGCAGGGCGC CATCACGGTA CTCGACCGTG TCGACATCAA AGATGAGCGG TTCGCACCCC GGAACACACG GGTGCGGATG CGGTCAGCCG GGTCCGACAG AGCGCGGCAG GTCATCAGCT GGAACCCCAG CGGCACCGGA ACCACGAATG GGCGTATCAC GCTGCTTAGG AAGGTCTTCG AGAGTCTGTC GAACAAGACC CCAGTGGAGA AGGTTCTCGA GGGTTGTTGG CGCCTGCTCG AGTCGGGCGG ATTCCTCGTC GCCGAATCTG ATCGAGTGCT CGGCCAGGTC TACCAGCTCG ACCACAGCAT GCTCACGGTG ACGAATGGTG CCAATTGCGA ATGGTACCGG TGCGATACGT GTCGGTTGTT GACTGCGTTC TCGATCCGCG ACGTGTGCCC GAACAGTCGC TGCACCGGAC GATTGAGGCT GTACGAGGTC CCTGCTTCCG GCAGGGATTC GAACCATTAC CGCGTGGTGT ATCAAACGAT GACTTCCGCG CCGTTGAGGG CGCACGAGCA CACAGCGCAG TGGGATGCCA AGACGGCGGC CAACATCCAG CGCGAGTTCA TCACGGGAAG GGTCAACGTG CTTTCGTGCT CGACAACATT CGAGCTCGGG GTCGACGTCG GTGATCTTCA ATCTGTGGTC ATGCGGAACA TGCCGCCGAA GACGGCGAAC TACGTGCAGC GCGCGGGTCG CGCGGGTCGC CGCGCGGCGT CAGCAGCTCT GGTGGTCACT TACGCCAACC GATCTGCACA TGACCTGGCG ATGTATCAGG ACCCGAATGC CATGATCGCC GGTCGGATGC GGATTCCGTG GGTACCGGTC GACAACGCGC GAATCGCCCG ACGCCATGCG CATTCCGTGG CTTTGGCCGC GTACTTCCGG CATTCCTACG AGCAGCGTAG GGAGCGGTGG AAGTCAGCCG GTGAATTCTT CTCGCCAGGT CCGAACGACG AAGACTCGCC GGCGCGTCGC GTGGGCCGGT ATCTATCGCC GATGCCCTCG GCCGTGCAGG AAGCTCTCTG CGCGGCACTG CCTCGAAGCG TGCATGCGGA AATCGGTATC GCCGACGGTT CCTGGGTTGG CGAGCTGGTC GCTCTCCTCA ACTCCGTTGA AGACGAGCTG ACCAAGGATA TCGCCGATAT CGAGGAGCGG ATCGAAGAGA CGATTACGGA ACGCAAGTTC TGGCTGAGTA AGCGCCTCGA AGATACGAAA CGAACGATTG TCGGACGTGA ACTGCTTGGA TACCTCGCGA ATCGGAACAT TCTGCCCAAG TATGGCTTTC CCGTCGACAC AGTCGAGCTC AGTACGCTGG CCTCGGCCGA CCCAGTGGGT CGTCAACTGG ACTTGTCCCG CGATCTCAGC CTGGCGATCT ATGACTATGC ACCCGGGAAT GAGGTCGTGG CGGGCGGCAA GGTATGGACG TCAGTTGGAT TGAAGCGGCG TCCCGGAAAG GAGCTGGTTC GCCACAAGTA TCGCGTCTGC CCGACGTGCG GACGTTTCCA GCGTGGGCAG GAGCTCGATC CGGCGGATGT GTGCCCCAGC TGTGGAGAGC CGTTCCGATC GATCGGCACA ATGGTCATCC CGGAGTTTGG CTTCATTGCC GCGAGTGAGA CGCGAGAGGT TGGGAGCGCG CCACCTGAGC GGCGTTGGCA CGGCGGCAGT TACGTCGAGA CGCCGGGCGA TGACGTCGGT TCCTATCGGT GGTCCGGCCC CGGCGGGCTT CGTGTCAACG CCCGCGCGGG CGTGCGAGCC TGGTTAGCGG TCGTGTCGGA CGGCAGGGGC GACGGGTTCC AGCTGTGCCA GTGGTGCGGA TGGGCGAAAC CCGCCGAAAG AGGCAGCCGG CGTCGCAAGC ATCACCAGCC GGACACCAAT AAGGAATGCG ATGGCCCGCT GGAGAAGATC TCGCTCGGAC ACCGCTACCA GTCGGATGTC GCCGAGTTCA CCTTCGACGG ACTGCAGTAC CGCAAAGACC ACGAATCCAA TTGGTTGTCA GCGCTGTACG CGATATTGGA AGGCGCGTCC TATGCGCTGG AGATCAGTCG CGACGACATC GACGGGGCAT TGTCGTGGAG TGCAGACCAC CGGCGGAGCA TCGTGGTTTT CGACACGGTG CCCGGCGGAG CCGGATCGGC GAAGAAGATC GCGGAAAACA TCGGTGAGGT CTTGAACGCC GCGGTGAAAC GTGTGACCGA ATGCGACTGC GGGGTAGAGA CGTCGTGCTA CGGGTGTCTG CGGTCATTTC GCAATGCTCG TTTCCATGAA CAGCTTTCGC GAGGGGCAGC ACTGCAAATT CTTGGAAGGT AG
|
Protein sequence | MNDRLDPLKT ADQIEGSYKR YLKTLLAPRD EALATAFDTE VDSSTMLTKG PILELTPPYE TGATCRQLIE EGVLHRDFAR LDSHAFRIDQ PLYVHQESAV RKFLSGRNLV VSTGTGSGKT ESFLIPIINT LLEESARGTL GPGVRALLLY PMNALANDQL KRLRGMLAGV PDITFGRYTG ETRDDARTAE NDFLQYNPGQ LRLSNELLSR EEMRSSPPHL LLTNYAMLEY LLLRPLDIDL FDGPHAGTWR FIVMDEAHVY DGAQGSEVAL LIRRLKQRVA PDSNIQCIAT SASLTGSVRN DPRGEAMDFA SNLFDAPFEY VEGDANRQDL VEPTRKRHLA TPEWRLTGEQ LLALRSGSTT LTQIIGPGAD PAEALAREQS IIELKDALSG GPDAVRSLRE KLWPDNPRSA EYLDALVELG SSTCDEAGHP VLSARYHFFV RATEGAFVSF NDDGPRIFLS RHEVDPATGR AVFEFGTCTR CGAVHLAGEL EHRDRREYFT PSKKADASVN WLVLADGDLD VVVDEDEATL ASDEPKNLAD PTTRRLCTGC GQLTAADAAR CATTNCRAGQ MLLVREHPRP TRIMSRCTEC GTQSRQGIRR LRTDVNAAPA VVTTALYQQL PEASGDAADN VGGGRKLLMF SDSRQAAAFA APYLDRTYSR MLERRYITEA LRDPVAATSE LTVGDLAILA REKAQAAGHF DRRLGSIEIA QAVNQWISGE LMTLETRQSL EGLGLMRVAL RREPSVPLRG FTSLGLTEEE AWALMNELVK TVRLQGAITV LDRVDIKDER FAPRNTRVRM RSAGSDRARQ VISWNPSGTG TTNGRITLLR KVFESLSNKT PVEKVLEGCW RLLESGGFLV AESDRVLGQV YQLDHSMLTV TNGANCEWYR CDTCRLLTAF SIRDVCPNSR CTGRLRLYEV PASGRDSNHY RVVYQTMTSA PLRAHEHTAQ WDAKTAANIQ REFITGRVNV LSCSTTFELG VDVGDLQSVV MRNMPPKTAN YVQRAGRAGR RAASAALVVT YANRSAHDLA MYQDPNAMIA GRMRIPWVPV DNARIARRHA HSVALAAYFR HSYEQRRERW KSAGEFFSPG PNDEDSPARR VGRYLSPMPS AVQEALCAAL PRSVHAEIGI ADGSWVGELV ALLNSVEDEL TKDIADIEER IEETITERKF WLSKRLEDTK RTIVGRELLG YLANRNILPK YGFPVDTVEL STLASADPVG RQLDLSRDLS LAIYDYAPGN EVVAGGKVWT SVGLKRRPGK ELVRHKYRVC PTCGRFQRGQ ELDPADVCPS CGEPFRSIGT MVIPEFGFIA ASETREVGSA PPERRWHGGS YVETPGDDVG SYRWSGPGGL RVNARAGVRA WLAVVSDGRG DGFQLCQWCG WAKPAERGSR RRKHHQPDTN KECDGPLEKI SLGHRYQSDV AEFTFDGLQY RKDHESNWLS ALYAILEGAS YALEISRDDI DGALSWSADH RRSIVVFDTV PGGAGSAKKI AENIGEVLNA AVKRVTECDC GVETSCYGCL RSFRNARFHE QLSRGAALQI LGR
|
| |