Gene Mjls_2251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_2251 
Symbol 
ID4877971 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp2348737 
End bp2352039 
Gene Length3303 bp 
Protein Length1100 aa 
Translation table11 
GC content60% 
IMG OID640139548 
Producthelicase domain-containing protein 
Protein accessionYP_001070528 
Protein GI126434837 
COG category[L] Replication, recombination and repair 
COG ID[COG1111] ERCC4-like helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.788605 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGAACC TCTTTGACAA CCTCAGCGAC GACACACGCT TGAAGGACGC ACTGCGCGTC 
TCGCTCAAGG ACTTCGACAC GGTCGACGTG GCGACCGGCT ACCTCGATTT GCGCGGTTGG
TCAACCATGG CCGACATCGT CGACACAAAG GCTCGAACAG GTGCAGAGCC TGTTGCTCGG
GTACTCATTG GGATGGTCGC GCCAGCTGAC TCGCAGGAAA TCCTCGACGG CCTGCAGCGC
GAGGTTCAGC CCCCGGGCTA CGGGGCAGAT ATACACGACC GGGAGAAGGC ACTTGCCCGC
CGCGACCAGC TCGTTAAGCA TCTGCGGAAC CAGCTGATGC GTGGGCTGGC CACCGTCGAA
GGACAGCGGA CCCTCCAGGC GCTGAAGCGG CAGTTGGAGG ACGGCGCAGT GCAGATGAAG
GTCTTCACCG AGAAGCCACT GCACGGAAAG GCCTACCTGT TTCACGCCCC GGGAAAGAGC
CACGGCTCAC GATGGGCTTA CGTCGGGTCG TCGAACCTCA CTCACGCCGG ATTGACGACG
AATCTAGAGC TGAACATCGA TGTCCAAGAC TCCGACGCCA ACACCAAGAT CGCCGAGTGG
TTCGAAGCAC GGTGGACCGA CCGGTTCTCG CTGCCCATCA CCGCTGAGAT CATCCAGCTG
ATCTCCGAAT CGTGGGCGGC TGGCCTACAG CCCACCCCAT TTGAGGTCTA TCTCAAGGTC
TGCCACGCAC TGTCCCAAGA TGCCCGGGAT GGACTCGGCT ACGTCTTGCC CACATCGATG
CAGAACCTGC TGCTCGACTA TCAGGAAAGT GCAGTCCGCA CGCTCGCTCG ACGCATTGTT
CGCCGAGGCG GAACCATGCT GGGCGACGTC GTTGGTCTCG GCAAGACGCT GACGGCCATC
GCGACTGCCT TGATGCTCCA GTCCGCCGAG GACTATTCGA CGCTGGTCTT GTGCCCGAAG
AATCTCGAAC TCATGTGGAC CAAGCACCTC GACAAGTATG AGATCAACGG GCGTGTGGTC
CCGTACTCGA TGGCCGACAG AGTGCTTCCT GAGCTCAAAC GATTCAATCT GGTCATCTGC
GATGAGTCGC ACAACCTGCG CAACAGCAGC ACCATTGCCT ACGAAGCCAT CCACGAGTAC
ATCCGCCGCA ACAGCTCCAA GGTGTTGCTT CTTACCGCAA CCCCCTACAA CCTCGCGTTC
CAGGACGTCG CCAGCCAGAT CGCGCTGTAT ATCGATGACG ACGAAGACCT CGGCATCGTC
CCCACAGCAG CACTCGCCGC AGAGCCCGGG CTCCGCGACA AAGTCGACGG GAAGATCAAC
ACACTCGCCG CCTTCCGACG CTCCGACCAC GCAGAGGACT GGCGCCGGCT GATGAGCGAC
CACCTTGTGC GGCGAACCCG CAGCTTCATC AAGCGAACGG CGAAGACAGA ACCTGTCACC
CTGCCCGACG GAACCACCGA GCAACAGCCG TACCTCCAGT TTGCGAACGG TGAGAAGTTC
CACTTCCCAA CCCGGATCGC CCGCCCGCTT AGCCACGACT TCGCCGACGA CGATCCCGCA
AAACTCATGG AAGACAACGA CACGCTCAAC GCCGTGAGCA CTTTGGCGCT GCCACGCTAC
CGGCTAGCCG ACTACGACAA CCCACGCGCA CCGCACACAG ACTCAGACAC GAAGGTCCTC
GATGACATTC GCTCCGGGCG TGGAAACGTC AGCGGGTTTG TCCGCACCGG ATTGTTCAAG
CGGTTGTCCT CGTCCGGGCA CTCGTTCATC GTGTCGTTGC AGCGCCAACG GGCACGCAAC
GAGCTGTTCC TGCATGCGAT CGACAACGAG CTCCCCATCC CCATTGGGTC GTTCACCGAC
AAGCAGTTCG CGGTCAGCGA TGAAGACGTC GAGGATGACC CAACACCGCA CGGCTCGCTC
GCAAGCCGCT ACGAGGAGCT CCGCAAGAAC CTCCCGGCAA AGACGAAATG GGTCAACTCC
ACAGTCTTCA AGACGACGCT GCGCAAGGAT CTCGAACGCG ACAACCAGAT GCTCACGACA
ATGCTCGATC GGTTTGGCAG CTGGGACTCG ACCCGAGACT CCAAAATCAA CGCATTGGTC
GACCTGCTCC GCAACGACCA CCCCGGCGAG AAGGTTCTCG TTTTCACCGA GTACGTGGAT
ACTGCCGAAT ACGTTGCCCA AGCACTACAG GAAGCCAGCG TGGAAAAGGT TGCCGTAGTC
TCAGGAAACA CCGACGACCC CGCCGATATT GCCCGCCGAT TCTCACCGCA CTCCAACCGG
ATCCCAGGGC AGGAAGAACT TCCTGAAGTA GACCCAGCGG ACCCCATCGA CGTGCTCGTC
GCGACCGACG TGCTCTCCGA AGGCCAGAAC CTGCAAGACT CCCACATCGT CGTCAACTAC
GACCTGCCGT GGGCGATCAT CCGACTCATC CAACGTGCCG GCCGTGTCGA CCGCGTCGGC
CAGAAGTCAG ACCAAGTCTT CGTCTACCTC ATCAGCCACG AGAACGTTGA AGCCCAGATC
AACCTCCGAC AGCGCATCCG CGCACGGCTG GGTGCCGCGG CCGAGGCATT CGGTGCCGAC
GAACAGTTCT TCGGCGGAGA CACCGAAATC AAGATCCTCG ACGACTTCTA CAAGGGCAAG
GTGAGCGACG ATGCTGACGA AGCCGACAAC GAGGCCGATG CCGTCAGCGA GGCATGGGTC
GTCTGGTCCA ACGCACAGAC CAAACACCCG CAGATCGCTA AAAAGGTTCT CGGCATGCAA
GACATGGTCC ACAGTACCCG CGAGCAATAC ATCGACGAGA ACGCCGGCAG CGTGACGTGC
TTCGTCAGTA CGGAGTCCGG CGTCGACGCA TTTGCGACAT CGACCACGAA CGCCGACGGC
TCAACGGCAG AGAGCCTACT CACACCGCTG GAAGCCATGC GCACCTTCCG TGCCCAAATC
GATACGCCCA CAGCGGAACT GCGGCCCGAT CACTTCGACC GACAAACAGC CCTCGTCCAC
GGGCCCCTCA CAATTGAAGC GATCGCGGCT GGAAACCTCA AGGGAATCCG AAAGTGGGTG
TGGGAACGTC TCGCTGGAAC ATTGTTCGCC CAGAAGGCAA CTGACGCGCT CAACGCGCTA
CACGCACAGC CCCTCACCGA GCACGCAACA ATGCGGCTTA GTCAGGCCCG TCGAAACCGC
TACAGCGTGG ATGACATTGC CGACCTACTC AACCAGCTCC ATGAAGAAGA CCGCCTGGTC
ATCAAGTCAT CGGAGACTGA CAACATCAAA CTCGTCTGCT CGATCGGAGT ACGTGAAGCA
TGA
 
Protein sequence
MPNLFDNLSD DTRLKDALRV SLKDFDTVDV ATGYLDLRGW STMADIVDTK ARTGAEPVAR 
VLIGMVAPAD SQEILDGLQR EVQPPGYGAD IHDREKALAR RDQLVKHLRN QLMRGLATVE
GQRTLQALKR QLEDGAVQMK VFTEKPLHGK AYLFHAPGKS HGSRWAYVGS SNLTHAGLTT
NLELNIDVQD SDANTKIAEW FEARWTDRFS LPITAEIIQL ISESWAAGLQ PTPFEVYLKV
CHALSQDARD GLGYVLPTSM QNLLLDYQES AVRTLARRIV RRGGTMLGDV VGLGKTLTAI
ATALMLQSAE DYSTLVLCPK NLELMWTKHL DKYEINGRVV PYSMADRVLP ELKRFNLVIC
DESHNLRNSS TIAYEAIHEY IRRNSSKVLL LTATPYNLAF QDVASQIALY IDDDEDLGIV
PTAALAAEPG LRDKVDGKIN TLAAFRRSDH AEDWRRLMSD HLVRRTRSFI KRTAKTEPVT
LPDGTTEQQP YLQFANGEKF HFPTRIARPL SHDFADDDPA KLMEDNDTLN AVSTLALPRY
RLADYDNPRA PHTDSDTKVL DDIRSGRGNV SGFVRTGLFK RLSSSGHSFI VSLQRQRARN
ELFLHAIDNE LPIPIGSFTD KQFAVSDEDV EDDPTPHGSL ASRYEELRKN LPAKTKWVNS
TVFKTTLRKD LERDNQMLTT MLDRFGSWDS TRDSKINALV DLLRNDHPGE KVLVFTEYVD
TAEYVAQALQ EASVEKVAVV SGNTDDPADI ARRFSPHSNR IPGQEELPEV DPADPIDVLV
ATDVLSEGQN LQDSHIVVNY DLPWAIIRLI QRAGRVDRVG QKSDQVFVYL ISHENVEAQI
NLRQRIRARL GAAAEAFGAD EQFFGGDTEI KILDDFYKGK VSDDADEADN EADAVSEAWV
VWSNAQTKHP QIAKKVLGMQ DMVHSTREQY IDENAGSVTC FVSTESGVDA FATSTTNADG
STAESLLTPL EAMRTFRAQI DTPTAELRPD HFDRQTALVH GPLTIEAIAA GNLKGIRKWV
WERLAGTLFA QKATDALNAL HAQPLTEHAT MRLSQARRNR YSVDDIADLL NQLHEEDRLV
IKSSETDNIK LVCSIGVREA