Gene Mjls_5182 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_5182 
Symbol 
ID4880880 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp5430680 
End bp5433553 
Gene Length2874 bp 
Protein Length957 aa 
Translation table11 
GC content68% 
IMG OID640142493 
ProductDNA topoisomerase I 
Protein accessionYP_001073437 
Protein GI126437746 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00490347 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGAATT GGAGCAGGAT CCAGTTGGCT GACGGTAGCC CAAGAGGCGG AAACGGCGGC 
GGTGAGCCGC CGGCGCGCAG AGCGAACGGC AGCGTGCGGC GACTCGTCAT TGTCGAGTCG
CCGACGAAGG CGCGCAAAAT CGCCGGTTAC CTGGGCTCGA ATTACATCGT CGAATCGTCG
CGTGGACACA TCCGCGACCT GCCCCGGGCC GCCGCTGACG TGCCCGCGAA GTACAAGTCG
GAGCCGTGGG CCCGCCTCGG CGTCGACGTC GAACACGACT TCGAACCGCT CTACATCATC
AGCCCGGACA AGAAGAGCAC CGTCGCGGAT CTGAAGGACA AGCTCAAGAA CGTCGACGAG
CTCTATCTGG CGACGGACGG CGACCGCGAG GGCGAGGCCA TCGCGTGGCA CCTGCTCGAA
ACGCTCAAAC CGCGCATCCC GGTCAAGCGG ATGGTGTTCC ACGAGATCAC CGAGCCCGCG
ATCCGTGCGG CCGCCGAAGA CCCCCGCGAC CTGGACAACG ACCTGGTCGA CGCGCAGGAG
ACCCGCCGCA TCCTCGACCG CCTCTACGGC TACGAGGTCA GCCCGGTGCT GTGGAAGAAG
GTGGCGCCGA AGCTGTCGGC CGGACGCGTG CAGTCGGTGG CGACCCGGAT CATCGTGCAG
CGCGAACGTG AGCGCATGGC GTTCCGCACC GCCGGCTACT GGGATGTGAG CGCCGAACTG
GACGCCAGCG TCTCCGATCC GCAGGCCACC CCGCCGACGT TCACGGCGAA ACTCAACAGC
GTCGACGGAC GCCGCGTGGC CACCGGCCGC GATTTCGACT CGCTCGGTCA GGTGCGCAAA
CCCGACGAGG TGCTGGTCCT CGACGAGGCC GCCGCCGGGG CGTTGGCGGC GGGTCTGCAG
GCTGCGCAGC TGTCGGTGTC CTCCGTCGAG CAGAAGCCCT ACACGCGCAG GCCCTACGCA
CCGTTCATGA CCTCGACGCT GCAGCAGGAG GCCGGCCGCA AGCTGCGCTT CTCGTCGGAG
CGCACGATGA GCATCGCCCA GCGCCTGTAC GAGAACGGCT ACATCACCTA CATGCGTACC
GACTCGACCA CGCTGTCGCA GTCGGCCATC GACGCGGCGC GCAACCAAGC CCGCCAGCTC
TACGGCGAGG AGTACGTGCA CCCGACGGCG CGCCAGTACA CCCGCAAGGT GAAGAACGCG
CAGGAGGCCC ACGAGGCGAT CCGCCCCGCG GGGGATGTGT TCCAGACCCC CGGGCAGCTG
CACGCGCAGC TCGACACCGA CGAGTTCCGG CTCTACGAGC TGATCTGGCA GCGCACCGTC
GCCTCGCAGA TGGCCGATGC GCGCGGCACC ACGCTGTCGC TGCGCATCGC CGGGGACTCG
CGGGACGGAC AGTCGGTGGT GTTCTCCGCC AGCGGGCGCA CCATCACCTT CGCCGGCTTC
CTCAAGGCCT ACGTGGAGAG TATCGACGAA CTCGCCGGCG GCGAGTCCGA CGACGCCGAG
AGCCGCCTGC CGAACCTGAC CCAGGGGCAG CGCGTCGACG CCAAGGAGCT CACCCCCGCC
GGGCACCAGA CCAGCCCGCC CGCCCGCTAC ACCGAGGCGT CGCTCATCAA GGCCCTCGAG
GATCTCGGCA TCGGCCGGCC GTCGACCTAT TCGTCGATCA TCAAGACCAT CCAGGACCGC
GGCTACGTCC ACAAGAAGGG CAGCGCGCTG GTCCCGTCGT GGGTGGCGTT CGCCGTGATC
GGGTTGCTGG AACAGCATTT CGGCCGTCTG GTGGACTACG GGTTCACCGC CGCGATGGAG
GACGAACTCG ACGAGATCGC CTCCGGCACC GAGCGAAGGA CCAACTGGCT CAAGAACTTC
TACTTCGGCG GTGAGCACGG CGTCGGCGAT TCGATCGCGC GCGCGGGTGG GCTGAAGAAG
CTGGTCGGCG TCAACCTCGA GGAAATCGAC GCGCGAGAAG TCAACTCCAT CAAACTCTTC
GACGATGCAG AGGGACGTCC CATCTACGTG CGGGTGGGCA AGAACGGCCC CTACCTGGAG
CGGATGGTCG CCGACGAGGA GAACCCGGGT GAGCTCAAAC CCCAGCGCGC CAACCTCAAG
GACGAGCTGA CGCCGGACGA GTTGACCCTT GAGCTGGCCG AAAAGCTGTT CTCCACACCG
CAAGAGGGCC GCACGCTGGG TGTCGACCCG GAGACCGGAC ACGAGATCGT CGCCAAGGAC
GGCCGCTACG GGCCGTATGT GACCGAGGTG CTGCCGGCGC CTCCGGAGGA GCCGGAAGAC
GGTGCGCCTG CGAAGAAGGG CAAGAAGCCG ACCGGTCCCA AACCGCGGAC CGGTTCGCTG
CTGCGCACCA TGGACCTCGA GACCGTCACG CTCGACGACG CACTCAAACT GCTGTCGCTG
CCGCGGGTGG TGGGAGTCGA TCCCAACACC GGTGAGGAGA TCACCGCGCA GAACGGCCGG
TACGGGCCAT ACCTCAAGCG CGGCACCGAC TCTCGGTCGC TCGCCACCGA AGAGCAGATG
TTCACCATCA CCCTCGACGA GGCGTTGAAG ATCTACGCCG AGCCGAAGCG CCGCGGCCGG
CAGGGCGCGG CGACGCCGCC GCTGCGCGAA CTGGGCGTCG ACCCCGTCTC GGAGAAGCCG
ATGGTGATCA AGGACGGCCG CTTCGGGCCG TACGTCACCG ACGGTGAGAC CAACGCCAGC
CTGCGCAAGG GCGACGACGT CATGTCGATC ACCGATGCGC GCGCCTCGGA ACTGCTCGCC
GACCGGCGGG CCCGCGGACC GGTCAAGAAG AAGGCCGCGG CCAAGAAGGC GCCGGCGAAG
AAGACCGCGG CCAAGAAGAC CGCGGCGAAG AAGGCGTCCG CCAAGAAGGC GTAG
 
Protein sequence
MKNWSRIQLA DGSPRGGNGG GEPPARRANG SVRRLVIVES PTKARKIAGY LGSNYIVESS 
RGHIRDLPRA AADVPAKYKS EPWARLGVDV EHDFEPLYII SPDKKSTVAD LKDKLKNVDE
LYLATDGDRE GEAIAWHLLE TLKPRIPVKR MVFHEITEPA IRAAAEDPRD LDNDLVDAQE
TRRILDRLYG YEVSPVLWKK VAPKLSAGRV QSVATRIIVQ RERERMAFRT AGYWDVSAEL
DASVSDPQAT PPTFTAKLNS VDGRRVATGR DFDSLGQVRK PDEVLVLDEA AAGALAAGLQ
AAQLSVSSVE QKPYTRRPYA PFMTSTLQQE AGRKLRFSSE RTMSIAQRLY ENGYITYMRT
DSTTLSQSAI DAARNQARQL YGEEYVHPTA RQYTRKVKNA QEAHEAIRPA GDVFQTPGQL
HAQLDTDEFR LYELIWQRTV ASQMADARGT TLSLRIAGDS RDGQSVVFSA SGRTITFAGF
LKAYVESIDE LAGGESDDAE SRLPNLTQGQ RVDAKELTPA GHQTSPPARY TEASLIKALE
DLGIGRPSTY SSIIKTIQDR GYVHKKGSAL VPSWVAFAVI GLLEQHFGRL VDYGFTAAME
DELDEIASGT ERRTNWLKNF YFGGEHGVGD SIARAGGLKK LVGVNLEEID AREVNSIKLF
DDAEGRPIYV RVGKNGPYLE RMVADEENPG ELKPQRANLK DELTPDELTL ELAEKLFSTP
QEGRTLGVDP ETGHEIVAKD GRYGPYVTEV LPAPPEEPED GAPAKKGKKP TGPKPRTGSL
LRTMDLETVT LDDALKLLSL PRVVGVDPNT GEEITAQNGR YGPYLKRGTD SRSLATEEQM
FTITLDEALK IYAEPKRRGR QGAATPPLRE LGVDPVSEKP MVIKDGRFGP YVTDGETNAS
LRKGDDVMSI TDARASELLA DRRARGPVKK KAAAKKAPAK KTAAKKTAAK KASAKKA