Gene Mkms_4882 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_4882 
Symbol 
ID4615842 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp5116324 
End bp5119197 
Gene Length2874 bp 
Protein Length957 aa 
Translation table11 
GC content68% 
IMG OID639794574 
ProductDNA topoisomerase I 
Protein accessionYP_940862 
Protein GI119870910 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.010742 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAATT GGAGCAGGAT CCAGTTGGCT GACGGTAGCC CAAGAGGCGG AAACGGCGGC 
GGTGAGCCGC CGGCGCGCAG AGCGAACGGC AGCGTGCGGC GACTCGTCAT TGTCGAGTCG
CCGACGAAGG CGCGCAAAAT CGCCGGTTAC CTGGGCTCGA ATTACATCGT CGAATCGTCG
CGTGGACACA TCCGCGACCT GCCCCGGGCC GCCGCTGACG TGCCCGCGAA GTACAAGTCG
GAGCCGTGGG CCCGCCTCGG CGTCGACGTC GAACACGACT TCGAACCGCT CTACATCATC
AGCCCGGACA AGAAGAGCAC CGTCGCGGAT CTGAAGGACA AGCTCAAGAA CGTCGACGAG
CTCTATCTGG CGACGGACGG CGACCGCGAG GGCGAGGCCA TCGCGTGGCA CCTGCTCGAA
ACGCTCAAAC CGCGCATCCC GGTCAAGCGG ATGGTGTTCC ACGAGATCAC CGAGCCCGCG
ATCCGTGCGG CCGCCGAAGA CCCCCGCGAC CTCGACAACG ACCTGGTCGA CGCGCAGGAG
ACCCGCCGCA TCCTCGACCG CCTCTACGGC TACGAGGTCA GCCCGGTGCT GTGGAAGAAG
GTGGCGCCGA AGCTGTCGGC CGGACGCGTG CAGTCGGTGG CGACCCGGAT CATCGTGCAG
CGCGAACGTG AGCGCATGGC GTTCCGCACC GCCGGCTACT GGGATGTGAG CGCCGAACTG
GACGCCAGCG TCTCCGATCC GCAGGCCACC CCGCCGACGT TCACGGCGAA ACTCAACAGC
GTCGACGGAC GCCGCGTGGC CACCGGCCGC GATTTCGACT CGCTCGGTCA GGTGCGCAAA
CCCGACGAGG TGCTGGTCCT CGACGAGGCC GCCGCCGGGG CGTTGGCGGC GGGTCTGCAG
GCTGCGCAGC TGTCGGTGTC CTCCGTCGAG CAGAAGCCCT ACACGCGCAG GCCCTACGCA
CCGTTCATGA CCTCGACGCT GCAGCAGGAG GCCGGCCGCA AGCTGCGCTT CTCGTCGGAG
CGCACGATGA GCATCGCCCA GCGCCTGTAC GAGAACGGCT ACATCACCTA CATGCGTACC
GACTCGACCA CGCTGTCGCA GTCGGCCATT GACGCGGCGC GCAACCAAGC CCGCCAGCTC
TACGGCGAGG AGTACGTGCA CCCGACGGCG CGCCAGTACA CCCGCAAGGT GAAGAACGCG
CAGGAGGCCC ACGAGGCGAT CCGCCCCGCG GGGGATGTGT TCCAGACCCC CGGGCAGCTG
CACGCGCAGC TCGACACCGA CGAGTTCCGG CTCTACGAGC TGATCTGGCA GCGCACCGTC
GCCTCACAGA TGGCCGATGC GCGCGGCACC ACGCTGTCGC TGCGCATCGC CGGGGACTCG
CGGGACGGAC AGTCGGTGGT GTTCTCCGCC AGCGGGCGCA CCATCACCTT CGCCGGCTTC
CTCAAGGCCT ACGTGGAGAG CATCGACGAA CTCGCCGGCG GCGAGTCCGA CGACGCCGAG
AGCCGCCTGC CGAACCTGAC CCAGGGGCAG CGCGTCGACG CCAAGGAGCT CACCCCCGCC
GGGCACCAGA CCAGCCCGCC CGCCCGCTAC ACCGAGGCGT CGCTCATCAA GGCCCTCGAG
GATCTCGGCA TCGGCCGGCC GTCGACCTAC TCGTCGATCA TCAAGACCAT CCAGGACCGC
GGCTACGTCC ACAAGAAGGG CAGCGCGCTG GTCCCGTCGT GGGTGGCGTT CGCCGTGATC
GGGTTGCTCG AACAGCATTT CGGCCGTCTG GTGGACTACG GGTTCACCGC CGCGATGGAG
GACGAACTCG ACGAGATCGC CTCCGGCACC GAGCGAAGGA CCAACTGGCT CAAGAACTTC
TACTTCGGCG GTGAGCACGG CGTCGGCGAT TCGATCGCGC GCGCGGGTGG GCTGAAGAAG
CTGGTCGGCG TCAACCTCGA GGAGATCGAC GCGCGAGAAG TCAACTCCAT CAAGCTCTTC
GACGATGCGG AGGGACGTCC CATCTACGTG CGGGTGGGCA AGAACGGCCC CTACCTGGAG
CGGATGGTCG CCGACGAGGA GAACCCGGGT GAGCTCAAAC CCCAGCGCGC CAACCTCAAA
GACGAGCTGA CACCGGACGA GCTGACCCTC GAGCTGGCCG AAAAGCTGTT CTCCACACCG
CAAGAGGGCC GCACGCTGGG CGTCGACCCG GAGACCGGAC ACGAGATCGT CGCCAAGGAC
GGCCGCTACG GACCGTATGT GACCGAGGTG CTGCCCGCGC CTCCCGAGGA GCCGGAGGAC
GGTGCGCCGG CGAAGAAGGG CAAGAAGCCG ACCGGGCCCA AACCGCGGAC CGGTTCGCTG
CTGCGCACCA TGGACCTCGA GACCGTCACG CTCGACGACG CACTCAAACT GCTGTCGCTG
CCGCGGGTGG TGGGAGTCGA TCCCAACACC GGTGAGGAGA TCACCGCGCA GAACGGCCGG
TACGGGCCAT ACCTCAAGCG CGGCACCGAC TCTCGGTCGC TGGCCACCGA GGAGCAGATG
TTCACCATCA CCCTCGACGA GGCGTTGAAG ATCTACGCCG AGCCGAAGCG CCGCGGCCGG
CAGGGCGCGG CGACGCCGCC GCTGCGCGAA CTGGGCGTCG ACCCTGTCTC GGAGAAGCCG
ATGGTGATCA AGGACGGCCG CTTCGGGCCG TACGTCACCG ACGGTGAGAC CAACGCCAGC
CTGCGCAAGG GCGACGACGT CATGTCGATC ACCGATGCGC GCGCCTCGGA ACTGCTCGCC
GACCGGCGGG CCCGCGGACC GGTCAAGAAG AAGGCCGCGG CCAAGAAGGC GCCGGCGAAG
AAGACCGCGG CCAAGAAGAC CGCGGCGAAG AAGGCGTCCG CCAAGAAGGC GTAG
 
Protein sequence
MKNWSRIQLA DGSPRGGNGG GEPPARRANG SVRRLVIVES PTKARKIAGY LGSNYIVESS 
RGHIRDLPRA AADVPAKYKS EPWARLGVDV EHDFEPLYII SPDKKSTVAD LKDKLKNVDE
LYLATDGDRE GEAIAWHLLE TLKPRIPVKR MVFHEITEPA IRAAAEDPRD LDNDLVDAQE
TRRILDRLYG YEVSPVLWKK VAPKLSAGRV QSVATRIIVQ RERERMAFRT AGYWDVSAEL
DASVSDPQAT PPTFTAKLNS VDGRRVATGR DFDSLGQVRK PDEVLVLDEA AAGALAAGLQ
AAQLSVSSVE QKPYTRRPYA PFMTSTLQQE AGRKLRFSSE RTMSIAQRLY ENGYITYMRT
DSTTLSQSAI DAARNQARQL YGEEYVHPTA RQYTRKVKNA QEAHEAIRPA GDVFQTPGQL
HAQLDTDEFR LYELIWQRTV ASQMADARGT TLSLRIAGDS RDGQSVVFSA SGRTITFAGF
LKAYVESIDE LAGGESDDAE SRLPNLTQGQ RVDAKELTPA GHQTSPPARY TEASLIKALE
DLGIGRPSTY SSIIKTIQDR GYVHKKGSAL VPSWVAFAVI GLLEQHFGRL VDYGFTAAME
DELDEIASGT ERRTNWLKNF YFGGEHGVGD SIARAGGLKK LVGVNLEEID AREVNSIKLF
DDAEGRPIYV RVGKNGPYLE RMVADEENPG ELKPQRANLK DELTPDELTL ELAEKLFSTP
QEGRTLGVDP ETGHEIVAKD GRYGPYVTEV LPAPPEEPED GAPAKKGKKP TGPKPRTGSL
LRTMDLETVT LDDALKLLSL PRVVGVDPNT GEEITAQNGR YGPYLKRGTD SRSLATEEQM
FTITLDEALK IYAEPKRRGR QGAATPPLRE LGVDPVSEKP MVIKDGRFGP YVTDGETNAS
LRKGDDVMSI TDARASELLA DRRARGPVKK KAAAKKAPAK KTAAKKTAAK KASAKKA