Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mjls_5182 |
Symbol | |
ID | 4880880 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. JLS |
Kingdom | Bacteria |
Replicon accession | NC_009077 |
Strand | - |
Start bp | 5430680 |
End bp | 5433553 |
Gene Length | 2874 bp |
Protein Length | 957 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640142493 |
Product | DNA topoisomerase I |
Protein accession | YP_001073437 |
Protein GI | 126437746 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01051] DNA topoisomerase I, bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00490347 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAGAATT GGAGCAGGAT CCAGTTGGCT GACGGTAGCC CAAGAGGCGG AAACGGCGGC GGTGAGCCGC CGGCGCGCAG AGCGAACGGC AGCGTGCGGC GACTCGTCAT TGTCGAGTCG CCGACGAAGG CGCGCAAAAT CGCCGGTTAC CTGGGCTCGA ATTACATCGT CGAATCGTCG CGTGGACACA TCCGCGACCT GCCCCGGGCC GCCGCTGACG TGCCCGCGAA GTACAAGTCG GAGCCGTGGG CCCGCCTCGG CGTCGACGTC GAACACGACT TCGAACCGCT CTACATCATC AGCCCGGACA AGAAGAGCAC CGTCGCGGAT CTGAAGGACA AGCTCAAGAA CGTCGACGAG CTCTATCTGG CGACGGACGG CGACCGCGAG GGCGAGGCCA TCGCGTGGCA CCTGCTCGAA ACGCTCAAAC CGCGCATCCC GGTCAAGCGG ATGGTGTTCC ACGAGATCAC CGAGCCCGCG ATCCGTGCGG CCGCCGAAGA CCCCCGCGAC CTGGACAACG ACCTGGTCGA CGCGCAGGAG ACCCGCCGCA TCCTCGACCG CCTCTACGGC TACGAGGTCA GCCCGGTGCT GTGGAAGAAG GTGGCGCCGA AGCTGTCGGC CGGACGCGTG CAGTCGGTGG CGACCCGGAT CATCGTGCAG CGCGAACGTG AGCGCATGGC GTTCCGCACC GCCGGCTACT GGGATGTGAG CGCCGAACTG GACGCCAGCG TCTCCGATCC GCAGGCCACC CCGCCGACGT TCACGGCGAA ACTCAACAGC GTCGACGGAC GCCGCGTGGC CACCGGCCGC GATTTCGACT CGCTCGGTCA GGTGCGCAAA CCCGACGAGG TGCTGGTCCT CGACGAGGCC GCCGCCGGGG CGTTGGCGGC GGGTCTGCAG GCTGCGCAGC TGTCGGTGTC CTCCGTCGAG CAGAAGCCCT ACACGCGCAG GCCCTACGCA CCGTTCATGA CCTCGACGCT GCAGCAGGAG GCCGGCCGCA AGCTGCGCTT CTCGTCGGAG CGCACGATGA GCATCGCCCA GCGCCTGTAC GAGAACGGCT ACATCACCTA CATGCGTACC GACTCGACCA CGCTGTCGCA GTCGGCCATC GACGCGGCGC GCAACCAAGC CCGCCAGCTC TACGGCGAGG AGTACGTGCA CCCGACGGCG CGCCAGTACA CCCGCAAGGT GAAGAACGCG CAGGAGGCCC ACGAGGCGAT CCGCCCCGCG GGGGATGTGT TCCAGACCCC CGGGCAGCTG CACGCGCAGC TCGACACCGA CGAGTTCCGG CTCTACGAGC TGATCTGGCA GCGCACCGTC GCCTCGCAGA TGGCCGATGC GCGCGGCACC ACGCTGTCGC TGCGCATCGC CGGGGACTCG CGGGACGGAC AGTCGGTGGT GTTCTCCGCC AGCGGGCGCA CCATCACCTT CGCCGGCTTC CTCAAGGCCT ACGTGGAGAG TATCGACGAA CTCGCCGGCG GCGAGTCCGA CGACGCCGAG AGCCGCCTGC CGAACCTGAC CCAGGGGCAG CGCGTCGACG CCAAGGAGCT CACCCCCGCC GGGCACCAGA CCAGCCCGCC CGCCCGCTAC ACCGAGGCGT CGCTCATCAA GGCCCTCGAG GATCTCGGCA TCGGCCGGCC GTCGACCTAT TCGTCGATCA TCAAGACCAT CCAGGACCGC GGCTACGTCC ACAAGAAGGG CAGCGCGCTG GTCCCGTCGT GGGTGGCGTT CGCCGTGATC GGGTTGCTGG AACAGCATTT CGGCCGTCTG GTGGACTACG GGTTCACCGC CGCGATGGAG GACGAACTCG ACGAGATCGC CTCCGGCACC GAGCGAAGGA CCAACTGGCT CAAGAACTTC TACTTCGGCG GTGAGCACGG CGTCGGCGAT TCGATCGCGC GCGCGGGTGG GCTGAAGAAG CTGGTCGGCG TCAACCTCGA GGAAATCGAC GCGCGAGAAG TCAACTCCAT CAAACTCTTC GACGATGCAG AGGGACGTCC CATCTACGTG CGGGTGGGCA AGAACGGCCC CTACCTGGAG CGGATGGTCG CCGACGAGGA GAACCCGGGT GAGCTCAAAC CCCAGCGCGC CAACCTCAAG GACGAGCTGA CGCCGGACGA GTTGACCCTT GAGCTGGCCG AAAAGCTGTT CTCCACACCG CAAGAGGGCC GCACGCTGGG TGTCGACCCG GAGACCGGAC ACGAGATCGT CGCCAAGGAC GGCCGCTACG GGCCGTATGT GACCGAGGTG CTGCCGGCGC CTCCGGAGGA GCCGGAAGAC GGTGCGCCTG CGAAGAAGGG CAAGAAGCCG ACCGGTCCCA AACCGCGGAC CGGTTCGCTG CTGCGCACCA TGGACCTCGA GACCGTCACG CTCGACGACG CACTCAAACT GCTGTCGCTG CCGCGGGTGG TGGGAGTCGA TCCCAACACC GGTGAGGAGA TCACCGCGCA GAACGGCCGG TACGGGCCAT ACCTCAAGCG CGGCACCGAC TCTCGGTCGC TCGCCACCGA AGAGCAGATG TTCACCATCA CCCTCGACGA GGCGTTGAAG ATCTACGCCG AGCCGAAGCG CCGCGGCCGG CAGGGCGCGG CGACGCCGCC GCTGCGCGAA CTGGGCGTCG ACCCCGTCTC GGAGAAGCCG ATGGTGATCA AGGACGGCCG CTTCGGGCCG TACGTCACCG ACGGTGAGAC CAACGCCAGC CTGCGCAAGG GCGACGACGT CATGTCGATC ACCGATGCGC GCGCCTCGGA ACTGCTCGCC GACCGGCGGG CCCGCGGACC GGTCAAGAAG AAGGCCGCGG CCAAGAAGGC GCCGGCGAAG AAGACCGCGG CCAAGAAGAC CGCGGCGAAG AAGGCGTCCG CCAAGAAGGC GTAG
|
Protein sequence | MKNWSRIQLA DGSPRGGNGG GEPPARRANG SVRRLVIVES PTKARKIAGY LGSNYIVESS RGHIRDLPRA AADVPAKYKS EPWARLGVDV EHDFEPLYII SPDKKSTVAD LKDKLKNVDE LYLATDGDRE GEAIAWHLLE TLKPRIPVKR MVFHEITEPA IRAAAEDPRD LDNDLVDAQE TRRILDRLYG YEVSPVLWKK VAPKLSAGRV QSVATRIIVQ RERERMAFRT AGYWDVSAEL DASVSDPQAT PPTFTAKLNS VDGRRVATGR DFDSLGQVRK PDEVLVLDEA AAGALAAGLQ AAQLSVSSVE QKPYTRRPYA PFMTSTLQQE AGRKLRFSSE RTMSIAQRLY ENGYITYMRT DSTTLSQSAI DAARNQARQL YGEEYVHPTA RQYTRKVKNA QEAHEAIRPA GDVFQTPGQL HAQLDTDEFR LYELIWQRTV ASQMADARGT TLSLRIAGDS RDGQSVVFSA SGRTITFAGF LKAYVESIDE LAGGESDDAE SRLPNLTQGQ RVDAKELTPA GHQTSPPARY TEASLIKALE DLGIGRPSTY SSIIKTIQDR GYVHKKGSAL VPSWVAFAVI GLLEQHFGRL VDYGFTAAME DELDEIASGT ERRTNWLKNF YFGGEHGVGD SIARAGGLKK LVGVNLEEID AREVNSIKLF DDAEGRPIYV RVGKNGPYLE RMVADEENPG ELKPQRANLK DELTPDELTL ELAEKLFSTP QEGRTLGVDP ETGHEIVAKD GRYGPYVTEV LPAPPEEPED GAPAKKGKKP TGPKPRTGSL LRTMDLETVT LDDALKLLSL PRVVGVDPNT GEEITAQNGR YGPYLKRGTD SRSLATEEQM FTITLDEALK IYAEPKRRGR QGAATPPLRE LGVDPVSEKP MVIKDGRFGP YVTDGETNAS LRKGDDVMSI TDARASELLA DRRARGPVKK KAAAKKAPAK KTAAKKTAAK KASAKKA
|
| |