Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_5396 |
Symbol | |
ID | 4648076 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | - |
Start bp | 5777114 |
End bp | 5779981 |
Gene Length | 2868 bp |
Protein Length | 955 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639808872 |
Product | DNA topoisomerase I |
Protein accession | YP_956173 |
Protein GI | 120406344 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01051] DNA topoisomerase I, bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.585871 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.0823046 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCTGACG AGGAACGCGG CAGCGGTAAG AACGGCGCCG AGCCGCGCAG GGGGAATGGC TCGTCGGTGC GGAGACTCGT CATCGTCGAG TCGCCGACCA AGGCGCGCAA GATCGCAGGT TACCTGGGGT CCAACTACAT CGTCGAGTCC TCACGCGGGC ACATTCGGGA CCTGCCGCGC AACGCCGCCG ACGTCCCGGC GAAGTACAAA TCCGAGCCGT GGGCCCGCCT GGGCGTCAAC GTCGAGCACA ACTTCGAGCC GCTCTACATC ATCAGCCCGG ACAAGAAGAG CACCGTCGCC GACCTGAAGG ACAAGCTCAA GAACGTCGAC GAGCTCTATC TGGCCACCGA CGGTGACCGC GAGGGCGAGG CCATCGCCTG GCACCTGCTG GAGACGCTGA AACCGCGCAT CCCGGTCAAG CGGATGGTGT TCCACGAGAT CACCGAGCCC GCGATCCGCG CGGCCGCCGA AGACCCGCGC GACCTGGACA ACGACCTGGT CGACGCGCAG GAGACCCGAC GCATCCTCGA CCGTCTCTAC GGCTACGAGG TCAGCCCCGT GCTGTGGAAG AAGGTCGCGC CGAAGCTGTC GGCGGGCCGG GTCCAGTCCG TGGCCACCCG CATCATCGTG CAGCGCGAAC GCGAGCGGAT GGCGTTCCGC AGCGCCGGGT ACTGGGACGT CACCGCCGAG CTGGACGCCA GCGTCTCCGA CGCGCAGGCC AGCCCGCCCA CCTTCGTCGC AAAGCTCAAC ACGGTCGACG GGCGCCGCGT GGCCACCGGC CGCGACTTCG ACTCCCTCGG CGCGGTCCGC AAGCCCGACG AGGTGCTCGT CCTCGACGAG GCCGCCGCCA ACGCCTTGGC CACCGGTCTG CGGGGCGCGC AGCTGGCGGT CTCGTCGGTC GAGCAGAAGC CGTACACGCG CCGCCCGTAC GCGCCGTTCA TGACGTCGAC GCTGCAACAG GAGGCGGGCC GCAAGCTGCG GTTCACGTCG GAGCGCACGA TGAGCATCGC GCAGCGGCTC TACGAGAACG GCTACATCAC CTACATGCGT ACCGACTCGA CCACGCTGTC GCAGTCGGCC ATTGACGCCG CCCGCAATCA GGCCCGTCAG CTCTACGGCG AGGAATACGT CCACCCGACG CCGCGCCAGT ACACCCGCAA GGTCAAGAAC GCGCAGGAGG CGCACGAGGC GATCCGCCCC GCCGGTGACG TGTTCCAGAC CCCCGGTCAG CTGCACAGCC AGCTCGACAC CGACGAGTTC CGCCTCTACG AGCTGATCTG GCAGCGCACC GTCGCCTCGC AGATGGCCGA TGCCCGCGGC ACCACGCTGA GCCTGCGGAT CGCCGGAGCC GCCCCGGCGA CGACATTGGG CGGAGGTACC GCGTCCGACG TCCAGGTGGT GTTCAACGCC AGCGGCCGCA CCATCACGTT CCCTGGCTTC CTGAAGGCCT ACGTCGAGAG CATCGACGAC CTGGCCGGCG GCGAGGCCGA CGACGCCGAG AGCAGGCTGC CCAACCTCAC CCAGGGTCAG CGGGTGGACG CCAAGGGGCT GACCGCCGAC GGCCACACCA CCTCGCCGCC CGCGCGCTAC ACCGAGGCCT CTCTGATCAA GGCGCTGGAG GATCTCGGCA TCGGCCGGCC GTCGACGTAC AGCTCGATCA TCAAGACCAT CCAGGACCGC GGTTACGTCC ACAAGAAGGG CAGCGCGCTG GTTCCGTCGT GGGTGGCGTT CGCCGTCATC GGTCTGCTCG AGCAGCACTT CGGGCGTCTG GTCGACTACG ACTTCACCGC CGCGATGGAG GACGAGCTCG ACGAGATCGC AGCAGGGCAC GAGCGACGCA CCAACTGGCT CAACAACTTC TACTTCGGTG GCGAGCACGG CGCGGACGGT TCGATCGCCC GCTCGGGCGG GCTCAAGAAG CTCGTCGGTG GCAACCTCGA AGAGATCGAC GCGCGAGAAG TCAACTCCAT CAAGCTCTTC GACGATGCCG AAGGCCGCGC GGTCAACGTG CGCGTCGGAC GCAACGGTGC CTATCTCGAG CGCATGGTGG CCGATCCGGA CAACCCCGGT GAGCTCAAAC CGCAGCGGGC CAACCTCAAG GACGAGCTGA CGCCTGACGA GCTGACCCTT GAGCTGGCCG AAAAGCTCTT CGCCACACCG CAAGAGGGCC GTTCGCTGGG TGTCGACCCG GCGACCGGGC ACGAGATCGT CGCCAAGGAC GGCCGTTACG GCCCGTATGT CACCGAGGTG CTTCCTGAAC CGCCCGACGA GGGCGAAGCG GGCGCCACGG CGAAGAAGGG CAAGAAGCCG ACCGGGCCGA AGCCGCGTAC CGGTTCGCTG CTTCGCTCGA TGGATCTGGA GACCGTCACG CTGGAGGACG CGCTTCGGCT GCTGTCGCTG CCGCGGGTGG TCGGCGTCGA TCCGGCCAGC GGTGAGGAGA TCACCGCGCA GAACGGCCGG TACGGCCCAT ATCTCAAGCG CGGCACCGAC TCCCGGTCTC TTGCCACCGA GGAGCAGATG TTCGACATCA CCCTCGAGGA GGCGTTGAAG ATCTACGCCG AGCCGAAGCG TCGCGGTCGG CAGGGGGCGG CGACCCCGCC GCTGCGCGAG CTCGGCGTCG ACCCGGTGTC GGAGAAGCCG ATGGTGATCA AGGACGGCCG GTTCGGGCCC TACGTCACCG ACGGTGAGAC CAACGCCAGC CTGCGCAAGG GCGACGACGT GCTGTCGATC ACCGACGCGC GGGCGTCCGA GCTGCTGGCC GACCGCCGCG CCCGGGGTCC GGTCAAGAAG AAGGCCGTCA AGAAGGCGCC TGCCAAGAAG ACGCCCGCGA AGAAGACCGC TGCCAAGAAG GCCGCGAAGA AGGCCTGA
|
Protein sequence | MADEERGSGK NGAEPRRGNG SSVRRLVIVE SPTKARKIAG YLGSNYIVES SRGHIRDLPR NAADVPAKYK SEPWARLGVN VEHNFEPLYI ISPDKKSTVA DLKDKLKNVD ELYLATDGDR EGEAIAWHLL ETLKPRIPVK RMVFHEITEP AIRAAAEDPR DLDNDLVDAQ ETRRILDRLY GYEVSPVLWK KVAPKLSAGR VQSVATRIIV QRERERMAFR SAGYWDVTAE LDASVSDAQA SPPTFVAKLN TVDGRRVATG RDFDSLGAVR KPDEVLVLDE AAANALATGL RGAQLAVSSV EQKPYTRRPY APFMTSTLQQ EAGRKLRFTS ERTMSIAQRL YENGYITYMR TDSTTLSQSA IDAARNQARQ LYGEEYVHPT PRQYTRKVKN AQEAHEAIRP AGDVFQTPGQ LHSQLDTDEF RLYELIWQRT VASQMADARG TTLSLRIAGA APATTLGGGT ASDVQVVFNA SGRTITFPGF LKAYVESIDD LAGGEADDAE SRLPNLTQGQ RVDAKGLTAD GHTTSPPARY TEASLIKALE DLGIGRPSTY SSIIKTIQDR GYVHKKGSAL VPSWVAFAVI GLLEQHFGRL VDYDFTAAME DELDEIAAGH ERRTNWLNNF YFGGEHGADG SIARSGGLKK LVGGNLEEID AREVNSIKLF DDAEGRAVNV RVGRNGAYLE RMVADPDNPG ELKPQRANLK DELTPDELTL ELAEKLFATP QEGRSLGVDP ATGHEIVAKD GRYGPYVTEV LPEPPDEGEA GATAKKGKKP TGPKPRTGSL LRSMDLETVT LEDALRLLSL PRVVGVDPAS GEEITAQNGR YGPYLKRGTD SRSLATEEQM FDITLEEALK IYAEPKRRGR QGAATPPLRE LGVDPVSEKP MVIKDGRFGP YVTDGETNAS LRKGDDVLSI TDARASELLA DRRARGPVKK KAVKKAPAKK TPAKKTAAKK AAKKA
|
| |