Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlut_18080 |
Symbol | |
ID | 7984773 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Micrococcus luteus NCTC 2665 |
Kingdom | Bacteria |
Replicon accession | NC_012803 |
Strand | - |
Start bp | 1953032 |
End bp | 1955866 |
Gene Length | 2835 bp |
Protein Length | 944 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 644806755 |
Product | DNA topoisomerase I |
Protein accession | YP_002957845 |
Protein GI | 239918287 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01051] DNA topoisomerase I, bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGTCCACTC CCCCCGCTTC CGTGACCGCT TCCGTGCCCG AGGGCAAGAA GCTGGTCATC GTGGAGTCCC CGGCCAAGAG CAAGTCCATC GCGAAGTACC TCAACGCGGT GGGCTCCGGC TGGGTGGTGG ACTCCTCCGT GGGGCACATC CGTGACCTGC CCAAGCCCTC GGACCTGCCG GCGGACATGA AGAAGGGCCC GTTCGGGAAG TTCGCCGTGG ACACGGAGCA CGGCTTCACC CCGTACTACG TGGTGCACCC GGACAAGAAG AAGAAGGTCA CGGAGCTCAA GCGCGAGCTC AAGGACGCCG AGGCCCTCTA CCTCGCGACG GATGCGGACC GGGAGGGCGA GGCCATCGCC TGGCACCTGC TGGACACCCT CAAGCCCACG GTGCCGGTGT ACCGGATGAC CTTCGGCGAG ATCACCCAGG AGGGCGTGCG CCGCGGCCTG GAGAACATCC GTGAGCTGGA CACGGCCCTC GTGGACGCCC AGGAGACGCG CCGCATCCTC GACCGCCTGT TCGGCTACGA GCTCTCGCCG GTGCTGTGGC GCAAGGTCTC CCGCGGGCTC TCGGCGGGCC GCGTGCAGTC GGTGGCCACC CGCCTCGTCG TCGAGCGCGA GCGGGAGCGC ATGGCCTTCG TCTCCGCCCA CTACTGGGGC GTGGAGGGCA CCTTCACGGT GGCCGAGGGC GCACAGGCCG CGGACGCCGT GGGCAGCTCG TTCACGGCCC GCCTCAGCAC GGTGGACGGC GAGCGCGTGG CCACGGGCCG CGACTTCGAC GACGCCGGGC GGCTCAAGGC CTCCGCCGGC AAGAAGGTGA CCCACCTGGA CGAGGCCGGC GCCCGCTCGC TGGCCGAGGG TCTGCGCGGG GCGGACTTCA CGGTCGACTC GGTGGAGGAC AAGCCGTACA CCCGCCGCCC GGCCGCCCCC TTCACCACCT CCACGCTGCA GCAGGAGGCC GCCCGCAAGC TCCGCATGAG CTCCCGCGTC TCCATGCAGG TGGCCCAGCG CCTGTACGAG AACGGCTACA TCACGTACAT GCGCACGGAC TCGGTGAACC TCTCGCAGGA GGCCGTGCAG GCCGCCCGGC GCCAGGCCAC CGAGCTCTAC GGGGCGGACG CCGTGCCGGC CCAGCCGCGC GTCTACGCCA AGCGCAACGA GACCGCGCAG GAGGCCCACG AGGCCATCCG GCCCGCGGGG GACTCCTTCC GGACGCCGGC CCAGGTCAAG GCCGAGCTGC GCCCGGACGA GTTCCGCCTG TACGAGCTGA TCTGGAAGCG CACCGTGGCC TCGCAGATGG CCGACGCCAA GGGCTACACC GCCACGCTGC GGCTCTCGGC CGCCGCGCAG GACGGCCGCC TCGCGCAGTT CAGCGCCTCG GGCACGGTCA TCACGTTCAA GGGCTTCCTC GACGCGTACG AGGAGGGCCG GGACGCCGAG GTCGACGGTG AGAACGCGGA GGCCAAGGAC CGCCGCCTGC CGCAGGTGGC CCAGGGCGAG CGCCTCACCG GTGACCCGAT CGAGGCCACC GGGCACGAGA CCTCCCCGCC GGCCCGCTAC ACCGAGGCGT CGATCGTCGC CGAGCTGGAG CGCCGCGAGA TCGGGCGTCC CTCCACGTAC GCCCCGACGA TCTCCACGAT CATGGACCGC GGCTACGTGT CCAAGCGCGG CACCGCGCTC GTCCCGTCGT GGACCGCGTT CGCGGTGATC GGACTGCTCG AGGACTACTT CGCGACGTAC GTGGACTACG ACTTCACGGC CCGCATGGAG GACGACCTCG ACCGGATCGC GGCGGGCGAG CTCGGCCGCG AGGCCTGGCT GCAGACCTTC TACTTCGGGG CCCCCGCCGC GGAGACGGAG AAGGGCGTCG AGGGTCTCAA GCACGTGGTG GACAACCTCG GCGAGATCGA CGCGCGGGCC GTGAACTCGC TGCCGATCAC CGAGGACATC ACCCTGCGGG TGGGCCAGTA CGGCCCGTAC CTGGAGCGGT CGCTGCCGGC CGACGCGGAG GCGGGCGCCA CCCCGCCGCG CGCCAACGTC CCCGAGGACC TGGCCCCGGA CGAGCTCACC CCGGCCAAGG CCGAGGAGCT GTTCGCCACC GCCCGGCCCT CCGAGCGCGA GCTCGGCACG GACCCGGAGA CGGGCCGGAC CGTCGTCGCC AAGGACGGGC GCTACGGGCC CTACGTCACC GAGGTCATCC CCGAGATGAC CGAGGAGGAG CTGCAGGCCT GGCTCGACGC CCAGCCCACC GAGTACTACA AGAACGGCAA GCCCAAGCCG AAGAAGAAGC CGGCCAAGGA GAAGCCCCGC ACCGGCTCCC TCTTCACGAG CATGTCCGTG GACACCGTGA CCCTCGAGCA GGCGCTGCAG CTGATGTCCC TGCCGCGCGT CGTGGGCGCG GACGCCGAGG GCGAGGTCAT CACCGCCCAG AACGGCCGCT TCGGGCCCTA CCTGAAGAAG GGCTCGGACT CGCGCTCGCT CGAGTCCGAG GACCAGATCT TCAGCATCAC GCAGGAGCAG GCCCTCGAGA TCTACGCCCA GCCGAAGCAG CGTGGCCGCG GGGCCGCCAA GCCGCCGCTG GCCGAGTTCG GCGAGGACCC GGTCTCCGGC AAGAAGGTCA CCGTGAAGGA GGGCCGCTTC GGCCCGTACG TCACGGACGG GGTCACCAAC ATCACCGTCC CGCGCGACCG CCAGCCCGAG GAGCTCACGG CGGAGGAGGC CTACCAGCTG CTCGCGGACA AGCGTGCCAA GGGCCCGGCC ACCCGCGGCG GTGCCAAGAA GCCGGCGGCG CGCAAGGCCC CGGCCAAGAA GACCACCGCG CGGAAGAAGG CCTGA
|
Protein sequence | MSTPPASVTA SVPEGKKLVI VESPAKSKSI AKYLNAVGSG WVVDSSVGHI RDLPKPSDLP ADMKKGPFGK FAVDTEHGFT PYYVVHPDKK KKVTELKREL KDAEALYLAT DADREGEAIA WHLLDTLKPT VPVYRMTFGE ITQEGVRRGL ENIRELDTAL VDAQETRRIL DRLFGYELSP VLWRKVSRGL SAGRVQSVAT RLVVERERER MAFVSAHYWG VEGTFTVAEG AQAADAVGSS FTARLSTVDG ERVATGRDFD DAGRLKASAG KKVTHLDEAG ARSLAEGLRG ADFTVDSVED KPYTRRPAAP FTTSTLQQEA ARKLRMSSRV SMQVAQRLYE NGYITYMRTD SVNLSQEAVQ AARRQATELY GADAVPAQPR VYAKRNETAQ EAHEAIRPAG DSFRTPAQVK AELRPDEFRL YELIWKRTVA SQMADAKGYT ATLRLSAAAQ DGRLAQFSAS GTVITFKGFL DAYEEGRDAE VDGENAEAKD RRLPQVAQGE RLTGDPIEAT GHETSPPARY TEASIVAELE RREIGRPSTY APTISTIMDR GYVSKRGTAL VPSWTAFAVI GLLEDYFATY VDYDFTARME DDLDRIAAGE LGREAWLQTF YFGAPAAETE KGVEGLKHVV DNLGEIDARA VNSLPITEDI TLRVGQYGPY LERSLPADAE AGATPPRANV PEDLAPDELT PAKAEELFAT ARPSERELGT DPETGRTVVA KDGRYGPYVT EVIPEMTEEE LQAWLDAQPT EYYKNGKPKP KKKPAKEKPR TGSLFTSMSV DTVTLEQALQ LMSLPRVVGA DAEGEVITAQ NGRFGPYLKK GSDSRSLESE DQIFSITQEQ ALEIYAQPKQ RGRGAAKPPL AEFGEDPVSG KKVTVKEGRF GPYVTDGVTN ITVPRDRQPE ELTAEEAYQL LADKRAKGPA TRGGAKKPAA RKAPAKKTTA RKKA
|
| |