Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Strop_4027 |
Symbol | |
ID | 5060509 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora tropica CNB-440 |
Kingdom | Bacteria |
Replicon accession | NC_009380 |
Strand | - |
Start bp | 4576864 |
End bp | 4579695 |
Gene Length | 2832 bp |
Protein Length | 943 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640476288 |
Product | DNA topoisomerase I |
Protein accession | YP_001160835 |
Protein GI | 145596538 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01051] DNA topoisomerase I, bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.563129 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCGAGCA ATACTGGAAC CACCCGTCTG GTCATCGTCG AGTCCCCGGC GAAGGCCAAG ACGATCTCGG GCTACCTCGG CCCGGGGTAC GTCGTGGAGG CCAGTTTCGG GCACGTCCGC GACCTGCCGC GCAATGCCGC CGATGTGCCG GCGAAGTACA AGGGCGAGGC GTGGGCCCGG CTCGGCGTCG ATGTCGACAA CGGCTTCCAC GCCCTCTACG TCGTCTCGTC GGACCGCCGG CAGCAGATCA GCAAGTTGAC CAAGCTGGCC CGGGAGGTTG ACGAGATCCT CCTGGCCACG GATGAGGACC GGGAGGGCGA GGCGATCGCC TGGCACCTGG TGGAGACGCT CAAGCCGAAG GTTCCGGTCA AGCGGATGGT CTTCCACGAG ATCACCAAGC CGGCGATCCA GGCCGCGGTG GCGAACCCGC GCGAGATCGA CCGCGATCTG GTCGATGCCC AGGAGGCCCG CCGCATCCTC GACCGGTTGT ACGGCTACGA GGTGTCCCCG GTGCTGTGGA AGAAGGTCAT GCCGAAGCTC TCGGCGGGCC GGGTGCAGTC GGTGGCGACC CGCATCGTGG TCGAGCGGGA GCGGCAGCGG ATGGCGTTCC GCACCGCCGA GTACTGGGAC ATCCTCGCCA CGCTCGCCGG CGAACAGCCC GGCGAGGGCC CCCGTACCTT CAACGCCACC CTGGTCGCGC TCAACGGTGA CCGGATCGCC ACCGGCAAGG ACTTCGAGCC GACCACCGGG CGGGTCCGTC CCGGTGCCGG CGTGGTGCAC CTCGACGAGG GCGGGGCCCG GGGGCTGGCC GCCCGGCTGG CGGATCGGCC GTTCACCGTC ACCCGGGTCG AGGAGAAGCC GTACCGTCGC CGTCCGTACG CGCCGTTCAT CACCTCGACG CTGCAACAGG AGTCGGCCCG CAAGCTGCGC TTCTCCTCCC AGCAGACGAT GCGGACCGCG CAGCGGCTCT ACGAGAACGG CTACATCACC TATATGCGTA CCGACTCGGT GAACCTGTCG GAGACCGCCA TCGCGGCGGC TCGCCGGCAG ATCGTCGAGC TGTACGGGGA ACGTAGCGTT CCACCGGAGC CGCGTCGCTA CACCGGCAAG GTGAAGAACG CGCAGGAGGC GCATGAGGCG ATCCGCCCCG CGGGCGACAC CTTCCGTACC CCTGGTGAGC TGCGCAACGA GCTGTCGGTT GAGGAGTACA AGCTTTACGA ACTGATCTGG CGGCGCACCA TCGCCTCCCA GATGACCGAT GCTGTGGGCT CCAGCGTTTC GGTGCGGATT CGTGCCGTCA CGGCCGCCGG TGAGGAGGCG GACTTCGGCG CCACCGGTAA GACCATCACC GACCCGGGTT TCCTCCGTGC CTACGTGGAG TCCAGCGACG ACGAGAACGC CGAGGCGGAG GATGCCGAGC GGCGTCTGCC ACATCTGGTG AAGGACCAGC CGCTGACCGG CGAGCAGCTC GCCGCGCAGG GGCACCACAC CCAGCCGCCC GCCCGTTACA CCGAGGCGTC CCTGGTCAAG GCGCTGGAGG AGCTGGGTAT CGGCCGCCCC TCCACCTATG CGTCGATTAT GCAGACCATC CAGGACCGGG GGTATGTCGC CAAGCGTGGC CAGGCGATGA TCCCGTCGTT CCTGGCCTTC GCCGTGGTCG GGCTGATGGA GCGGCACTAC CCGCGCCTGA TCGACTACGG CTTCACCGCC AGCATGGAGA ACGAGCTGGA CGAGATCGCC GGTGGCGACC ACGCGGCGGT GGACTTCCTG ACCGCGTTCT ACTTCGGCAT CACCAACGGC GCCGGTGACC AGGACATTGC CCGCTCCGGT GGGCTGAAGA AGCTGGTCAC GGAGAATCTG AGTGAGATCG ACGCGCGGAG CGTCAACTCG ATTCCGCTCT TCAGCGACGA CGAGGGGCGG CCAGTCGTCG TCCGGGTGGG CCGCTACGGG CCGTACCTAC AGCGGGAGTT GCCCGGCGAG GCGGCCGCCG CCGACGGTGA GGAGGGCGGC GGCCAGGGCG ACCGGGCTCC GATTCCGGAG GGGCTGGCCC CGGATGAGCT GACCCCGGAG AAGGTGCACG AGCTGTTCCT CGGTGGCGGT GGCGAGCGCA AGCTCGGGGA GGACCCGGCC ACCGGGGAGC CGATTCTGCT CAAGTCGGGC CGGTTCGGCC CCTACGTGGC CAGTGGGGAG CGTAAGTCGT CCCTGCTGCG CTCCCAGTCG CCGGACGCCC TCACCCTCGA CGAGGCACTG CGGCTGTTGA GCCTGCCTCG GCTGGTCGGT ACTGACCCGG AGGGCAACGA GGTCTTCGCC AACAACGGCC GCTACGGCCC GTACGTCAAG CGGGGCGAGG AGTTCCGGTC GCTGGAGTCC GAAGAGAAGA TGTTCACGGT CACGCTGGAC GAGGCGTTGG CTCTGCTGGC CGCTCCGAAG ACCCGGCAGC GTCGGGCCCC CGCGCCTCCG CTGCGGGAGT TGGGCGCCGA CCCGTTGACC GAGAAGCCGC TGGTCATCAA GGACGGCCGG TTCGGGCCGT ACGTGACCGA CGGGGAGACC AACGCGTCGT TGCGGCGTGG TCAGACGCCG GAGGCACTGA GCCTGGAGCA GGCGTCGGAG ATGCTGGCCG AGAAGCGGGC GAAGGGCCCG GCGCCGCGGA AGAAGGCGGC CAAGAAGGCC GCGCCGGCCA AGAAGGCGAC GCCGGCCAAG AAGGCGACGA AGAAGGCGGC AGCGACGAAG AAGACCGCCG CGGCGACCAA GTCAGCGAAG AAGACCGCGA CGGTGAACAG CACCGCGGCC AAGCAGGCGA AGCCGAAGCG GGCGGGCAGC GCGGCCGAGT GA
|
Protein sequence | MPSNTGTTRL VIVESPAKAK TISGYLGPGY VVEASFGHVR DLPRNAADVP AKYKGEAWAR LGVDVDNGFH ALYVVSSDRR QQISKLTKLA REVDEILLAT DEDREGEAIA WHLVETLKPK VPVKRMVFHE ITKPAIQAAV ANPREIDRDL VDAQEARRIL DRLYGYEVSP VLWKKVMPKL SAGRVQSVAT RIVVERERQR MAFRTAEYWD ILATLAGEQP GEGPRTFNAT LVALNGDRIA TGKDFEPTTG RVRPGAGVVH LDEGGARGLA ARLADRPFTV TRVEEKPYRR RPYAPFITST LQQESARKLR FSSQQTMRTA QRLYENGYIT YMRTDSVNLS ETAIAAARRQ IVELYGERSV PPEPRRYTGK VKNAQEAHEA IRPAGDTFRT PGELRNELSV EEYKLYELIW RRTIASQMTD AVGSSVSVRI RAVTAAGEEA DFGATGKTIT DPGFLRAYVE SSDDENAEAE DAERRLPHLV KDQPLTGEQL AAQGHHTQPP ARYTEASLVK ALEELGIGRP STYASIMQTI QDRGYVAKRG QAMIPSFLAF AVVGLMERHY PRLIDYGFTA SMENELDEIA GGDHAAVDFL TAFYFGITNG AGDQDIARSG GLKKLVTENL SEIDARSVNS IPLFSDDEGR PVVVRVGRYG PYLQRELPGE AAAADGEEGG GQGDRAPIPE GLAPDELTPE KVHELFLGGG GERKLGEDPA TGEPILLKSG RFGPYVASGE RKSSLLRSQS PDALTLDEAL RLLSLPRLVG TDPEGNEVFA NNGRYGPYVK RGEEFRSLES EEKMFTVTLD EALALLAAPK TRQRRAPAPP LRELGADPLT EKPLVIKDGR FGPYVTDGET NASLRRGQTP EALSLEQASE MLAEKRAKGP APRKKAAKKA APAKKATPAK KATKKAAATK KTAAATKSAK KTATVNSTAA KQAKPKRAGS AAE
|
| |