Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4408 |
Symbol | |
ID | 5705510 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 4981377 |
End bp | 4984226 |
Gene Length | 2850 bp |
Protein Length | 949 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641273827 |
Product | DNA topoisomerase I |
Protein accession | YP_001539176 |
Protein GI | 159039923 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01051] DNA topoisomerase I, bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0610462 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCGAGCA ATGCTGGAAC CACCCGTCTG GTCATCGTCG AGTCCCCGGC GAAGGCCAAG ACGATCTCGG GCTACCTCGG CCCGGGGTAC GTCGTGGAGG CCAGTTTCGG GCACGTCCGC GACCTGCCGC GCAACGCCGC CGACGTGCCG GCGAAGTACA AGGGCGAGGC GTGGGCCCGG CTCGGCGTCG ATGTCGACAA CGGCTTCCAC GCCCTCTACG TCGTCTCGTC GGACCGCCGG CAGCAGATCA GCAAGTTGAC CAAGCTGGCC CGGGAGGTTG ACGAGATTCT CCTGGCCACG GATGAGGACC GGGAGGGCGA GGCGATCGCC TGGCACCTGG TGGAGACGCT CAAGCCGAAG GTGCCGGTCA AGCGGATGGT CTTCCACGAG ATCACCAAGC CGGCGATCCA GGCCGCGGTG GCGAACCCGC GTGAGATCGA CCGGGACCTG GTCGACGCGC AGGAGGCCCG CCGCATCCTC GACCGGTTGT ACGGCTACGA GGTGTCACCG GTGCTGTGGA AGAAGGTCAT GCCGAAGCTT TCGGCGGGCC GGGTGCAGTC GGTGGCGACC CGCACCGTGG TCGAGCGGGA GCGCCAGCGG ATGGCGTTCC GTACTGCCGA GTACTGGGAC ATCCTGGCCA CGCTGGCCGT CGAACAGGCC GGCGAGGGCC CCCGCACCTT CAACGCCACC CTGGTGGCGC TCAACGGTGA CCGGATCGCC GCCGGCAAGG ACTTCGAGCC GACCACCGGG CGGGTCCGTC CCGGCGCGGG TGTCGTCCAC CTCGACGAGA GCGGGGCCCA CGGGCTGGCC GCCCGGTTGG CAGGTCGGCC ATTCACCGTC ACCCGGGTCG AGGAGAAGCC CTACCGTCGC CGCCCCTACG CGCCGTTCAT CACCTCCACC CTGCAACAGG AGTCGGCCCG CAAACTGCGC TTCTCCTCCC AGCAGACGAT GCGCACCGCG CAGCGACTCT ATGAGAACGG CTACATCACC TATATGCGAA CCGACTCGGT GAACCTGTCG GAGACCGCCA TCGCGGCGGC CCGCCGGCAG ATCGTCGAGT TGTACGGCGA GCGCAGTGTC CCGCCGGAGC CGCGCCGCTA CACCGGCAAG GTGAAGAACG CGCAGGAGGC GCATGAGGCG ATCCGCCCCG CGGGGGACAC CTTCCGCACC CCCGGTGAGC TGCGCAACGA ACTCTCGGTT GAGGAGTACA AGCTCTACGA GTTGATCTGG CGGCGCACCA TCGCCTCCCA GATGACCGAC GCGGTGGGTT TCAGTGTCTC GGTGCGTATC CGCGCCGTCA CCTCCGCCGG TGAGGAGGCC GACTTCGGCG CCACCGGCAA GACCATCACC GACCCGGGTT TTCTCCGTGC CTATGTGGAG TCCAGCGACG ACGAGAACGC CGAGGCGGAG GACGCCGAGC GGCGTCTGCC GACGCTGGTG AAGGACCAGC CGCTTACCGG CGAGCAGCTG GTGGCGCAGG GTCACCACAC CCAGCCGCCC GCCCGCTACA CCGAGGCGTC CCTGGTCAAA GCGCTGGAGG AACTGGGCAT CGGTCGCCCG TCGACGTACG CGTCGATCAT GCAGACGATC CAGGACCGGG GGTACGTCGT CAAGCGTGGC CAGGCGATGA TCCCGTCCTT CCTGGCGTTC GCCGTGGTCG GGCTGATGGA GCGGCACTAC CCACGCCTGA TCGACTATGG CTTCACCGCC AGCATGGAGA ACGAGCTGGA CGAGATCGCC GGTGGCGACC ACGCGGCGGT CGATTTCCTC ACCGCGTTCT ACTTCGGCAC CACCAACGGC GCCGGTGACC AGGACATCGC CCGTTCCGGT GGGCTGAAGA AGCTGGTCAC CGAGAACCTC AGCGACATCG ACGCACGGAG CGTCAACTCG ATACCCCTGT TCACCGACGA ACAGGGACGG GTCGTCGTCG TCCGGGTGGG CCGCTACGGG CCGTACCTGC AGCGGGAACT ACCCGGCGAG GCGGCGACGC CCGCCGACGG TGAGGAGGGC GGCGGCCAGG GCGACCGGGC ACCGATTCCA GACGGTCTGG CACCCGACGA GCTGACCCCG GAGAAGGTGC ACGAGCTGTT CCTCGGCGGT GGGGGCGAGC GCAAGCTCGG CGAGGACCCG GCCACCGGAG AGCCGATCCT GCTCAAGTCG GGCCGGTTCG GCCCGTACGT GGCCAGCGGG GAGCGTAAGT CGTCACTGCT GCGCTCCCAG TCGCCGGACG CGCTCACCCT CGAGGAGGCG TTGCGACTGC TGAGCCTGCC TCGCCTGGTC GGTGTCGACC CGGAGGGCAA CGAGGTCTTC GCCAACAACG GCCGCTACGG ACCGTATGTC AAGCGGGGCG AGGAGTTCCG GTCGCTGGAC TCCGAGGAGA AGATGTTCAC GGTCACGCTG GATGAGGCGC TGGCCCTGCT GGCCGCCCCG AAGACCCGGC AGCGCCGGGC GCCCGCGCCT CCACTGCGGG AACTGGGCGC CGACCCGCTG ACCGAGAAAC CGCTGGTCAT CAAGGATGGG CGGTTCGGGC CGTACGTGAC CGACGGTGAG ACCAACGCGT CATTGCGGCG CGGGCAGACG CCGGAGGCGT TGAGCCTGGA GCAGGCGTCG GAGATGCTCG CCGAGAAGCG GGCGAAGGGT CCGGCGCCGC GGAAACGGGC GGCCAAGAAG GCCACGCCGG CCAAGAGGTC CACGCCGGCG AAGAAGGCGG CGGCGACGAA GACCACCGCG ACGGGCAAAG CCACGAAGAA GACCGCCGCG GCGACCAAGG TGGCGAAGAA GACGGCGACG GCGAAGACCA CAACGGCCAA GCAGGCGAAA CCGAAGCGGG CGAGCAGCAC GACCGAGTGA
|
Protein sequence | MPSNAGTTRL VIVESPAKAK TISGYLGPGY VVEASFGHVR DLPRNAADVP AKYKGEAWAR LGVDVDNGFH ALYVVSSDRR QQISKLTKLA REVDEILLAT DEDREGEAIA WHLVETLKPK VPVKRMVFHE ITKPAIQAAV ANPREIDRDL VDAQEARRIL DRLYGYEVSP VLWKKVMPKL SAGRVQSVAT RTVVERERQR MAFRTAEYWD ILATLAVEQA GEGPRTFNAT LVALNGDRIA AGKDFEPTTG RVRPGAGVVH LDESGAHGLA ARLAGRPFTV TRVEEKPYRR RPYAPFITST LQQESARKLR FSSQQTMRTA QRLYENGYIT YMRTDSVNLS ETAIAAARRQ IVELYGERSV PPEPRRYTGK VKNAQEAHEA IRPAGDTFRT PGELRNELSV EEYKLYELIW RRTIASQMTD AVGFSVSVRI RAVTSAGEEA DFGATGKTIT DPGFLRAYVE SSDDENAEAE DAERRLPTLV KDQPLTGEQL VAQGHHTQPP ARYTEASLVK ALEELGIGRP STYASIMQTI QDRGYVVKRG QAMIPSFLAF AVVGLMERHY PRLIDYGFTA SMENELDEIA GGDHAAVDFL TAFYFGTTNG AGDQDIARSG GLKKLVTENL SDIDARSVNS IPLFTDEQGR VVVVRVGRYG PYLQRELPGE AATPADGEEG GGQGDRAPIP DGLAPDELTP EKVHELFLGG GGERKLGEDP ATGEPILLKS GRFGPYVASG ERKSSLLRSQ SPDALTLEEA LRLLSLPRLV GVDPEGNEVF ANNGRYGPYV KRGEEFRSLD SEEKMFTVTL DEALALLAAP KTRQRRAPAP PLRELGADPL TEKPLVIKDG RFGPYVTDGE TNASLRRGQT PEALSLEQAS EMLAEKRAKG PAPRKRAAKK ATPAKRSTPA KKAAATKTTA TGKATKKTAA ATKVAKKTAT AKTTTAKQAK PKRASSTTE
|
| |