Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_8444 |
Symbol | |
ID | 8339824 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 9772296 |
End bp | 9775130 |
Gene Length | 2835 bp |
Protein Length | 944 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 644961531 |
Product | DNA topoisomerase I |
Protein accession | YP_003119108 |
Protein GI | 256397544 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01051] DNA topoisomerase I, bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.0264145 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCAGGCA AGGGTTCAGC GGACTCGCGC GGCTCGACGC GCCGGCTGGT CATCGTCGAG TCGCCGGCCA AGGCGAAGAC GATCAAGGGC TACCTCGGTG CGGGCTACAC CGTCGAGGCC TCCGTCGGCC ACATCCGGGA CCTGCCGGCC GGCGCGGACG AGGTGCCGGA GAAGTACAAG GGCACCTCCA TGGGCCGGCT CGGCGTGGAC GTGGACGGCG ACTTCGAGCC GCTGTACCTG GTCAACGCCG ACAAGCGCAA GCAGGTCGCC AAGCTCAAGG ACCTGCTCAA GGAAGCCGAC GAACTCCTGC TCGCCACCGA CGAGGACCGC GAGGGCGAGG CCATCGCCTG GCACCTGCAG GAGGTGCTCA AGCCCAAGGT CCCCTCCAAG CGCATGGTGT TCCACGAGAT CACCCGCGAG GCGATCCAGC AGGCCGTCAG CAACACCCGC GACATCAACC TGCAGCTGGT CGACGCCCAG GAGACCCGGC GCATCCTGGA CCGGCTCTAC GGCTACGAGG TCTCCCCGGT GCTGTGGAAG AAGGTCCGCA CCGGGCTGTC CGCCGGCCGC GTGCAGTCCG TGGCCACCCG GATGGTGGTG GACCGGGAGC GGGAGCGCAT TGCGTTCACC GCGGCCGAGT ACTGGGACCT GACCGGCTCC TTCGAGACCC TGAAGGCGCC GGCGCCGGGC GATCCGCGCG GGATGACGGC GCGGCTGGCC TCCGTGGACG GCAAGCGCGT CGCCTCCGGC CGCGACTTCG GACCGGACGG GCAGCTCAAG TCCGGCAGCC AGAACGTCGC CCACCTCACC GAGGTCACCG CCAAGGCGCT GGCCGCGGCG CTGCGGGACG CCGACTTCAG CGTGCGCGGC GTGGAGCGCA AGCCCTACCG GCGCTCGCCG TACGCCCCGT TCCGGACCAC CACGCTGCAG CAGGAGGCCA GCCGCAAGCT CGGCATGGAC TCCAAGCGCA CCATGCGGGT CGCGCAGAGC CTGTACGAGA ACGGCTACAT CACTTATATG CGTACTGACA GCATCACGCT GTCGGACACC GCGCTGAACG CCTCGCGGAC CCAGGTGCGC GAGCTGTACG GCGCCGACTA CCTGCCGGAC GTCCCGCGCC GCTACGACTC CAAGGTGAAG AACGCGCAGG AGGCGCACGA GGCGATCCGC CCCTCCGGCG ACACGTTCCG CACCCCGGCG CAGACCGGCC TGAAGGGCGA CGAGTTCCGC CTGTACGAGC TGATCTGGAT GCGCACCGTC GCCTCGCAGA TGAAGGACGC GACCGGGCAC ACCGTGACGG TGAAGGTCGG CGGCGCCGCC TCCGACGGCC GGGACGTGGA GTTCAGCGCC AGCGGCCGCA TCATCTCCTT CCACGGCTTC CTGAAGGCCT ACGTGGAGGG CACCGACGAC CCGGACGCCG CGCTGGACGA CTCCGAGCAG CGGCTGCCGG CCGTGGCCGA GGGCGACGCG CTGACCACCA CCAAGGTCAC CGCGGACGGG CACTCGACCA AGCCGCCGGC GCGCTTCACC GAGGCCTCGC TGATCAAGGA GATGGAAGAG CGCGAGATCG GCCGGCCCTC GACGTACTCC ACGATCCTGG GCACGATCCT GGACCGCGGG TACGCCTTCA AGAAGGGCAC GGCGCTGGTC CCGTCCTACA TCGCCTTCGC GGTGGTCGGG CTGCTCGAGA ACCACTTCGG CGACCTGGTG AACTACGAGT TCACCGCGCG CATGGAGGAC GACCTGGACC GCATCGCCCG CGGCGAGGCG CAGCGCGTCC CGTGGCTGCG GCGCTTCTAC TTCGGCCCGA CCGGCGAGGA GCCGGGCGCC GCCCCGGCCG CGCTGAAGAG CGGCGGCGGC GACGGCGCGG TCTTCGACCA CCTCGGCGGC CTGAAGGACC TGGTCACCGA CCTGGGCAAC ATCGACGCCC GGGAGGTGAA CTCCTTCCCG GTGGGCGAGG ACGGCATCAT CCTGCGCGTG GGCCGCTTCG GCCCGTACAT CGAGCGCAAC CTGGAGGACG GAACCCAGCA GCGCGCGAGC GTCCCGGACG ACCTGCCGCC GGACGAGCTG ACCCCGGCCT TCGCCGAGGA GCTGTTCCTG CAGCCCAGCG GCGACCGCGA ACTCGGCAAG GACCCCTCGA CCGGGTTCGA GGTCGTGGCC AAGGCCGGCC GCTTCGGCCC GTACGTCACC GAGATCCTCC CCGAGGGCAC CCCGACCCGC GGCAAGAACG CGGTGAAGGC CCGCACCGGC TCGCTGTTCA AGAACATGGG CCTGGACACC GTGACGCTGG AGGAGGCGCT GCAGCTGCTG TCGCTGCCGC GCGTCGTCGG CGCCGACCCG GAATCCGGCG AGGAGATCAC GGTCCAGAAC GGCCGCTACG GCCCGTACCT GAAGAAGGGC GCGGACTCCC GCTCGATCAC CTCCGAGGAG CAGATCTTCA CGATCACCCT CGAGGAAGCC CTCGAGATCT ACAAGCAGCC CAAGGCCCGC GGCCGCGGCG CCGCCAAGCC GCCGCTGCGC GAGATGGGCC CGGACCCGGT CTCCGGCAAG CCGATCGTGA TCAAGTCGGG CTTCTACGGC GAGTACCTGA CCGACGGCGA GACCAACGTG ACCATCCCCA AGAGCGAGAC GGTCGAGGAC ATCACCCCGG CGAGGGCCTA CGAGCTCCTC GCCGAGAAGC GCGCCAAGGG ACCGGCGAAG AAGACGGCGA AGAAGGCGCC CGCGAAGAAG ACCGCCGCGA AGAAGACGGC GGCTTCGTCG GGGACGAAGA CCGCGAAGGC CACGGCCGCG AAGAAGACCG CCGCGAAGAA GACGGCGAAT TCGGGGACGA AGTAG
|
Protein sequence | MAGKGSADSR GSTRRLVIVE SPAKAKTIKG YLGAGYTVEA SVGHIRDLPA GADEVPEKYK GTSMGRLGVD VDGDFEPLYL VNADKRKQVA KLKDLLKEAD ELLLATDEDR EGEAIAWHLQ EVLKPKVPSK RMVFHEITRE AIQQAVSNTR DINLQLVDAQ ETRRILDRLY GYEVSPVLWK KVRTGLSAGR VQSVATRMVV DRERERIAFT AAEYWDLTGS FETLKAPAPG DPRGMTARLA SVDGKRVASG RDFGPDGQLK SGSQNVAHLT EVTAKALAAA LRDADFSVRG VERKPYRRSP YAPFRTTTLQ QEASRKLGMD SKRTMRVAQS LYENGYITYM RTDSITLSDT ALNASRTQVR ELYGADYLPD VPRRYDSKVK NAQEAHEAIR PSGDTFRTPA QTGLKGDEFR LYELIWMRTV ASQMKDATGH TVTVKVGGAA SDGRDVEFSA SGRIISFHGF LKAYVEGTDD PDAALDDSEQ RLPAVAEGDA LTTTKVTADG HSTKPPARFT EASLIKEMEE REIGRPSTYS TILGTILDRG YAFKKGTALV PSYIAFAVVG LLENHFGDLV NYEFTARMED DLDRIARGEA QRVPWLRRFY FGPTGEEPGA APAALKSGGG DGAVFDHLGG LKDLVTDLGN IDAREVNSFP VGEDGIILRV GRFGPYIERN LEDGTQQRAS VPDDLPPDEL TPAFAEELFL QPSGDRELGK DPSTGFEVVA KAGRFGPYVT EILPEGTPTR GKNAVKARTG SLFKNMGLDT VTLEEALQLL SLPRVVGADP ESGEEITVQN GRYGPYLKKG ADSRSITSEE QIFTITLEEA LEIYKQPKAR GRGAAKPPLR EMGPDPVSGK PIVIKSGFYG EYLTDGETNV TIPKSETVED ITPARAYELL AEKRAKGPAK KTAKKAPAKK TAAKKTAASS GTKTAKATAA KKTAAKKTAN SGTK
|
| |