Gene Caci_8444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_8444 
Symbol 
ID8339824 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp9772296 
End bp9775130 
Gene Length2835 bp 
Protein Length944 aa 
Translation table11 
GC content70% 
IMG OID644961531 
ProductDNA topoisomerase I 
Protein accessionYP_003119108 
Protein GI256397544 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0264145 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAGGCA AGGGTTCAGC GGACTCGCGC GGCTCGACGC GCCGGCTGGT CATCGTCGAG 
TCGCCGGCCA AGGCGAAGAC GATCAAGGGC TACCTCGGTG CGGGCTACAC CGTCGAGGCC
TCCGTCGGCC ACATCCGGGA CCTGCCGGCC GGCGCGGACG AGGTGCCGGA GAAGTACAAG
GGCACCTCCA TGGGCCGGCT CGGCGTGGAC GTGGACGGCG ACTTCGAGCC GCTGTACCTG
GTCAACGCCG ACAAGCGCAA GCAGGTCGCC AAGCTCAAGG ACCTGCTCAA GGAAGCCGAC
GAACTCCTGC TCGCCACCGA CGAGGACCGC GAGGGCGAGG CCATCGCCTG GCACCTGCAG
GAGGTGCTCA AGCCCAAGGT CCCCTCCAAG CGCATGGTGT TCCACGAGAT CACCCGCGAG
GCGATCCAGC AGGCCGTCAG CAACACCCGC GACATCAACC TGCAGCTGGT CGACGCCCAG
GAGACCCGGC GCATCCTGGA CCGGCTCTAC GGCTACGAGG TCTCCCCGGT GCTGTGGAAG
AAGGTCCGCA CCGGGCTGTC CGCCGGCCGC GTGCAGTCCG TGGCCACCCG GATGGTGGTG
GACCGGGAGC GGGAGCGCAT TGCGTTCACC GCGGCCGAGT ACTGGGACCT GACCGGCTCC
TTCGAGACCC TGAAGGCGCC GGCGCCGGGC GATCCGCGCG GGATGACGGC GCGGCTGGCC
TCCGTGGACG GCAAGCGCGT CGCCTCCGGC CGCGACTTCG GACCGGACGG GCAGCTCAAG
TCCGGCAGCC AGAACGTCGC CCACCTCACC GAGGTCACCG CCAAGGCGCT GGCCGCGGCG
CTGCGGGACG CCGACTTCAG CGTGCGCGGC GTGGAGCGCA AGCCCTACCG GCGCTCGCCG
TACGCCCCGT TCCGGACCAC CACGCTGCAG CAGGAGGCCA GCCGCAAGCT CGGCATGGAC
TCCAAGCGCA CCATGCGGGT CGCGCAGAGC CTGTACGAGA ACGGCTACAT CACTTATATG
CGTACTGACA GCATCACGCT GTCGGACACC GCGCTGAACG CCTCGCGGAC CCAGGTGCGC
GAGCTGTACG GCGCCGACTA CCTGCCGGAC GTCCCGCGCC GCTACGACTC CAAGGTGAAG
AACGCGCAGG AGGCGCACGA GGCGATCCGC CCCTCCGGCG ACACGTTCCG CACCCCGGCG
CAGACCGGCC TGAAGGGCGA CGAGTTCCGC CTGTACGAGC TGATCTGGAT GCGCACCGTC
GCCTCGCAGA TGAAGGACGC GACCGGGCAC ACCGTGACGG TGAAGGTCGG CGGCGCCGCC
TCCGACGGCC GGGACGTGGA GTTCAGCGCC AGCGGCCGCA TCATCTCCTT CCACGGCTTC
CTGAAGGCCT ACGTGGAGGG CACCGACGAC CCGGACGCCG CGCTGGACGA CTCCGAGCAG
CGGCTGCCGG CCGTGGCCGA GGGCGACGCG CTGACCACCA CCAAGGTCAC CGCGGACGGG
CACTCGACCA AGCCGCCGGC GCGCTTCACC GAGGCCTCGC TGATCAAGGA GATGGAAGAG
CGCGAGATCG GCCGGCCCTC GACGTACTCC ACGATCCTGG GCACGATCCT GGACCGCGGG
TACGCCTTCA AGAAGGGCAC GGCGCTGGTC CCGTCCTACA TCGCCTTCGC GGTGGTCGGG
CTGCTCGAGA ACCACTTCGG CGACCTGGTG AACTACGAGT TCACCGCGCG CATGGAGGAC
GACCTGGACC GCATCGCCCG CGGCGAGGCG CAGCGCGTCC CGTGGCTGCG GCGCTTCTAC
TTCGGCCCGA CCGGCGAGGA GCCGGGCGCC GCCCCGGCCG CGCTGAAGAG CGGCGGCGGC
GACGGCGCGG TCTTCGACCA CCTCGGCGGC CTGAAGGACC TGGTCACCGA CCTGGGCAAC
ATCGACGCCC GGGAGGTGAA CTCCTTCCCG GTGGGCGAGG ACGGCATCAT CCTGCGCGTG
GGCCGCTTCG GCCCGTACAT CGAGCGCAAC CTGGAGGACG GAACCCAGCA GCGCGCGAGC
GTCCCGGACG ACCTGCCGCC GGACGAGCTG ACCCCGGCCT TCGCCGAGGA GCTGTTCCTG
CAGCCCAGCG GCGACCGCGA ACTCGGCAAG GACCCCTCGA CCGGGTTCGA GGTCGTGGCC
AAGGCCGGCC GCTTCGGCCC GTACGTCACC GAGATCCTCC CCGAGGGCAC CCCGACCCGC
GGCAAGAACG CGGTGAAGGC CCGCACCGGC TCGCTGTTCA AGAACATGGG CCTGGACACC
GTGACGCTGG AGGAGGCGCT GCAGCTGCTG TCGCTGCCGC GCGTCGTCGG CGCCGACCCG
GAATCCGGCG AGGAGATCAC GGTCCAGAAC GGCCGCTACG GCCCGTACCT GAAGAAGGGC
GCGGACTCCC GCTCGATCAC CTCCGAGGAG CAGATCTTCA CGATCACCCT CGAGGAAGCC
CTCGAGATCT ACAAGCAGCC CAAGGCCCGC GGCCGCGGCG CCGCCAAGCC GCCGCTGCGC
GAGATGGGCC CGGACCCGGT CTCCGGCAAG CCGATCGTGA TCAAGTCGGG CTTCTACGGC
GAGTACCTGA CCGACGGCGA GACCAACGTG ACCATCCCCA AGAGCGAGAC GGTCGAGGAC
ATCACCCCGG CGAGGGCCTA CGAGCTCCTC GCCGAGAAGC GCGCCAAGGG ACCGGCGAAG
AAGACGGCGA AGAAGGCGCC CGCGAAGAAG ACCGCCGCGA AGAAGACGGC GGCTTCGTCG
GGGACGAAGA CCGCGAAGGC CACGGCCGCG AAGAAGACCG CCGCGAAGAA GACGGCGAAT
TCGGGGACGA AGTAG
 
Protein sequence
MAGKGSADSR GSTRRLVIVE SPAKAKTIKG YLGAGYTVEA SVGHIRDLPA GADEVPEKYK 
GTSMGRLGVD VDGDFEPLYL VNADKRKQVA KLKDLLKEAD ELLLATDEDR EGEAIAWHLQ
EVLKPKVPSK RMVFHEITRE AIQQAVSNTR DINLQLVDAQ ETRRILDRLY GYEVSPVLWK
KVRTGLSAGR VQSVATRMVV DRERERIAFT AAEYWDLTGS FETLKAPAPG DPRGMTARLA
SVDGKRVASG RDFGPDGQLK SGSQNVAHLT EVTAKALAAA LRDADFSVRG VERKPYRRSP
YAPFRTTTLQ QEASRKLGMD SKRTMRVAQS LYENGYITYM RTDSITLSDT ALNASRTQVR
ELYGADYLPD VPRRYDSKVK NAQEAHEAIR PSGDTFRTPA QTGLKGDEFR LYELIWMRTV
ASQMKDATGH TVTVKVGGAA SDGRDVEFSA SGRIISFHGF LKAYVEGTDD PDAALDDSEQ
RLPAVAEGDA LTTTKVTADG HSTKPPARFT EASLIKEMEE REIGRPSTYS TILGTILDRG
YAFKKGTALV PSYIAFAVVG LLENHFGDLV NYEFTARMED DLDRIARGEA QRVPWLRRFY
FGPTGEEPGA APAALKSGGG DGAVFDHLGG LKDLVTDLGN IDAREVNSFP VGEDGIILRV
GRFGPYIERN LEDGTQQRAS VPDDLPPDEL TPAFAEELFL QPSGDRELGK DPSTGFEVVA
KAGRFGPYVT EILPEGTPTR GKNAVKARTG SLFKNMGLDT VTLEEALQLL SLPRVVGADP
ESGEEITVQN GRYGPYLKKG ADSRSITSEE QIFTITLEEA LEIYKQPKAR GRGAAKPPLR
EMGPDPVSGK PIVIKSGFYG EYLTDGETNV TIPKSETVED ITPARAYELL AEKRAKGPAK
KTAKKAPAKK TAAKKTAASS GTKTAKATAA KKTAAKKTAN SGTK