Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TC0012 |
Symbol | topA |
ID | 1245536 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlamydia muridarum Nigg |
Kingdom | Bacteria |
Replicon accession | NC_002620 |
Strand | + |
Start bp | 17361 |
End bp | 19958 |
Gene Length | 2598 bp |
Protein Length | 865 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 637060801 |
Product | DNA topoisomerase I/SWI domain fusion protein |
Protein accession | NP_296396 |
Protein GI | 15834637 |
COG category | [B] Chromatin structure and dynamics [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA [COG0551] Zn-finger domain associated with topoisomerase type I [COG5531] SWIB-domain-containing proteins implicated in chromatin remodeling |
TIGRFAM ID | [TIGR01051] DNA topoisomerase I, bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.25431 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAT CCTTAATCAT CGTTGAGTCC CCAGCTAAGA TCAAAACCTT ACGAAAATTG TTAGGAGAAG GGTTTATTTT CGACTCTTCT TTGGGTCATA TTGTAGATCT TCCAGCAAAA GGGTTTGGTA TTGATATCGA AAAAGGATTT ATTCCGGACT ACCAAATTCT AGAGGGGAAG GAAGAGGTTA TTCGGAAAAT TTGTGCTGAA GCCAAGAAAT GCGACGTTGT TTATCTTGCT CCCGATCCAG ACCGAGAGGG GGAGGCTATA GCATGGCATA TTGCCAATCA GTTGCCTAAG AATACCAAAA TTCAGAGAAT TTCTTTTAAT GCGATCACTA AAGGGGCTGT TACAGAAGCG TTGAAGCATC CCCGAGAAAT TGATATGGCG TTGGTTAATG CTCAGCAGGC GAGACGTTTT TTAGATCGCA TAGTGGGGTA TAAAATTTCT CCAATCTTGG GTCGCAAGTT GCAGCGATGG TCGGGAGTAT CTGCAGGAAG AGTGCAGTCT GTAGCTTTGA AATTAGTTGT AGATCGAGAA TACGCCATAG AACAATTTGT TCCTGTAGAG TTTTGGAATA TTCGCGTTAA TCTCCAAGAT CCTAAAAGTC AGAAAACGTT TTGGGCGCAT TTACATTCTG TAAATGGAAA AAAATGGGAA AAAGAAATTC CAGAGGGGAA GTCTTCTGAA GACGTAGTCT TAATTGATTC TAAAGAAAAA GCGGATGATT TAGTTTCCCT ATTAGAATCA GCAACTTATT GCGTAGATCG CGTTGAATCT AAAGAGAAAA AGCGCAACGC ATATCCTCCA TTTATTACTT CTACGTTACA GCAGGAAGCC AGTCGACATT ATCGTTTTTC TTCCTCTAGA ACGATGAATA TCGCTCAAAC TTTATACGAA GGGGTGGATT TAGATAGTCA AGGCGCTGTG GGATTAATTA CTTACATGAG AACCGATTCC GTTCGGACAG ATCCTGAAGC TATCAAACAG GTGCGTAAGT ATATAGAGAA CCATTTTGGA AAAGAATATG TGCCTTCTTC TCCAAATATG TATGCCACGA AGAAAATGGC TCAGGATGCT CATGAGGCTA TACGTCCTAC AGATGTTTCT CTTTCTCCCG AATCCATACG TACAAAGTTA ACAGAAGATC AGTACAAACT TTACTCTTTG ATATGGAAGC GTTTTGTAGC TTCACAAATG ATTGCGGCCA TCTATGATAC TCTTGCCATT CAAATAGCGA CAAATAAAGG CATAGATCTC CGGGCTACAG GTTCTTGCTT AAAGTTTAAG GGGTTTTTAG CTGTCTATGA AGAGAAAAGA GATGAAGAAG GAGATGAGGA TGAGAATATT CAGCTTCCTA AGCTCCATGA GCGTGATGAG CTAAAAAAAG AGGAAATAGA AGCTGAACAA TCGCATACTA AACCTCTACC GCGTTTCACA GAGGCCTCTT TAGTTAAGGA ATTAGAAAAA TCTGGCATAG GAAGACCTTC TACTTATGCT ACGATTATGA ATAAAATTCA GAGTCGGGAA TATACATTAA AAGAAGGCCA ACGATTACGT CCTACCGAGC TTGGAAAGGT AGTTTGTCAA TTTTTAGAGA CGAATTTCCC TCGGATTATG GACATTGGTT TTACGGCTAA AATGGAAGAT GAGCTAGAGC TTATCGCTGA TAATAAAAAG CCTTGGAAAC AGCTGTTACA AGAATTTTGT GAATTATTCC TTCCTTTTGT AGTGACGGCT GAAAAAGAAG CTTTTATTCC TCGTATTGTC ACAGAAATGG ACTGTCCAAG ATGTCATAAA GGGAAACTAG TAAAAATTTG GTCTAAAAAT CGATACTTCT TTGGTTGTTC GGAATACCCT ACCTGTGATT ACAAAACTTC GGAAGAAGAG CTCACTTTCG ATAAAAGCGA GTATGCAGAC GATACTCCTT GGGATGCTCC TTGCGCTCTT TGTGGGGGGC AAATGAAAGT GCGACATGGG AAGTTTGGAA GCTTTCTCGG GTGCGAGAAT TACCCACAAT GTCATTATAT TGTGAATCTT TTTAAAAAAG GAGAAGCTGG CTCAGAGCCT GAAGAGATCG TATCATGTCC TGCAGAAGGC TGTACTGGTC ATCTCGTTAA AAGAAGATCG CGGTTTAATA AAATGTTTTA TTCCTGTTCA GAGTATCCTG CGTGCAGCGT TATTGGGAAC TCTGTAGATG CAGTCATTGA AAAATATACA GGAACTCCTA AAACTCCTTA TGAGAAGAAG ATAAAAGCTA AAAAAGCAAC AGCTTCTAAA AAGGGAAAAG CAACAAAAGG AAAGGCTTCT ACTACGAAAA CAAAGAAAAA ATCTACTGTA ACGACGAAAA ACAGAAAGAC TGCAACCTAT ACACCTTCTT CTGCTTTAGC TGCTGTTATT GGTCCTGACC CAATAGATGG TTTCCCCGAA GCTACGAAAA AAATCTGGGC GTATATCAAA GAGCAGGGAT TGCAATCGCC GAATAACAAA AGAGTCATTG TCCCTGATAG CAAAATGAAG CATGTAATCG GTGATGATCC GATTGATATG TTCGCACTAT CTAAAAAAAT ACAAGCGCAT TTAACGAAGC AAGAGTAA
|
Protein sequence | MKKSLIIVES PAKIKTLRKL LGEGFIFDSS LGHIVDLPAK GFGIDIEKGF IPDYQILEGK EEVIRKICAE AKKCDVVYLA PDPDREGEAI AWHIANQLPK NTKIQRISFN AITKGAVTEA LKHPREIDMA LVNAQQARRF LDRIVGYKIS PILGRKLQRW SGVSAGRVQS VALKLVVDRE YAIEQFVPVE FWNIRVNLQD PKSQKTFWAH LHSVNGKKWE KEIPEGKSSE DVVLIDSKEK ADDLVSLLES ATYCVDRVES KEKKRNAYPP FITSTLQQEA SRHYRFSSSR TMNIAQTLYE GVDLDSQGAV GLITYMRTDS VRTDPEAIKQ VRKYIENHFG KEYVPSSPNM YATKKMAQDA HEAIRPTDVS LSPESIRTKL TEDQYKLYSL IWKRFVASQM IAAIYDTLAI QIATNKGIDL RATGSCLKFK GFLAVYEEKR DEEGDEDENI QLPKLHERDE LKKEEIEAEQ SHTKPLPRFT EASLVKELEK SGIGRPSTYA TIMNKIQSRE YTLKEGQRLR PTELGKVVCQ FLETNFPRIM DIGFTAKMED ELELIADNKK PWKQLLQEFC ELFLPFVVTA EKEAFIPRIV TEMDCPRCHK GKLVKIWSKN RYFFGCSEYP TCDYKTSEEE LTFDKSEYAD DTPWDAPCAL CGGQMKVRHG KFGSFLGCEN YPQCHYIVNL FKKGEAGSEP EEIVSCPAEG CTGHLVKRRS RFNKMFYSCS EYPACSVIGN SVDAVIEKYT GTPKTPYEKK IKAKKATASK KGKATKGKAS TTKTKKKSTV TTKNRKTATY TPSSALAAVI GPDPIDGFPE ATKKIWAYIK EQGLQSPNNK RVIVPDSKMK HVIGDDPIDM FALSKKIQAH LTKQE
|
| |