Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_1674 |
Symbol | topA |
ID | 4206445 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | - |
Start bp | 1869794 |
End bp | 1871896 |
Gene Length | 2103 bp |
Protein Length | 700 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 642566224 |
Product | DNA topoisomerase I |
Protein accession | YP_698989 |
Protein GI | 110801774 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA [COG0551] Zn-finger domain associated with topoisomerase type I |
TIGRFAM ID | [TIGR01051] DNA topoisomerase I, bacterial [TIGR01057] DNA topoisomerase I, archaeal |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.121954 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTCAAA AATTAGTAAT TGTTGAGTCA CCAGCTAAAG CCAAGACAAT AGAAAAATAT TTAGGTAAAA ATTATGTAGT CGAAGCTTCA ATGGGACATG TAAGAGATTT GCCAAAAAGC CAATTAGGCG TTGATATAGA AAATGACTAT AATCCTAAAT ATATAACAAT AAGAGGAAAA GGTGAACTTT TAAGCAAACT TAGAAAGTTA GCTAAAAAAA GTGATAAGAT ATACCTTGCA ACTGACCCTG ATAGAGAAGG GGAAGCTATA TCTTGGCACT TAGCTAATGT GCTTAAGATA GATGAAAACG AGAATTGTAG AATAGAATTT AATGAAATAA CAAAGGATGC AGTTAAAAAT TCAATAAAGC ATCCAAGAAA AATAAATTGT AATTTAGTAG ATGCTCAGCA AGCAAGAAGA GTGTTAGACA GATTGGTTGG ATATGAAATA AGTCCACTCC TATGGAGGAA TGTAAAATGG GGATTGAGTG CTGGAAGGGT TCAGTCAGCA GCATTAAAAC TTATATGTGA TAGAGAAGAA GAAATAAAGA AGTTCAATCC AGAGGAGTAT TGGACTGTTG ATGTTAAACT TAAAAAAGGA AAGAAGTCTT TTCCTGTTAA GTTAACAACT AAAAATAAGA AAAAGATAGA AATAAAAAAT AAAGAACAAG CAGATCAAAT AATAGATGAA CTTAAAGAAA ATGAATATAT AGTAAGCAAG ATAAAAAAAG GAACTAAGAA TAAAAATCCT TTAGCTCCAT TTACTACAAG TACTCTTCAA CAAGAGGCAA GTAAAAAGCT TAACTTTATG ACAAAGAAAA CAATGTCAGT AGCTCAACAA CTTTATGAAG GGGTTGAAGT TAAGAAATTT GGAACTGTGG GTTTAATAAC TTATATGAGA ACTGATTCTG TTAGAATCTC AAAGGAAGCT CAAGAAAAAG CTCTTAACTT TATAGATGAA ACATATGGAA AAGAGTATGT TCCAGAGGAG CCAAGAGTAT ATAAAGGAAA GAAAAATATA CAAGATGCCC ATGAAGCTAT AAGACCTACA TATGTTGAAA TAACACCAGA GATTGCTAAG ACAAACTTAA GTAATGATCA ATATAAGTTA TATGCTTTAA TATGGAAAAG ATTCATAGCA AGCCAAATGG CTACATGTAT ATTAAATACT AATAGTTTAG AGATTAAAAA TGGTGATTAT ACATTAAGAG CTAGTGGATC TACCATAAAA TTTGATGGTT TCATGAAAGT TTATGAGTAT ATATCAGGAG AAGAGGAAGA ATCAGTACTT TTACCTGAAT TAGAAGAAAA TGAAGTTTTA AAAGAAGAAT CAATTGAAGG TAAACAACAT TTTACTCAAC CACCTGCTAG ATATTCAGAG GCTGCTTTTG TTAAGCTTTT AGAGGAAAAA GGTATTGGAA GACCAAGTAC TTACGTTCCA ACAATATCTA CATTAATAAG TAGAAAATAT GTAGATAGAG AGAAAAAGAT TCTTATACCA ACAGAATTAG GATTTATAGT AAATGATATA CTTTCAAATT ATTTTAAGCA GATAGTTGAT ACTGACTTTA CTGCAGAAAT GGAAGTTAAG CTTGATAATG TTGAAGCTGG AAAAGAAAGT TGGACTCATA TAGTAGATGA ATTCTTTACT CCATTAAAAG AAGATATAGA AAAGGCAGAA AAAGAGATAT CTAAGGTTAT TATAGAAGAT AAAGTCAGTT ATGTACCTTG TGATAAATGT GGAAGACTTA TGGTTATTAA GCATGGAAGA TTTGGAGATT TCTTAGCTTG CCCTGGATAT CCAGAGTGTC AAAACACAAA ACCTATAGTT GAAGAGGTTG ATGCAAACTG TCCATTATGT GGAGGAAAGA TTTTAGTTAA GAGAAGTAAA AAGGGAAACA GATTCTATGG ATGTAGTAAT TATCCAGAGT GTAATTTCGT AAGTTGGTAT GAGCCAACAA ATGAAAAATG TCCAGAATGT AGTTCATATA TGGTTAAGAG ATACTCTAAG AGTAAAGGTG AATATTTACA GTGTAGCGAT AAAGAATGTA AATATGAAAA AATAATTGAA AAAAATAATG ATGAAAATAA CTCAGAAAAG TAG
|
Protein sequence | MGQKLVIVES PAKAKTIEKY LGKNYVVEAS MGHVRDLPKS QLGVDIENDY NPKYITIRGK GELLSKLRKL AKKSDKIYLA TDPDREGEAI SWHLANVLKI DENENCRIEF NEITKDAVKN SIKHPRKINC NLVDAQQARR VLDRLVGYEI SPLLWRNVKW GLSAGRVQSA ALKLICDREE EIKKFNPEEY WTVDVKLKKG KKSFPVKLTT KNKKKIEIKN KEQADQIIDE LKENEYIVSK IKKGTKNKNP LAPFTTSTLQ QEASKKLNFM TKKTMSVAQQ LYEGVEVKKF GTVGLITYMR TDSVRISKEA QEKALNFIDE TYGKEYVPEE PRVYKGKKNI QDAHEAIRPT YVEITPEIAK TNLSNDQYKL YALIWKRFIA SQMATCILNT NSLEIKNGDY TLRASGSTIK FDGFMKVYEY ISGEEEESVL LPELEENEVL KEESIEGKQH FTQPPARYSE AAFVKLLEEK GIGRPSTYVP TISTLISRKY VDREKKILIP TELGFIVNDI LSNYFKQIVD TDFTAEMEVK LDNVEAGKES WTHIVDEFFT PLKEDIEKAE KEISKVIIED KVSYVPCDKC GRLMVIKHGR FGDFLACPGY PECQNTKPIV EEVDANCPLC GGKILVKRSK KGNRFYGCSN YPECNFVSWY EPTNEKCPEC SSYMVKRYSK SKGEYLQCSD KECKYEKIIE KNNDENNSEK
|
| |