Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCG9842_B1312 |
Symbol | topA |
ID | 7186585 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus G9842 |
Kingdom | Bacteria |
Replicon accession | NC_011772 |
Strand | - |
Start bp | 3818736 |
End bp | 3820814 |
Gene Length | 2079 bp |
Protein Length | 692 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 643551727 |
Product | DNA topoisomerase I |
Protein accession | YP_002447397 |
Protein GI | 218898986 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA [COG0551] Zn-finger domain associated with topoisomerase type I |
TIGRFAM ID | [TIGR01051] DNA topoisomerase I, bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000876522 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 7.60195e-25 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGTCAGATT ACCTCGTGAT CGTGGAGTCG CCTTCTAAGG CGAAGACCAT TGAGAAATAT TTAGGGAAAA AATACAAAGT TGTCGCGTCT ATGGGACATG TTCGCGATTT GCCAAAAAGT CAAATGGGGA TAGAAGTAAA GAACAACTTC ACCCCGAAGT ATATTACCAT TCGTGGTAAA GGTCCCGTCT TAAAAGATTT AAAATCAGCG GCGAAAAAAG CAAAGAAAGT CTATCTCGCG GCCGATCCAG ACCGCGAAGG GGAAGCAATT GCTTGGCATT TAGCGAATAC GTTAAATGTG GACGTTGAAT CAGATTGTCG GGTTGTGTTT AATGAGATTA CGAAAGATGC AATCAAAGAA TCATTTAAAC ATCCTCGTGC AATTAATATG GATTTAGTAG ATGCACAACA AGCAAGACGT ATACTTGATC GTCTTGTTGG TTACAATATT AGTCCTTTAT TATGGAAGAA AGTAAAAAAA GGATTAAGTG CAGGGCGTGT ACAATCTGTA GCAGTTCGTT TAATCATCGA ACGTGAAAAG GAAATTCAAA GCTTTGAACC TGAAGAATTC TGGACAATTA AAACAGAATT TGTAAAAGGA AAAGACACAT TTGAAGCAAG CTTTTACGGT GTAGATGGGG AAAAGGTTCA ATTAACGAAT GAAACGCAAG TGAATGAAAT AATTGAACAG ATGAAAGACA ATGCGTTTTC AGTTGAAAAT GTAACGCGCA AAGAGCGAAA ACGTAATCCT GCATTACCGT TTACAACATC TTCCTTGCAA CAAGAAGCAG CACGTAAGTT AAACATGCGA GCAAAGAAAA CGATGATGCT TGCACAGCAA TTGTATGAAG GGATAGATCT TGGAAAACAA GGAACTGTAG GTCTTATTAC GTATATGAGA ACTGATTCAA CACGTATCTC AGAAACAGCT CAAACAGAGG CTCGTACTTA CATCACTGAA GCGTATGGTA CGGAATACAT AGGAACAGAA AAGAAGAAAG AAACGAAAAA GTCAAATGCA CAAGATGCAC ATGAGGCAAT TCGTCCTACT TCGGTAATGA GAAAGCCAGA GGAACTAAAA AGTTTCTTAA GTCGTGATCA ACTTCGATTA TATAAATTGA TTTGGGAGCG ATTTGTTGCA AGTCAAATGG CGTCTGCTAT AATGGATACT GTGACAGCGA GACTCATTAA TAACAATGTT CAGTTCCGTG CAAGTGGATC GGTTGTAAAG TTCCCAGGAT TTATGAAAGT GTATGTAGAG TCGAAAGATG ATGGTGCTGA AGAAAAGGAT AAGATGTTGC CGCCTTTAGA AGTAGGGGAA ACTGTATTTT CGAAGGATTT AGAACCGAAG CAACATTTTA CACAACCTCC TCCGCGCTAT ACAGAGGCTC GTCTAGTAAG AACACTTGAA GAACTTGGAA TTGGAAGACC GTCGACTTAT GTACCTACAC TTGAAACGAT TCAAAAACGT GGATATGTAG GCTTGGATAA TAAACGCTTC GTTCCGACTG AACTTGGTGA AATAGTAATT GAACTTATTT TAGAATTTTT CCCGGAAATT ATTAACATTG AATTTACTGC CAATATGGAG CAAAGCCTTG ATGAAGTGGA AGAAGGAAAT GCCAATTGGG TGAAAATTGT TGATGATTTC TACGTAGGAT TTGAACCGCG CTTAGAAAAA GCGGAAAAAG AAATGCGTGA AGTGGAAATT AAAGATGAGC CAGCTGGGGA AGACTGTGAA TTATGCGGAC ATCCAATGGT CTTTAAAATG GGTAAATACG GGAAGTTTAT GGCTTGTTCG AATTTCCCTG ATTGTCGTAA TACAAAACCG ATTGTGAAAG AAATCGGTGT AACTTGTCCG AAATGTGAAG AAGGACAAAT TATTGAGCGT CGTAGTAACA AAAAGAAACG CCTTTTCTAT GGATGCGGTA CGTATCCAGA ATGTGACTTT GTATCTTGGG ATAAGCCGAT TGGTCGTAAA TGTCCGAAGT GTGAAGGCAT GCTTGTAGAG AAGAAGTTGA AAAAAGGCGT GCAAGTACAA TGTATTTCGT GCGATTATGA AGAAGAACAA CAAATGTGA
|
Protein sequence | MSDYLVIVES PSKAKTIEKY LGKKYKVVAS MGHVRDLPKS QMGIEVKNNF TPKYITIRGK GPVLKDLKSA AKKAKKVYLA ADPDREGEAI AWHLANTLNV DVESDCRVVF NEITKDAIKE SFKHPRAINM DLVDAQQARR ILDRLVGYNI SPLLWKKVKK GLSAGRVQSV AVRLIIEREK EIQSFEPEEF WTIKTEFVKG KDTFEASFYG VDGEKVQLTN ETQVNEIIEQ MKDNAFSVEN VTRKERKRNP ALPFTTSSLQ QEAARKLNMR AKKTMMLAQQ LYEGIDLGKQ GTVGLITYMR TDSTRISETA QTEARTYITE AYGTEYIGTE KKKETKKSNA QDAHEAIRPT SVMRKPEELK SFLSRDQLRL YKLIWERFVA SQMASAIMDT VTARLINNNV QFRASGSVVK FPGFMKVYVE SKDDGAEEKD KMLPPLEVGE TVFSKDLEPK QHFTQPPPRY TEARLVRTLE ELGIGRPSTY VPTLETIQKR GYVGLDNKRF VPTELGEIVI ELILEFFPEI INIEFTANME QSLDEVEEGN ANWVKIVDDF YVGFEPRLEK AEKEMREVEI KDEPAGEDCE LCGHPMVFKM GKYGKFMACS NFPDCRNTKP IVKEIGVTCP KCEEGQIIER RSNKKKRLFY GCGTYPECDF VSWDKPIGRK CPKCEGMLVE KKLKKGVQVQ CISCDYEEEQ QM
|
| |