Gene BCG9842_B1312 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCG9842_B1312 
SymboltopA 
ID7186585 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus G9842 
KingdomBacteria 
Replicon accessionNC_011772 
Strand
Start bp3818736 
End bp3820814 
Gene Length2079 bp 
Protein Length692 aa 
Translation table11 
GC content39% 
IMG OID643551727 
ProductDNA topoisomerase I 
Protein accessionYP_002447397 
Protein GI218898986 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA
[COG0551] Zn-finger domain associated with topoisomerase type I 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000876522 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value7.60195e-25 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGTCAGATT ACCTCGTGAT CGTGGAGTCG CCTTCTAAGG CGAAGACCAT TGAGAAATAT 
TTAGGGAAAA AATACAAAGT TGTCGCGTCT ATGGGACATG TTCGCGATTT GCCAAAAAGT
CAAATGGGGA TAGAAGTAAA GAACAACTTC ACCCCGAAGT ATATTACCAT TCGTGGTAAA
GGTCCCGTCT TAAAAGATTT AAAATCAGCG GCGAAAAAAG CAAAGAAAGT CTATCTCGCG
GCCGATCCAG ACCGCGAAGG GGAAGCAATT GCTTGGCATT TAGCGAATAC GTTAAATGTG
GACGTTGAAT CAGATTGTCG GGTTGTGTTT AATGAGATTA CGAAAGATGC AATCAAAGAA
TCATTTAAAC ATCCTCGTGC AATTAATATG GATTTAGTAG ATGCACAACA AGCAAGACGT
ATACTTGATC GTCTTGTTGG TTACAATATT AGTCCTTTAT TATGGAAGAA AGTAAAAAAA
GGATTAAGTG CAGGGCGTGT ACAATCTGTA GCAGTTCGTT TAATCATCGA ACGTGAAAAG
GAAATTCAAA GCTTTGAACC TGAAGAATTC TGGACAATTA AAACAGAATT TGTAAAAGGA
AAAGACACAT TTGAAGCAAG CTTTTACGGT GTAGATGGGG AAAAGGTTCA ATTAACGAAT
GAAACGCAAG TGAATGAAAT AATTGAACAG ATGAAAGACA ATGCGTTTTC AGTTGAAAAT
GTAACGCGCA AAGAGCGAAA ACGTAATCCT GCATTACCGT TTACAACATC TTCCTTGCAA
CAAGAAGCAG CACGTAAGTT AAACATGCGA GCAAAGAAAA CGATGATGCT TGCACAGCAA
TTGTATGAAG GGATAGATCT TGGAAAACAA GGAACTGTAG GTCTTATTAC GTATATGAGA
ACTGATTCAA CACGTATCTC AGAAACAGCT CAAACAGAGG CTCGTACTTA CATCACTGAA
GCGTATGGTA CGGAATACAT AGGAACAGAA AAGAAGAAAG AAACGAAAAA GTCAAATGCA
CAAGATGCAC ATGAGGCAAT TCGTCCTACT TCGGTAATGA GAAAGCCAGA GGAACTAAAA
AGTTTCTTAA GTCGTGATCA ACTTCGATTA TATAAATTGA TTTGGGAGCG ATTTGTTGCA
AGTCAAATGG CGTCTGCTAT AATGGATACT GTGACAGCGA GACTCATTAA TAACAATGTT
CAGTTCCGTG CAAGTGGATC GGTTGTAAAG TTCCCAGGAT TTATGAAAGT GTATGTAGAG
TCGAAAGATG ATGGTGCTGA AGAAAAGGAT AAGATGTTGC CGCCTTTAGA AGTAGGGGAA
ACTGTATTTT CGAAGGATTT AGAACCGAAG CAACATTTTA CACAACCTCC TCCGCGCTAT
ACAGAGGCTC GTCTAGTAAG AACACTTGAA GAACTTGGAA TTGGAAGACC GTCGACTTAT
GTACCTACAC TTGAAACGAT TCAAAAACGT GGATATGTAG GCTTGGATAA TAAACGCTTC
GTTCCGACTG AACTTGGTGA AATAGTAATT GAACTTATTT TAGAATTTTT CCCGGAAATT
ATTAACATTG AATTTACTGC CAATATGGAG CAAAGCCTTG ATGAAGTGGA AGAAGGAAAT
GCCAATTGGG TGAAAATTGT TGATGATTTC TACGTAGGAT TTGAACCGCG CTTAGAAAAA
GCGGAAAAAG AAATGCGTGA AGTGGAAATT AAAGATGAGC CAGCTGGGGA AGACTGTGAA
TTATGCGGAC ATCCAATGGT CTTTAAAATG GGTAAATACG GGAAGTTTAT GGCTTGTTCG
AATTTCCCTG ATTGTCGTAA TACAAAACCG ATTGTGAAAG AAATCGGTGT AACTTGTCCG
AAATGTGAAG AAGGACAAAT TATTGAGCGT CGTAGTAACA AAAAGAAACG CCTTTTCTAT
GGATGCGGTA CGTATCCAGA ATGTGACTTT GTATCTTGGG ATAAGCCGAT TGGTCGTAAA
TGTCCGAAGT GTGAAGGCAT GCTTGTAGAG AAGAAGTTGA AAAAAGGCGT GCAAGTACAA
TGTATTTCGT GCGATTATGA AGAAGAACAA CAAATGTGA
 
Protein sequence
MSDYLVIVES PSKAKTIEKY LGKKYKVVAS MGHVRDLPKS QMGIEVKNNF TPKYITIRGK 
GPVLKDLKSA AKKAKKVYLA ADPDREGEAI AWHLANTLNV DVESDCRVVF NEITKDAIKE
SFKHPRAINM DLVDAQQARR ILDRLVGYNI SPLLWKKVKK GLSAGRVQSV AVRLIIEREK
EIQSFEPEEF WTIKTEFVKG KDTFEASFYG VDGEKVQLTN ETQVNEIIEQ MKDNAFSVEN
VTRKERKRNP ALPFTTSSLQ QEAARKLNMR AKKTMMLAQQ LYEGIDLGKQ GTVGLITYMR
TDSTRISETA QTEARTYITE AYGTEYIGTE KKKETKKSNA QDAHEAIRPT SVMRKPEELK
SFLSRDQLRL YKLIWERFVA SQMASAIMDT VTARLINNNV QFRASGSVVK FPGFMKVYVE
SKDDGAEEKD KMLPPLEVGE TVFSKDLEPK QHFTQPPPRY TEARLVRTLE ELGIGRPSTY
VPTLETIQKR GYVGLDNKRF VPTELGEIVI ELILEFFPEI INIEFTANME QSLDEVEEGN
ANWVKIVDDF YVGFEPRLEK AEKEMREVEI KDEPAGEDCE LCGHPMVFKM GKYGKFMACS
NFPDCRNTKP IVKEIGVTCP KCEEGQIIER RSNKKKRLFY GCGTYPECDF VSWDKPIGRK
CPKCEGMLVE KKLKKGVQVQ CISCDYEEEQ QM