Gene BCB4264_A3931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCB4264_A3931 
SymboltopA 
ID7099832 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus B4264 
KingdomBacteria 
Replicon accessionNC_011725 
Strand
Start bp3853017 
End bp3855095 
Gene Length2079 bp 
Protein Length692 aa 
Translation table11 
GC content38% 
IMG OID643471454 
ProductDNA topoisomerase I 
Protein accessionYP_002368634 
Protein GI218232018 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA
[COG0551] Zn-finger domain associated with topoisomerase type I 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000332787 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGATT ACCTCGTAAT CGTGGAGTCG CCTTCTAAGG CGAAGACCAT TGAGAAATAT 
TTAGGGAAAA AATACAAAGT TGTCGCGTCT ATGGGACATG TTCGCGATTT GCCAAAAAGT
CAAATGGGGA TAGAAGTAAA GAACAACTTC ACCCCGAAGT ATATTACCAT TCGTGGTAAA
GGTCCCGTCT TAAAAGATTT AAAATCAGCG GCGAAAAAAG CAAAGAAAGT CTATCTCGCG
GCCGATCCAG ACCGTGAAGG GGAAGCAATT GCTTGGCATT TAGCGAATAC GTTAAATGTG
GACGTTGAAT CAGATTGTCG GGTTGTGTTT AATGAGATTA CGAAAGATGC AATCAAAGAA
TCATTTAAAC ATCCTCGTGC AATTAATATG GATTTAGTGG ATGCACAACA AGCAAGACGT
ATACTTGATC GTCTTGTTGG TTACAATATT AGTCCTTTAT TATGGAAGAA AGTAAAAAAA
GGATTAAGTG CAGGACGTGT ACAATCTGTA GCAGTTCGTT TAATCATCGA GCGTGAAAGG
GAAATTCAAA ACTTTGAACC TGAAGAATTC TGGACAATTA AAACAGAATT TGTAAAAGGA
AAAGACACAT TTGAAGCAAG CTTTTACGGT GTAGATGGTG AAAAAGTTCA ATTAACAAAT
GAAACGCAAG TGAATGAAAT AATTGAACAG ATGAAAGACA ATGCGTTTTC AGTTGAAAAT
GTAACGCGCA AAGAGCGAAA ACGTAATCCT GCATTACCGT TTACAACATC TTCCTTGCAA
CAAGAAGCAG CACGTAAGTT AAACATGCGA GCAAAGAAAA CAATGATGCT TGCACAGCAA
TTGTATGAAG GGATAGATCT TGGAAAACAA GGAACTGTAG GTCTTATTAC GTATATGAGA
ACTGATTCAA CACGTATCTC AGAAACAGCT CAAACAGAGG CTCGTACTTA CATCACTGAA
GCGTATGGTA CGGAATACAT AGGAACAGAA AAGAAGAAAG AAACGAAAAA GTCAAATGCA
CAAGATGCGC ATGAGGCAAT TCGTCCTACT TCAGTAATGA GAAAGCCAGA GGAACTAAAA
AGTTTCTTAG GTCGTGATCA ACTTCGATTA TATAAATTGA TTTGGGAGCG ATTTGTTGCA
AGTCAAATGG CGTCTGCTAT AATGGATACT GTGACAGCGA GACTCATTAA TAACAATGTT
CAGTTCCGTG CAAGTGGATC GGTTGTAAAG TTCCCAGGAT TTATGAAAGT GTATGTAGAG
TCAAAAGATG ATGGTGCTGA AGAAAAGGAT AAGATGTTGC CACCTTTAGA AGTAGGGGAA
ACTGTATTTT CAAAGGATTT AGAACCAAAG CAACATTTTA CACAACCTCC TCCGCGCTAT
ACAGAGGCTC GCCTAGTAAG AACACTTGAA GAACTTGGAA TTGGAAGACC GTCAACATAT
GTACCTACAC TTGAAACAAT TCAAAAGCGT GGGTATGTAG GTTTGGATAA TAAACGCTTC
GTTCCGACTG AACTTGGTGA AATAGTAATT GAACTTATTT TAGAGTTTTT CCCAGAAATT
ATTAACATTG AATTTACTGC CAATATGGAG CAAAGCCTTG ATGAAGTAGA AGAAGGAAAT
GCGAATTGGG TAAAAATTGT TGATGATTTC TACGTAGGTT TTGAACCGCG TTTAGAAAAA
GCGGAAAAAG AAATGCGTGA AGTGGAAATT AAAGATGAAC CAGCTGGGGA AGATTGTGAA
TTATGTAATC ACCCAATGGT CTTTAAAATG GGTAAATACG GGAAATTTAT GGCTTGCTCG
AATTTCCCAG ATTGTCGTAA TACAAAGCCG ATTGTGAAAG AAATCGGTGT TACTTGTCCA
AAGTGTGATA AGGGTCAAAT TATTGAACGC CGTAGTAATA AAAAGAAACG TCTTTTCTAT
GGGTGCGGTA CGTATCCGGA ATGCGACTTT GTATCTTGGG ATAAGCCAAT TGGTCGTAAG
TGTCCGAAGT GCGAAGGCAT GCTTGTAGAG AAAAAGTTGA AAAAAGGCGT GCAAGTACAA
TGTATTTCGT GCGATTATGA AGAAGAACAA CAAATGTGA
 
Protein sequence
MSDYLVIVES PSKAKTIEKY LGKKYKVVAS MGHVRDLPKS QMGIEVKNNF TPKYITIRGK 
GPVLKDLKSA AKKAKKVYLA ADPDREGEAI AWHLANTLNV DVESDCRVVF NEITKDAIKE
SFKHPRAINM DLVDAQQARR ILDRLVGYNI SPLLWKKVKK GLSAGRVQSV AVRLIIERER
EIQNFEPEEF WTIKTEFVKG KDTFEASFYG VDGEKVQLTN ETQVNEIIEQ MKDNAFSVEN
VTRKERKRNP ALPFTTSSLQ QEAARKLNMR AKKTMMLAQQ LYEGIDLGKQ GTVGLITYMR
TDSTRISETA QTEARTYITE AYGTEYIGTE KKKETKKSNA QDAHEAIRPT SVMRKPEELK
SFLGRDQLRL YKLIWERFVA SQMASAIMDT VTARLINNNV QFRASGSVVK FPGFMKVYVE
SKDDGAEEKD KMLPPLEVGE TVFSKDLEPK QHFTQPPPRY TEARLVRTLE ELGIGRPSTY
VPTLETIQKR GYVGLDNKRF VPTELGEIVI ELILEFFPEI INIEFTANME QSLDEVEEGN
ANWVKIVDDF YVGFEPRLEK AEKEMREVEI KDEPAGEDCE LCNHPMVFKM GKYGKFMACS
NFPDCRNTKP IVKEIGVTCP KCDKGQIIER RSNKKKRLFY GCGTYPECDF VSWDKPIGRK
CPKCEGMLVE KKLKKGVQVQ CISCDYEEEQ QM