Gene BCAH820_3845 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCAH820_3845 
SymboltopA 
ID7191621 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus AH820 
KingdomBacteria 
Replicon accessionNC_011773 
Strand
Start bp3683190 
End bp3685268 
Gene Length2079 bp 
Protein Length692 aa 
Translation table11 
GC content39% 
IMG OID643557256 
ProductDNA topoisomerase I 
Protein accessionYP_002452795 
Protein GI218904961 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA
[COG0551] Zn-finger domain associated with topoisomerase type I 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value6.270330000000001e-60 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGTCAGATT ACCTCGTAAT CGTGGAGTCG CCTTCTAAGG CGAAGACCAT TGAGAAATAT 
TTAGGGAAAA AATACAAAGT TGTCGCGTCT ATGGGACACG TTCGCGATTT ACCTAAAAGC
CAAATGGGGA TAGAAGTAAA GAACAACTTC ACCCCGAAGT ATATTACCAT TCGTGGTAAA
GGTCCCGTCT TAAAAGACTT AAAATCAGCG GCAAAAAAAG CAAAGAAAGT CTATCTCGCG
GCCGATCCAG ACCGCGAAGG AGAAGCTATT GCTTGGCATT TAGCCAATAC GTTAAATGTG
GACGTTGAAT CAGATTGCCG AGTTGTGTTT AATGAGATTA CAAAAGATGC AATAAAAGAA
TCATTCAAAC ATCCTCGTGC AATTAACATG GATTTAGTAG ATGCACAACA AGCAAGACGT
ATACTAGATC GTCTTGTTGG TTACAATATT AGTCCTTTAT TATGGAAGAA AGTAAAAAAA
GGATTAAGTG CAGGGCGCGT ACAATCGGTA GCAGTTCGTT TAATTATCGA ACGTGAAAGA
GAAATTCAAA GTTTCGAGCC TGAAGAATTC TGGACAATTA AAACAGAATT TGTGAAAGGG
AAAGACACAT TTGAAGCAAG CTTTTACGGT GTAGATGGTG AAAAAGTTCA ATTAACGAAT
GAAACACAAG TGAATGAAAT AATTGAACAG CTGAAAGATA ATGCGTTCTC AGTTGAAAAT
GTAACGCGAA AAGAGCGAAA ACGTAATCCT GCATTACCAT TTACAACATC TTCCTTGCAA
CAAGAGGCAG CGCGTAAGTT AAACATGCGA GCAAAGAAAA CGATGATGCT TGCGCAGCAA
TTGTATGAAG GGATCGATCT TGGAAAACAA GGAACTGTAG GTCTTATTAC GTATATGAGA
ACTGATTCAA CACGTATCTC AGAAACAGCT CAAACAGAGG CACGTACTTA TATTACTGAG
GCGTATGGTA CGGAATACAT AGGAGCAGAA AAGAAGAAAG AAACGAAGAA GTCGAACGCA
CAAGATGCGC ATGAGGCAAT TCGTCCTACT TCGGTAATGA GAAAGCCAGA GGAATTAAAG
AGTTTCTTAA GTCGTGATCA ACTTCGATTG TATAAATTGA TTTGGGAGCG ATTTGTTGCA
AGTCAAATGG CGTCTGCTAT AATGGATACT GTGACAGCGA GACTCATTAA TAACAATGTT
CAGTTCCGTG CAAGTGGATC GGTTGTAAAG TTCCCAGGAT TTATGAAAGT GTATGTAGAG
TCGAAAGATG ACGGGGCGGA AGAAAAGGAT AAGATGTTGC CACCTTTAGA AGTAGGGGAA
ACCGTATTTT CGAAGGATTT AGAACCGAAG CAGCACTTTA CACAACCTCC GCCGCGCTAT
ACTGAGGCTC GTCTAGTAAG AACACTTGAA GAGCTTGGAA TTGGAAGACC GTCGACTTAC
GTACCGACAC TTGAAACGAT TCAAAAACGT GGATATGTCG GTTTGGATAA TAAACGCTTC
GTTCCAACTG AACTTGGTGA AATAGTAATT GAACTTATTT TAGAGTTTTT CCCAGAAATT
ATTAACATTG AATTTACTGC TAACATGGAG CAAAGCCTCG ATGAAGTAGA AGAAGGAAAT
GCAAATTGGG TAAAAATTGT TGATGATTTC TACGTAGGGT TTGAACCGCG TTTAGAAAAA
GCGGAAAAAG AAATGCGTGA AGTAGAAATT AAAGATGAAC CAGCTGGGGA AGACTGTGAA
TTATGTAATC ACCCAATGGT CTTTAAAATG GGTAAATACG GGAAATTTAT GGCTTGCTCG
AATTTCCCAG ATTGTCGTAA TACAAAACCG ATTGTGAAAG AAATCGGTGT TACTTGTCCG
AAATGCGATA AAGGTCAAAT TATTGAACGT CGTAGTAATA AAAAGAAACG TCTGTTCTAT
GGATGCGGTA CGTATCCAGA GTGTGACTTT GTATCTTGGG ATAAGCCGAT TGGCCGTAAA
TGTCCGAAGT GTGAAGGTAT GCTAGTAGAG AAGAAGTTGA AAAAAGGCGT ACAAGTACAA
TGTATTTCGT GTGATTATGA AGAAGAACAA CAAATGTGA
 
Protein sequence
MSDYLVIVES PSKAKTIEKY LGKKYKVVAS MGHVRDLPKS QMGIEVKNNF TPKYITIRGK 
GPVLKDLKSA AKKAKKVYLA ADPDREGEAI AWHLANTLNV DVESDCRVVF NEITKDAIKE
SFKHPRAINM DLVDAQQARR ILDRLVGYNI SPLLWKKVKK GLSAGRVQSV AVRLIIERER
EIQSFEPEEF WTIKTEFVKG KDTFEASFYG VDGEKVQLTN ETQVNEIIEQ LKDNAFSVEN
VTRKERKRNP ALPFTTSSLQ QEAARKLNMR AKKTMMLAQQ LYEGIDLGKQ GTVGLITYMR
TDSTRISETA QTEARTYITE AYGTEYIGAE KKKETKKSNA QDAHEAIRPT SVMRKPEELK
SFLSRDQLRL YKLIWERFVA SQMASAIMDT VTARLINNNV QFRASGSVVK FPGFMKVYVE
SKDDGAEEKD KMLPPLEVGE TVFSKDLEPK QHFTQPPPRY TEARLVRTLE ELGIGRPSTY
VPTLETIQKR GYVGLDNKRF VPTELGEIVI ELILEFFPEI INIEFTANME QSLDEVEEGN
ANWVKIVDDF YVGFEPRLEK AEKEMREVEI KDEPAGEDCE LCNHPMVFKM GKYGKFMACS
NFPDCRNTKP IVKEIGVTCP KCDKGQIIER RSNKKKRLFY GCGTYPECDF VSWDKPIGRK
CPKCEGMLVE KKLKKGVQVQ CISCDYEEEQ QM