Gene GBAA_3971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGBAA_3971 
SymboltopA 
ID2818374 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. 'Ames Ancestor' 
KingdomBacteria 
Replicon accessionNC_007530 
Strand
Start bp3648430 
End bp3650508 
Gene Length2079 bp 
Protein Length692 aa 
Translation table11 
GC content39% 
IMG OID637790685 
ProductDNA topoisomerase I 
Protein accessionYP_020610 
Protein GI47529261 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA
[COG0551] Zn-finger domain associated with topoisomerase type I 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000571341 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGATT ACCTCGTAAT CGTGGAGTCG CCTTCTAAGG CGAAGACCAT TGAGAAATAT 
TTAGGGAAAA AATACAAAGT TGTCGCGTCT ATGGGACACG TTCGCGATTT ACCTAAAAGC
CAAATGGGGA TAGAAGTAAA GAACAACTTC ACCCCGAAGT ATATTACCAT TCGTGGTAAA
GGTCCCGTCT TAAAAGACTT AAAATCAGCG GCAAAAAAAG CAAAGAAAGT CTATCTCGCG
GCCGATCCAG ACCGCGAAGG AGAAGCTATT GCTTGGCATT TAGCCAATAC GTTAAATGTG
GACGTTGAAT CAGATTGCCG AGTTGTGTTT AATGAGATTA CAAAAGATGC AATAAAAGAA
TCATTCAAAC ATCCTCGTGC GATTAACATG GATTTAGTAG ATGCACAACA AGCAAGACGT
ATACTAGATC GTCTTGTTGG TTACAATATT AGTCCTTTAT TATGGAAGAA AGTAAAAAAA
GGATTAAGTG CAGGGCGCGT ACAATCGGTA GCAGTTCGTT TAATTATCGA ACGTGAAAGA
GAAATTCAAA GTTTCGAGCC TGAAGAATTC TGGACAATTA AAACAGAATT TGTGAAAGGG
AAAGACACAT TTGAAGCAAG TTTTTACGGT GTAGATGGTG AAAAAGTTCA ATTAACGAAT
GAAACACAAG TGAATGAAAT AATTGAACAG CTGAAAGATA ATGCGTTCTC AGTTGAAAAT
GTAACGCGAA AAGAGCGAAA ACGTAATCCT GCATTACCAT TTACAACATC TTCCTTGCAA
CAAGAGGCAG CGCGTAAGTT AAACATGCGA GCAAAGAAAA CGATGATGCT TGCGCAGCAA
TTGTATGAAG GGATCGATCT TGGAAAACAA GGAACTGTAG GTCTTATTAC GTATATGAGA
ACTGATTCAA CACGTATCTC AGAAACAGCT CAAACAGAGG CACGTACTTA TATTACTGAG
GCGTATGGTA CGGAATACAT AGGAGCAGAA AAGAAGAAAG AAACGAAGAA GTCGAACGCA
CAAGATGCGC ATGAGGCAAT TCGTCCTACT TCGGTAATGA GAAAGCCAGA GGAATTAAAG
AGTTTCTTAA GTCGTGATCA ACTTCGATTG TATAAATTGA TTTGGGAGCG ATTTGTTGCA
AGTCAAATGG CGTCTGCTAT AATGGATACT GTGACAGCGA GACTCATTAA TAACAATGTT
CAGTTCCGTG CAAGTGGATC GGTTGTAAAG TTCCCAGGAT TTATGAAAGT GTATGTAGAG
TCGAAAGATG ACGGGGCTGA AGAAAAGGAT AAGATGTTGC CACCTTTAGA AGTAGGGGAA
ACCGTATTTT CGAAGGATTT AGAACCGAAG CAGCACTTTA CACAACCTCC GCCGCGCTAT
ACTGAGGCTC GTCTAGTAAG AACGCTTGAA GAGCTTGGAA TTGGAAGACC GTCGACTTAC
GTACCGACAC TTGAAACGAT TCAAAAACGT GGATATGTCG GTTTGGATAA TAAACGCTTC
GTTCCAACTG AACTTGGTGA AATAGTAATT GAACTTATTT TAGAGTTTTT CCCAGAAATT
ATTAACATTG AATTTACTGC TAACATGGAG CAAAGCCTCG ATGAAGTAGA AGAAGGAAAT
GCAAATTGGG TAAAAATTGT TGATGATTTC TACGTAGGGT TTGAACCGCG TTTAGAAAAA
GCGGAAAAAG AAATGCGTGA AGTAGAAATT AAAGATGAAC CAGCTGGGGA AGACTGTGAA
TTATGTAATC ACCCAATGGT CTTTAAAATG GGTAAATACG GGAAATTTAT GGCTTGCTCG
AATTTCCCAG ATTGTCGTAA TACAAAACCG ATTGTGAAAG AAATCGGTGT TACTTGTCCG
AAATGCGATA AAGGTCAAAT TATTGAACGT CGTAGTAATA AAAAGAAACG CCTTTTCTAT
GGATGCGGTA CGTATCCAGA GTGTGACTTT GTATCTTGGG ATAAGCCGAT TGGCCGTAAA
TGTCCGAAGT GTGAAGGTAT GCTAGTAGAG AAGAAGTTGA AAAAAGGCGT ACAAGTACAA
TGTATTTCGT GTGATTATGA AGAAGAACAA CAAATGTGA
 
Protein sequence
MSDYLVIVES PSKAKTIEKY LGKKYKVVAS MGHVRDLPKS QMGIEVKNNF TPKYITIRGK 
GPVLKDLKSA AKKAKKVYLA ADPDREGEAI AWHLANTLNV DVESDCRVVF NEITKDAIKE
SFKHPRAINM DLVDAQQARR ILDRLVGYNI SPLLWKKVKK GLSAGRVQSV AVRLIIERER
EIQSFEPEEF WTIKTEFVKG KDTFEASFYG VDGEKVQLTN ETQVNEIIEQ LKDNAFSVEN
VTRKERKRNP ALPFTTSSLQ QEAARKLNMR AKKTMMLAQQ LYEGIDLGKQ GTVGLITYMR
TDSTRISETA QTEARTYITE AYGTEYIGAE KKKETKKSNA QDAHEAIRPT SVMRKPEELK
SFLSRDQLRL YKLIWERFVA SQMASAIMDT VTARLINNNV QFRASGSVVK FPGFMKVYVE
SKDDGAEEKD KMLPPLEVGE TVFSKDLEPK QHFTQPPPRY TEARLVRTLE ELGIGRPSTY
VPTLETIQKR GYVGLDNKRF VPTELGEIVI ELILEFFPEI INIEFTANME QSLDEVEEGN
ANWVKIVDDF YVGFEPRLEK AEKEMREVEI KDEPAGEDCE LCNHPMVFKM GKYGKFMACS
NFPDCRNTKP IVKEIGVTCP KCDKGQIIER RSNKKKRLFY GCGTYPECDF VSWDKPIGRK
CPKCEGMLVE KKLKKGVQVQ CISCDYEEEQ QM