Gene GWCH70_1102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_1102 
Symbol 
ID7977590 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp1153254 
End bp1155329 
Gene Length2076 bp 
Protein Length691 aa 
Translation table11 
GC content45% 
IMG OID644798055 
ProductDNA topoisomerase I 
Protein accessionYP_002949228 
Protein GI239826604 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000153126 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGACT ATCTTGTCAT CGTGGAATCG CCAGCGAAAG CGAAGACGAT CGAACGATAT 
TTAGGAAAAA AATATAAAGT AAAAGCTTCG ATGGGACATG TTCGCGATCT GCCAAAAAGC
CAAATGGGCG TTGATATAAA TAACGGCTAT GAGCCAAAGT ATATTACGAT TCGCGGTAAA
GGGCCAATCA TTAAAGAATT AAAAACAGCA GCGAAAAAAG CAAAAAAAGT GTTTCTTGCC
GCGGACCCGG ACCGCGAAGG GGAAGCGATT GCTTGGCATT TAGCCAATAT GCTCGACCTT
GATATTCATT CCGACTGCCG CGTTGTATTT AACGAGATTA CGAAGGATGC GGTTAAAGAG
TCATTTCAAC ATCCACGTCC GATCAATATG AATCTTGTTG ACGCGCAGCA AGCGCGCCGA
GTGTTGGACC GGCTGGTTGG ATACAACATT AGCCCGCTTC TTTGGAAAAA GGTGAAGAAA
GGATTGAGCG CGGGGCGTGT TCAATCTGTA GCGCTGCGTT TGATCATCGA CCGCGAAAAA
GAAATTAAAC AATTTCAGCC GGAAGAGTAT TGGACGATTC AAGCCGAATT TGTAAAAGGA
AATGAAACGT TTACTGCTTC TTTTTACGGA GTGGATGGGC AAAAGCTTGA ATTAAAAAAG
GAAGCAGACG TTGCCGCGAT TTTACAACGC ATAAACGGCA ACCACTTTAC GGTGACATCG
GTGGCAAAAA AGGAGCGGAA ACGAAATCCA GTGCCGCCGT TTACAACGTC TTCCTTGCAG
CAAGAAGCAG CGCGCAAGCT TAATTTTCGA ACGAAGAAAA CGATGATGAT CGCCCAGCAG
CTATATGAAG GAATCGATCT TGGCAGTGAA GGAACGGTCG GCTTAATTAC CTATATGCGT
ACAGACTCGA CAAGAGTATC AGAAAGCGCA CGGCAAGAGG CACTATCTTA TATAGAAGCG
ACGTTTGGAA AAGAATTTGT CGCACAAGAA AAGCGAAAAG AAAAGAAAAA TGCCAATGCG
CAAGATGCGC ATGAAGCGAT TCGTCCGACA TCTGCATTTC GCGAGCCGGA AAAGGTAAAG
CCATATTTAA CCCGCGATCA ATTTCGGTTG TATAAGTTAA TTTGGGAACG TTTTATCGCA
AGCCAAATGG CAGCCGCACT GTTAGATACG ATGAGCATTG AACTTGAAAA TGAAGGGGTG
ATCTTTCGGG CAAGCGGCTC GAAAGTAAAA TTTCCTGGTT TTATGAAAGT ATATGTAGAG
GGAACGGATG ACCAAACGGA TGAACAAGAT CGCCTTCTTC CGGATTTGCA GGAAGGGGAA
ACTGTTTTCA GCAAAGATAT TGAACCAAAG CAGCATTTTA CTCAGCCGCC TCCTCGCTAT
ACGGAAGCGC GGCTTGTGAA AACGCTAGAA GAGCTTGGCA TCGGCCGGCC GTCTACGTAC
GCGCCGACGC TTGATACGAT TCAAAAACGA AACTATGTCG TGCTAGAAAA TAAACGTTTT
GTTCCAACAG AACTTGGAGA AATCGTGTTA GAACTAATGT TAGAGTTTTT CCCAGAAATC
ATTGACGTGG AGTTTACAGC GAAAATGGAG AAAAATTTGG ATGAAATCGA GGAAGGAAAA
GTAGAATGGG TGAAAGTGGT CGACGAATTT TACCAGGAAT TTGAAAAGCG GCTGCAAACC
GCGGAAAAGG AAATGAAAGA AGTCGAGATT AAAGACGAGC CGGCGGGAGT CGACTGCGAA
GTGTGCGGAA GCCCAATGGT ATATAAAATG GGGCGATTCG GCAAATTTGT CGCCTGCTCC
AATTTCCCGG AATGCCGCAA TACAAAGCCG ATCGTTAAGG AAATCGGGGT AAAATGTCCG
AAATGCCGCG AAGGAAATAT TGTGGAGCGC AGCAGTAAGA AAAAGCGGAT TTTTTATGGC
TGCGACCGTT TTCCACAATG CGATTTCGTC TCGTGGGATA AACCGCTTGC CCGCCCTTGC
CCGAAATGCG GCGGCTTGCT AGTGGAAAAG AAACTGAAAA AAGGCGTGCA AGTGCAATGT
ACGGCATGTG ATTACGAAGA AGCACCACAA TCTTGA
 
Protein sequence
MSDYLVIVES PAKAKTIERY LGKKYKVKAS MGHVRDLPKS QMGVDINNGY EPKYITIRGK 
GPIIKELKTA AKKAKKVFLA ADPDREGEAI AWHLANMLDL DIHSDCRVVF NEITKDAVKE
SFQHPRPINM NLVDAQQARR VLDRLVGYNI SPLLWKKVKK GLSAGRVQSV ALRLIIDREK
EIKQFQPEEY WTIQAEFVKG NETFTASFYG VDGQKLELKK EADVAAILQR INGNHFTVTS
VAKKERKRNP VPPFTTSSLQ QEAARKLNFR TKKTMMIAQQ LYEGIDLGSE GTVGLITYMR
TDSTRVSESA RQEALSYIEA TFGKEFVAQE KRKEKKNANA QDAHEAIRPT SAFREPEKVK
PYLTRDQFRL YKLIWERFIA SQMAAALLDT MSIELENEGV IFRASGSKVK FPGFMKVYVE
GTDDQTDEQD RLLPDLQEGE TVFSKDIEPK QHFTQPPPRY TEARLVKTLE ELGIGRPSTY
APTLDTIQKR NYVVLENKRF VPTELGEIVL ELMLEFFPEI IDVEFTAKME KNLDEIEEGK
VEWVKVVDEF YQEFEKRLQT AEKEMKEVEI KDEPAGVDCE VCGSPMVYKM GRFGKFVACS
NFPECRNTKP IVKEIGVKCP KCREGNIVER SSKKKRIFYG CDRFPQCDFV SWDKPLARPC
PKCGGLLVEK KLKKGVQVQC TACDYEEAPQ S