Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_0303 |
Symbol | |
ID | 7977422 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 345884 |
End bp | 347161 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 644797296 |
Product | DNA topoisomerase IV subunit A, contains toprim domain protein |
Protein accession | YP_002948496 |
Protein GI | 239825872 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02679] conserved hypothetical protein TIGR02679 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 51 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATCAT TGCCGGAACC AGCGGCGGAA GCGGCGCAAT TTTTTCGCGG TGAAGCTGGA TTTCGCCGCC TTTTTTGCGA AATGAAAAAG AAGTACGAAT CGCTCGGCCG CATTGGCGGC ACCATTTCCC TTACGAATTT TACGAAAGAA GAAAAAGAAG CGATTGCGGC GTTTTTTGGG AAGGAAATGG CGCGCGTTTC GCTGCAGGCG TTTGCTAAAC AACTGGATAA GACAAGATTT GCGGGTATTT CGCTGGAAGA GCTGTTAGTC CATTATTTTG GGGAACCACT CGTCGCAAAC AAAGAAAGGC GTGAAGCGGA ACAGCAGGAA AAAGCGGCGT TTTTCCAGCG GCTGATCGCC GCTTACCCGA GTGGGGCGGA ATGGCTTACG GCAGCGCTGG AGCATCCGAA CGAATACCGT CTTCTTCACC AAGCGTACGC CCAGCAGCGC GATGAGCTGT ATGCTTCCAT TTGCGCAGTT TGCGAAGCGC TTCGCCAGCT TCCGAAGCAA GGAACGTACG AACGCCTGCC GCTATTCGCG CAAAAAGTAA CCGGAGACCC GCACGCGTTT GATTTGCACA CGTTTCAAGG AAAACTGCTG CTATCCGCAT TGGAATTTTA CTCTCGAAAA AAATATCAGC TTTCGTCTGT AGAAGAAGTC AATGAACTAT TGCAATCGTT CGGCATATTG CGTGAGGACA TCCTCAATTT CGTGACGTGC GCGGGAATTT TGGCAGAAAC GAAAGAGGGC ATTCATCCCG TATTTTCCGC CGCTTGTGAA ACGAATATGG CGCTGAATGT GCCGCTGAGG GAGATAGTGG CACTAACCCG CGCGTATCCA GCGAAAGGAA ATGCCGTCTT TGTCGTAGAA AACGCCGGCG TTTTTTCTGA ACTGCTAGAC GAAGCGATGC CGCTTGTCAG CACAAACGGC CAACTCAATT TGGCGACGCT ATTGCTTCTT GACTTGCTTG TGGAGTCGGG CGCACGTCTT TATTATTCGG GCGACTTTGA TCCAGAAGGT CTGAAAATGG CCGATCGGCT TGCCGGGCGA TACGGCGAGA ATCTTTCCCT TTGGCATTTC ACGTGTGAGG ACTACTTTGC CTCGATGCCG AGTGTGTCGC TTTCTGAGGA GCGGCTGGCT AAGCTCCAAT CGATTTCCTC TCCCAAACTG CAGCCAGTAA AGCAGGAGAT CGAGCGGTGC AAGAAAGCGG GGTATCAGGA GGCGATTTTG CCGGTGCTTC GGAAAGATAT AGAGAAGGAA AAGGCGGAGT TAAGTTAA
|
Protein sequence | MKSLPEPAAE AAQFFRGEAG FRRLFCEMKK KYESLGRIGG TISLTNFTKE EKEAIAAFFG KEMARVSLQA FAKQLDKTRF AGISLEELLV HYFGEPLVAN KERREAEQQE KAAFFQRLIA AYPSGAEWLT AALEHPNEYR LLHQAYAQQR DELYASICAV CEALRQLPKQ GTYERLPLFA QKVTGDPHAF DLHTFQGKLL LSALEFYSRK KYQLSSVEEV NELLQSFGIL REDILNFVTC AGILAETKEG IHPVFSAACE TNMALNVPLR EIVALTRAYP AKGNAVFVVE NAGVFSELLD EAMPLVSTNG QLNLATLLLL DLLVESGARL YYSGDFDPEG LKMADRLAGR YGENLSLWHF TCEDYFASMP SVSLSEERLA KLQSISSPKL QPVKQEIERC KKAGYQEAIL PVLRKDIEKE KAELS
|
| |