Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1864 |
Symbol | |
ID | 7976485 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 1928366 |
End bp | 1929646 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 644798698 |
Product | transposase IS116/IS110/IS902 family protein |
Protein accession | YP_002949868 |
Protein GI | 239827244 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3547] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000000000939746 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTGTA CACAAAACTA TAAAATTGAT CAAGTTACGG AACAAACGCT TGTCGTGGGC ATCGATATCG CGAAACGAAC CCACTACGCC TGCTTCGTGG ATGACCGGGG GCGCGTGCTT CGCAAGTCGT TCCCGATCTT CCAGTCGAAA GAGGGGTTTC AGCAGCTGTA TAAAGCGATT CAGGAGGCGA TGCAAGCGTT TGGGAAGTCA GAGGTGATCG TCGCGGTGGA GCCGACCGGG CACTACTGGT TGAACCTGGC CTACTTCCTC GAGGAGCACG GGATCCCGTT GGTCATGGTC AACCCGGCGC ATGTGTGCCG GTCGAAAGAA CTTGATGACA ACCTGCCGAC GAAACACGAC GCCAAAGACG CCCTGGTCAT TGCCAGACTG GCAAAAGACG GACGATTCCT CGTCCCCCGG CTGCTGCACG AGATAGAAGC CGATTTGCGC GTGGGAAGCA CGCTCAAAGA GAAGCTCCGC AAGGAACAGA CGGCGGTGAA AAACGCGATC GTCCGCTGGA CCGACCGGTA TTTTCCGGAG TTTTGGACGG TGTTTCGCGA TCTGGGAAAA ACGGCGCTTT CGGTGTTGGA GTGGACGCCG TTTCCGGCCG ATATGGCGGG TCGGACCGCC GAGGAGCTCA TCGAGGTGTA CCGGCAAAGC GAAGGGCTGA AATGCCCGCA GAAGGCCAAA ATTCAGGCGT TGATCAACGC CGCGAAGGAC TCCATTGGGG TGACGGAAGG GACGACGATG GCCCGGTTTG AGATCGCCGC GCTCGTCCGC CGATACCGCC AATTGGAGGC GGAGATCGCC GCGTTGGACG CCGAGTTGAA GGCATTGGTT CAAACGACGA TGGAGTATCA ATGGCTGAAA ACGGTCGACG GGTTGGGAGA CGCCACGATT ATCGATCTGT TGGCGGAGAT CGGCAGCTTC GCCCATTATC GGGACCCGCG CCAATTGGTG AAGTTGGCGG GCCTGACGCT CAAGGAGAAC TCCTCCGGCC AGCGCAAAGG GCAAAAGCAC ATCTCCAAAC GGGGACGGAA ACGGCTGCGC TCGGTGCTGT TTCGGGCGAT GATTCCGCTG ATCCGGCACA ACGAGGCGTT TCGCGAGCTG CATGAATATT ACACGACCCG ATCCGTCAAC CCGCTGACCG GAAAGCAATC CATCGTCGCC TTGTGCCGGA AGCTGTTGAA TGTGCTGTTT GCGATTTGTA CGAAGAAACA AGCGTTTGAC GCGGAGCGAA TGAAACAGGA CGTCTTGTCC CAGGTGCAAC GGGCGGCCTA A
|
Protein sequence | MNCTQNYKID QVTEQTLVVG IDIAKRTHYA CFVDDRGRVL RKSFPIFQSK EGFQQLYKAI QEAMQAFGKS EVIVAVEPTG HYWLNLAYFL EEHGIPLVMV NPAHVCRSKE LDDNLPTKHD AKDALVIARL AKDGRFLVPR LLHEIEADLR VGSTLKEKLR KEQTAVKNAI VRWTDRYFPE FWTVFRDLGK TALSVLEWTP FPADMAGRTA EELIEVYRQS EGLKCPQKAK IQALINAAKD SIGVTEGTTM ARFEIAALVR RYRQLEAEIA ALDAELKALV QTTMEYQWLK TVDGLGDATI IDLLAEIGSF AHYRDPRQLV KLAGLTLKEN SSGQRKGQKH ISKRGRKRLR SVLFRAMIPL IRHNEAFREL HEYYTTRSVN PLTGKQSIVA LCRKLLNVLF AICTKKQAFD AERMKQDVLS QVQRAA
|
| |