Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_2089 |
Symbol | |
ID | 7979252 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 2163189 |
End bp | 2164469 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 644798906 |
Product | transposase IS116/IS110/IS902 family protein |
Protein accession | YP_002950066 |
Protein GI | 239827442 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3547] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000000334168 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTGTA CACAAAACTA TAAAATTGAT CAAGTCACCG AACAAACGCT TGTCGTGGGC ATCGATATCG CGAAACGAAC CCACTACGCC TGCTTCGTGG ATGACCGGGG GCGTGTGCTT CGCAAATCGT TCCCGATCTT CCAGTCGAAA GAGGGCTTTC GCCAGCTGTA TGAAGCGATT CAGGAGGCGA TGCAAGCGTT CGGAAAGCCG CAGGTGATCG TCGCCGTGGA GCCGACCGGG CACTACTGGT TGAACCTGGC CTACTTCCTC GAGGAGCACG GGATCCCGTT GGTCATGGTC AACCCGGCGC ATGTGTGCCG GTCGAAAGAA CTCGATGACA ACCTGCCGAC GAAACATGAC GCCAAAGACG CCCTAGTCAT CGCCAGACTG GCGAAAGACG GACGATTCCT CGTCCCCCGG CTGTTGCACG AGATCGAAGC CGATTTGCGC GTCGGGAGCA CGCTCAAAGA GAAGCTCCGC AAGGAACAGA CGGCGGTGAA AAACGCGATC GTCCGCTGGA CCGACCGATA TTTTCCGGAG TTTTGGACGG TGTTTCGCGA CCTGGGGAAA ACGGCGCTTT CGGTGCTGGA GTGGACGCCG CTTCCAGCCG ATATGGCCGG CCGGGCGGTG GAGGAGCTTC TTGAGGTGTA CCGGCAAAGC GAAGGGCTGA AATGCCCGCA GAAGGCCAAA ATTCAGGCGT TGATCGACGC CGCGAAGGAC TCGATTGGGG TGACGGAAGG GACGACGATG GCCCGGTTTG AGATCGCCGC GCTCGTCCGC CGATACCGCC AATTGGAGGC TGAGGTCGCC GCGTTGGACG CCGAGTTGAA GGCGTTGGTT CAAACGACGA TGGAGTATCA ATGGCTGAAA ACGGTCGACG GGTTGGGAGA CGCCACGATC ATCGATCTTC TGGCGGAGAT CGGCAGCTTC GCCCATTATC GGGACCCGCG TCAATTGGTG AAGTTGGCGG GCCTGACGCT CAAGGAGAAT TCCTCCGGCC AGCGCAAAGG GCAAAAGCAC ATCTCCAAAC GGGGACGGAA ACGGCTGCGA TCGGTGCTGT TTCGGGCGAT GATTCCGCTG ATTCGGCATA ACGAGGCGTT TCGCGAGCTG CATGAGTATT ATACGACCCG ATCCGTCAAT CCGCTGACCG GAAAGCAGTC CATCGTCGCC TTGTGCCGGA AGCTGTTGAA TGTGCTGTTT GCGATTTGTA CGAAGAAACA AGCGTTTGAC GCGGAGCGAA TGAAACAGGA CGTCTTGTCC CAGGTGCAAC GGGCGGCCTA A
|
Protein sequence | MNCTQNYKID QVTEQTLVVG IDIAKRTHYA CFVDDRGRVL RKSFPIFQSK EGFRQLYEAI QEAMQAFGKP QVIVAVEPTG HYWLNLAYFL EEHGIPLVMV NPAHVCRSKE LDDNLPTKHD AKDALVIARL AKDGRFLVPR LLHEIEADLR VGSTLKEKLR KEQTAVKNAI VRWTDRYFPE FWTVFRDLGK TALSVLEWTP LPADMAGRAV EELLEVYRQS EGLKCPQKAK IQALIDAAKD SIGVTEGTTM ARFEIAALVR RYRQLEAEVA ALDAELKALV QTTMEYQWLK TVDGLGDATI IDLLAEIGSF AHYRDPRQLV KLAGLTLKEN SSGQRKGQKH ISKRGRKRLR SVLFRAMIPL IRHNEAFREL HEYYTTRSVN PLTGKQSIVA LCRKLLNVLF AICTKKQAFD AERMKQDVLS QVQRAA
|
| |