Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1542 |
Symbol | |
ID | 7976625 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 1617429 |
End bp | 1618709 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 644798433 |
Product | transposase IS116/IS110/IS902 family protein |
Protein accession | YP_002949606 |
Protein GI | 239826982 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3547] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000000000639136 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTGTA CACAAAACTA TAAAATTGAT CAAGTTACGG AACAAACGCT TGTCGTGGGC ATCGATATCG CGAAACGAAC CCACTACGCC TGCTTCGTGG ATGACCGGGG GCGCGTGCTT CGCAAGTCGT TCCCGATCTT CCAGTCGAAA GAGGGGTTTC AGCAGCTGTA TAAAGCGATT CAGGAGGCGA TGCAAGCGTT TGGGAAGTCA GAGGTGATCG TCGCGGTGGA GCCGACCGGG CACTACTGGT TGAACCTGGC CTACTTCCTC GAGGAGCACG GGATCCCGTT GGTCATGGTC AACCCGGCGC ATGTGTGCCG GTCGAAAGAA CTCGATGACA ACCTGCCGAC GAAACACGAC GCCAAAGACG CCCTGGTCAT TGCCAGACTG GCAAAAGACG GACGATTCCT CGTCCCCCGG CTGCTGCACG AGATCGAAGC CGATTTGCGC GTGGGAAGCA CGCTCAAAGA GAAGCTCCGC AAGGAACAGA CGGCGGTGAA AAATGCGATC ATCCGCTGGA CCGACCGGTA TTTTCCGGAG TTTTGGACGG TGTTTCGCGA TCTGGGAAAA ACGGCGCTTT CGGTGTTGGA GTGGACGCCG TTTCCGGCCG ATATGGCGGG TCGGACCGCC GAGGAGCTCA TCGAGGTGTA CCGGCAAAGC GAAGGGCTGA AATGCCCGCA GAAGGCCAAA ATTCAGGCGT TGATCAACGC CGCGAAGGAC TCCATTGGGG TGACGGAAGG GACAGCGATG GCCCGGTTTG AGATCGCCGC GCTCGTCCGC CGATACCGCC AATTGGAGGC GGAGATCGCC GCGTTGGACG CCGAGTTGAA GGCATTGGTT CAAACGACGA TGGAGTACCA ATGGTTGAAA ACGGTCGACG GGTTGGGAGA CGCCACGATC ATCGATCTGT TGGCGGAGAT CGGCAGCTTC GCCCATTATC GGGACCCGCG CCAATTGGTG AAGTTGGCGG GCCTGACGCT CAAGGAGAAC TCCTCCGGCC AGCGCAAAGG GCAAAAGCAC ATCTCCAAAC GGGGACGGAA ACGGCTGCGC TCGGTGCTGT TTCGGGCGAT GATTCCGCTG ATCCGGCACA ACGAGGCGTT TCGCGAGCTG CATGAATATT ACACGACCCG ATCCGTCAAC CCGCTGACCG GAAAGCAGTC CATCGTCGCC TTGTGCCGGA AGCTGTTGAA TGTGCTGTTT GCGATTTGTA CGAAGAAACA AGCCTTTGAC GCGGAGCGAA TGAAGCAGGA CGTCTTGTCC CAAGTGCAAC GGGCGGCCTA A
|
Protein sequence | MNCTQNYKID QVTEQTLVVG IDIAKRTHYA CFVDDRGRVL RKSFPIFQSK EGFQQLYKAI QEAMQAFGKS EVIVAVEPTG HYWLNLAYFL EEHGIPLVMV NPAHVCRSKE LDDNLPTKHD AKDALVIARL AKDGRFLVPR LLHEIEADLR VGSTLKEKLR KEQTAVKNAI IRWTDRYFPE FWTVFRDLGK TALSVLEWTP FPADMAGRTA EELIEVYRQS EGLKCPQKAK IQALINAAKD SIGVTEGTAM ARFEIAALVR RYRQLEAEIA ALDAELKALV QTTMEYQWLK TVDGLGDATI IDLLAEIGSF AHYRDPRQLV KLAGLTLKEN SSGQRKGQKH ISKRGRKRLR SVLFRAMIPL IRHNEAFREL HEYYTTRSVN PLTGKQSIVA LCRKLLNVLF AICTKKQAFD AERMKQDVLS QVQRAA
|
| |