Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_2047 |
Symbol | |
ID | 7977283 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 2107288 |
End bp | 2108568 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 644798865 |
Product | transposase IS116/IS110/IS902 family protein |
Protein accession | YP_002950035 |
Protein GI | 239827411 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3547] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000000186467 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTGTA CACAAAACTA TAAAATTGAT CAAGTAACCG AACAAACGCT TGTCGTGGGC ATTGATATCG CGAAACGAAC CCACTACGCC TGCTTCGTGG ATGACCGGGG GCGCGTGCTT CGCAAATCGT TCCCGATCTT CCAGTCGAAA GAGGGGTTTC AACAGCTGTA TAAAGCGATT CAGGGGGCGA TGCAAGCGTT CGGGAAGTCA GAGGTGATCG TCGCCGTGGA GCCGACCGGG CACTACTGGT TGAACCTGGC CTACTTCCTC GAGGAGCACG GGATCCCGTT GGTCATGGTC AACCCGGCGC ATGTGTGCCG GTCGAAAGAA CTTGATGACA ACCTGCCGAC GAAACACGAC GCCAAAGACG CCCTGGTCAT TGCCAGACTG GCAAAAGACG GACGATTCCT CGTTCCCCGG CTGCTGCACG AGATTGAAGC CGATTTGCGC GTGGGGAGCA CGCTCAAAGA GAAGCTCCGC AAGGAACAGA CGGCGGTGAA AAACGCGATC GTCCGCTGGA CGGATCGGTA TTTTCCAGAG TTTTGGACCG TGTTTCGTGA CTTGGGGAAA ACGGCGCTTT CGGTGTTGGA GTGGACGCCG CTTCCGGCTG ATATGGCCGG CCGGACGGTG GAGGAGCTTC TTGAGGTGTA CCGGCAAAGC GAAGGGATGA AATGCCCGCA GAAGGCCAAA ATTCAGGCGT TGATCAACAC CGCGAAGGAC TCGATTGGGG TGACGGAAGG GACAGCGATG GCCCGGTTTG AGATCGCCGC GCTCGTCCGT CGATACCGCC AATTGGAGGC GGAGATCGCT GCACTGGACG CCGAGTTGAA GGCATTGGTT CAAACGACGA TGGAGTACCA ATGGTTGAAA ACGGTCGACG GGTTGGGAGA CGCCACGATC ATCGATCTGC TGGCGGAGAT CGGCAGCTTC GCCCATTATC GGGACCCGCG CCAATTGGTG AAGTTGGCGG GCCTGACGCT CAAGGAGAAC TCCTCCGGCC AGCGCAAAGG GCAAAAGCAC ATCTCCAAAC GGGGACGGAA ACGGTTGCGC TCGGTGCTGT TTCGGGCGAT GATTCCGCTG ATTCGGCATA ACGAGGCGTT TCGCGAGCTG CATGAGTATT ATACGACCCG ATCCGTCAAT CCGCTGACCG GAAAGCAGTC CATCGTCGCC TTGTGCCGGA AGCTGTTGAA TGTGCTGTTT GCGATTTGTA CGAAGAAACA AGCGTTTGAC GCGGAGCGAA TGAAACAGGA CGTCTTGTCC CAGGTGCAAC GGGCGGCCTA A
|
Protein sequence | MNCTQNYKID QVTEQTLVVG IDIAKRTHYA CFVDDRGRVL RKSFPIFQSK EGFQQLYKAI QGAMQAFGKS EVIVAVEPTG HYWLNLAYFL EEHGIPLVMV NPAHVCRSKE LDDNLPTKHD AKDALVIARL AKDGRFLVPR LLHEIEADLR VGSTLKEKLR KEQTAVKNAI VRWTDRYFPE FWTVFRDLGK TALSVLEWTP LPADMAGRTV EELLEVYRQS EGMKCPQKAK IQALINTAKD SIGVTEGTAM ARFEIAALVR RYRQLEAEIA ALDAELKALV QTTMEYQWLK TVDGLGDATI IDLLAEIGSF AHYRDPRQLV KLAGLTLKEN SSGQRKGQKH ISKRGRKRLR SVLFRAMIPL IRHNEAFREL HEYYTTRSVN PLTGKQSIVA LCRKLLNVLF AICTKKQAFD AERMKQDVLS QVQRAA
|
| |