Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_3103 |
Symbol | |
ID | 7979176 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 3122792 |
End bp | 3124450 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 644799890 |
Product | transposase |
Protein accession | YP_002951029 |
Protein GI | 239828405 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG5421] Transposase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0000596563 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGTTC AAGTCAAAAA GGTCTATCGC AATTCTTATT TGAATATAAT AAGTGCCCTA TTCAAGAAAC TGGGTCTGCC TCAATTGATT GACCATCTCG TGCCCGTCGA TCCGCAGTGC CAAACGCGAG TCAGCGATGC CGTTCAGGCC ATCCTCTACA ATGTGTTTGA CGGCCGGCAA GCCCTTGTTC ACTTGGAACA TTGGGCTCAG GAGGTCGATT GTGAGAAACT CATCCGTCCC GATCTCCATC CTTCCTGGTT GAACGACGAT GCGTTGGCCC GTCATCTCGA TCGCCTGTAT GAGGCTGGCA TTCACAACGT CATCAGCACT TGCTTGATTC ATATTTATCG AAAAGAAGGC CTTTCCCTCC GAGCCTTCCA CGCCGATACG ACGGACAAGA CCGTTTACGG CGCGTATGAA TCGGCCTCGT TAGAGGCCTT ACAAATCACA CATGGCTACA ACCGCCATCA TCGTTGGCAA AAACAGATCG GTTTCGGACT GGTCGGCAAC GAGGACGGCA TCCCGTTTTA CGGCGATGTG CACGATGGCA ACCTGCCCGA TAAAACATGG AATCCCGAGG TGCTGTCTCG TGTCCATGAA CAGCTGAAGC AGGCCAAAAT CGAAGACGAA TGGATTTACG TGGCCGATTC CGCCGCGATG ACGAAAGAGA CCCTGGCGCA AACCAAAGCG GCCAACGCCT TTTTGATCAC CAGAGGCCCT TCGTCGCTCC GGATCGTGAA AACCGCGCTG GCCGAAGCGG ATGTTGAGGA CACGACGTGG AGCGATCCCT TTACGTTGGC GGAGAGAAAC GGCGCCACGT ACCGGGTATG GGAAACGGCC TCGACGTATG AAGGCCACCC CGTTCGGCTG ATCGTTGTTG AATCGAGCGC GCTCGACCAG CGAAAAGGAA AGACGCTTGA AAAAGAACGA ACCAAAGAAG CGGAGCTTCT TCGCGAGGAA CAAGCCCGTT GGGAGCGTCA CCCCTTCTCC TGCCGGGAAG ATGCCGAACA AGCCTTGGCG TCCCTCAAGG CGTCCCTTCG CCCCCGGTTT CATCGGGTTG AGGCCGCGGT CGAAGAGATC GTACGCCTGA AAAAACGGCG CGGACGGCCG AAAAAAGGGG CGGAACCCGA GGTGGAGACG CTGTATTTCT TGCACCTTGA CGTCGAATTC GACCAAGACG CGTGGGAACA GGCGAGACGG AAAGCGTCCC GGTTTGTCCT TGTCACGACC GTTCCGAAGG AATGGAAGGG CCAACCCATG GATGCCCAAG AGATCTTGAA GCTGTATAAA GGGCAGATCT CGGTGGAAAT GAACTTCGCT TTTTTGAAAG ATCCGTTTTT CACGGATGAG ATTTACGTCA AAAAACCAGA ACGGGTCGCA GTATTAGGCT ATTTGTTTCT GTTATCCTTG GCTATTTACC GCGTTTTTCA GCGCCGAGTG CGTCAGTTTA TTACTCCAGA ACACCCGTTG AAGGGTCCTG GAGGCCGCAA GCTGACCCGG CCGACGGGAC AGGCGATTTT TCAGCTGTTT CAATATGTGA ACGTCGTCCT GTTCAAGCTG CCGGATGGGC GCATCCAACG CTCACTGGAT CGCTCCCTTA CCCCTGATCA GCGAAGGATT CTGCAGGGAT TGGGCATGGA TGAGAGCATC TACGTGTAA
|
Protein sequence | MNVQVKKVYR NSYLNIISAL FKKLGLPQLI DHLVPVDPQC QTRVSDAVQA ILYNVFDGRQ ALVHLEHWAQ EVDCEKLIRP DLHPSWLNDD ALARHLDRLY EAGIHNVIST CLIHIYRKEG LSLRAFHADT TDKTVYGAYE SASLEALQIT HGYNRHHRWQ KQIGFGLVGN EDGIPFYGDV HDGNLPDKTW NPEVLSRVHE QLKQAKIEDE WIYVADSAAM TKETLAQTKA ANAFLITRGP SSLRIVKTAL AEADVEDTTW SDPFTLAERN GATYRVWETA STYEGHPVRL IVVESSALDQ RKGKTLEKER TKEAELLREE QARWERHPFS CREDAEQALA SLKASLRPRF HRVEAAVEEI VRLKKRRGRP KKGAEPEVET LYFLHLDVEF DQDAWEQARR KASRFVLVTT VPKEWKGQPM DAQEILKLYK GQISVEMNFA FLKDPFFTDE IYVKKPERVA VLGYLFLLSL AIYRVFQRRV RQFITPEHPL KGPGGRKLTR PTGQAIFQLF QYVNVVLFKL PDGRIQRSLD RSLTPDQRRI LQGLGMDESI YV
|
| |