Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_0246 |
Symbol | |
ID | 7976111 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 271442 |
End bp | 272815 |
Gene Length | 1374 bp |
Protein Length | 457 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 644797239 |
Product | transposase IS4 family protein |
Protein accession | YP_002948442 |
Protein GI | 239825818 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0000788871 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGATT TCCCGATTCG GTTTGTATTG ACAGATGAAG CGATTACCCC AAGTGCTGGG CTTGCTCTCG TTGGCTACTT ACTCCATCGA ACGAAACTGG ATAAACGGGT AAACGCACTT CGGCTTCCAA CGGTTCGTCG AGAAGTGCAC ATTTCCCATA GCGATGTCAT TCGCTCGATG ATTGGCTTGC TTGCCACAGG AAAAACGGAT TTCGATCATA TCGAAGCGTA TCGTCAGGAC GATATCTTTT CGGCATCGAT GGGGATTCAG CACGTGCCTT CCTCTCCAAC CTTGCGACAA CGACTCGATC AGCTCGCTTG TCTTCCGATG ACCGAAACGA TTCTTTGGGA GGAGTCCATA CGTCTGTTGA TTCAACGACA TGCCACTTTG TCCCCTTGTT GGACCAAAGG AAAGACGACA TGGCTTCCCC TTGATATAGA TGGTTCCCCA TTTGACAACT CCGATACGAA AAAAGAAGGA GTCAGTCGAA CGTATAAAGG ATTTGACGGT TTTACACCGT TGTTTGCGTA TGCAGGGAAG GAAGGGTATC TCGTTCATGC CGAATTGCGT CCAGGGAAAC AACATGTGCA AGACAACATG CCCTCGTTTT TAGTCACCGC TATCCGTCGA GCTCGTCAAC TGACTTCATC TCGTCTACTT GTTCGCATGG ATGCAGGAAA CGATGCAGAA GCGAATGTGC ACGTATGCCT AAAGGAAGAC GTGGACTTTG TCATCAAGCG AAACTTACGC CGAGAATCGA AAGCGCTTTG GTTCCAGATC GCTTCGCAAA AGGGCAGACG CGTCGATGAT GGACAAAGCG AAGGAGTACA AACCTATGAG CTATGCCTTC CACAGAAGGC AGCGATCGAT GGAAACACGT ATACGTACGT TCAAGTCACC CAAGTGACGG AACGGACGAT GGAACGCAAT GGACAGCTGA TGCTCGTTCC TGATTATGAA GTGGAAAGCT ATTGGGTGCG GCTCAAAGGA TACGAGCATG TTCGAATGAG CGATGTGCTC GCGTTGTATC ATGACCATGC GACATGCGAA CAGTTTCATA GCGAACTGAA GAGCGACTTA GATTTAGAGC GGCTTCCATC TGGGAAGATG AAAACGAATG CGCTCGTGTT GGTCATGGGA GCCTTCGTTT ACAATCTTCT TCGTCTGATT GGACAAGATC TATTAAGCGA TCCGAGACAT CCGTTGCACC ACAAAGTGAA ACGCCGTCGC ATCAAGACGA TTATTCAGAC GGTGATCACG ATGGCAGGTC GACTCGTCCG CCGATCACGA CAGATCTGGA TGAAACTGAC GCGAAGGAGT GGGTACAGTA TACTCCTACT GAATGTGTAT CAAAAATGGA AAGAGGCAAG ATAA
|
Protein sequence | MKDFPIRFVL TDEAITPSAG LALVGYLLHR TKLDKRVNAL RLPTVRREVH ISHSDVIRSM IGLLATGKTD FDHIEAYRQD DIFSASMGIQ HVPSSPTLRQ RLDQLACLPM TETILWEESI RLLIQRHATL SPCWTKGKTT WLPLDIDGSP FDNSDTKKEG VSRTYKGFDG FTPLFAYAGK EGYLVHAELR PGKQHVQDNM PSFLVTAIRR ARQLTSSRLL VRMDAGNDAE ANVHVCLKED VDFVIKRNLR RESKALWFQI ASQKGRRVDD GQSEGVQTYE LCLPQKAAID GNTYTYVQVT QVTERTMERN GQLMLVPDYE VESYWVRLKG YEHVRMSDVL ALYHDHATCE QFHSELKSDL DLERLPSGKM KTNALVLVMG AFVYNLLRLI GQDLLSDPRH PLHHKVKRRR IKTIIQTVIT MAGRLVRRSR QIWMKLTRRS GYSILLLNVY QKWKEAR
|
| |