Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_0876 |
Symbol | |
ID | 7977879 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 939002 |
End bp | 940438 |
Gene Length | 1437 bp |
Protein Length | 478 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 644797842 |
Product | transposase IS66 |
Protein accession | YP_002949015 |
Protein GI | 239826391 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3436] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTGACGG TACAACAAGC TGTATTTACA GTTGAGAGCT TAATCGGCAA AGTTCAACAA CAAAAACAGC TCATTCATCA ACTCATTCAA GAAAATGAAC ATTTGCGTCA CGAAAACAAA CAACTACGCA AAGAAAATGA ACAACTGAAG TACCGTGTTC AAGAGCTGGA AGCACGCACG AAAAAAAACA GCTCCAATAG CCATTTGCCC CCATCTTCTG ACCGTTTTGA GAAAAAGCGT TCCTCCCGCG AGCCGTCTGG CAAAAAGCCT GGTGGGCAAG AGGGACATGA GGGGAAGACG CTCCGTCCAC ATCATCGTGT CGTCCACCGT GTGCATACGT GTCAAGGATG TGGAGCTTCT TTGCGTGAAG TCAAACCGTT CAAAGTAGAT ATCCGTCAAG TGTTTGATGT CCCTCCTGTG GCGATCGAGG TGACACAACA TGAACGTGAA GTGAAATCGT GTCCACATTG TCGATGCGTG CAACAAGCCG AATTCCCATC CCATGTCACG AATCATGTGC AATACGGTCC ACGGCTCACG GCGCTCGTTG TTTATTTACA TCATATCCAA TTGATCCCGT ACAAGCGTTT AAGTGATACA ATCGAAGCGT TATATCAACA CTCGATTAGT ACGGGGACTC TTGCCAATAT GGTGAAACGA GGACGCGAAT CGTTGGAATC AAATATGGAC ATCATCGAAG ACGCCTTACT TGAATCCAAC ATCCTGCATG TCGATGAAAC GAGTTTGCGC ATCAATGGGA AACTCGCATG GGTGCATGTC GCGTGTACAT CGAGATATAC ATACTTGGCT CCTCACGCTT CTCGTGGAAA AAAAGCGACC GATGAGATCG GGATTCTTCC CCGATATGAA GGGACGATGA TGCACGATGC GTTCGGTACA TATCCGAAAT ACACACATGC CACCCATGCC CTTTGTCATG CCCACCATTT GCGTGAGTTA AAAGGATTCA TCGAACAAGG GCATACGTGG GCGATGCGCA TGACCACGTT TCTGTTAGCC GCCAAGCAAG CCGTCGAAGC CCATCACGGT GCACTTTCCG AAGAAGAAGC GAGACGGTGG GAACGAGTGT ATGATCGCAT CCTAGAAAGA GCACAACACC GATTAGAAAC GATGACGCCT CTTCCGAAAA AAGCACTCGC TTTTGTTCGA CGCCTTCAAA AACGAAAGGA AGAAGCGCTG CGTTTCTTAC GTGAAGTACA TGTTCCCTTT GATAACAACC AAGCCGAACG CGATCTTCGC ATGGTCAAAG TCAAAGAGAA CATTTCGGGT ACGTTTCGCG AAGAAACATT CGCGCAGTCG TTTTGCATCG CAAGAAGCAT CGTTTCCACA CTGACGAAAC ACGAAAAAAA CGTGTGGGAT TCGTTATGTC TTCTGTTGGC AGGCGAAACG ATCGATCGAG TTCTTTCCGC TACCTAG
|
Protein sequence | MLTVQQAVFT VESLIGKVQQ QKQLIHQLIQ ENEHLRHENK QLRKENEQLK YRVQELEART KKNSSNSHLP PSSDRFEKKR SSREPSGKKP GGQEGHEGKT LRPHHRVVHR VHTCQGCGAS LREVKPFKVD IRQVFDVPPV AIEVTQHERE VKSCPHCRCV QQAEFPSHVT NHVQYGPRLT ALVVYLHHIQ LIPYKRLSDT IEALYQHSIS TGTLANMVKR GRESLESNMD IIEDALLESN ILHVDETSLR INGKLAWVHV ACTSRYTYLA PHASRGKKAT DEIGILPRYE GTMMHDAFGT YPKYTHATHA LCHAHHLREL KGFIEQGHTW AMRMTTFLLA AKQAVEAHHG ALSEEEARRW ERVYDRILER AQHRLETMTP LPKKALAFVR RLQKRKEEAL RFLREVHVPF DNNQAERDLR MVKVKENISG TFREETFAQS FCIARSIVST LTKHEKNVWD SLCLLLAGET IDRVLSAT
|
| |