Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_3130 |
Symbol | |
ID | 7976774 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 3158102 |
End bp | 3159238 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 644799916 |
Product | transposase IS116/IS110/IS902 family protein |
Protein accession | YP_002951055 |
Protein GI | 239828431 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3547] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.743243 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATGTCA TCTATCCTCG CTGCGCAGGA TTGGATGTTC ATACCGAAAC TATTGTCGCT TGTGCCCTAT GGGAAGAAGA GGGGGAGATT CAAAAGGAGA TCCAAACGTT CTCAACGTTC TCGAAAGGGC TTGGCGACCT GCTCGATTGG CTCGAAGACC ATGGGGTCAC CCATGTCGCC ATGGAATCCA CCGGCGTGTA TTGGAAACCG GTCTTTGCCT TCCTCGAGGG CTATGTCGAC TTGACTTTGG CCAATCCGCA GCGGATCAAA AATGTCCCGG GAAGAAAAAC CGATGTCTCT GACGCCGAGT GGATCGCCAA GCTGCTCCGC CATGGACTCA TTGAAAAAAG TTTCGTCCCC CCAGCGCCGA TTCGTGAACT GCAGGATTTT ACCCGCCTGC GCAAAAAGTG GGTCGGACAG TTGAGTTCAG AGAAAAACCG GATTCAAAAA GTGCTGGAGT CTTCCAATGT CAAACTGGGC TCCGTCCTCT CGGATCTCTT CGGCGTTTCC GGACGAAACA TCCTTGCCCG GCTGCTCGAG AAGGGATACG TGGACAAGGA CGAGCTGGAT GAATGCCTGC GCGGCAGGCT CAAAACGAAA AAGCAGGCGG TGTACGATTC GCTGCTGGGC ACCTTGACCG AACATGAGCT CTATCTCCTT CGCCTCTTGT GGAAACACGT GGAGGAGTTG GAGCGGCTCA TCGAAGAAGT CGACCAGCAC ATCGACCGCC TGCTCGAGCC GTATCGGGAG GAAGTGGACT TACTGATGAC CATGCCTGGA ATCAAAAAAC AAACCGCCGC CGTCATCATC GCCGAGATGG GAACCGACAT GAGCGTCTTT GAAACGCCGG AACGGGCGGC TTCATGGACG GGGTTGTCCC CCGGCAACCA TGAAAGCGCC GGAAAGCGAA AGAGCACGCG CACCACCAAA GGCAATCCCC ATCTTCGATC AGCGTTATGC GAGGCGGCAT GGTCAGCAGC TCGATCCAAG ATGCATCCTT TGTCCCGAAA ATTTTGGTCG TTGGCGGCCC GGTGCGGGAA GAAAAAAGCC CTCATCGCCA CGGCTCGACG AATGTTGGTG ATCATCTTTT GCATGATCTC CCGCAAAGAG CCGTTCCGCC AACCACAACT GATTTAG
|
Protein sequence | MDVIYPRCAG LDVHTETIVA CALWEEEGEI QKEIQTFSTF SKGLGDLLDW LEDHGVTHVA MESTGVYWKP VFAFLEGYVD LTLANPQRIK NVPGRKTDVS DAEWIAKLLR HGLIEKSFVP PAPIRELQDF TRLRKKWVGQ LSSEKNRIQK VLESSNVKLG SVLSDLFGVS GRNILARLLE KGYVDKDELD ECLRGRLKTK KQAVYDSLLG TLTEHELYLL RLLWKHVEEL ERLIEEVDQH IDRLLEPYRE EVDLLMTMPG IKKQTAAVII AEMGTDMSVF ETPERAASWT GLSPGNHESA GKRKSTRTTK GNPHLRSALC EAAWSAARSK MHPLSRKFWS LAARCGKKKA LIATARRMLV IIFCMISRKE PFRQPQLI
|
| |