Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_2090 |
Symbol | |
ID | 7977333 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 2164827 |
End bp | 2165963 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 644798907 |
Product | transposase IS116/IS110/IS902 family protein |
Protein accession | YP_002950067 |
Protein GI | 239827443 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3547] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000000104804 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATGTCA TCTATCCTCG CTGCGCAGGA TTGGATGTTC ATGCCGAAAC CATCGTCGCC TGCGCGCTAT GGGAAGAAGA TGGACACATT CAAAAGGACA TTCAAACATT CTCCACGTTC TCGAAAGGAC TTGGCGACCT GCTCGAGTGG CTCGAAGACC ATGGGGTCAC CCATGTCGCC ATGGAATCCA CCGGCGTGTA TTGGAAACCG GTCTTTGCCT TCCTCGAGGG CTATGTCGAC TTGACTTTGG CCAATCCGCA GCGGATCAAA AATGTCCCGG GAAGAAAAAC CGATGTCTCT GACGCCGAGT GGATCGCCAA GCTGCTCCGT CATGGACTCA TTGAAAAAAG TTTCGTCCCC CCAGCGCCGA TTCGTGAACT GCGGGATTTT ACCCGCCTCC GCAAAAAGTG GGTCGGACAG TTGAGTTCAG AGAAAAACCG GATTCAAAAA GTGCTGGAGT CTTCCAATGT CAAACTGGGC TCCGTCCTCT CGGATCTCTT CGGCGTTTCC GGACGAAACA TCCTTGCCCG GCTGCTCGAG AAGGGATACG TGGACAAGGA CGAGCTGGAT CAATGCCTGC GCGGAAGGCT CAAAAAGAAA AAGCAAGCGG TGTACGATTC GCTGCTCGGC ACCTTGACCG AACACGAGCT CCGTCTCCTT CGCCTCTTGT GGAAACACGT TGAGGAATTG GAGCAGTTAA TCGAAGAAGT CGACCAGCAC ATCGACCGCC TGCTCGAGCC GTATCGCGAG GAAGTCGAAT TGCTGATGAC CATGCCCGGA GTGAAAAAAC AAACCGCCGC CGTCATCATA GCCGAGATGG GAACCGACAT GAGCGTCTTT GAAACGCCGG AACGGGCGGC TTCATGGACT GGATTGTCCC CCGGCAACCA TGAAAGCGCC GGAAAGCGAA AGAGCACGCG CACGACAAAA GGCAATCCCC ATCTCCGATC GGCGTTATGC GAGGCGGCAT GGTCAGCAGC TCAATCCAAG ACACATCCCT TGTCCCGAAA ATTTTGGTCG TTGGCGGCCC GATGCGGGAA GAAAAAAGCC CTCATCGCCA CAGCTCGACG AATGTTGGTG ATCATCTTTT GCATGATCTC CCGCAAAGAG TCGTTCCGCC AACCACAACT TATCTAG
|
Protein sequence | MDVIYPRCAG LDVHAETIVA CALWEEDGHI QKDIQTFSTF SKGLGDLLEW LEDHGVTHVA MESTGVYWKP VFAFLEGYVD LTLANPQRIK NVPGRKTDVS DAEWIAKLLR HGLIEKSFVP PAPIRELRDF TRLRKKWVGQ LSSEKNRIQK VLESSNVKLG SVLSDLFGVS GRNILARLLE KGYVDKDELD QCLRGRLKKK KQAVYDSLLG TLTEHELRLL RLLWKHVEEL EQLIEEVDQH IDRLLEPYRE EVELLMTMPG VKKQTAAVII AEMGTDMSVF ETPERAASWT GLSPGNHESA GKRKSTRTTK GNPHLRSALC EAAWSAAQSK THPLSRKFWS LAARCGKKKA LIATARRMLV IIFCMISRKE SFRQPQLI
|
| |