Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_3127 |
Symbol | |
ID | 7976771 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 3155301 |
End bp | 3156521 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 644799913 |
Product | transposase IS116/IS110/IS902 family protein |
Protein accession | YP_002951052 |
Protein GI | 239828428 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3547] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCGTCT TGTATGAACG CTGTTGCGGA TTGGACGTGC ATAAGCAATC GATTACGGCT TGCGCCCTTA CCCCTGAAGG AAAAGAGATT CGCACGTTTG GTACGCTGAC CGACGATCTC GAGGAGTTGG TGGATTGGCT GAAAGAAAAA AAGGTGACGC ACGTTGCCAT GGAGTCGACG GGCGTATATT GGAAGCCAGT GTATAATCTC CTCGAAGCAG AGCCGATCGA AGTGCTTGTC GTCAATGCCC AACACATCAA AGCGGTTCCC GGGCGAAAGA CCGATGTCAA AGATGCCGAA TGGATCGCGG ACTTGCTTCG CCATGGATTG CTAAAAGGGA GTTACATCCC TCATCGGGCT CAGCGGGAGC TCCGGGAACT GGTCCGTTAT CGGCGCAGTT TGATCGAGGA ACGGGCACGG GAGCTCAACC GCATCCAAAA AGTGCTGGAA GGAGCCAATA TCAAGCTTTC TTCGGTCGTA TCCGACATCA ACGGGATGTC GGCCCGGCTC ATCATTCGCG CCCTTATCGA AGGAAAGGAC GATCCGGCGG CCCTCGCCCA GCTCGCCAAA GGGCGGCTGA AACAAAAAAC GGAAGAGCTC CGGCGCGCAT TGAAAGGAGT GATGGGGCCG CATCAACGCA TGATGCTGGC CGAGCAATGG CGTCATGTGG AGTATTTAGA TGAAGCGATT GCCCGGTTGG ATCGGGAAAT CGAGGAACGA ACGAGCCCTT TTCATGAAGC GCTGGAGCTC ATCGATACGA TCCCGGGAGT GGGGCGGCAA AGCGCGGAAC AAATTGTAGC GGAAATCGGG ACGGACATGA GCCGGTTCCC TACCGCCGCG CACTTGGCCT CATGGGCCGG AATGGCTCCC GGGAATCATG AGAGTGCAGG GAAACGGTTG TCAGGTCGAA CGAGGAAAGG GAACAAGAAG CTGAGGTCGT GCCTCGTGGA ATGCGCCCGT GCCGCCGCCC GAACGAAGAA CACGTACCTA TCGGCCAAGT ATCATCGGAT CGCCAAACGA AGAGGAGCGA ATCGAGCGAG TGTCGCGGTC GGGAGAACGA TTTTAGAAAT GATCTATTAT ATCTTAACTC GAAAGGAACC GTATAGAGAG TTGGGAGCCG ACTACTGGGA TCGGCAGCGA GAAGCGCGCA TCGTGCGTCA AACGGTGAAA CGATTAGAGG GGTTAGGGTA CGAAGTGAAA CTGGAAAAAA CGAGTGCATA G
|
Protein sequence | MRVLYERCCG LDVHKQSITA CALTPEGKEI RTFGTLTDDL EELVDWLKEK KVTHVAMEST GVYWKPVYNL LEAEPIEVLV VNAQHIKAVP GRKTDVKDAE WIADLLRHGL LKGSYIPHRA QRELRELVRY RRSLIEERAR ELNRIQKVLE GANIKLSSVV SDINGMSARL IIRALIEGKD DPAALAQLAK GRLKQKTEEL RRALKGVMGP HQRMMLAEQW RHVEYLDEAI ARLDREIEER TSPFHEALEL IDTIPGVGRQ SAEQIVAEIG TDMSRFPTAA HLASWAGMAP GNHESAGKRL SGRTRKGNKK LRSCLVECAR AAARTKNTYL SAKYHRIAKR RGANRASVAV GRTILEMIYY ILTRKEPYRE LGADYWDRQR EARIVRQTVK RLEGLGYEVK LEKTSA
|
| |