Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_2180 |
Symbol | |
ID | 7976984 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 2241885 |
End bp | 2243072 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 644798995 |
Product | transposase IS4 family protein |
Protein accession | YP_002950155 |
Protein GI | 239827531 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.000421211 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAGAT TAGCACATCA CCAAGGAATC CACAAGTTTT TCTTCACGCT GGGGTTGACG CTGCAGCTTT CCAAACCGGT CATCAAGCAT CTCATTCATA TTGTCGATGC CTTGACCACC AAGGGATTCT CGGGAACATT GACTGATATT CATTACTGGA GCTTTCATCC GAATCATCGA ACGACGCTCA GTCACTTTTT CACGAAAAGC CCTTGGAACG AGGAAAGGCT GCTTGGGAAG CTTCAAGAGT GGATCCTTTC CCAGGTCGAA CGACTGGCCA AACGGAAGAA TCAACCCCTT TTTGTTTCGA TTGATGATAC GATTTGCCAA AAAACAAAGC CTTCGTCACG GGCTGTGCAC GCCATTCAAG GGTGCGACTG GCACTACTCG CATAAAGATC ATCAATCGGT CTGGGGGCAT TCGCTCGTTT GGCTGATGGT GCACACCTTC ACGCAGGCGT TCCCATTTGC GTTCCGCCTG TATGACAAGA AAGCGGGAAA AAGCAAGATC GACCTGGCGA TCGAGATGCT TTCCTCGCTC AAGGTGAAGC GGGCTCAGCC GGTGTATGTG CTCATGGATT CGTGGTATCC GTCCAAAAAG CTCATCGAAG CCTGTCTGAA ACAGGGATTC CATGTCATCG CGATGCTCAA GACGAACCGG ATTCTCTACC CGAAAGGCAT CGCCATCCAA GCCAAGCAGT TTGCCCGCTA TATCGAGTCC AAAGACACCC GCCTCGTCAC GGTGGGGCAG GAGCGTTATC GCGTGTATCG CTATGAGGGG GCCATCCATG GCCTCGATGA CGCGGTGGTG CTGCTGGCTT GGAAGGCGGA TCAGCCGATG GCGCCGGAAC ATCTTCATTG CATCTTGAGC ACCGACCGGG AACTCGGGGA CGAAGACATC TTGCGTTACT ACGCCCAGCG CTGGACGATC GAGTGCTTTT TCCGGCAGGC GAAAGATCAA CTGAAGCTGG ATGGATACCG CGTTCGCCAC ATTCGGGCGG TGAAACGGTA TTGGGCGGTG GTGCTGTTGG CCTGCGTGTA CAGCATCGCC GAATCCCGAC AAAACCTCTC CACCGGGCTG GAGCTTCTTC GGTCGCGGAA AGACCACAGC GTCGTCGAGT TCATTTATGA CGCTGCGAAG CAAGATATTC CCATTGATGT GATCAAAAAA CAGCTCCGTA TCGCGTAA
|
Protein sequence | MNRLAHHQGI HKFFFTLGLT LQLSKPVIKH LIHIVDALTT KGFSGTLTDI HYWSFHPNHR TTLSHFFTKS PWNEERLLGK LQEWILSQVE RLAKRKNQPL FVSIDDTICQ KTKPSSRAVH AIQGCDWHYS HKDHQSVWGH SLVWLMVHTF TQAFPFAFRL YDKKAGKSKI DLAIEMLSSL KVKRAQPVYV LMDSWYPSKK LIEACLKQGF HVIAMLKTNR ILYPKGIAIQ AKQFARYIES KDTRLVTVGQ ERYRVYRYEG AIHGLDDAVV LLAWKADQPM APEHLHCILS TDRELGDEDI LRYYAQRWTI ECFFRQAKDQ LKLDGYRVRH IRAVKRYWAV VLLACVYSIA ESRQNLSTGL ELLRSRKDHS VVEFIYDAAK QDIPIDVIKK QLRIA
|
| |