Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1770 |
Symbol | |
ID | 7978686 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 1843267 |
End bp | 1844601 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 644798610 |
Product | protein of unknown function DUF21 |
Protein accession | YP_002949782 |
Protein GI | 239827158 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGAGATAG TTAATTTACT GATGGTTGGT GTTTTAATTG CTTTAACGGC ATTCTTTGTT GCATCCGAAT TTGCCATTGT TAAAGTACGC AGTACACGTA TTGATCAACT GATAGCTGAA GGAAATCGAA ACGCTAGGGC TGCAAAACGA GTCATTAGCA ATTTAGATGA ATATTTATCG GCTTGCCAGC TTGGTATTAC TATTACCGCA TTGGGGCTTG GTTGGCTTGG TGAACCAACG GTGGAGCATT TGCTTCACCC TGTATTTGAA CGAATGAATT TAAGTGAGTC CGTAGCGTCT TTCCTTTCTT TTGCGATTGC ATTTGCAACG ATTACCTTTT TGCATGTGGT CGTTGGAGAA CTGGCTCCGA AAACATTGGC AATTCAAAAA GCAGAAACCA TTACGCTGTT ATGTTCAAGA CCTTTAATCT TTTTCTACAA AATAATGTAT CCGTTTATTT GGGCATTAAA CGGCTCTGCT CGCGTCATTA CCGGATTATT TGGGTTAAAG CCAGCATCCG AACATGAAGT GGCCCATTCA GAGGAAGAAT TGCGCTTGAT TTTATCTGAA AGTTATAAAA GTGGAGAAAT TAATCAGTCG GAATATAAAT ACGTAAACAA TATTTTTGAG TTTGACGATC GGATTGCCAA AGAAATCATG GTACCGCGTA CAGAAATTGT CGCACTGGAT AAAAACCGTT CCATTGCGGA ATATTTTGAA ACGATAAAGC AAGAAAAATA TACGCGCTAT CCTGTTATAG ACGGAGATAA AGACCATATC ATTGGGATGG TCAACATAAA AGAAATATTA ACGGATTGTA TTCAAAATCC AAAGGCCACT GAAAAGAAGT TAGGTGATTA TATCCGCCCG ATCATACAAG TAATCGAATC CATCCCTATT CATGACTTGC TCGTAAAAAT GCAGCGTGAA CGAGTTCATA TGGCGATCTT AGTTGATGAA TACGGCGGAA CAGCAGGGCT TGTGACGGTT GAGGATATAT TAGAGGAAAT TGTCGGTGAG ATTCAAGATG AGTTTGATAT TGACGAAGTT CCGATGATTC GCAAAGTGAA CGAACACACC ACCATTGTTG ATGGAAAAGT ATTGATTGAA GATGTAAATG ACTTACTTGG AACGGATATC GATGATACGG ATGTTGATAC GATTGGTGGT TGGATTTTAA CAGAAAAATT TGATATTCAA CAAGGCGGCA TTATTTCCTA CGGCGATTAT GAATTTAAGG TATTAAAAAT GGAAGGGCAT CACGTCCAAT TAGTTGAAAT CACAAAACGC GTGAAGGCTC CTTCTCTTAT TGTACAAGAA GCGGCAATAG AATAG
|
Protein sequence | MEIVNLLMVG VLIALTAFFV ASEFAIVKVR STRIDQLIAE GNRNARAAKR VISNLDEYLS ACQLGITITA LGLGWLGEPT VEHLLHPVFE RMNLSESVAS FLSFAIAFAT ITFLHVVVGE LAPKTLAIQK AETITLLCSR PLIFFYKIMY PFIWALNGSA RVITGLFGLK PASEHEVAHS EEELRLILSE SYKSGEINQS EYKYVNNIFE FDDRIAKEIM VPRTEIVALD KNRSIAEYFE TIKQEKYTRY PVIDGDKDHI IGMVNIKEIL TDCIQNPKAT EKKLGDYIRP IIQVIESIPI HDLLVKMQRE RVHMAILVDE YGGTAGLVTV EDILEEIVGE IQDEFDIDEV PMIRKVNEHT TIVDGKVLIE DVNDLLGTDI DDTDVDTIGG WILTEKFDIQ QGGIISYGDY EFKVLKMEGH HVQLVEITKR VKAPSLIVQE AAIE
|
| |