Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_0278 |
Symbol | |
ID | 7976143 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 316866 |
End bp | 318089 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 644797272 |
Product | Erythromycin esterase |
Protein accession | YP_002948472 |
Protein GI | 239825848 |
COG category | [R] General function prediction only |
COG ID | [COG2312] Erythromycin esterase homolog |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTTACC GTAAAAATAA TATTTCTTCT GAAATCGATT GGTTAAAAGA TCATACATAC AGTATACAAT TGCATGAATT AGATGGTCAT TCCGATTTGT ATTTTTTAAA ACCTTTACTG CAAAATAAAC GGATCGTTTT TTTGGGAGAG AATGGCCACG GCGTGGCAGA GCATAGCCAG ATTAAAACAA AGATGATTAA GTACTTACAT AAAGAATTGG GGTTTCGAGT TTTAGCATTC GAAAGCAGTT TTAGTGACTG CAGCTTATGC TTTTATCAAC AAGATCAACT GGATATAAGG CAATGGATGA AACATTCTTT GTTTAAAGTC TGGCATACGG AGGAAGTCGA GACTCTCTTT CATTATGTGA AGGAAACACA ATTATCCAAT CAGCCTCTCA TCTTAACAGG ATTGGATATT CAGCCTGCCT CAAACGATCA TATTACTGGC AAATTTCTCT CAAAAATGTT TTCAAATATG GATAAGGCCT ATGGTGAGAA AATAGTAGAG TTGGAACAAG AAATGCTTTA CCAATATGTC AATTCCCGTA GTCCGTCCAT TTCCAAAAAG GAACGTAAGC GAAAATGCAA AGAATGGATG GCTCTTTATC AAGAATTCCT CTTTTTATTA GATAAAAACA GTCTTGAGTT ACAAAATAAG TTTGGAAAAG ACGCTTTTTT GCTTGTGAAC CGAATTTTGC AAAACCGAAT TTTCCTTATG CAGATGATCT CTTCCCATTT CATGAAAGCG ATAAAAATCC GAAATAAGGC AATGGCTGAC AATATTTCAT GGCTGGCACA AGAGATGTTT CCTAATGAAA AGATAATAAT ATGGGCGCAC AATGGGCATA TAATGAAGCA ATCTAGGAGA TTACTAGGGT TTCTATCGAC ATTTTCTTAT TTACCTCCCA AAATCAAGGA ATTCTCCTAT ACCATTGGCT TCTTTATGTA TAGTGGACAA GCAGCGGAAA ATAATCGCAG TGTATATGAA GTACAGCAGC CTAGCCAAGA CAGCATCGAG TTTCGAATCA AGCAAGTAGG ACATGAAATT GGCTTTTTAG ATATCTCGCG GCAAAGAAAA GTGCCTCAAA ATCGTTGGCT TTTCAAGCAT ACTTACACGA TGCATGAAGG GAAGCAATTA AATCTTATCA AGCCTATTGA TTGTTACGAT GGGCTTGTCG TTTGTGCGAA AACATCTCCT CCAAAATACA TCAATCTAAA TTAA
|
Protein sequence | MFYRKNNISS EIDWLKDHTY SIQLHELDGH SDLYFLKPLL QNKRIVFLGE NGHGVAEHSQ IKTKMIKYLH KELGFRVLAF ESSFSDCSLC FYQQDQLDIR QWMKHSLFKV WHTEEVETLF HYVKETQLSN QPLILTGLDI QPASNDHITG KFLSKMFSNM DKAYGEKIVE LEQEMLYQYV NSRSPSISKK ERKRKCKEWM ALYQEFLFLL DKNSLELQNK FGKDAFLLVN RILQNRIFLM QMISSHFMKA IKIRNKAMAD NISWLAQEMF PNEKIIIWAH NGHIMKQSRR LLGFLSTFSY LPPKIKEFSY TIGFFMYSGQ AAENNRSVYE VQQPSQDSIE FRIKQVGHEI GFLDISRQRK VPQNRWLFKH TYTMHEGKQL NLIKPIDCYD GLVVCAKTSP PKYINLN
|
| |