Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_2619 |
Symbol | |
ID | 7978282 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 2654664 |
End bp | 2655791 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 644799420 |
Product | alkanesulfonate monooxygenase |
Protein accession | YP_002950579 |
Protein GI | 239827955 |
COG category | [C] Energy production and conversion |
COG ID | [COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases |
TIGRFAM ID | [TIGR03565] alkanesulfonate monooxygenase, FMNH(2)-dependent |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00000000106001 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAATTAT TATGGTTTAT CCCTTCTCAT GGTGATGGAC GGTATCTTGG AACAACAAAA GGTGGACGCA CTGCGGAGTA CAGTTATTTT CGGCAAATTG CCCAAGCTGC TGACCGGCTT GGCTATAAAG GAGTATTAAT TCCAACGGGA AAATCGTGCG AAGATCCGTG GCTGCTTGCG TCGGCATTAG CGGCTGAAAC CGAGCAATTG CATTTTCTTG TTGCCGTACG GCCGGGATTA ATGTCTCCTA CACTCGCGGC GAGAATGGCT TCTACATTAG ACCGAATTTC AGAAGGTCGG TTGCTTATTA ATGTCGTAGC TGGCGGAGAT CCCGTTGAAC TTGCGGGCGA TGGGCTATTT TTGAGCCATG ATGAGCGCTA TGAAGCAACG GATGAATTTT TAACAGTTTG GAGACGATTG TTAAGCGGCG AAGAAGTCAC TTTAAAAGGA AAACATATCC ATGTCAATGG TGCTAAGCTT CTGTTTCCGC CAATACAAAC ACCGTATCCG CCGATTTATT TTGGAGGGTC GTCATCAGCA GGACAGCTTG TGGCAGCGAA ACATGCCGAT GTTTACTTAA CATGGGGGGA ACCGCCGGCA CAGGTGGAAG AAAAAATTTC TCAAGTGAGA AAATTGGCTG AACAGCAAGG ACGTGCGGTT GAATTTGGCA TTCGACTTCA TATTATTGTG CGCGAAACAG AAAAGGAAGC GTGGAATGCT GCGGAACAGC TCATTCGTTA TGTGGACGAA AAAACTATTC AGGAAGCGCA ACGAGTTTTT TCTCGGTATG ATTCTGTTGG GCAGCAGCGG ATGAGACAAT TGCACAACGG GAGTAGAGAA TCGCTAGAAA TTAGTCCAAA TTTATGGGCG GGAGTCGGTC TTGTCAGAGG AGGAGCGGGA ACCGCTTTAG TCGGAGATCC GGAAACAGTA GCCGAGCGGC TATTAGAGTA TCATCAATTA GGCATCCGAT ACTTTATTTT ATCTGGTTAC CCGCATTTAG AAGAAGCATA CCGGGTAGCT GAACTGCTAT TCCCGCTTTT ACCGTTAAAT CATGGAAAAC AAGCCGCATC GTCTATCCAA GGAGAAATTG TCGGGAATGA GTTTTTCCCT TCATTTATCA AAGTATAA
|
Protein sequence | MELLWFIPSH GDGRYLGTTK GGRTAEYSYF RQIAQAADRL GYKGVLIPTG KSCEDPWLLA SALAAETEQL HFLVAVRPGL MSPTLAARMA STLDRISEGR LLINVVAGGD PVELAGDGLF LSHDERYEAT DEFLTVWRRL LSGEEVTLKG KHIHVNGAKL LFPPIQTPYP PIYFGGSSSA GQLVAAKHAD VYLTWGEPPA QVEEKISQVR KLAEQQGRAV EFGIRLHIIV RETEKEAWNA AEQLIRYVDE KTIQEAQRVF SRYDSVGQQR MRQLHNGSRE SLEISPNLWA GVGLVRGGAG TALVGDPETV AERLLEYHQL GIRYFILSGY PHLEEAYRVA ELLFPLLPLN HGKQAASSIQ GEIVGNEFFP SFIKV
|
| |