Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_5022 |
Symbol | |
ID | 6132883 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | - |
Start bp | 5503772 |
End bp | 5504989 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641645158 |
Product | alkanesulfonate monooxygenase |
Protein accession | YP_001771783 |
Protein GI | 170743128 |
COG category | [C] Energy production and conversion |
COG ID | [COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases |
TIGRFAM ID | [TIGR03565] alkanesulfonate monooxygenase, FMNH(2)-dependent |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.036458 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCAC AGACCGACGT CCTCTGGTTC CTCCCGACCC ACGGCGACGG CCGCTACCTC GGCGCGAGCG AGGGCGCCCG GCACGTCTCC CTCGCCTATC TGCGCCAGAT TGCGCAGGCG GCGGACGACC TCGGCTATTT CGGGGTCCTG CTGCCGACCG GGCGGTCCTG CGAGGATTCC TGGATCGTCG CCTCGGCGCT GGCGCCGCTG ACGCGCCGGC TGCGCTTCCT CGTGGCGGTG CGGCCGGGCC TGCAGGAGCC GTCGGCGGCG GCGCGCATGG CCGCGACCCT CGACCGGATC TCCGAGGGCC GCCTGCTCAT CAACGTGGTG ACGGGCGGCG ACCCGGTGGA GCTGCGCGGC GACGGCGTCT TCCTCGACCA CGACGCGCGC TACGCGGTCA CGGACGAGTT CCTGCACATC TGGCGCCGCC TGATGGCGGG CGAGACCGTC ACCTACGCGG GCCGGCACCT GCGCACCGAG GAGGGGCGGC TGATCTTCGG GCCGGTGCAG CAGCCCTCGC CGCCGCTCTA TTTCGGCGGC TCGTCGGAGG CCGGCATCGC GGTCGCGGCC GAGCATTGCG AGGTCTACCT GACCTGGGGC GAGCCCCCCG CCCAGGTGGC CGAGAAGATC GCCCGGGCGC GCGCCGCCGC CGCCGCCAAG GGGAAGACCT TCTCGTTCGG GATCCGGCTG CACGTGATCG TGCGCGAGAC CGAGGGCGCG GCCTGGGAGG CCGCCGAATC CCTGATCTCG CGCCTCGACG ACGCCACCAT CGCGCGGGCG CAGGAGACCC TGAAGCGCCA GGATTCCGTC GGGCAGAGCC GCATGATGGC GCTGCACCGG GGCGACCGCC GCAACCTCGT CGTCGCGCCG AACCTGTGGG CCGGGGTGGG GCTGGTGCGC GGGGGCGCCG GCACCGCCCT GGTGGGCTCG GCCGACCAGG TCGCCGACCG CATGAAGGAG TACATCGACC TCGGCATCGA CCGCTTCATC CTCTCGGGCT ACCCGCACCT GGAGGAGGCC TACCGCTTCG CCGAACTCGT CTTCCCGAAA CTCCCCCTGC GCGACACGAC CGGGACCGCG CCGCGGCGGG CCCGCAACGA CGGGCCCTTC GGCGAGGTGA TCGCCAACGA CATTGTGCCG ACGCCGGGCG CCGGGGCGGC GCCGGGCATC GGGGCGACGC GGAGCGTCGC CGAGGCGCGG CGCGTCGGGG CGCGTTGA
|
Protein sequence | MSAQTDVLWF LPTHGDGRYL GASEGARHVS LAYLRQIAQA ADDLGYFGVL LPTGRSCEDS WIVASALAPL TRRLRFLVAV RPGLQEPSAA ARMAATLDRI SEGRLLINVV TGGDPVELRG DGVFLDHDAR YAVTDEFLHI WRRLMAGETV TYAGRHLRTE EGRLIFGPVQ QPSPPLYFGG SSEAGIAVAA EHCEVYLTWG EPPAQVAEKI ARARAAAAAK GKTFSFGIRL HVIVRETEGA AWEAAESLIS RLDDATIARA QETLKRQDSV GQSRMMALHR GDRRNLVVAP NLWAGVGLVR GGAGTALVGS ADQVADRMKE YIDLGIDRFI LSGYPHLEEA YRFAELVFPK LPLRDTTGTA PRRARNDGPF GEVIANDIVP TPGAGAAPGI GATRSVAEAR RVGAR
|
| |