Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mkms_4409 |
Symbol | |
ID | 4612352 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. KMS |
Kingdom | Bacteria |
Replicon accession | NC_008705 |
Strand | - |
Start bp | 4635301 |
End bp | 4636431 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639794095 |
Product | alkanesulfonate monooxygenase |
Protein accession | YP_940390 |
Protein GI | 119870438 |
COG category | [C] Energy production and conversion |
COG ID | [COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACCG AACGCATCGC CGACCACGTC AAGTTCGCCT ACTGGGTGCC CAACGTCAGT GGCGGCCTGG TGACCAGCAC CATCGAACAG CGCACCGACT GGAACTACGA GTACAACAAG AAACTCGCCC AGACCGCGGA GAACAACGGC TTCGAATACG CGTTGAGCCA GGTCCGCTAC GAAGCCAGCT ACGGCGCCGA ATACCAGCAC GAATCGACCA GCTTCAGCCT GGCGCTGCTG CTGGCCACCG AGCGGCTCAA GGTGATCGCC GCCGTGCACC CGGGCCTCTG GCAGCCCGCG GTGCTCGCCA AGCTCGGCGC CACCGCCGAC CAGCTGTCGG GCGGCCGGTT TGCCGTCAAC GTCGTCTCCG GCTGGTTCAA GGACGAGTTC ACCCACCTCG GCGAGCCGTG GCTCGAACAC GACGAGCGCT ACCGCCGCAG CGCCGAGTTC CTGCAGGTGC TGCGCAAGAT CTGGACCGAG GACGACGTGG ACTTCCGCGG CGACTTCTAC CGCATCCACG ACTTCACCCT CAAGCCGAAA CCGGTCAACA CACCCGAGCG GCCCAACCCG GAGCTCTTCC AGGGCGGCAA CTCCACCGCC GCGCGGCGCA ACGGCGGCCG CTACGCCGAC TGGTACTTCT CCAACGGCAA GGACTTCGAC GGCATCACCG AACAGATCGT CGAGGTGCGT GACCATGCCC GTGAGGTGGG CCGCGAGGTC AAGGTCGGAC TGAACGGGTT CATCATCGCC CGCGACACCG AGAAGGAGGC GAGGGACACC CTCCGGGAGA TCATCGAGAA GGCCGACAAG CCGGCCGTCG AGGGCTTCCG CAGCGCCGTC CAGCAGGCCG GGAACTCGAC GGCCGACAAG AAGGGAATGT GGGCGGATTC CTCCTTCGAG GATCTGGTCC AGTACAACGA CGGCTTCCGC ACCCAGCTGA TCGGCACGCC CGAGCAGATC GCGGAACGCA TTGCGGCATA CCGCAAACGC GGTGTCGACC TGATCCTCGG CGGCTTCCTG CACTTCCAGG AGGAGATCGA GTACTTCGGT GCCCGGGTGC TGCCGCTCGT GCGCGAGATC GAGGCATCCG ATGCCTCCTC GGCCGACGCC CCGGCGCTCG CCACGGTGTG A
|
Protein sequence | MSTERIADHV KFAYWVPNVS GGLVTSTIEQ RTDWNYEYNK KLAQTAENNG FEYALSQVRY EASYGAEYQH ESTSFSLALL LATERLKVIA AVHPGLWQPA VLAKLGATAD QLSGGRFAVN VVSGWFKDEF THLGEPWLEH DERYRRSAEF LQVLRKIWTE DDVDFRGDFY RIHDFTLKPK PVNTPERPNP ELFQGGNSTA ARRNGGRYAD WYFSNGKDFD GITEQIVEVR DHAREVGREV KVGLNGFIIA RDTEKEARDT LREIIEKADK PAVEGFRSAV QQAGNSTADK KGMWADSSFE DLVQYNDGFR TQLIGTPEQI AERIAAYRKR GVDLILGGFL HFQEEIEYFG ARVLPLVREI EASDASSADA PALATV
|
| |