Gene Mkms_3571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_3571 
Symbol 
ID4611501 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp3760832 
End bp3761947 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content72% 
IMG OID639793247 
Productalkanesulfonate monooxygenase 
Protein accessionYP_939555 
Protein GI119869603 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID[TIGR03565] alkanesulfonate monooxygenase, FMNH(2)-dependent 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.55225 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0903973 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGGAT CGCTGCGGTT CCACTGGTAT CTGCCCACCC ACGGTGACAC CACCACGATC 
GCCGACAACC GGGGCAGTGC GACGGCGCGG GCCGGGTCGC ACCTCGAGCC GACCCTGCAC
AACCTCACCG CGCTGGGACG CGCGGCCGAG GACTTCGGCT TCGAGGCGGT GCTGACACCG
ACCGGGTCGC ACTGCGAGGA CTCCTGGATC GCCACCGCGG CGCTGGCCCA GCACACGCGG
CGGCTGAAGT ACCTCGTGGC GTTCCGGCCC GGTGTGCTGT CACCCACGTT GGCCGCGCAA
CAGGTCAGCA CCTACCAGCG GTTCACCGGT GGCCGATTGG CCCTCAACGT GGTGACCGGT
GGCGACGACG ACGAGATGCG CCGCTACGGC GACGGGATCG ACAAGTCGGC GCGGTACCGG
CGCACCGGGG AGTTCCTCCG GATCGTGCGG GGGATCTGGT CGGAGCCCGA TTTCTCCTTT
CACGGCGAGT TCTACGCCGT CGACCACGCC AGGACGGCGT ATCCGCTCAC CGAGGTACCG
ACCGTCTACT TCGGCGGTTC GTCGCCGGAA GCCATCGAGG TCGCCGCCGA ATTCGCCGAC
GTGTACCTCA CCTGGGGGGA ACCGCCGGAT CAGGTCGCCG AGAAGATCGA CCGGGTGCGG
GCCGCGGCCG TCCGGCACGG GCGCACGCTG CGGTACGGCG TGCGGTTGCA CACCGTCGCG
CGGCCCACCT CCGCACAGGC GTGGCAGCGG GCAGAAGACC TCATCGCGGG GCTGTCGGCC
GATGAGGTCC GCCGGGCACA CGAGCGCTAC CTGGTCAGCG GATCCGAGGG GCAGCGGCGG
ATGGCGGCGC TGACCACCGG TGAACTCGTC GACGCGCGCA GCCTCGAGGT GTACCCGGGA
CTGTGGGCCG GGCCGAGTCT GCTGCGCGAC GGGGCGGGCA CCGCGGCGAC CGGCAGTTAC
GCCGAGGTGG CCGCGGTGTT CGGGGAGTAC GCACGGCTCG GGGTCAGCGA ATTCGTGTTG
TCGGGCTACC CGCAGGCCGA GGAGATCCGC CACGTCGGCG AGGGCGTGCT TCCGCTCCTG
GCCGGCACTC CCGCCGAGGC GGTCCCCGCC GGCTGA
 
Protein sequence
MSGSLRFHWY LPTHGDTTTI ADNRGSATAR AGSHLEPTLH NLTALGRAAE DFGFEAVLTP 
TGSHCEDSWI ATAALAQHTR RLKYLVAFRP GVLSPTLAAQ QVSTYQRFTG GRLALNVVTG
GDDDEMRRYG DGIDKSARYR RTGEFLRIVR GIWSEPDFSF HGEFYAVDHA RTAYPLTEVP
TVYFGGSSPE AIEVAAEFAD VYLTWGEPPD QVAEKIDRVR AAAVRHGRTL RYGVRLHTVA
RPTSAQAWQR AEDLIAGLSA DEVRRAHERY LVSGSEGQRR MAALTTGELV DARSLEVYPG
LWAGPSLLRD GAGTAATGSY AEVAAVFGEY ARLGVSEFVL SGYPQAEEIR HVGEGVLPLL
AGTPAEAVPA G