Gene Mkms_4858 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_4858 
Symbol 
ID4616273 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp5086556 
End bp5087746 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content67% 
IMG OID639794549 
Productluciferase family protein 
Protein accessionYP_940838 
Protein GI119870886 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGCA AACCGCTGAA CTTCGGCGTC TTCATCACAC CGTTCCACCC GGTGGGCCAA 
TCCCCCACCG TCGCATTGGA ATACGACCTC GAGCGGGTGG TGCGGCTGGA CCGGCTCGGC
TTCGACGAGG CGTGGTTCGG CGAACACCAT TCGGGCGGTT ACGAACTCAT CGCCTGCCCG
GAGGTCTTCA TCGCCACGGC CGCCGAACGG ACCAGACACA TCCGCCTCGG CACCGGCGTG
GTGTCGCTGC CCTACCACCA TCCGCTCATG GTCGCCGACC GGTGGGTCCT GCTCGACCAC
CTCACCCGTG GCCGCGTCAT GTTCGGCACC GGGCCGGGCG CGCTGCCGTC GGACGCCTAC
ATGATGGGCC TCGACCCGGT CGAGCAGCGC CGCATGATGC AGGAGTCGCT CGAGGCGATC
CTCGCGTTGT TCCGCGCGGA ACCCGAGGAA CGCATCACCC GCGAGACCGA CTGGTTCACC
CTGCGCGACG CCCAACTTCA CATCCGGCCC TACACCTGGC CGTACCCCGA GATCTCCACC
GCCGCGATGA TCTCCCCGTC CGGACCACGA CTGGCCGGTT CACTCGGGAC GTCGCTGCTG
TCACTGTCGA TGTCGGTGCC GGGTGGATTC GCGGCACTGG AGTCGACGTG GCAGATCGTC
GTCGACCAGG CGGCCAAATC GGGCCGCCCC GAACCCGAAC GGGACAACTG GCGGGTGCTG
TCGATCATGC ACCTGGCCGA CACCCGAGAG CAGGCGATCG ACGACTGCAC GTACGGACTG
GCCGATTTCG CCAACTACTT CGGTGCGGCC GGCTTCGTCC CGCTGTCCAA CAGCGTCGAG
GGGGAACGCT CACCGCGGGA GTTCGCGGCC GAGTACGCCG CACAGGGGAA CTGCTGTATC
GGCACGCCCG ACGACGCCAT CGCCTACATC GACGACCTGC TCGTGAAGTC CGGCGGGTTC
GGGACACTGC TGCTGCTCGG CCACGACTGG GCGTCCCCGG AGGCCACCTA CCACTCCTAC
GACCTGTTCG CGCGAAAGGT GATGCCGCAC TTCAAGGGTC AGCTCACCGC CCCTCGCGCC
TCGCACGACT GGGCCAAGGG TATGCGCGAC CAGTTGCTCG GCCGGGCCGG CGACGCGATC
GTCAAGGCGA TCACCGAACA CACCGACGAG TTGAAGGTGG ACCCGCGGTA G
 
Protein sequence
MSRKPLNFGV FITPFHPVGQ SPTVALEYDL ERVVRLDRLG FDEAWFGEHH SGGYELIACP 
EVFIATAAER TRHIRLGTGV VSLPYHHPLM VADRWVLLDH LTRGRVMFGT GPGALPSDAY
MMGLDPVEQR RMMQESLEAI LALFRAEPEE RITRETDWFT LRDAQLHIRP YTWPYPEIST
AAMISPSGPR LAGSLGTSLL SLSMSVPGGF AALESTWQIV VDQAAKSGRP EPERDNWRVL
SIMHLADTRE QAIDDCTYGL ADFANYFGAA GFVPLSNSVE GERSPREFAA EYAAQGNCCI
GTPDDAIAYI DDLLVKSGGF GTLLLLGHDW ASPEATYHSY DLFARKVMPH FKGQLTAPRA
SHDWAKGMRD QLLGRAGDAI VKAITEHTDE LKVDPR