Gene Mkms_4956 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_4956 
Symbol 
ID4612633 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp5194883 
End bp5196238 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content68% 
IMG OID639794648 
Productcarotenoid oxygenase 
Protein accessionYP_940935 
Protein GI119870983 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3670] Lignostilbene-alpha,beta-dioxygenase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAGA CCGACCACGC GCGCGACGCC GTCAGCGCCG ACAACCTGCC GTCGGGAGGC 
GAGTTCTTCC ACAAGGGCAA CTACGCGCCC GTCGCCGACG AACTCACCGC CTTCGACCTG
CCCGTCGAGG GGCAGATCCC GGCCGATCTG CAGGGGTGGT ACCTGCGCAA CGGTCCGAAC
CCGCGGCAGC CGTCCGCGCA CTGGTTCACC GGCGACGGCA TGATCCACGG CGTGCGCATC
GAGAACGGCC GCGCCGCCTG GTACCGCAAC CGGTGGGTGC GCACGGAGAG CTTCGAGCAG
CACTTCCCGG TCTACAACTC CGACGGCAGC CGCAACCTGC ACTCCAGCGT CGCCAACACC
CACGTCGTCA ACCACGCAGG CAAGACCCTG GCGCTCGTCG AATCGTCGCT GCCCTACGAG
ATCACCAACG ACCTGCAGAC CGTGGGCGCC TACGACTTCG GGGGCAAGCT GGTCGACTCG
ATGACGGCGC ACCCGAAGAT CTGTCCGACC ACCGGCGAGT TGCACTTCTT CGGCTACGGC
AACCTCTTCG AGCCCTACGT GACCTATCAC CGGGCCGCCG CCGACGGCGA ACTGACCGTC
AACCGGCCGT TGGACGTCAA GGCGCTGACG ATGATGCACG ACTTCGCGAT GACCAGCGGG
CACGTGGTCT TCATGGACCT GCCGATCGTC TTCGACATGG GCATCGCGCT CGAGGGCAAG
GGTGACATGC CCTACCGCTG GGACGACGAC TACGGCGCCC GCCTCGGCGT ACTGCGCCGC
GACGATCCCT TCGGCGAGGT GCGCTGGTTC GACATCGACC CGTGCTACGT CTTCCACGTC
GCCAACGCCT ACGAGGACGG GAACACGCTG GTGCTGCAGG CCGTGCGCTA CCCCGAACTG
TGGCGCGGCA CAGGCGGATT CGAGGCCGAG GGAGTGCTGT GGAGCTGGAC CCTCGACCTG
GTGACGGGCA CGGTGCGCGA ACGCCAGCTC GACGACCGGG CCGTGGAGTT CCCCCGCATC
GACGACCGGT TGGCGGGTCT GGCGGCCCGG TACGCGGTGT CTGTGGGCGA TCAGCGGTTG
GTGCGCTACG ACCTGACGAG CGGTACGGCG GTCGAACACG CCTTCGGGAC CGCCGACGCG
CCGGGCGGAC CCGGCGAGGC GGTGTTCGTG CCGGCCACCT CGGGACCCGT CGACGAACAG
AACGGGTGGT ATATGGCGTA CGTCTACGAC CCGCAGCGCG ACGGCAGCGA TCTGGTGATC
CTCGACGCCG CCGACTTCGC CGGCCAGCCG GTCGCGAGAA TCAAATTGCC GCAACGGGTT
CCGTACGGTT TCCACGGCAA TTGGATCACC GGATAG
 
Protein sequence
MTETDHARDA VSADNLPSGG EFFHKGNYAP VADELTAFDL PVEGQIPADL QGWYLRNGPN 
PRQPSAHWFT GDGMIHGVRI ENGRAAWYRN RWVRTESFEQ HFPVYNSDGS RNLHSSVANT
HVVNHAGKTL ALVESSLPYE ITNDLQTVGA YDFGGKLVDS MTAHPKICPT TGELHFFGYG
NLFEPYVTYH RAAADGELTV NRPLDVKALT MMHDFAMTSG HVVFMDLPIV FDMGIALEGK
GDMPYRWDDD YGARLGVLRR DDPFGEVRWF DIDPCYVFHV ANAYEDGNTL VLQAVRYPEL
WRGTGGFEAE GVLWSWTLDL VTGTVRERQL DDRAVEFPRI DDRLAGLAAR YAVSVGDQRL
VRYDLTSGTA VEHAFGTADA PGGPGEAVFV PATSGPVDEQ NGWYMAYVYD PQRDGSDLVI
LDAADFAGQP VARIKLPQRV PYGFHGNWIT G