Gene Mkms_3158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_3158 
Symbol 
ID4610993 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp3304497 
End bp3305987 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content67% 
IMG OID639792829 
Productcarotenoid oxygenase 
Protein accessionYP_939142 
Protein GI119869190 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3670] Lignostilbene-alpha,beta-dioxygenase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.350488 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAGTGG AACGCCTGCA GACCTTCGCC TCGACGCTGC CCGCCGATGA CGACCATCCG 
TACCGCACCG GGCCGTGGCG CCCCCAGGTC ACCGAGTGGC GGGCCGACGA CCTCGAGGTC
GTCGCCGGCG AGGTGCCTGC CGATCTCGAC GGCATGTACC TGCGCAACAC GGAGAACCCG
CTGCATCCGG CCGCGACGGC CTACCACCCG TTCGACGGTG ACGGGATGAT CCACATCGTC
GAGTTCGGCG GGGGAAAAGC GGCCTACCGC AACCGCTTCG TCCGCACCGA CGGCTTCCTC
GCCGAGAACG AGGCCGGGGG ACCGCTGTGG GCCGGGTTCA TCGAGATGCC CTCGGCCGCC
AAACGCGCCG ACGGCTGGGG CGCGCGCACG CGGATGAAGG ACGCGTCGAG CACTGACGTC
GTCGTCCACC GCGGGACGGC GCTGACCAGT TTCTACATGT GCGGCGACCT CTACCAGGTC
GACCCGTACA CCGCCGACAC CCTCGGCAAG GAGACCTGGC ACGGCGACTT CCCGGACTGG
GGGGTGTCGG CGCATCCCAA GATCGACCCG GTCACCGGGG AGCTGCTGTT CTTCAGCTAC
AGCAAGGAAG CGCCTCATCT GCGCTACGGC GTGGTCGACA AGGACGCGAA CCTGGTGCAC
CACACCGACG TCGCGCTGCC CGGGCCGCGG ATGCCGCACG ATATGGCGTT CACCGAGAAC
TACGTGATCC TCAACGACTT CCCGCTGTTC TGGGAGCCGT CGCTGCTGAA GCAGGACATC
CACGCACCGG TCTTCCACCG CGACATGCCG TCGCGTTTCG CCGTGCTGCC CCGCCGCGGT
GACCAGTCGC AGGTGCGGTG GTTCGAGACC GACCCGACGT ATGCCCTGCA CTTCGTCAAC
GCCTACGAGG ACGGTGACGA GATCGTGCTC GACGGGTTCT TCCAGGACAA CCCGTCACCG
TCGACGAAGG GCGCGAAGTC GTTGGAGGAC GCGGCCTTCC GCTACCTGGC ACTCGACGGG
TTCGAATCGC ACCTGCACCG CTGGCGGTTC AACCTCGCCA CGGGGGCGGC CACGGAGGAA
CGGCTGTCGG ACAGCCTCAC CGAATTCGGC ATGATGAACG GTGACTACCA GACCCGGCGG
CACCGCTACG TGTACGCCGC CACCGGCAAA CCGGGCTGGT TCCTGTTCGA CGGGCTGGTC
AAACACGATC TGCGCGACGG TACCGAGGAG CGGATCACGT TCGGCGACGG CGTGTTCGGC
AGCGAGACCG CGATGGCGCC GCGTCAGGAC GGCACCGCCG AGGACGACGG CTACCTCGTC
ACCCTGACCA CGGACATGAA CGACGACGCC TCCTACTGCT TGGTGTTCGA TGCCGCGCGG
ATCGCCGACG GTCCGGTGTG CAAGCTGCGG CTTCCTGAAA GAATCTGCAG CGGAACACAT
TCGACGTGGG TGTCCGGGGC TGAGCTGCGG CGCTGGCACA GCCCGCGGTG A
 
Protein sequence
MRVERLQTFA STLPADDDHP YRTGPWRPQV TEWRADDLEV VAGEVPADLD GMYLRNTENP 
LHPAATAYHP FDGDGMIHIV EFGGGKAAYR NRFVRTDGFL AENEAGGPLW AGFIEMPSAA
KRADGWGART RMKDASSTDV VVHRGTALTS FYMCGDLYQV DPYTADTLGK ETWHGDFPDW
GVSAHPKIDP VTGELLFFSY SKEAPHLRYG VVDKDANLVH HTDVALPGPR MPHDMAFTEN
YVILNDFPLF WEPSLLKQDI HAPVFHRDMP SRFAVLPRRG DQSQVRWFET DPTYALHFVN
AYEDGDEIVL DGFFQDNPSP STKGAKSLED AAFRYLALDG FESHLHRWRF NLATGAATEE
RLSDSLTEFG MMNGDYQTRR HRYVYAATGK PGWFLFDGLV KHDLRDGTEE RITFGDGVFG
SETAMAPRQD GTAEDDGYLV TLTTDMNDDA SYCLVFDAAR IADGPVCKLR LPERICSGTH
STWVSGAELR RWHSPR