Gene Mjls_5235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_5235 
Symbol 
ID4880933 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp5484932 
End bp5486287 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content68% 
IMG OID640142547 
Productcarotenoid oxygenase 
Protein accessionYP_001073490 
Protein GI126437799 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3670] Lignostilbene-alpha,beta-dioxygenase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.751022 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAGA CCGACCACGC GCACGACGCC GTCAGCGCCG ACAACCTGCC GTCGGGAGAC 
GAGTTCTTCC ACAGGGGCAA CTACGCGCCC GTCGCCGACG AACTCACCGC CTTCGACCTG
CCCGTCGAGG GACAGATCCC GGCCGATCTG CAGGGGTGGT ACCTGCGCAA CGGTCCGAAC
CCGCGGCAGC CGTCCGCGCA CTGGTTCACC GGCGACGGGA TGATCCACGG CGTGCGCATC
GAGAACGGCC GCGCCGCCTG GTACCGCAAC CGGTGGGTGC GCACGGAGAG CTTCGAGCAG
CACTTCCCGG TCTACAACTC CGACGGCAGC CGCAACCTGC ACTCCAGCGT CGCCAACACC
CACGTCGTCA ACCACGCAGG CAAGACCCTG GCGCTCGTCG AATCGTCGCT GCCCTACGAG
ATCACCAACG ACCTGCAGAC CGTGGGCGCC TACGACTTCG GGGGCAAGCT GGTCGACTCG
ATGACGGCGC ACCCGAAGAT CTGTCCGACC ACCGGGGAAT TGCACTTCTT CGGCTACGGC
AACCTCTTCG AGCCCTACGT GACCTATCAC CGGGCCGACG CCGACGGCGA ACTGACCGTC
AACCGGCCGC TGGACGTCAA GGCGTTGACG ATGATGCACG ACTTCGCGAT GACCAGTGGG
CACGTGGTCT TCATGGACCT GCCGATCGTC TTCGACATGG GCATCGCGCT CGAGGGCAAG
GGTGACATGC CCTACCGCTG GGACGACGAC TACGGCGCCC GCCTCGGCGT ACTGCGCCGC
GACGACCCCT TCGGCGAGGT GCGCTGGTTC GACATCGACC CGTGCTACGT CTTCCACGTC
GCCAACGCCT ACGAGGACGG GAACACGCTG GTGCTGCAGG CCGTGCGCTA CCCCGAACTG
TGGCGCGGCA CAGGCGGATT CGAGGCCGAG GGAGTGCTGT GGAGCTGGAC CCTCGACCTG
GCGACGGGCA CGGTGCGCGA ACGCCAGCTC GACGACCGGG CCGTGGAGTT CCCCCGCATC
GACGACCGGT TGGCGGGTCT GCCTGCCCGG TACGCGGTGT CGGTGGGCGA TCAGCGGTTG
GTGCGCTACG ACCTGACGAG CGGCACGGCG GTCGAACACG CCTTCGGGAC CGCCGACGCG
CCGGGCGGAC CCGGCGAGGC GGTGTTCGTG CCGGCCACCT CGGGCCCCGC CGACGAACAG
AACGGGTGGT ATATGGCGTA CGTCTACGAC CCGCAGCGCG ACGGCAGCGA TCTGGTGATC
CTCGACGCCG CCGATTTCGG CGCCCCGCCG GTGGCCAGGG TGCAACTGCC GCAACGGGTT
CCGTACGGTT TCCACGGCAA CTGGATCGCT GGGTAG
 
Protein sequence
MTETDHAHDA VSADNLPSGD EFFHRGNYAP VADELTAFDL PVEGQIPADL QGWYLRNGPN 
PRQPSAHWFT GDGMIHGVRI ENGRAAWYRN RWVRTESFEQ HFPVYNSDGS RNLHSSVANT
HVVNHAGKTL ALVESSLPYE ITNDLQTVGA YDFGGKLVDS MTAHPKICPT TGELHFFGYG
NLFEPYVTYH RADADGELTV NRPLDVKALT MMHDFAMTSG HVVFMDLPIV FDMGIALEGK
GDMPYRWDDD YGARLGVLRR DDPFGEVRWF DIDPCYVFHV ANAYEDGNTL VLQAVRYPEL
WRGTGGFEAE GVLWSWTLDL ATGTVRERQL DDRAVEFPRI DDRLAGLPAR YAVSVGDQRL
VRYDLTSGTA VEHAFGTADA PGGPGEAVFV PATSGPADEQ NGWYMAYVYD PQRDGSDLVI
LDAADFGAPP VARVQLPQRV PYGFHGNWIA G