Gene Mmcs_3098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_3098 
Symbol 
ID4111930 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp3276215 
End bp3277705 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content67% 
IMG OID638032228 
Productcarotenoid oxygenase 
Protein accessionYP_640261 
Protein GI108800064 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3670] Lignostilbene-alpha,beta-dioxygenase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAGTGG AACGCCTGCA GACCTTCGCC TCGACGCTGC CCGCCGATGA CGACCATCCG 
TACCGCACCG GGCCGTGGCG CCCCCAGGTC ACCGAGTGGC GGGCCGACGA CCTCGAGGTC
GTCGCCGGCG AGGTGCCTGC CGATCTCGAC GGCATGTACC TGCGCAACAC GGAGAACCCG
CTGCATCCGG CCGCGACGGC CTACCACCCG TTCGACGGTG ACGGGATGAT CCACATCGTC
GAGTTCGGCG GGGGAAAAGC GGCCTACCGC AACCGCTTCG TCCGCACCGA CGGCTTCCTC
GCCGAGAACG AGGCCGGGGG ACCGCTGTGG GCCGGGTTCA TCGAGATGCC CTCGGCCGCC
AAACGCGCCG ACGGCTGGGG CGCGCGCACG CGGATGAAGG ACGCGTCGAG CACTGACGTC
GTCGTCCACC GCGGGACGGC GCTGACCAGT TTCTACATGT GCGGCGACCT CTACCAGGTC
GACCCGTACA CCGCCGACAC CCTCGGCAAG GAGACCTGGC ACGGCGACTT CCCGGACTGG
GGGGTGTCGG CGCATCCCAA GATCGACCCG GTCACCGGGG AGCTGCTGTT CTTCAGCTAC
AGCAAGGAAG CGCCTCATCT GCGCTACGGC GTGGTCGACA AGGACGCGAA CCTGGTGCAC
CACACCGACG TCGCGCTGCC CGGGCCGCGG ATGCCGCACG ATATGGCGTT CACCGAGAAC
TACGTGATCC TCAACGACTT CCCGCTGTTC TGGGAGCCGT CGCTGCTGAA GCAGGACATC
CACGCACCGG TCTTCCACCG CGACATGCCG TCGCGTTTCG CCGTGCTGCC CCGCCGCGGT
GACCAGTCGC AGGTGCGGTG GTTCGAGACC GACCCGACGT ATGCCCTGCA CTTCGTCAAC
GCCTACGAGG ACGGTGACGA GATCGTGCTC GACGGGTTCT TCCAGGACAA CCCGTCACCG
TCGACGAAGG GCGCGAAGTC GTTGGAGGAC GCGGCCTTCC GCTACCTGGC ACTCGACGGG
TTCGAATCGC ACCTGCACCG CTGGCGGTTC AACCTCGCCA CGGGGGCGGC CACGGAGGAA
CGGCTGTCGG ACAGCCTCAC CGAATTCGGC ATGATGAACG GTGACTACCA GACCCGGCGG
CACCGCTACG TGTACGCCGC CACCGGCAAA CCGGGCTGGT TCCTGTTCGA CGGGCTGGTC
AAACACGATC TGCGCGACGG TACCGAGGAG CGGATCACGT TCGGCGACGG CGTGTTCGGC
AGCGAGACCG CGATGGCGCC GCGTCAGGAC GGCACCGCCG AGGACGACGG CTACCTCGTC
ACCCTGACCA CGGACATGAA CGACGACGCC TCCTACTGCT TGGTGTTCGA TGCCGCGCGG
ATCGCCGACG GTCCGGTGTG CAAGCTGCGG CTTCCTGAAA GAATCTGCAG CGGAACACAT
TCGACGTGGG TGTCCGGGGC TGAGCTGCGG CGCTGGCACA GCCCGCGGTG A
 
Protein sequence
MRVERLQTFA STLPADDDHP YRTGPWRPQV TEWRADDLEV VAGEVPADLD GMYLRNTENP 
LHPAATAYHP FDGDGMIHIV EFGGGKAAYR NRFVRTDGFL AENEAGGPLW AGFIEMPSAA
KRADGWGART RMKDASSTDV VVHRGTALTS FYMCGDLYQV DPYTADTLGK ETWHGDFPDW
GVSAHPKIDP VTGELLFFSY SKEAPHLRYG VVDKDANLVH HTDVALPGPR MPHDMAFTEN
YVILNDFPLF WEPSLLKQDI HAPVFHRDMP SRFAVLPRRG DQSQVRWFET DPTYALHFVN
AYEDGDEIVL DGFFQDNPSP STKGAKSLED AAFRYLALDG FESHLHRWRF NLATGAATEE
RLSDSLTEFG MMNGDYQTRR HRYVYAATGK PGWFLFDGLV KHDLRDGTEE RITFGDGVFG
SETAMAPRQD GTAEDDGYLV TLTTDMNDDA SYCLVFDAAR IADGPVCKLR LPERICSGTH
STWVSGAELR RWHSPR