Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mkms_3158 |
Symbol | |
ID | 4610993 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. KMS |
Kingdom | Bacteria |
Replicon accession | NC_008705 |
Strand | - |
Start bp | 3304497 |
End bp | 3305987 |
Gene Length | 1491 bp |
Protein Length | 496 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639792829 |
Product | carotenoid oxygenase |
Protein accession | YP_939142 |
Protein GI | 119869190 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3670] Lignostilbene-alpha,beta-dioxygenase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.350488 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAGTGG AACGCCTGCA GACCTTCGCC TCGACGCTGC CCGCCGATGA CGACCATCCG TACCGCACCG GGCCGTGGCG CCCCCAGGTC ACCGAGTGGC GGGCCGACGA CCTCGAGGTC GTCGCCGGCG AGGTGCCTGC CGATCTCGAC GGCATGTACC TGCGCAACAC GGAGAACCCG CTGCATCCGG CCGCGACGGC CTACCACCCG TTCGACGGTG ACGGGATGAT CCACATCGTC GAGTTCGGCG GGGGAAAAGC GGCCTACCGC AACCGCTTCG TCCGCACCGA CGGCTTCCTC GCCGAGAACG AGGCCGGGGG ACCGCTGTGG GCCGGGTTCA TCGAGATGCC CTCGGCCGCC AAACGCGCCG ACGGCTGGGG CGCGCGCACG CGGATGAAGG ACGCGTCGAG CACTGACGTC GTCGTCCACC GCGGGACGGC GCTGACCAGT TTCTACATGT GCGGCGACCT CTACCAGGTC GACCCGTACA CCGCCGACAC CCTCGGCAAG GAGACCTGGC ACGGCGACTT CCCGGACTGG GGGGTGTCGG CGCATCCCAA GATCGACCCG GTCACCGGGG AGCTGCTGTT CTTCAGCTAC AGCAAGGAAG CGCCTCATCT GCGCTACGGC GTGGTCGACA AGGACGCGAA CCTGGTGCAC CACACCGACG TCGCGCTGCC CGGGCCGCGG ATGCCGCACG ATATGGCGTT CACCGAGAAC TACGTGATCC TCAACGACTT CCCGCTGTTC TGGGAGCCGT CGCTGCTGAA GCAGGACATC CACGCACCGG TCTTCCACCG CGACATGCCG TCGCGTTTCG CCGTGCTGCC CCGCCGCGGT GACCAGTCGC AGGTGCGGTG GTTCGAGACC GACCCGACGT ATGCCCTGCA CTTCGTCAAC GCCTACGAGG ACGGTGACGA GATCGTGCTC GACGGGTTCT TCCAGGACAA CCCGTCACCG TCGACGAAGG GCGCGAAGTC GTTGGAGGAC GCGGCCTTCC GCTACCTGGC ACTCGACGGG TTCGAATCGC ACCTGCACCG CTGGCGGTTC AACCTCGCCA CGGGGGCGGC CACGGAGGAA CGGCTGTCGG ACAGCCTCAC CGAATTCGGC ATGATGAACG GTGACTACCA GACCCGGCGG CACCGCTACG TGTACGCCGC CACCGGCAAA CCGGGCTGGT TCCTGTTCGA CGGGCTGGTC AAACACGATC TGCGCGACGG TACCGAGGAG CGGATCACGT TCGGCGACGG CGTGTTCGGC AGCGAGACCG CGATGGCGCC GCGTCAGGAC GGCACCGCCG AGGACGACGG CTACCTCGTC ACCCTGACCA CGGACATGAA CGACGACGCC TCCTACTGCT TGGTGTTCGA TGCCGCGCGG ATCGCCGACG GTCCGGTGTG CAAGCTGCGG CTTCCTGAAA GAATCTGCAG CGGAACACAT TCGACGTGGG TGTCCGGGGC TGAGCTGCGG CGCTGGCACA GCCCGCGGTG A
|
Protein sequence | MRVERLQTFA STLPADDDHP YRTGPWRPQV TEWRADDLEV VAGEVPADLD GMYLRNTENP LHPAATAYHP FDGDGMIHIV EFGGGKAAYR NRFVRTDGFL AENEAGGPLW AGFIEMPSAA KRADGWGART RMKDASSTDV VVHRGTALTS FYMCGDLYQV DPYTADTLGK ETWHGDFPDW GVSAHPKIDP VTGELLFFSY SKEAPHLRYG VVDKDANLVH HTDVALPGPR MPHDMAFTEN YVILNDFPLF WEPSLLKQDI HAPVFHRDMP SRFAVLPRRG DQSQVRWFET DPTYALHFVN AYEDGDEIVL DGFFQDNPSP STKGAKSLED AAFRYLALDG FESHLHRWRF NLATGAATEE RLSDSLTEFG MMNGDYQTRR HRYVYAATGK PGWFLFDGLV KHDLRDGTEE RITFGDGVFG SETAMAPRQD GTAEDDGYLV TLTTDMNDDA SYCLVFDAAR IADGPVCKLR LPERICSGTH STWVSGAELR RWHSPR
|
| |