Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_4940 |
Symbol | |
ID | 4648925 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | + |
Start bp | 5284983 |
End bp | 5286596 |
Gene Length | 1614 bp |
Protein Length | 537 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639808411 |
Product | carotenoid oxygenase |
Protein accession | YP_955718 |
Protein GI | 120405889 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3670] Lignostilbene-alpha,beta-dioxygenase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.97423 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0157069 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCTGC ACAGCCACCC CGAGATCGTC GGCAAGTTCC TCTCGACGCT GCCCGAGGAT GACGACCACC CCTACCGGAC CGGTCCGTGG CGGCCGCAGA CCACCGAATG GGACGACGAC GACCTCAAGG TCGTCGAGGG CCGGATCCCG GACGACCTCG ACGGCGTGTA CCTGCGCAAC ACCGAGAACC CGCTGCACCC GGCGCTGAAG TACTACCACC CGTTCGACGG CGACGGCATG CTGCACATCG TCGGATTCCG TGACGGGAAA GCGTTCTACC GCAACAGGTT CGTCCGGACC GACGCGTTCG CCGAAGAGAA CGAAGCGGGC GGTCCGCTGT GGCCCGGCAT CGCCGAGCCC CTCGAGTTGG CCCGCCGCGA CTACGGGTGG GGTGCGCGCA CCCTGATGAA GGACGCGTCC TCCACCGACG TCGTGGTGCA CCGTGGTGTC GCGCTGACCA GCCACTACCA GTGTGGCGAT CTCTACCGCA TCGACCCGCG CACCGGCGCG GCCCTCGGCA AGGAGGACTG GCACGGCACG TTCCCGTCCG ACTGGGGCGT GTCCGCCCAC CCGAAGATCG ACGACCACAC CGGTGAGCTG CTGTTCTTCA ACTACAGCAA GCAGGCGCCC TACATGCGCT ACGGCGTGGT CAGCCCGGCC GGCGAGGTGG TGCACAACGT CGATGTCCCG CTGCCGGGGC CGCGGCTGCC GCACGACATG GCGTTCACCG AGAACTACGT GATACTCAAC GACTTTCCGC TGTTCTGGGA CACCGGACTG CTCGAACACA ACATTCACTT CCCGCGTTTC CACCGGGACA TGCCGTCGCG GTTCGCCGTG GTCCCCCGCC GCGGCGACAC CTCGCAGATC AAGTGGTTCG AGACCGCTCC CACCTACGTG CTGCACTTCC CCAACGCCTA CGAGGACGGC GACGAGATCG TGCTGGACGG GTTCTTCCAG GGCGATCCCG AGCCCACCGA CGGAGTCGAC AACGGGATGA GCGGCAAATG GCGGCAGATC TTCCGCAGCC TGTCACTGGA CGGTATGCAG ACCCGGCTGC ACCGGTGGCG GTTCAACCTG GTCACCGGCG AGGCCCGCGA AGAGCAGCTG TCGGACAGCA TCACCGAGTT CGGCATGATC AACCCCGCGT ATTCCGGGCG CCCCTACCGC TACACGTATG CGGCCACGGG CAAGCCCGGG TGGTTCCTGT TCGACGGCCT GGTCCGGCAC GACCTGCACA CCGGCACCGA GCAGCGCTAC GGGTTCGGGG ACGGTGTCTA CGGCAGCGAG ACCGCGATGG CACCCCGCGT GGGCAGCGCG GCCGCTGACC ACATTGGCGG CTCGGCCGCC GCCCACATTG GCGGCTCGGC CGCCGCCCAC ATTGGCGGCT CGGCCGCCGA AGACGACGGG TACCTCGTCA CCCTGACCAC CGACATGGCG GCCGACGCGT CGTACTGCCT GGTGTTCGAC GCGGCGCGGG TGGGTGACGG TCCGGTATGC AAACTGCAAC TGCCCGAACG GATCTCCAGC GGCACACATT CGACCTGGGC GCCGGGAGCC GATCTGCCGC AATGGCGCGA GAGCGACTCG GCGGCCGGCG CGATCGGCCT GTAA
|
Protein sequence | MSLHSHPEIV GKFLSTLPED DDHPYRTGPW RPQTTEWDDD DLKVVEGRIP DDLDGVYLRN TENPLHPALK YYHPFDGDGM LHIVGFRDGK AFYRNRFVRT DAFAEENEAG GPLWPGIAEP LELARRDYGW GARTLMKDAS STDVVVHRGV ALTSHYQCGD LYRIDPRTGA ALGKEDWHGT FPSDWGVSAH PKIDDHTGEL LFFNYSKQAP YMRYGVVSPA GEVVHNVDVP LPGPRLPHDM AFTENYVILN DFPLFWDTGL LEHNIHFPRF HRDMPSRFAV VPRRGDTSQI KWFETAPTYV LHFPNAYEDG DEIVLDGFFQ GDPEPTDGVD NGMSGKWRQI FRSLSLDGMQ TRLHRWRFNL VTGEAREEQL SDSITEFGMI NPAYSGRPYR YTYAATGKPG WFLFDGLVRH DLHTGTEQRY GFGDGVYGSE TAMAPRVGSA AADHIGGSAA AHIGGSAAAH IGGSAAEDDG YLVTLTTDMA ADASYCLVFD AARVGDGPVC KLQLPERISS GTHSTWAPGA DLPQWRESDS AAGAIGL
|
| |