Gene Mvan_4940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_4940 
Symbol 
ID4648925 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5284983 
End bp5286596 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content68% 
IMG OID639808411 
Productcarotenoid oxygenase 
Protein accessionYP_955718 
Protein GI120405889 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3670] Lignostilbene-alpha,beta-dioxygenase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.97423 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0157069 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCTGC ACAGCCACCC CGAGATCGTC GGCAAGTTCC TCTCGACGCT GCCCGAGGAT 
GACGACCACC CCTACCGGAC CGGTCCGTGG CGGCCGCAGA CCACCGAATG GGACGACGAC
GACCTCAAGG TCGTCGAGGG CCGGATCCCG GACGACCTCG ACGGCGTGTA CCTGCGCAAC
ACCGAGAACC CGCTGCACCC GGCGCTGAAG TACTACCACC CGTTCGACGG CGACGGCATG
CTGCACATCG TCGGATTCCG TGACGGGAAA GCGTTCTACC GCAACAGGTT CGTCCGGACC
GACGCGTTCG CCGAAGAGAA CGAAGCGGGC GGTCCGCTGT GGCCCGGCAT CGCCGAGCCC
CTCGAGTTGG CCCGCCGCGA CTACGGGTGG GGTGCGCGCA CCCTGATGAA GGACGCGTCC
TCCACCGACG TCGTGGTGCA CCGTGGTGTC GCGCTGACCA GCCACTACCA GTGTGGCGAT
CTCTACCGCA TCGACCCGCG CACCGGCGCG GCCCTCGGCA AGGAGGACTG GCACGGCACG
TTCCCGTCCG ACTGGGGCGT GTCCGCCCAC CCGAAGATCG ACGACCACAC CGGTGAGCTG
CTGTTCTTCA ACTACAGCAA GCAGGCGCCC TACATGCGCT ACGGCGTGGT CAGCCCGGCC
GGCGAGGTGG TGCACAACGT CGATGTCCCG CTGCCGGGGC CGCGGCTGCC GCACGACATG
GCGTTCACCG AGAACTACGT GATACTCAAC GACTTTCCGC TGTTCTGGGA CACCGGACTG
CTCGAACACA ACATTCACTT CCCGCGTTTC CACCGGGACA TGCCGTCGCG GTTCGCCGTG
GTCCCCCGCC GCGGCGACAC CTCGCAGATC AAGTGGTTCG AGACCGCTCC CACCTACGTG
CTGCACTTCC CCAACGCCTA CGAGGACGGC GACGAGATCG TGCTGGACGG GTTCTTCCAG
GGCGATCCCG AGCCCACCGA CGGAGTCGAC AACGGGATGA GCGGCAAATG GCGGCAGATC
TTCCGCAGCC TGTCACTGGA CGGTATGCAG ACCCGGCTGC ACCGGTGGCG GTTCAACCTG
GTCACCGGCG AGGCCCGCGA AGAGCAGCTG TCGGACAGCA TCACCGAGTT CGGCATGATC
AACCCCGCGT ATTCCGGGCG CCCCTACCGC TACACGTATG CGGCCACGGG CAAGCCCGGG
TGGTTCCTGT TCGACGGCCT GGTCCGGCAC GACCTGCACA CCGGCACCGA GCAGCGCTAC
GGGTTCGGGG ACGGTGTCTA CGGCAGCGAG ACCGCGATGG CACCCCGCGT GGGCAGCGCG
GCCGCTGACC ACATTGGCGG CTCGGCCGCC GCCCACATTG GCGGCTCGGC CGCCGCCCAC
ATTGGCGGCT CGGCCGCCGA AGACGACGGG TACCTCGTCA CCCTGACCAC CGACATGGCG
GCCGACGCGT CGTACTGCCT GGTGTTCGAC GCGGCGCGGG TGGGTGACGG TCCGGTATGC
AAACTGCAAC TGCCCGAACG GATCTCCAGC GGCACACATT CGACCTGGGC GCCGGGAGCC
GATCTGCCGC AATGGCGCGA GAGCGACTCG GCGGCCGGCG CGATCGGCCT GTAA
 
Protein sequence
MSLHSHPEIV GKFLSTLPED DDHPYRTGPW RPQTTEWDDD DLKVVEGRIP DDLDGVYLRN 
TENPLHPALK YYHPFDGDGM LHIVGFRDGK AFYRNRFVRT DAFAEENEAG GPLWPGIAEP
LELARRDYGW GARTLMKDAS STDVVVHRGV ALTSHYQCGD LYRIDPRTGA ALGKEDWHGT
FPSDWGVSAH PKIDDHTGEL LFFNYSKQAP YMRYGVVSPA GEVVHNVDVP LPGPRLPHDM
AFTENYVILN DFPLFWDTGL LEHNIHFPRF HRDMPSRFAV VPRRGDTSQI KWFETAPTYV
LHFPNAYEDG DEIVLDGFFQ GDPEPTDGVD NGMSGKWRQI FRSLSLDGMQ TRLHRWRFNL
VTGEAREEQL SDSITEFGMI NPAYSGRPYR YTYAATGKPG WFLFDGLVRH DLHTGTEQRY
GFGDGVYGSE TAMAPRVGSA AADHIGGSAA AHIGGSAAAH IGGSAAEDDG YLVTLTTDMA
ADASYCLVFD AARVGDGPVC KLQLPERISS GTHSTWAPGA DLPQWRESDS AAGAIGL