Gene Mvan_0203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_0203 
Symbol 
ID4647716 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp213297 
End bp214691 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content69% 
IMG OID639803713 
Productcarotenoid oxygenase 
Protein accessionYP_951059 
Protein GI120401230 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3670] Lignostilbene-alpha,beta-dioxygenase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.297924 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTCCCG ACGATCTGAG CAGCCTCGGC GTCAGCGAGC CGCAACCCGC CGAACACGAC 
TACCGCGTCG AACAGATCGA CGGGACGATC CCGGCCGGGT TGCGCGGGAC GCTCTACCGC
AACGGCCCCG GTCGCTGGGA GGACCACAAG GGGCGGCCGC TGCGTCATCT GTTCGACGGG
GACGGCATGC TCTCGGCCTT CACCATCGAC GCCGCCGGCG TGCACTACCG CAACCGCTAT
GTGCGTACCC GGCATTTCGG GGGCAGGGGC GGTGTCAGCC ACATGGGTAC CTCGGCGCCG
GGGGGTTGGC GGGCGAACAT CTGCCGGGTG CCACCGAATC TCGCCAACAC CAACGTCGTC
GAGCACGCCG GCCGCCTCTA CGCGTTGTGG GAGGGTGGCC CGCCGCACGA GATCGACCCG
GATACCCTCG AGACCATGGG TGTACGTCGG TTCGGCGGCG AATTACGCTG GCTGGGAAGC
TATTCCGCCC ATCCGAGCTT CTGCCCGAGC AGCGGGGCGA TGTTCAACTT CGGGGTCGAG
CTGATGCCGC GTCCGCACCT GCGCATCTAC CGCACCGACC GGACGGGGCG CCTGCGGCAC
TTCCGCTCGG CCGCGCTGCC GTACGCCGCG ATGGTCCACG ACTTCGCGAT CACCGAACGG
TACATCGTGT TCCTCATCTC ACCGATCATC CCCGACGCGA TGTCGGTGGC GTTGGGGCGC
GCGCCGATCG GTGACACGCT GCGCTACCGC CCCGAGCGGG GCAGTGTGGT TCTCCTGGTG
CCGCGTGCCG GCGGAAAGAT CCGCCGTATC GAGTGCGAGG CGGTTCTGCA GTTCCATCTG
AGCAACGCCT TCGATGACGG AGACGACGTG GTCATCGACG CCATCACCTA CGCCGACGGG
CGGCTGCTCG AACGCATCGC GCGCTTCCAC ACCACCTCGC TGGCCGACAT GCCCTCGCAG
TTCACGCGTT TTCGGGTCGG CGCAACGGGC AGGGTCGGGG CGGAGCCGCT GACCGACAGC
CCGAGTGAGT TCCCCCGCCA TCACCCTGCG CGGGAGGGCC GACCGCACCG CTACGCGTAC
GTCAACACGC GCCGGACGCT CGGCACGCTG TACGACACGG TCACCAAGCT CGACCTGGCC
GATCAGACGG AGCTCAGCTA TCCCGCCCCC GAACCCGGCA ACAGCTTCTG CGAGCCGGTG
TTCGCACCGC GGCCCGGCGC CACGGCCGAG GACGACGGCT GGTTGCTGAC CGTGGAGTAC
CGGGCCGCGC ACAAGACGTC GCGGCTGGTC ATCCTGGATG CCGCGGACCC GTCGCGCGGA
CCGGTCGCCA CGGCTCAACT TGCGAGTCAC ATCCCGCAGG GTTTCCACGG CAACTTCTCC
GCGCGCACCA GCTGA
 
Protein sequence
MGPDDLSSLG VSEPQPAEHD YRVEQIDGTI PAGLRGTLYR NGPGRWEDHK GRPLRHLFDG 
DGMLSAFTID AAGVHYRNRY VRTRHFGGRG GVSHMGTSAP GGWRANICRV PPNLANTNVV
EHAGRLYALW EGGPPHEIDP DTLETMGVRR FGGELRWLGS YSAHPSFCPS SGAMFNFGVE
LMPRPHLRIY RTDRTGRLRH FRSAALPYAA MVHDFAITER YIVFLISPII PDAMSVALGR
APIGDTLRYR PERGSVVLLV PRAGGKIRRI ECEAVLQFHL SNAFDDGDDV VIDAITYADG
RLLERIARFH TTSLADMPSQ FTRFRVGATG RVGAEPLTDS PSEFPRHHPA REGRPHRYAY
VNTRRTLGTL YDTVTKLDLA DQTELSYPAP EPGNSFCEPV FAPRPGATAE DDGWLLTVEY
RAAHKTSRLV ILDAADPSRG PVATAQLASH IPQGFHGNFS ARTS