Gene Mvan_3359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_3359 
Symbol 
ID4647642 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp3575867 
End bp3577375 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content68% 
IMG OID639806837 
Productcarotenoid oxygenase 
Protein accessionYP_954162 
Protein GI120404333 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3670] Lignostilbene-alpha,beta-dioxygenase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.120702 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.796578 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGCGG GCACCCGGTA CTGGGACGAC ATCGACGGTG AATTCGACAT CCCGGTCGAG 
GAGATCACCG GAACTCTGCC CGAGGGGCTG ACCGGCACGC TGTACCGCAA CGGGTCCGGG
CGCTGGAACA TCGGTTCGTC GCCGGTGGAG AGCATCTACG ACGCCGACGG AATGGTGAGC
GCCTTCGTGC TGAACGGAGC CGGCGTCCGG TTCCGCAACA GGTTCGTCCG CACCAGGCAT
TTCGTCCAGT CGACACGGGC CGGACGGCCC ATCGGCCGCG GCTTCAGCGC GCAGCGGCCG
GGCGGCATCC GGGCCAATGC GCTTCGCCTG CCGGTGAACA CCGCCAATAC CAGCGTGATG
ATGAACAACG ATCGGCTACT GGCCCTTTGG GAGGGCGGCC GGCCGCACGA GCTCGATCCC
GACAGCCTCG ACACCTACGG GATGAGCGAC CTCGGCGGTG CGTTGAGGGG CCCGGTGGGG
GCGTACTCGG CGCACTACAC GCTCGACCCC GAAACCGGCG CCAAGGTCAA CTTCGGTTTC
GACCCGTACT TTCCCCGGAT CGACCTGCGG TGGGCGCTGC GCGGGGTGAG CGGTGCGGAA
CGACGCAGAA GGCTGCGGGA CCTCGTCGGC GAAGCCGTAC CCCGAGTTCG CTTGCGGCTC
TACGAAACCG ACACCGACGG CGTCACCCGC TATCTGCGCG CGGTCCCGCT GCCGGGGATG
GGCATCGTGC ACGACATGGC GCTGACCCGG CGCTACGCGA TCTTCGTCGT CTCGCCGATG
CGGATCAATC CGTGGGCACT CACCGGTCAC GCGTCCTTCT GGGACTCCAT GCGCTTCCTG
ACCGACACAC CGACGTACTT CCTGTTGGCG CCGCGTAGCG GCGGACCGGT GCGGACCGTC
GAGACCGACG CCTTCTACCA CTGGCACTTC ACGAATGCCT ACGACGACGG CGACGACGTC
GTGGTGGAGC TGCCGAGGTT CGCACCGCAC ACGTACGAGG GCGTGAAGAA CTACACCGCG
CATGTCCGGT CCGGCAGCGC CCAGGTCGGC GGCGCGGACC CATCGGATGC CGTGGTGCTG
ACCCGGTTCC GCATCGGATC GTCCGGTCGG GTCACGCGCG AACCGCTCGC CGATTTCGGT
TGCGAGTTCC CGCAGATCGA TCCCCGGCGT TCCACCAGGC GGCACGACTA CTCCTACGTC
GCGGTGCAGG ACCCGGTGCG GTTCCAGGGC CAGGGGATCG CCCGCATCGA CCACCGCAGT
GGAGCCGGCC AGGTGTTCTG TCCGCCCGGA AACGTGCTTG TGGAGCCGAC TTTCGTGCCG
AGGCCCGGCG GCGACGCCGA AGACGACGGC TGGGTGATCA CCGTCGGCTA CCAGGAAAGC
CACCACCGCA GCCGGCTCAT GGTGTTCGAC GCCGCCGGAA TCGCCGACGG CCCCGTCGCC
GAAGCGTGGC TGCCGTTCCA TGTCCCGATG AGCTATCACG GCACATTCAC CTCGCGGGTC
GCCCGCTGA
 
Protein sequence
MVAGTRYWDD IDGEFDIPVE EITGTLPEGL TGTLYRNGSG RWNIGSSPVE SIYDADGMVS 
AFVLNGAGVR FRNRFVRTRH FVQSTRAGRP IGRGFSAQRP GGIRANALRL PVNTANTSVM
MNNDRLLALW EGGRPHELDP DSLDTYGMSD LGGALRGPVG AYSAHYTLDP ETGAKVNFGF
DPYFPRIDLR WALRGVSGAE RRRRLRDLVG EAVPRVRLRL YETDTDGVTR YLRAVPLPGM
GIVHDMALTR RYAIFVVSPM RINPWALTGH ASFWDSMRFL TDTPTYFLLA PRSGGPVRTV
ETDAFYHWHF TNAYDDGDDV VVELPRFAPH TYEGVKNYTA HVRSGSAQVG GADPSDAVVL
TRFRIGSSGR VTREPLADFG CEFPQIDPRR STRRHDYSYV AVQDPVRFQG QGIARIDHRS
GAGQVFCPPG NVLVEPTFVP RPGGDAEDDG WVITVGYQES HHRSRLMVFD AAGIADGPVA
EAWLPFHVPM SYHGTFTSRV AR