Gene Mvan_0546 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_0546 
Symbol 
ID4644282 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp587669 
End bp589069 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content60% 
IMG OID639804051 
Productring hydroxylating dioxygenase, alpha subunit 
Protein accessionYP_951396 
Protein GI120401567 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTATTG TCGGTAAGAA CGACATTCAG CAACTGGTAG CGGCGGGCCA AGAAGGCCTT 
GCTAAGGGCC GTCTGCCCGC CGGGCTGGTC GCCAACGCAG AACTCCACAA GCTCGAAGCT
CAGCGAGTCT TCGGCCGGTG CTGGCAATTC CTGGCCCACG AGACGGAGAT CCCCCAGGCA
GGGGACTACG TGGTCCGATA TCTGGGTGGC GGTTCGATCA TCGTTGTCCG CGGCGAAGAC
GGCGAAGTGC GCGCCATGGC GAACTCGTGT CGGCACCGCG GAACAATGCT GTGCCGCACG
GAGATGGGCA ACACTTCGCA CTTCCGCTGC CCCTATCACG GCTGGACCTA CCGCAATACC
GGAACTCTGG CGGGTGTACC CGCACAAAAA GAGGTCTATG GGGTCGAGAT GGACAAGAAC
GAGTGGAGTC TTACCCAGGT TCCGCGCCTC GAGAACTACC GCGGAATGAT ATTCGGTTGC
CTGGACGAGA AGGCAGAACC TCTCGTTGAT TATTTGGGCG ATATGGCGTG GTATCTGGAC
CTGATCACCC AGAAGTCCAA GGGTGGACTG GAGGTGCGGG GTGAGCCCCA GCGTTGGATC
ATCGACTCCA ACTGGAAGCT CGGCGCGGAC AACTTTGTCG GGGACGCCTA CCACACGTTG
ATGACGCACC GATCGGCGGT CGAGCTCGGT CTGGCTCCGC CCGATCCAAA ATTCGCGTCG
GAGCCGGCGC ATATCAGTCT CTCCAACGGT CACGGCCTCG GCGTCCTCGG GGTGCCGCCC
GGGCAACCGA TGCCGCCCTT TATGAACTAT CCACCCGAGG TCGTCGATGG ACTCGCAGCG
GCTTACGGCG ATCAGGACCG CGCAGACATG CTTCAGCGTT CGGCCTTCAT TCACGGCACG
GTCTTTCCCA ACCTGTCGTT CCTCAACGTC CTCATCGGTA GGGACAAGAA GTCAATGCCA
GTGCCGATGT TGACATTTCG GCTGTGGCGT CCACTGTCAC ACGACACGAT GGAAGTTTGG
TCGTGGTTTC TCGTCGAGAA GGATGCCGAC GAAGAGTTCA AACAGCAGTC GTATGAGACC
TACGTACGAA CGTTCGGCAT CTCCGGTGTG TTCGAACAGG ACGACGCCGA GACTTGGCGC
TCCATCACTG CGGGAACGCA AGGCATTCTC GCAGGCAGCC AGACACTCAA CTTCGAGATG
GGCATGGGTG TGCTGACCAG CGACGACACG TGGAAGGGGC CCGGTCGTCC CCTGTCCAGC
GGGTACGCGG AGCGTAACCA ACGCGAATTC TGGGGTCGCC TGTTGGAGTT ACTCACCGAC
TCAGGCGATG ACGCCAGCGA AACCGAGCCC AAACCCCAAC TACTCGCGCA ATCTCGGACC
AATACAGACG AGGTCGCCTG A
 
Protein sequence
MSIVGKNDIQ QLVAAGQEGL AKGRLPAGLV ANAELHKLEA QRVFGRCWQF LAHETEIPQA 
GDYVVRYLGG GSIIVVRGED GEVRAMANSC RHRGTMLCRT EMGNTSHFRC PYHGWTYRNT
GTLAGVPAQK EVYGVEMDKN EWSLTQVPRL ENYRGMIFGC LDEKAEPLVD YLGDMAWYLD
LITQKSKGGL EVRGEPQRWI IDSNWKLGAD NFVGDAYHTL MTHRSAVELG LAPPDPKFAS
EPAHISLSNG HGLGVLGVPP GQPMPPFMNY PPEVVDGLAA AYGDQDRADM LQRSAFIHGT
VFPNLSFLNV LIGRDKKSMP VPMLTFRLWR PLSHDTMEVW SWFLVEKDAD EEFKQQSYET
YVRTFGISGV FEQDDAETWR SITAGTQGIL AGSQTLNFEM GMGVLTSDDT WKGPGRPLSS
GYAERNQREF WGRLLELLTD SGDDASETEP KPQLLAQSRT NTDEVA