Gene Mflv_0537 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMflv_0537 
Symbol 
ID4976587 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium gilvum PYR-GCK 
KingdomBacteria 
Replicon accessionNC_009338 
Strand
Start bp566003 
End bp567403 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content60% 
IMG OID640454740 
Productring hydroxylating dioxygenase, alpha subunit 
Protein accessionYP_001131817 
Protein GI145221139 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.689442 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTACTG TCGGTAAGAA CGACATTCAG CAACTGGTAG CGGCCGGCCG AGAAGGCCTT 
GCTAAGGGCC GTCTGCCCGC CGGGCTGGTC GCCAACGCAG AACTCCACAA GCTCGAAGCT
CAGCGAGTCT TCGGCCGGTG CTGGCAATTC CTGGCCCACG AGACGGAGAT CCCCCAGGCA
GGGGACTACG TGGTCCGATA TCTGGGTGGC GGTTCGATCA TCGTTGTCCG CGGCGAAGAC
GGCGAAGTGC GCGCCATGGC GAACTCGTGT CGGCACCGCG GAACAATGCT GTGCCGCACG
GAGATGGGCA ACACTTCGCA CTTCCGCTGC CCCTATCACG GCTGGACCTA CCGCAATACC
GGAACTCTGG CGGGTGTACC CGCACAAAAA GAGGTCTATG GGGTCGAGAT GGACAAGAAC
GAGTGGAGTC TTACCCAGGT TCCGCGCCTC GAGAACTACC GCGGAATGAT ATTCGGTTGC
CTGGACGAGA AGGCAGAACC TCTCGTTGAT TATTTGGGCG ATATGGCGTG GTATCTGGAC
CTGATCACCC AGAAGTCCAA GGGTGGACTG GAGGTGCGGG GTGAGCCCCA GCGTTGGATC
ATCGACTCCA ACTGGAAGCT CGGCGCGGAC AACTTTGTCG GGGACGCCTA CCACACGTTG
ATGACGCACC GATCGGCGGT CGAGCTCGGT CTGGCTCCGC CCGATCCGAA ATTCGCATCG
GAGCCGGCGC ATATCAGTCT CTCCAACGGT CACGGCCTCG GCGTCCTCGG GGTAACGCCC
GGGCAACCGA TGCCGCCCTT TATGAACTAT CCACCCGAGG TCGTCGATGG ACTCGCAGCG
GCTTACGGCG ATCAGGACCG CGCAGACATG CTTCAGCGTT CGGCCTTCAT TCACGGCACG
GTCTTTCCCA ACCTGTCGTT CCTCAACGTC CTCATCGGTA GGGACAAGAA GTCAATGCCA
GTGCCGATGT TGACATTTCG GCTGTGGCGT CCACTGTCAC ACGACACGAT GGAAGTCTGG
TCGTGGTTTC TCGTCGAGAA GGATGCCGAC GAAGAGTTCA AACAGCAGTC GTATGAGACC
TACGTACGAA CGTTCGGCAT CTCCGGTGTG TTCGAACAGG ACGACGCCGA GACTTGGCGC
TCCATCACTG CGGGAACGCA AGGCATTCTC GCAGGCAGCC AGACACTCAA CTTCGAGATG
GGCATGGGTG TGCTGACCAG CGACGACACG TGGAAGGGGC CCGGTCGTCC CCTGTCCAGC
GGGTACGCGG AGCGTAACCA ACGCGAATTC TGGGGTCGCC TGTTGGAGTT ACTCACCGAC
TCAGGCGATG ACGCCAGCGA AACCGAGCCC AAACCCCAAC TACTCGCGCA ATCTCGGACC
AATGCAGACG AGGTCGCCTG A
 
Protein sequence
MSTVGKNDIQ QLVAAGREGL AKGRLPAGLV ANAELHKLEA QRVFGRCWQF LAHETEIPQA 
GDYVVRYLGG GSIIVVRGED GEVRAMANSC RHRGTMLCRT EMGNTSHFRC PYHGWTYRNT
GTLAGVPAQK EVYGVEMDKN EWSLTQVPRL ENYRGMIFGC LDEKAEPLVD YLGDMAWYLD
LITQKSKGGL EVRGEPQRWI IDSNWKLGAD NFVGDAYHTL MTHRSAVELG LAPPDPKFAS
EPAHISLSNG HGLGVLGVTP GQPMPPFMNY PPEVVDGLAA AYGDQDRADM LQRSAFIHGT
VFPNLSFLNV LIGRDKKSMP VPMLTFRLWR PLSHDTMEVW SWFLVEKDAD EEFKQQSYET
YVRTFGISGV FEQDDAETWR SITAGTQGIL AGSQTLNFEM GMGVLTSDDT WKGPGRPLSS
GYAERNQREF WGRLLELLTD SGDDASETEP KPQLLAQSRT NADEVA