Gene Mkms_1716 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_1716 
Symbol 
ID4614004 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp1832737 
End bp1834137 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content60% 
IMG OID639791383 
Productring hydroxylating dioxygenase, alpha subunit 
Protein accessionYP_937709 
Protein GI119867757 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.77849 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTACTG TCGGTAAGAA CGACATTCAG CAACTGGTAG CGGCCGGCCG AGAAGGCCTT 
GCTAAGGGCC GTCTGCCCGC CGGGCTGGTC GCCAACGCAG AACTCCACAA GCTCGAAGCT
CAGCGAGTCT TCGGCCGGTG CTGGCAATTC CTGGCCCACG AGACGGAGAT CCCCCAGGCA
GGGGACTACG TGGTCCGATA TCTGGGTGGC GGTTCGATCA TCGTTGTCCG CGGCGAAGAC
GGCGAAGTGC GCGCCATGGC GAACTCGTGT CGGCACCGCG GAACAATGCT GTGCCGCACG
GAGATGGGCA ACACTTCGCA CTTCCGCTGC CCCTATCACG GCTGGACCTA CCGCAATACC
GGAACTCTGG CGGGTGTACC CGCACAAAAA GAGGTCTATG GGGTCGAGAT GGACAAGAAC
GAGTGGAGTC TTACCCAGGT TCCGCGCCTC GAGAACTACG GCGGAATGAT ATTCGGTTGC
CTGGACGAGA AGGCAGAACC TCTCGTTGAT TATTTGGGCG ATATGGCGTG GTATCTGGAC
CTGATCACCC AGAAGTCCAA GGGTGGACTG GAGGTGCGGG GTGAGCCCCA GCGTTGGATC
ATCGACTCCA ACTGGAAGCT CGGCGCGGAC AACTTTGTCG GGGACGCCTA CCACACGTTG
ATGACGCACC GATCGGCGGT CGAGCTCGGT CTGGCTCCGC CCGATCCGAA ATTCGCATCG
GAGCCGGCGC ATATCAGTCT CTCCAACGGT CACGGCCTCG GCGTCCTCGG GGTAACGCCC
GGGCAACCGA TGCCGCCCTT TATGAACTAT CCACCCGAGA TCGTCGATGG ACTCGCAGCG
GCTTACGGCG ATCAGGACCG CGCAGACATG CTTCAGCGTT CGGCCTTCAT TCACGGCACG
GTCTTTCCCA ACCTGTCGTT CCTCAACGTC CTCATCGGTA GGGACAAGAA GTCAATGCCA
GTGCCGATGT TGACATTTCG GCTGTGGCGT CCGCTGTCAC ACGACACGAT GGAAGTCTGG
TCGTGGTTTC TCGTCGAGAA GGATGCCGAC GAAGAGTTCA AACAGCAGTC GTATGAGACC
TACGTACGAA CGTTCGGCAT CTCCGGTGTG TTCGAACAGG ACGACGCCGA GACTTGGCGC
TCCATCACTG CGGGAACGCA AGGCATTCTC GCAGGCAGCC AGACACTCAA CTTCGAGATG
GGCATGGGTG TGCTGACCAG CGACGACACG TGGAAGGGGC CCGGTCGTCC CCTGTCCAGC
GGGTACGCGG AGCGTAACCA ACGCGAATTC TGGGGTCGCC TGTTGGAGTT ACTCACCGAC
TCAGGCGATG ACGCCAGCGA AACCGAGCCC AAACCCCAAC TACTCGCGCA ATCTCGGACC
AATGCAGACG AGGTCGCCTG A
 
Protein sequence
MSTVGKNDIQ QLVAAGREGL AKGRLPAGLV ANAELHKLEA QRVFGRCWQF LAHETEIPQA 
GDYVVRYLGG GSIIVVRGED GEVRAMANSC RHRGTMLCRT EMGNTSHFRC PYHGWTYRNT
GTLAGVPAQK EVYGVEMDKN EWSLTQVPRL ENYGGMIFGC LDEKAEPLVD YLGDMAWYLD
LITQKSKGGL EVRGEPQRWI IDSNWKLGAD NFVGDAYHTL MTHRSAVELG LAPPDPKFAS
EPAHISLSNG HGLGVLGVTP GQPMPPFMNY PPEIVDGLAA AYGDQDRADM LQRSAFIHGT
VFPNLSFLNV LIGRDKKSMP VPMLTFRLWR PLSHDTMEVW SWFLVEKDAD EEFKQQSYET
YVRTFGISGV FEQDDAETWR SITAGTQGIL AGSQTLNFEM GMGVLTSDDT WKGPGRPLSS
GYAERNQREF WGRLLELLTD SGDDASETEP KPQLLAQSRT NADEVA