Gene Mmcs_1689 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_1689 
Symbol 
ID4110524 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp1827168 
End bp1828568 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content60% 
IMG OID638030809 
Productring hydroxylating dioxygenase, alpha subunit 
Protein accessionYP_638855 
Protein GI108798658 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.550846 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTACTG TCGGTAAGAA CGACATTCAG CAACTGGTAG CGGCCGGCCG AGAAGGCCTT 
GCTAAGGGCC GTCTGCCCGC CGGGCTGGTC GCCAACGCAG AACTCCACAA GCTCGAAGCT
CAGCGAGTCT TCGGCCGGTG CTGGCAATTC CTGGCCCACG AGACGGAGAT CCCCCAGGCA
GGGGACTACG TGGTCCGATA TCTGGGTGGC GGTTCGATCA TCGTTGTCCG CGGCGAAGAC
GGCGAAGTGC GCGCCATGGC GAACTCGTGT CGGCACCGCG GAACAATGCT GTGCCGCACG
GAGATGGGCA ACACTTCGCA CTTCCGCTGC CCCTATCACG GCTGGACCTA CCGCAATACC
GGAACTCTGG CGGGTGTACC CGCACAAAAA GAGGTCTATG GGGTCGAGAT GGACAAGAAC
GAGTGGAGTC TTACCCAGGT TCCGCGCCTC GAGAACTACG GCGGAATGAT ATTCGGTTGC
CTGGACGAGA AGGCAGAACC TCTCGTTGAT TATTTGGGCG ATATGGCGTG GTATCTGGAC
CTGATCACCC AGAAGTCCAA GGGTGGACTG GAGGTGCGGG GTGAGCCCCA GCGTTGGATC
ATCGACTCCA ACTGGAAGCT CGGCGCGGAC AACTTTGTCG GGGACGCCTA CCACACGTTG
ATGACGCACC GATCGGCGGT CGAGCTCGGT CTGGCTCCGC CCGATCCGAA ATTCGCATCG
GAGCCGGCGC ATATCAGTCT CTCCAACGGT CACGGCCTCG GCGTCCTCGG GGTAACGCCC
GGGCAACCGA TGCCGCCCTT TATGAACTAT CCACCCGAGA TCGTCGATGG ACTCGCAGCG
GCTTACGGCG ATCAGGACCG CGCAGACATG CTTCAGCGTT CGGCCTTCAT TCACGGCACG
GTCTTTCCCA ACCTGTCGTT CCTCAACGTC CTCATCGGTA GGGACAAGAA GTCAATGCCA
GTGCCGATGT TGACATTTCG GCTGTGGCGT CCGCTGTCAC ACGACACGAT GGAAGTCTGG
TCGTGGTTTC TCGTCGAGAA GGATGCCGAC GAAGAGTTCA AACAGCAGTC GTATGAGACC
TACGTACGAA CGTTCGGCAT CTCCGGTGTG TTCGAACAGG ACGACGCCGA GACTTGGCGC
TCCATCACTG CGGGAACGCA AGGCATTCTC GCAGGCAGCC AGACACTCAA CTTCGAGATG
GGCATGGGTG TGCTGACCAG CGACGACACG TGGAAGGGGC CCGGTCGTCC CCTGTCCAGC
GGGTACGCGG AGCGTAACCA ACGCGAATTC TGGGGTCGCC TGTTGGAGTT ACTCACCGAC
TCAGGCGATG ACGCCAGCGA AACCGAGCCC AAACCCCAAC TACTCGCGCA ATCTCGGACC
AATGCAGACG AGGTCGCCTG A
 
Protein sequence
MSTVGKNDIQ QLVAAGREGL AKGRLPAGLV ANAELHKLEA QRVFGRCWQF LAHETEIPQA 
GDYVVRYLGG GSIIVVRGED GEVRAMANSC RHRGTMLCRT EMGNTSHFRC PYHGWTYRNT
GTLAGVPAQK EVYGVEMDKN EWSLTQVPRL ENYGGMIFGC LDEKAEPLVD YLGDMAWYLD
LITQKSKGGL EVRGEPQRWI IDSNWKLGAD NFVGDAYHTL MTHRSAVELG LAPPDPKFAS
EPAHISLSNG HGLGVLGVTP GQPMPPFMNY PPEIVDGLAA AYGDQDRADM LQRSAFIHGT
VFPNLSFLNV LIGRDKKSMP VPMLTFRLWR PLSHDTMEVW SWFLVEKDAD EEFKQQSYET
YVRTFGISGV FEQDDAETWR SITAGTQGIL AGSQTLNFEM GMGVLTSDDT WKGPGRPLSS
GYAERNQREF WGRLLELLTD SGDDASETEP KPQLLAQSRT NADEVA