Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmcs_1689 |
Symbol | |
ID | 4110524 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. MCS |
Kingdom | Bacteria |
Replicon accession | NC_008146 |
Strand | + |
Start bp | 1827168 |
End bp | 1828568 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 638030809 |
Product | ring hydroxylating dioxygenase, alpha subunit |
Protein accession | YP_638855 |
Protein GI | 108798658 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.550846 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTACTG TCGGTAAGAA CGACATTCAG CAACTGGTAG CGGCCGGCCG AGAAGGCCTT GCTAAGGGCC GTCTGCCCGC CGGGCTGGTC GCCAACGCAG AACTCCACAA GCTCGAAGCT CAGCGAGTCT TCGGCCGGTG CTGGCAATTC CTGGCCCACG AGACGGAGAT CCCCCAGGCA GGGGACTACG TGGTCCGATA TCTGGGTGGC GGTTCGATCA TCGTTGTCCG CGGCGAAGAC GGCGAAGTGC GCGCCATGGC GAACTCGTGT CGGCACCGCG GAACAATGCT GTGCCGCACG GAGATGGGCA ACACTTCGCA CTTCCGCTGC CCCTATCACG GCTGGACCTA CCGCAATACC GGAACTCTGG CGGGTGTACC CGCACAAAAA GAGGTCTATG GGGTCGAGAT GGACAAGAAC GAGTGGAGTC TTACCCAGGT TCCGCGCCTC GAGAACTACG GCGGAATGAT ATTCGGTTGC CTGGACGAGA AGGCAGAACC TCTCGTTGAT TATTTGGGCG ATATGGCGTG GTATCTGGAC CTGATCACCC AGAAGTCCAA GGGTGGACTG GAGGTGCGGG GTGAGCCCCA GCGTTGGATC ATCGACTCCA ACTGGAAGCT CGGCGCGGAC AACTTTGTCG GGGACGCCTA CCACACGTTG ATGACGCACC GATCGGCGGT CGAGCTCGGT CTGGCTCCGC CCGATCCGAA ATTCGCATCG GAGCCGGCGC ATATCAGTCT CTCCAACGGT CACGGCCTCG GCGTCCTCGG GGTAACGCCC GGGCAACCGA TGCCGCCCTT TATGAACTAT CCACCCGAGA TCGTCGATGG ACTCGCAGCG GCTTACGGCG ATCAGGACCG CGCAGACATG CTTCAGCGTT CGGCCTTCAT TCACGGCACG GTCTTTCCCA ACCTGTCGTT CCTCAACGTC CTCATCGGTA GGGACAAGAA GTCAATGCCA GTGCCGATGT TGACATTTCG GCTGTGGCGT CCGCTGTCAC ACGACACGAT GGAAGTCTGG TCGTGGTTTC TCGTCGAGAA GGATGCCGAC GAAGAGTTCA AACAGCAGTC GTATGAGACC TACGTACGAA CGTTCGGCAT CTCCGGTGTG TTCGAACAGG ACGACGCCGA GACTTGGCGC TCCATCACTG CGGGAACGCA AGGCATTCTC GCAGGCAGCC AGACACTCAA CTTCGAGATG GGCATGGGTG TGCTGACCAG CGACGACACG TGGAAGGGGC CCGGTCGTCC CCTGTCCAGC GGGTACGCGG AGCGTAACCA ACGCGAATTC TGGGGTCGCC TGTTGGAGTT ACTCACCGAC TCAGGCGATG ACGCCAGCGA AACCGAGCCC AAACCCCAAC TACTCGCGCA ATCTCGGACC AATGCAGACG AGGTCGCCTG A
|
Protein sequence | MSTVGKNDIQ QLVAAGREGL AKGRLPAGLV ANAELHKLEA QRVFGRCWQF LAHETEIPQA GDYVVRYLGG GSIIVVRGED GEVRAMANSC RHRGTMLCRT EMGNTSHFRC PYHGWTYRNT GTLAGVPAQK EVYGVEMDKN EWSLTQVPRL ENYGGMIFGC LDEKAEPLVD YLGDMAWYLD LITQKSKGGL EVRGEPQRWI IDSNWKLGAD NFVGDAYHTL MTHRSAVELG LAPPDPKFAS EPAHISLSNG HGLGVLGVTP GQPMPPFMNY PPEIVDGLAA AYGDQDRADM LQRSAFIHGT VFPNLSFLNV LIGRDKKSMP VPMLTFRLWR PLSHDTMEVW SWFLVEKDAD EEFKQQSYET YVRTFGISGV FEQDDAETWR SITAGTQGIL AGSQTLNFEM GMGVLTSDDT WKGPGRPLSS GYAERNQREF WGRLLELLTD SGDDASETEP KPQLLAQSRT NADEVA
|
| |