Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmcs_1666 |
Symbol | |
ID | 4110501 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. MCS |
Kingdom | Bacteria |
Replicon accession | NC_008146 |
Strand | + |
Start bp | 1803818 |
End bp | 1805209 |
Gene Length | 1392 bp |
Protein Length | 463 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 638030786 |
Product | ring hydroxylating dioxygenase, alpha subunit |
Protein accession | YP_638832 |
Protein GI | 108798635 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGCTC ACGTTCTAGG GGCACAAATC GATAGGAAGG TGCGCCCGGT GGATTCGATG GCGCCTGATG CGACGACAAT GCGAACCTTA GAGAATGCGC GCGGCTCCAT CCTAAAGGGT CGCCTCCCTG CGTCTCTCAT CGCTAATGCA GCGCTTTACG AGCTTGAATT GAAGCGAGTA TTTGGTAGGA CCTGGCAGTT TCTCTGCCAC GAAGACGAGA TCCCCAATGC GGGTGACTAT GTAGTGCGCT ACATCGCTGA TAACTCAATT ATTGTCGCGC GGCAGCAGGA TATGACGATT CGGGCGATGT CGAACTCGTG TCGGCACCGC GGCACGCTGC TTTGCCGAAC CGAGTCTGGG AATGAGTCGG CGTTCCAGTG TCCGTACCAC GGTTGGACCT ATCGAAACAA CGGTGATCTC ATCGCGATAC CTGCGCAGCA GGCAGTGTAC GGTGCTGCGT TCGACAAGAG TCGGCTAGGG TTGCGCGCTC TGCCGATGCT GGACTCGTAC GCGGGCCTTG TCTTCGGGTG TGTGTCGGAT GAGGCGCCGG GACTGGATGA GTACCTCGGG GACATGCGCT GGTATCTCGA CTTGATGATG AAGAAGAGCC CGACCGGCCT TGAGGCGTGG GGTGCCCCGC AGCGTTGGGT GATTGACGCG AACTGGAAGA CCGGCGCCGA TAACTTTGTT GGGGACGGCT ATCACACGGT CATGACGCAC CGTTCGATGT GTGAGCTGGG GTTGTTACCG CCCGATAATG TGGCCGTTTC GCCGGCCCAC GTCAGCCTAT CGGGCGGGCA CGGGGCGGGC GTTCTAGGCG CACCACCCGG CATACCCGCA CCGCCGTACA TGGGCTATCC GGAGGAAGTC GTCTCCGGTC TCAGCGAGGG TTACGGCGAT GACGTCCATG GCGAGTTGCT GAAACGGACG ATGTTCATTC ATGGCAATGT GTTCCCGAAC TTGTCCTTCT TGAACGCCTT CATCGCCAAG GACGGGGAGT CTATGCCGGT GCCCATTCTG ACCTTGCGGC AATGGCGTCC CTTGGACGCA GCGCGTATGG AGGTGTGGTC GTGGTTCTTC GTGGAGCGCA ACGCGCCCGA AGAGTTCAAG CAGCAGTCGT TTGAGACTTA TGTTCGGACG TTCGGGGTCG GGGGTGTCTT CGAGCAGGAT GACGCCGAGA TATTCCAGGC TATTACCAAG GGAACACGCG GCGAGTTGGC TGGTGGTGTG GAGCTGAACC TGGAGATGGG ACTGGACAAT CTGGCTCCTG ATCCAACGTG GCTGGGCCCG GGACGACCGT TGGCCAGTGG CTACGCCGAA CAGAATCAGC GCGAGTACTG GAAGCAGTAC TTCGACTATC TGGCCACACC GAGAAGGGAT GAGAACGTAT GA
|
Protein sequence | MSAHVLGAQI DRKVRPVDSM APDATTMRTL ENARGSILKG RLPASLIANA ALYELELKRV FGRTWQFLCH EDEIPNAGDY VVRYIADNSI IVARQQDMTI RAMSNSCRHR GTLLCRTESG NESAFQCPYH GWTYRNNGDL IAIPAQQAVY GAAFDKSRLG LRALPMLDSY AGLVFGCVSD EAPGLDEYLG DMRWYLDLMM KKSPTGLEAW GAPQRWVIDA NWKTGADNFV GDGYHTVMTH RSMCELGLLP PDNVAVSPAH VSLSGGHGAG VLGAPPGIPA PPYMGYPEEV VSGLSEGYGD DVHGELLKRT MFIHGNVFPN LSFLNAFIAK DGESMPVPIL TLRQWRPLDA ARMEVWSWFF VERNAPEEFK QQSFETYVRT FGVGGVFEQD DAEIFQAITK GTRGELAGGV ELNLEMGLDN LAPDPTWLGP GRPLASGYAE QNQREYWKQY FDYLATPRRD ENV
|
| |