Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0996 |
Symbol | |
ID | 5104545 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 919242 |
End bp | 920708 |
Gene Length | 1467 bp |
Protein Length | 488 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640506895 |
Product | 4-hydroxyphenylacetate 3-hydroxylase |
Protein accession | YP_001191088 |
Protein GI | 146303772 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2368] Aromatic ring hydroxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0222064 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.012 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATAAGGA AAGGGAGTGA CTACATAGAG AGTATAAGCA AGAATCCGCC TGTGACCTAC TACGAAGGGG AAGTTGTGAG CGACGTTGTA AATCATCCCG CGTTCAGGAT CCCGGTTAAG ACTGTGGCCA GCTACTACGA CCTTCACTGG AAAGTTGACG GGTTGAGGGT CTACAACAGA GACGTCGGTG AGGAAACTAG CATAAGCTTA GTCAGACCCA GGAGCAAGGA GGACCTCCTG AAGTTAGGGG AGGGGCTGGT AAAGATCTAC GAGTTCTATA GGGGTTTCTT CGGAAGAAGC CCAGATTACA TGAACCTATG GACCATGGTC TTCTTCGCCC ACGCTGAGGA CTACTTCGGG AAACACTTCG GTTCGAGGTT CATGGAGAAC GCCATGGAAA TCTATCGGGA ATCCACAAGA AAGGACCACT TTTACACACA CGCCATAGTT GCCCCGATGT ATGACAGATC TAGACCTCCG TCTCAGTGGG AAGATCCGTA TATACAGATA GGTATCACAG AGGAGAGGCC AGAGGGGGTT GTGGTCAGGG GTGCAGCCAT GATTTGTACG GCAGGACCCT ACGCCGAGAT GCTGTGGTAT CTTCCGAACA TGAGGAGGGA CTCTGACCCT AGGTATGCCC TCTACTTCTC AATTCCCACA ACTACCAAGG GAGTAAGGTT CATAGCGAGG AGAGGGTTCC AACCAAGGGA GGGTGGCGAG TTTGAGTATC CCATCTCCTC AAGGTGGGAC GAGGCTGATG CAATCCTGGT CTTGGATAAC GTGTTAGTCC CGTGGGACAG GATCATATTC TTCAAGAAAC CTGAGCTCAT TGAGGACCTC ATGTGGCATA CTGTGGGGCT CAGGGGATGG TTTAACTGGC ACTTCATGAT ACAGCACTAC ACCAGGCTTA AGTTTTTGGC AGGGCTAGCC ATGACTATCA CCGAGGCAGC CGGCACAAGT ACCTTCATCA ACGTTCAAGA GAAAATAGGC GAGATCCTCC TATACGTCGC ACTTAACGAG GCTGCCCTCT ACGGTTCCGT GGCCAGAGCC CAGGAATTAC CCAACATCGT GAGACCCGAT CCCTATATCT CAATCTCAGC TAGTCACTTC AACATGAAGG CAGTACCACG GGCAAACGAG ATCTTGAGGC TCATAAGTGC AGGTTCATCC ATACCAATAC CCGCCGGAGC AAAGGACTTC ACGAACCCCG AGGAAAGGGC TTACCTAGAG AAGTACATGG CAATGAAGGG ATTCGATGCT CTTGAGAGAG TCAAGACCTT CAACCTCCTT TGGGACGTGA TAGGGTCGGA GGTAGGAATG AGATATGAGC AATACGACCG GTTCAGTAGG GGTGATCCCA CGATCAGGTG GGCCCAGACT TACACTGAGG TATTCAGGGA TAGGAGAAAC GAGTTCGTGA AACTGGTCAA GGAAATTCTG GATCAGATGC CCAATCCAAA GGCCTAA
|
Protein sequence | MIRKGSDYIE SISKNPPVTY YEGEVVSDVV NHPAFRIPVK TVASYYDLHW KVDGLRVYNR DVGEETSISL VRPRSKEDLL KLGEGLVKIY EFYRGFFGRS PDYMNLWTMV FFAHAEDYFG KHFGSRFMEN AMEIYRESTR KDHFYTHAIV APMYDRSRPP SQWEDPYIQI GITEERPEGV VVRGAAMICT AGPYAEMLWY LPNMRRDSDP RYALYFSIPT TTKGVRFIAR RGFQPREGGE FEYPISSRWD EADAILVLDN VLVPWDRIIF FKKPELIEDL MWHTVGLRGW FNWHFMIQHY TRLKFLAGLA MTITEAAGTS TFINVQEKIG EILLYVALNE AALYGSVARA QELPNIVRPD PYISISASHF NMKAVPRANE ILRLISAGSS IPIPAGAKDF TNPEERAYLE KYMAMKGFDA LERVKTFNLL WDVIGSEVGM RYEQYDRFSR GDPTIRWAQT YTEVFRDRRN EFVKLVKEIL DQMPNPKA
|
| |