Gene Msed_0996 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0996 
Symbol 
ID5104545 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp919242 
End bp920708 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content51% 
IMG OID640506895 
Product4-hydroxyphenylacetate 3-hydroxylase 
Protein accessionYP_001191088 
Protein GI146303772 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2368] Aromatic ring hydroxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0222064 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.012 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAAGGA AAGGGAGTGA CTACATAGAG AGTATAAGCA AGAATCCGCC TGTGACCTAC 
TACGAAGGGG AAGTTGTGAG CGACGTTGTA AATCATCCCG CGTTCAGGAT CCCGGTTAAG
ACTGTGGCCA GCTACTACGA CCTTCACTGG AAAGTTGACG GGTTGAGGGT CTACAACAGA
GACGTCGGTG AGGAAACTAG CATAAGCTTA GTCAGACCCA GGAGCAAGGA GGACCTCCTG
AAGTTAGGGG AGGGGCTGGT AAAGATCTAC GAGTTCTATA GGGGTTTCTT CGGAAGAAGC
CCAGATTACA TGAACCTATG GACCATGGTC TTCTTCGCCC ACGCTGAGGA CTACTTCGGG
AAACACTTCG GTTCGAGGTT CATGGAGAAC GCCATGGAAA TCTATCGGGA ATCCACAAGA
AAGGACCACT TTTACACACA CGCCATAGTT GCCCCGATGT ATGACAGATC TAGACCTCCG
TCTCAGTGGG AAGATCCGTA TATACAGATA GGTATCACAG AGGAGAGGCC AGAGGGGGTT
GTGGTCAGGG GTGCAGCCAT GATTTGTACG GCAGGACCCT ACGCCGAGAT GCTGTGGTAT
CTTCCGAACA TGAGGAGGGA CTCTGACCCT AGGTATGCCC TCTACTTCTC AATTCCCACA
ACTACCAAGG GAGTAAGGTT CATAGCGAGG AGAGGGTTCC AACCAAGGGA GGGTGGCGAG
TTTGAGTATC CCATCTCCTC AAGGTGGGAC GAGGCTGATG CAATCCTGGT CTTGGATAAC
GTGTTAGTCC CGTGGGACAG GATCATATTC TTCAAGAAAC CTGAGCTCAT TGAGGACCTC
ATGTGGCATA CTGTGGGGCT CAGGGGATGG TTTAACTGGC ACTTCATGAT ACAGCACTAC
ACCAGGCTTA AGTTTTTGGC AGGGCTAGCC ATGACTATCA CCGAGGCAGC CGGCACAAGT
ACCTTCATCA ACGTTCAAGA GAAAATAGGC GAGATCCTCC TATACGTCGC ACTTAACGAG
GCTGCCCTCT ACGGTTCCGT GGCCAGAGCC CAGGAATTAC CCAACATCGT GAGACCCGAT
CCCTATATCT CAATCTCAGC TAGTCACTTC AACATGAAGG CAGTACCACG GGCAAACGAG
ATCTTGAGGC TCATAAGTGC AGGTTCATCC ATACCAATAC CCGCCGGAGC AAAGGACTTC
ACGAACCCCG AGGAAAGGGC TTACCTAGAG AAGTACATGG CAATGAAGGG ATTCGATGCT
CTTGAGAGAG TCAAGACCTT CAACCTCCTT TGGGACGTGA TAGGGTCGGA GGTAGGAATG
AGATATGAGC AATACGACCG GTTCAGTAGG GGTGATCCCA CGATCAGGTG GGCCCAGACT
TACACTGAGG TATTCAGGGA TAGGAGAAAC GAGTTCGTGA AACTGGTCAA GGAAATTCTG
GATCAGATGC CCAATCCAAA GGCCTAA
 
Protein sequence
MIRKGSDYIE SISKNPPVTY YEGEVVSDVV NHPAFRIPVK TVASYYDLHW KVDGLRVYNR 
DVGEETSISL VRPRSKEDLL KLGEGLVKIY EFYRGFFGRS PDYMNLWTMV FFAHAEDYFG
KHFGSRFMEN AMEIYRESTR KDHFYTHAIV APMYDRSRPP SQWEDPYIQI GITEERPEGV
VVRGAAMICT AGPYAEMLWY LPNMRRDSDP RYALYFSIPT TTKGVRFIAR RGFQPREGGE
FEYPISSRWD EADAILVLDN VLVPWDRIIF FKKPELIEDL MWHTVGLRGW FNWHFMIQHY
TRLKFLAGLA MTITEAAGTS TFINVQEKIG EILLYVALNE AALYGSVARA QELPNIVRPD
PYISISASHF NMKAVPRANE ILRLISAGSS IPIPAGAKDF TNPEERAYLE KYMAMKGFDA
LERVKTFNLL WDVIGSEVGM RYEQYDRFSR GDPTIRWAQT YTEVFRDRRN EFVKLVKEIL
DQMPNPKA