Gene Msed_1776 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1776 
Symbol 
ID5104776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1718690 
End bp1720171 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content49% 
IMG OID640507674 
Product4-hydroxyphenylacetate 3-hydroxylase 
Protein accessionYP_001191855 
Protein GI146304539 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2368] Aromatic ring hydroxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.193392 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGATCCGAA AGGGAATAGA TTTCATAAAA AGCATGAAGG AAGGACACCA CGGGGAGATA 
TACTATAATG GGGAGAAGGT AAGTGACGTA ACAGAGCATC CCGCATTTAG GGCTGCCATA
CACACGGTCT CAGACTACTA CGACCTTCAT TGGAAGGATG AATACAAGGA ATACCTGAGG
GTGTATAACC CTGACGTGGG GGAGGAGACC AGTATAACCT TCTTCAGGCC AAGGAATAAG
GAGGAGTTGA GAAGGCTGAG GATGGGGATA AGTAAGATTT ACGACTTCTA TAGGGGATTC
TTCGGTAGGA GTCCAGATTA CCTAAACGTA TGGACCATGT TATTCTACGC ACATGCTGAC
GACTACTTCG GTAAGGTGTT TGGAAATAAG ATGATGGAGA ACGTGATAGA GATATATAGG
GAGGCAGCTA AACAGGACCT TTTCTACACG CACGCCATTG TCGCCCCGAT GTATGATAGA
TCCAGACCAC CCTCACAATG GGAGGACCCC TACATACAGG TGGGCGTGGT CAGGGAAACC
ACCGAGGGAC TCGTGGTGAG GGGCGCCGCG CTACTATCCA CTGCAGGTCC CTACTCCGAA
AGGTTGTGGT ACCTGCCCAA CATCAAGAGG GACACCGACC CCAGGTATTC CGTGTTCTTC
TCACTTCCCA CGGAAAGTAA GGGTGTAAAG TTCATCTCCA GGAGGGGGTT CCATCCCAAG
GAGGAGTTCG GCGAGTTTGA GTACCCGATT ACCTCTAGAT ATGAGGAACC TGACGCCATT
ATGGTATTGG ACAACGTTTT GATACCTTGG GATAGGGTGA TCTTTTACAA GAAACCTGAG
GAGATAGAGG GGTTCATGTG GCATACGGTG AACCTGAGGG GATGGTTCAA CTGGCACTTC
GTGATTCAGC ACTACTCCAG GACGAAGTTC TTGGCAGGCC TGGCCATAGC CATTGCTGAG
GCCGTGGGGA TTAACAATTT CATCAATGTG CAGGAGAAGT TGGGAGAGAT CCTAATCTAC
CTAGCAATGT ATGAAGCTGG GATGGTGGCA TCAGAGGAAC TAGGGGAGCA ATTACCTGGA
GTATATAGAC CGAATCCGCA GATAGCCATA GCCACTAGCT CCATGGGAAT GAAGGCATTA
CCCAGGATAA ACGAGATACT TAGGTCCATA AGTGCTGGAT CTTCTATCCC CGTTCCAGCG
GGAATTAGGG ACTTTGAAAA CCCCGAGGAG AGAGCTCTCC TGGACAAGTA CCTAGCGTCC
AAGGGGTTGC CAGCCCTAGA GAGGGTGAAG CTGTTCAATA TTCTCTGGGA CACCATAGGT
TCTGAGACCG GGATGAGGTA TGAGCAGTAC GACAGGTTCA GTAGGGGAGA TCCCACAATT
AGGTGGGCTC AGATGTATAC CGAGGTGTAT AAGGACAGAA AGCAAGAGTT TGTGAAGATG
GTGAGGGACA TCATGGATCA GATGCCAAAC CCTAAGGCAT AA
 
Protein sequence
MIRKGIDFIK SMKEGHHGEI YYNGEKVSDV TEHPAFRAAI HTVSDYYDLH WKDEYKEYLR 
VYNPDVGEET SITFFRPRNK EELRRLRMGI SKIYDFYRGF FGRSPDYLNV WTMLFYAHAD
DYFGKVFGNK MMENVIEIYR EAAKQDLFYT HAIVAPMYDR SRPPSQWEDP YIQVGVVRET
TEGLVVRGAA LLSTAGPYSE RLWYLPNIKR DTDPRYSVFF SLPTESKGVK FISRRGFHPK
EEFGEFEYPI TSRYEEPDAI MVLDNVLIPW DRVIFYKKPE EIEGFMWHTV NLRGWFNWHF
VIQHYSRTKF LAGLAIAIAE AVGINNFINV QEKLGEILIY LAMYEAGMVA SEELGEQLPG
VYRPNPQIAI ATSSMGMKAL PRINEILRSI SAGSSIPVPA GIRDFENPEE RALLDKYLAS
KGLPALERVK LFNILWDTIG SETGMRYEQY DRFSRGDPTI RWAQMYTEVY KDRKQEFVKM
VRDIMDQMPN PKA