Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0933 |
Symbol | |
ID | 5104363 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 859929 |
End bp | 860987 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640506836 |
Product | hydrogenase expression/formation protein HypE |
Protein accession | YP_001191029 |
Protein GI | 146303713 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0309] Hydrogenase maturation factor |
TIGRFAM ID | [TIGR02124] hydrogenase expression/formation protein HypE |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.900318 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAACTGC CCGATAGAAA GAACGTAATT ACCCTACTCC ACGGGGCTGG AGGGACGTAC ATGCATTCCC TGATAAGGGA CGTCTTTTTG AAGCTAAATG ATGGATTTGG CGAGGTGGGA CTAGAAATGA TGGACGATGC AGCCGTGGTT AATGGAATAG TGTTCACTAC GGATTCGTTC GTCATTAGGC CCATTTTCTT CAGAGGTGGA GATATAGGAA GATTAAGCGT GAGTGGCACA GTAAATGATA TAGCGATGAT GGGGGGAGAT CCTCAGGCCC TGAGTCTTGG AGTAGTATTA GAGGAAGGAT TCCCCAAGGA TATGTTGGAG AAGATAGTTG AAAGCATTAA GAAGACTGCT GAGGAAGCGA ACGTCCACGT AGTTACTGGA GACACGAAGG TCATGGAGAG GGGAAACTTA GATAAAATTG TCATTAACAC CGCCGGAATA GGTACAAGAC CTAGGCAATT GGATCACAAC ATTGAGACCT TGAGGAAAAG TAGGCAACCT TCCCGCTGGT TAGTTCCCAC TAACCTTAGG GATGGAGATA AAATTGTGGT CACCGGTACC CTTGGGGATC ATGCCATAGC GGTCCTATCT TCAAGGGAAG GAGTGGGATT TGAGTCAAAT GTCATGTCCG ATGTTGCCCC TCTCAATAAG ATGATCATGA ACCTACTGGA AGTAGGAGGA ATAGCTGATG CCAAGGATCC AACGAGAGGA GGACTGGCAG ATCTGCTTCA GGACTGGTCG GAAAAGTCTG GACTTGGAAT CTTCATAAGG GAGAGTGACA TCCCAGTGAA GGATGAGGTC AGAGCTGCCG TCGAGTTTCT AGGAATGGAC GTTCTAGAGT TGGGTAATGA GGGAAAGGCC GTCTTAGCAG TTTCGCCTGA ATATGTTAAG GACGTTATGG ACGCGTTACA TTCAGATCCG CTTGGGAAGG ACGCAACAAT AATAGGAGAG GTCAGAAAGG ATTTAGAGGG GGTAATAATG GAGACTGTGG TCGGGGGAAA CAGGTATGTT GGAAGACCCT TAGGGGATCC AGTTCCTAGA ATATGCTAG
|
Protein sequence | MELPDRKNVI TLLHGAGGTY MHSLIRDVFL KLNDGFGEVG LEMMDDAAVV NGIVFTTDSF VIRPIFFRGG DIGRLSVSGT VNDIAMMGGD PQALSLGVVL EEGFPKDMLE KIVESIKKTA EEANVHVVTG DTKVMERGNL DKIVINTAGI GTRPRQLDHN IETLRKSRQP SRWLVPTNLR DGDKIVVTGT LGDHAIAVLS SREGVGFESN VMSDVAPLNK MIMNLLEVGG IADAKDPTRG GLADLLQDWS EKSGLGIFIR ESDIPVKDEV RAAVEFLGMD VLELGNEGKA VLAVSPEYVK DVMDALHSDP LGKDATIIGE VRKDLEGVIM ETVVGGNRYV GRPLGDPVPR IC
|
| |