Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1432 |
Symbol | |
ID | 5104802 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1402747 |
End bp | 1403688 |
Gene Length | 942 bp |
Protein Length | 313 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640507320 |
Product | formate hydrogenlyase subunit 4-like protein |
Protein accession | YP_001191513 |
Protein GI | 146304197 |
COG category | [C] Energy production and conversion |
COG ID | [COG0650] Formate hydrogenlyase subunit 4 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTAACA CGTTAGTGGA AACACTTATC CAGGTCTCTG CAGTACTTGC CCTCTCTCCT CTCTATCCCG GCATCCTTGA GAAAATGAAA GCTAGGGTGG AAGGAAGAAG AGGTCCTAGC ATATTTCAGC CCTACTACGA TCTACTAAAA CTTAACAAGA AGGAGATGAC CATTCCGAGA AACGCAGGAT GGCTCTTCGT CAATGGACCA TACCTTGTTT TCTCGATTTA TGTCCTTATC TCCTTTGTAA TTCCTGTTGT TTACCCAGAA CCTGTCTATC TTACCCCAGT GGTGGACTTT TTAGGAGGAG CTCTCCTTTT CTCGCTTTCA GGTTTCTTGA AAGTCTATGA GTCCATGGAA AGCTCTAGTA ACCTAGTAAC TCTTGGGGTA TCGAGAAACA TTTCCTTTGC CTATCTTGGA GAGGCCACCC TGCTAACAGT GTTCATAGCA GTTGCGTTGG TAACTGGAAC TAATAATCCC TACATTACAA TGGAGGCTAT TCAATCGCCG ACAGAATATC TCTTCCTGCC TCATATAACA GCCAGTGTGG CCTTTTTCAT GTTATGGTTG TACGAGACTG GGAAACTTCC GTTGGAGAGC TCAGGGATGT CGGAAATGGG AATGATCGAT GATTCTCTAG TATATGAATA CAGCGGAAAG GGTCTGATGC TATTGAGATG GGGAAGCTAC GTGAAGTCTT ACCTTCTTGG GTCTGTGTTG TTAAACGTCT TCCTCATACC CTGGGGAATG CAGACGGGTG TACTTGGGGC CATGGCGGAT GTGGGAATAA TGTTCCTGAA GTGGTTAGTT CTCCTTATGA TAACCGTGGT AATAGAGACA AGCCTCGCGA AGTTCAGGCT ATTTAAGATA CAGGACTTCC TGATTGTGGC CTTAGTGCTC TCGGTGTTTT CGGTTATCCT GACGGTGACC TTGAATGGTT GA
|
Protein sequence | MINTLVETLI QVSAVLALSP LYPGILEKMK ARVEGRRGPS IFQPYYDLLK LNKKEMTIPR NAGWLFVNGP YLVFSIYVLI SFVIPVVYPE PVYLTPVVDF LGGALLFSLS GFLKVYESME SSSNLVTLGV SRNISFAYLG EATLLTVFIA VALVTGTNNP YITMEAIQSP TEYLFLPHIT ASVAFFMLWL YETGKLPLES SGMSEMGMID DSLVYEYSGK GLMLLRWGSY VKSYLLGSVL LNVFLIPWGM QTGVLGAMAD VGIMFLKWLV LLMITVVIET SLAKFRLFKI QDFLIVALVL SVFSVILTVT LNG
|
| |