Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1428 |
Symbol | |
ID | 5104159 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1398035 |
End bp | 1399330 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640507316 |
Product | hydrogenase 4 subunit F |
Protein accession | YP_001191509 |
Protein GI | 146304193 |
COG category | [C] Energy production and conversion [P] Inorganic ion transport and metabolism |
COG ID | [COG0651] Formate hydrogenlyase subunit 3/Multisubunit Na+/H+ antiporter, MnhD subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.289915 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.541686 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGGCC TAAACCTAGT GGGATGGATA ATACTAGTTC CCATTTTGGG AAGTGCAGTA GTTTTTAAGG AAAAGTACTC CACGGTAGCC TCTGCTACGA TCACCCTAAT TCTCGTGCTA GCTCTCAAAC CATTTTTACC CATAGTCTCC CAATTTTACG TTAGTAACTT AACTTGGTAT TTTTTGGTCA TGGTGGCCGG TGTCTATCTG CTCTCATCCA TTTACTCCCT CTTCTACGTG GAGAAGAGGG GAGTTCAGGA AAGGAATTAC TATTATTTCC TCAATCTATT CGCCGCCTCC ATGTTGTTCA CACTCTCCGT CAACAATCTT GGACTCATGT GGGTTGGATT AGAGGCCACT ACGATATCGT CCGTGCTTCT AGTAACCTTT GAGGGTACTT CAACTGCACT TGAGGCAGGA TGGAGGTATC TCCTTCTTGT TTCCTCCGGA GTGACTTTCG CATTCATCTC TGTCATCCTT TTCTATTTCG GCCTACACAC CTTAACCATA AGCGAGGTAC TATATCCTCA CTTTTCTCCA TTATTCTCGT TGGCCTCAGC CATTGCACTA TTGGGTTTCG GGACAAAGGC TGGAGTATTC CCAGTAAATA CCTGGCTACC AGATGCTCAT TCGGAAGCTC CTTCACCTGT GAGTGCACTT TTTTCAGGAG TCCTCCTGCC AGTATCTATC TACATTCTTC ACACAATTTT CATCATAGCA CCCCTTCCGG GACTGTATAG CTGGCTAGCT ACTATCTCCA TTCTCATTTC CTCCATAATG ATGGCTAGCC AACGATATTA CAAGAGGCTA TTCGCCTACT CCACGATCGA GAACATGAAT TTTGCCCTTC TTGGGGTTGC AGTGAACTCG TTAACAGGTT TAGTTATACT CTTGGTGACT CACGCATTCG CTAAGGCTGG GGCGTTTTAT GCGTCTGGAT CCCTGCTCAA AAGCCAAGGG ACCAAGGAGA TCAACGAACA TGGACTCTAT GGAAGTCCCT TACTAGCTTC TTCGCTCATC CTTTCATCTC TAGCAGTAAC GGGTGCACCT CCCTTTGGCA ACTTCATTGG AGAGTTCCTG ATTCTCTCCT CAGTCCTCAA ACATGGACTT GTCCCCCAAT TCTGGATTGT AATCATTTCC TTGATAATAT CATTCGTCTG CGTTAACTAT CATGTGTCTA GGATGATCTT TAAGGGAAAA CCATTTCGGG AGAGATACAA GGCACTCTCG ATCATTGCCA TAATTTCAGC TCTGATTTCC TTAATCGTTG GTGTAGTAGG GGTGGTACTA CTATGA
|
Protein sequence | MTGLNLVGWI ILVPILGSAV VFKEKYSTVA SATITLILVL ALKPFLPIVS QFYVSNLTWY FLVMVAGVYL LSSIYSLFYV EKRGVQERNY YYFLNLFAAS MLFTLSVNNL GLMWVGLEAT TISSVLLVTF EGTSTALEAG WRYLLLVSSG VTFAFISVIL FYFGLHTLTI SEVLYPHFSP LFSLASAIAL LGFGTKAGVF PVNTWLPDAH SEAPSPVSAL FSGVLLPVSI YILHTIFIIA PLPGLYSWLA TISILISSIM MASQRYYKRL FAYSTIENMN FALLGVAVNS LTGLVILLVT HAFAKAGAFY ASGSLLKSQG TKEINEHGLY GSPLLASSLI LSSLAVTGAP PFGNFIGEFL ILSSVLKHGL VPQFWIVIIS LIISFVCVNY HVSRMIFKGK PFRERYKALS IIAIISALIS LIVGVVGVVL L
|
| |