Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1431 |
Symbol | |
ID | 5104801 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1401038 |
End bp | 1402750 |
Gene Length | 1713 bp |
Protein Length | 570 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640507319 |
Product | hydrogenase 4 subunit B |
Protein accession | YP_001191512 |
Protein GI | 146304196 |
COG category | [C] Energy production and conversion [P] Inorganic ion transport and metabolism |
COG ID | [COG0651] Formate hydrogenlyase subunit 3/Multisubunit Na+/H+ antiporter, MnhD subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.832858 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATTCTAG AACTCGACCT AATTTACATA CTATCTGCTC TTTCCTTAGT TACATCCCTC TTCAGCAACA GAATATCTTT AGTGCTTCTC GTGTTAGCTT CTGGGATACT TGCCTTTTAC GGTGTACAGG AACTACCCAT GGGTATATTC TACCTAGTCG CGGGATTGGT ATGGGTTCTG GTCGCTTTAC ACTCCTTGTT CCACTATAGC GATAAGTGGC TCACTATGAC CCTAAGTGGT ACTGTACTAG GGATCATTGT GGTTCTTACC AGCACGAACT ACATCGAGTT TCTCGCGGGA TGGGAAACCA TGACGCTTTT CTCGTTCGTT GGAATAGCGA TCTACAGAAA GGACTGGAAA CCTGCATTAA CTTTCCTTGC GTTCGGGGAG CTGAGCACTG CATTACTGCT AGCCGGTTTC GCCCTTGCCT ACTCTCAGAC AGGTAGTCTA GTATTCGAGA GGTTAAGCAC TCAGCTCCCC CTCATTATAA CGTCAATGGG TTTCATCGTC AAGATGGGAA TATTCCCATT CCTTGTCGTG GAGTGGTTGC CCATAGCCCA CGGAAACGCC AGGTCGGATC TATCAGCAGT TCTAAGCGCA ACGGTAACCA TGACAGGGAT TTACGGAATA TTGAAGATGG AGTCCTTGAG TCCCGTTTCG ACGTATCTGG GAATATTCCT TCTCGCCGTG GGAGCCTTCT CCAACCTGTT TGGTGCCCTA TACTCCTATG TCTCCGATCA TGTTAAGGGA TTGCTAGCGT TTAGCACCAT CGAAAATAAT GGTGCCATGC TAGCTCTGTT GGGAAGCCTA GAGCTTGTGA GCGGAGACTT GAAGGAGTTC GTCACGTTTA GTCTTTTTAC TTACGTAATA GCTCACTCCC TCGCTAAGAC AGGGCTTTTT CTTTCAACTG GATATGTTGA GGGAGAATCG CTGACAACTG CAAGCTCCTT TAGATATGGT CTCTCAGTTC TAGGAGCAGT CCTGATGGCC ATGTCGTTAT CGGGGCTTTT GCCTACCATA GGGGGAATCG CGACGTGGTC ATTGCTGGAG TCGATGTTCA TGGAAGCTAT AACACTACCG CACTTCATCA ACATTGTCCC AATAGTGGCA GGTGTCATGA TAGGCATGGG AGAGGGATTT GCCACCGGAT CCCTTGCGAA GTTCGTATCA TACACTCAAC TGACAAAACC GATCAAGGAC AAACAGGGAC TCATCCTTGC AGTTTCTGGG ATCCTCGTCT TAGTTACTGT GGGCCTGGCG TATCTTCTTT CTCCCTTCAG AACAGAGGTG TCCCAGCTTG GGGTTGGGCT AAATTCCCTT ATCTCCTCGC AATATCAAAG AAGTTTTGGA GGGATTGATC CGCTTTATAT CTTAGTTTCA TGGCCGATTA TCGCATTAAT AGTGTACCTG TCCCTAGGTA AGAGAAAAAT AAGAGTTGTA GACCCTTGGG ATAATGGATC TGCTCAAGGA TTTAGATACA CCTCCTTTGG CATGGCAAAT AACGTGAGAC TGATGTTAAG GGCTTTACTT AGAACCAAGA CTGGATCCCT GGAGACTAGC GCTGACATCT TCTGGCAAGC CATGTTAGTT TTAATCAGAT GGTATCTCAA ATTCTCTAGA ACCTTCTCCA GGAGCTTCAT GAATGGCTCC CTGAGATGGT ACATGGTTTA CATGATAATT GCCATTGTCG TCATAATGGT GATCACGTTA TGA
|
Protein sequence | MILELDLIYI LSALSLVTSL FSNRISLVLL VLASGILAFY GVQELPMGIF YLVAGLVWVL VALHSLFHYS DKWLTMTLSG TVLGIIVVLT STNYIEFLAG WETMTLFSFV GIAIYRKDWK PALTFLAFGE LSTALLLAGF ALAYSQTGSL VFERLSTQLP LIITSMGFIV KMGIFPFLVV EWLPIAHGNA RSDLSAVLSA TVTMTGIYGI LKMESLSPVS TYLGIFLLAV GAFSNLFGAL YSYVSDHVKG LLAFSTIENN GAMLALLGSL ELVSGDLKEF VTFSLFTYVI AHSLAKTGLF LSTGYVEGES LTTASSFRYG LSVLGAVLMA MSLSGLLPTI GGIATWSLLE SMFMEAITLP HFINIVPIVA GVMIGMGEGF ATGSLAKFVS YTQLTKPIKD KQGLILAVSG ILVLVTVGLA YLLSPFRTEV SQLGVGLNSL ISSQYQRSFG GIDPLYILVS WPIIALIVYL SLGKRKIRVV DPWDNGSAQG FRYTSFGMAN NVRLMLRALL RTKTGSLETS ADIFWQAMLV LIRWYLKFSR TFSRSFMNGS LRWYMVYMII AIVVIMVITL
|
| |