Gene Msed_1428 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1428 
Symbol 
ID5104159 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1398035 
End bp1399330 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content45% 
IMG OID640507316 
Producthydrogenase 4 subunit F 
Protein accessionYP_001191509 
Protein GI146304193 
COG category[C] Energy production and conversion
[P] Inorganic ion transport and metabolism 
COG ID[COG0651] Formate hydrogenlyase subunit 3/Multisubunit Na+/H+ antiporter, MnhD subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.289915 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.541686 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGGCC TAAACCTAGT GGGATGGATA ATACTAGTTC CCATTTTGGG AAGTGCAGTA 
GTTTTTAAGG AAAAGTACTC CACGGTAGCC TCTGCTACGA TCACCCTAAT TCTCGTGCTA
GCTCTCAAAC CATTTTTACC CATAGTCTCC CAATTTTACG TTAGTAACTT AACTTGGTAT
TTTTTGGTCA TGGTGGCCGG TGTCTATCTG CTCTCATCCA TTTACTCCCT CTTCTACGTG
GAGAAGAGGG GAGTTCAGGA AAGGAATTAC TATTATTTCC TCAATCTATT CGCCGCCTCC
ATGTTGTTCA CACTCTCCGT CAACAATCTT GGACTCATGT GGGTTGGATT AGAGGCCACT
ACGATATCGT CCGTGCTTCT AGTAACCTTT GAGGGTACTT CAACTGCACT TGAGGCAGGA
TGGAGGTATC TCCTTCTTGT TTCCTCCGGA GTGACTTTCG CATTCATCTC TGTCATCCTT
TTCTATTTCG GCCTACACAC CTTAACCATA AGCGAGGTAC TATATCCTCA CTTTTCTCCA
TTATTCTCGT TGGCCTCAGC CATTGCACTA TTGGGTTTCG GGACAAAGGC TGGAGTATTC
CCAGTAAATA CCTGGCTACC AGATGCTCAT TCGGAAGCTC CTTCACCTGT GAGTGCACTT
TTTTCAGGAG TCCTCCTGCC AGTATCTATC TACATTCTTC ACACAATTTT CATCATAGCA
CCCCTTCCGG GACTGTATAG CTGGCTAGCT ACTATCTCCA TTCTCATTTC CTCCATAATG
ATGGCTAGCC AACGATATTA CAAGAGGCTA TTCGCCTACT CCACGATCGA GAACATGAAT
TTTGCCCTTC TTGGGGTTGC AGTGAACTCG TTAACAGGTT TAGTTATACT CTTGGTGACT
CACGCATTCG CTAAGGCTGG GGCGTTTTAT GCGTCTGGAT CCCTGCTCAA AAGCCAAGGG
ACCAAGGAGA TCAACGAACA TGGACTCTAT GGAAGTCCCT TACTAGCTTC TTCGCTCATC
CTTTCATCTC TAGCAGTAAC GGGTGCACCT CCCTTTGGCA ACTTCATTGG AGAGTTCCTG
ATTCTCTCCT CAGTCCTCAA ACATGGACTT GTCCCCCAAT TCTGGATTGT AATCATTTCC
TTGATAATAT CATTCGTCTG CGTTAACTAT CATGTGTCTA GGATGATCTT TAAGGGAAAA
CCATTTCGGG AGAGATACAA GGCACTCTCG ATCATTGCCA TAATTTCAGC TCTGATTTCC
TTAATCGTTG GTGTAGTAGG GGTGGTACTA CTATGA
 
Protein sequence
MTGLNLVGWI ILVPILGSAV VFKEKYSTVA SATITLILVL ALKPFLPIVS QFYVSNLTWY 
FLVMVAGVYL LSSIYSLFYV EKRGVQERNY YYFLNLFAAS MLFTLSVNNL GLMWVGLEAT
TISSVLLVTF EGTSTALEAG WRYLLLVSSG VTFAFISVIL FYFGLHTLTI SEVLYPHFSP
LFSLASAIAL LGFGTKAGVF PVNTWLPDAH SEAPSPVSAL FSGVLLPVSI YILHTIFIIA
PLPGLYSWLA TISILISSIM MASQRYYKRL FAYSTIENMN FALLGVAVNS LTGLVILLVT
HAFAKAGAFY ASGSLLKSQG TKEINEHGLY GSPLLASSLI LSSLAVTGAP PFGNFIGEFL
ILSSVLKHGL VPQFWIVIIS LIISFVCVNY HVSRMIFKGK PFRERYKALS IIAIISALIS
LIVGVVGVVL L