Gene Msed_1431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1431 
Symbol 
ID5104801 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1401038 
End bp1402750 
Gene Length1713 bp 
Protein Length570 aa 
Translation table11 
GC content47% 
IMG OID640507319 
Producthydrogenase 4 subunit B 
Protein accessionYP_001191512 
Protein GI146304196 
COG category[C] Energy production and conversion
[P] Inorganic ion transport and metabolism 
COG ID[COG0651] Formate hydrogenlyase subunit 3/Multisubunit Na+/H+ antiporter, MnhD subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.832858 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATTCTAG AACTCGACCT AATTTACATA CTATCTGCTC TTTCCTTAGT TACATCCCTC 
TTCAGCAACA GAATATCTTT AGTGCTTCTC GTGTTAGCTT CTGGGATACT TGCCTTTTAC
GGTGTACAGG AACTACCCAT GGGTATATTC TACCTAGTCG CGGGATTGGT ATGGGTTCTG
GTCGCTTTAC ACTCCTTGTT CCACTATAGC GATAAGTGGC TCACTATGAC CCTAAGTGGT
ACTGTACTAG GGATCATTGT GGTTCTTACC AGCACGAACT ACATCGAGTT TCTCGCGGGA
TGGGAAACCA TGACGCTTTT CTCGTTCGTT GGAATAGCGA TCTACAGAAA GGACTGGAAA
CCTGCATTAA CTTTCCTTGC GTTCGGGGAG CTGAGCACTG CATTACTGCT AGCCGGTTTC
GCCCTTGCCT ACTCTCAGAC AGGTAGTCTA GTATTCGAGA GGTTAAGCAC TCAGCTCCCC
CTCATTATAA CGTCAATGGG TTTCATCGTC AAGATGGGAA TATTCCCATT CCTTGTCGTG
GAGTGGTTGC CCATAGCCCA CGGAAACGCC AGGTCGGATC TATCAGCAGT TCTAAGCGCA
ACGGTAACCA TGACAGGGAT TTACGGAATA TTGAAGATGG AGTCCTTGAG TCCCGTTTCG
ACGTATCTGG GAATATTCCT TCTCGCCGTG GGAGCCTTCT CCAACCTGTT TGGTGCCCTA
TACTCCTATG TCTCCGATCA TGTTAAGGGA TTGCTAGCGT TTAGCACCAT CGAAAATAAT
GGTGCCATGC TAGCTCTGTT GGGAAGCCTA GAGCTTGTGA GCGGAGACTT GAAGGAGTTC
GTCACGTTTA GTCTTTTTAC TTACGTAATA GCTCACTCCC TCGCTAAGAC AGGGCTTTTT
CTTTCAACTG GATATGTTGA GGGAGAATCG CTGACAACTG CAAGCTCCTT TAGATATGGT
CTCTCAGTTC TAGGAGCAGT CCTGATGGCC ATGTCGTTAT CGGGGCTTTT GCCTACCATA
GGGGGAATCG CGACGTGGTC ATTGCTGGAG TCGATGTTCA TGGAAGCTAT AACACTACCG
CACTTCATCA ACATTGTCCC AATAGTGGCA GGTGTCATGA TAGGCATGGG AGAGGGATTT
GCCACCGGAT CCCTTGCGAA GTTCGTATCA TACACTCAAC TGACAAAACC GATCAAGGAC
AAACAGGGAC TCATCCTTGC AGTTTCTGGG ATCCTCGTCT TAGTTACTGT GGGCCTGGCG
TATCTTCTTT CTCCCTTCAG AACAGAGGTG TCCCAGCTTG GGGTTGGGCT AAATTCCCTT
ATCTCCTCGC AATATCAAAG AAGTTTTGGA GGGATTGATC CGCTTTATAT CTTAGTTTCA
TGGCCGATTA TCGCATTAAT AGTGTACCTG TCCCTAGGTA AGAGAAAAAT AAGAGTTGTA
GACCCTTGGG ATAATGGATC TGCTCAAGGA TTTAGATACA CCTCCTTTGG CATGGCAAAT
AACGTGAGAC TGATGTTAAG GGCTTTACTT AGAACCAAGA CTGGATCCCT GGAGACTAGC
GCTGACATCT TCTGGCAAGC CATGTTAGTT TTAATCAGAT GGTATCTCAA ATTCTCTAGA
ACCTTCTCCA GGAGCTTCAT GAATGGCTCC CTGAGATGGT ACATGGTTTA CATGATAATT
GCCATTGTCG TCATAATGGT GATCACGTTA TGA
 
Protein sequence
MILELDLIYI LSALSLVTSL FSNRISLVLL VLASGILAFY GVQELPMGIF YLVAGLVWVL 
VALHSLFHYS DKWLTMTLSG TVLGIIVVLT STNYIEFLAG WETMTLFSFV GIAIYRKDWK
PALTFLAFGE LSTALLLAGF ALAYSQTGSL VFERLSTQLP LIITSMGFIV KMGIFPFLVV
EWLPIAHGNA RSDLSAVLSA TVTMTGIYGI LKMESLSPVS TYLGIFLLAV GAFSNLFGAL
YSYVSDHVKG LLAFSTIENN GAMLALLGSL ELVSGDLKEF VTFSLFTYVI AHSLAKTGLF
LSTGYVEGES LTTASSFRYG LSVLGAVLMA MSLSGLLPTI GGIATWSLLE SMFMEAITLP
HFINIVPIVA GVMIGMGEGF ATGSLAKFVS YTQLTKPIKD KQGLILAVSG ILVLVTVGLA
YLLSPFRTEV SQLGVGLNSL ISSQYQRSFG GIDPLYILVS WPIIALIVYL SLGKRKIRVV
DPWDNGSAQG FRYTSFGMAN NVRLMLRALL RTKTGSLETS ADIFWQAMLV LIRWYLKFSR
TFSRSFMNGS LRWYMVYMII AIVVIMVITL