Gene Msed_0923 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0923 
Symbol 
ID5104353 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp853272 
End bp854813 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content45% 
IMG OID640506826 
Productnickel-dependent hydrogenase, large subunit 
Protein accessionYP_001191019 
Protein GI146303703 
COG category[C] Energy production and conversion 
COG ID[COG0374] Ni,Fe-hydrogenase I large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAATGAGA AAGTAATAAG TCCACTCAAT AGGGTAGAAG GGGACTTGGA CCTTAAGGTA 
GTTTTTGAGG GAAAGAAAGT TGTAAAGGCC TTTCCCATGT CTAGGCTGTT CAGGGGAATA
GAGATAATCC TTAAGGGAAA ATTCCCCATG GATTCCTTGG TTATAACCCC GAGAATATGT
GGTATATGCG GGGGTTCACA TCTGCTTTCT GCGGCGAAGG CCCTAGAGAT GGCCTATGGG
GCGAGTGTTC CGCCCAACGC TGTGAGGTTG AGAAATGTGA TGACGTTAGC CGAAATGGGA
CAAAATGACG TCAGACATAC CTACCTCATG TTCCTTATAG ATACGGTCAA TCTGAAATAT
GAAAAGATGG GGTTTTACAG GGACATAGTG CTTAGATGGG CTCCATATCT AGGTCAGTCA
TACAAACAGG CAGTAGCCTG GTCTAAAAGA TACACCGAGA TCTATGCAAT ATTTGGTGGA
CAGTGGCCAC ATGGTTCAGC TATGGTTCCA GGAGGCGTTA CCACTGATCC GCTCAGCAAT
GACATAATAA AGGCCAAATC TATTCTCGCA TCTATTACTG CAGAGTTCCT GGAAAAAGTG
ATTCTGGGAG GTCCCCTAGA CCAGTTTCTC CAAGTGAAGA GCAAAAGGGA TCTGGACCAG
TGGGCCAAGG ACTATCCCAA TGGAGATATT TCCAAGATCT GGAACTACGG GCTAGAAATG
AAATGGGATA AGATAGGTAG TGGCTCACAG TACCTCATGA GTTACGGACA TGTGACGCTT
CCAGAACATT ATGACCCCGC ATCACACGTA GAAAAAAAGA GGTTTAGGGA GGGTTTACTG
GATCTTAGGA CTCGCGAGAT TCATCAAATT AAAGAGGAGA ACATAGTGGA GTTTGTTTCC
CACTCTTTTT ACTCCTATGA ACAGGGAGAT AAGGTTGGGC TTCATCCTTA TAATGGCGAA
ACAACGCCCC TACCACCGGA ATCCAAGGGA AAATACACTT TTACCAAGGC TTTTAGATAT
AAACTAGGTG GGAAGTACGT GGCTCCGGAA GTTGGAGCCT TAGCAATGAT GGTAGTAGCA
GGAGACCCAT TAATGACAGA TCTAGTTTAT AGGATTGGAA CCAGCGTTCT CGCTAGGGAA
ATAGCTAGAA TTGTTAGACT CGCCCGAATC CATGAGATCA TGAGAGAGGA GTTGGAAAGC
TACGAGTACG ATGAAATTAC GTACATAAAA CCTGAGGAAA AACTATCTGG AAGAGGTTAT
GGTCTGGTTG AGGCGTCCAG AGGTTCCCTA GGTCACTGGC TCGTGATAGA AGAGGGAAAA
ATTAAGAACT ACCAGGTCGT TACACCAACT CAGATTAACA TGGGGCCAGA AGATCCCTTC
GGTAACCCAA GCCATTTATC CATAGCACTT CAGGGAACAG AGGTGGAGAA TCCCAACAAT
CCCATAGAGG TAGCTCATAT TGTCAGGTCA CATGATGCAT GTATGGTATG CAACGTTCAT
GTCTTAGACG GCGGAAAAGA GATACTCTCG ATGAGACTAT GA
 
Protein sequence
MNEKVISPLN RVEGDLDLKV VFEGKKVVKA FPMSRLFRGI EIILKGKFPM DSLVITPRIC 
GICGGSHLLS AAKALEMAYG ASVPPNAVRL RNVMTLAEMG QNDVRHTYLM FLIDTVNLKY
EKMGFYRDIV LRWAPYLGQS YKQAVAWSKR YTEIYAIFGG QWPHGSAMVP GGVTTDPLSN
DIIKAKSILA SITAEFLEKV ILGGPLDQFL QVKSKRDLDQ WAKDYPNGDI SKIWNYGLEM
KWDKIGSGSQ YLMSYGHVTL PEHYDPASHV EKKRFREGLL DLRTREIHQI KEENIVEFVS
HSFYSYEQGD KVGLHPYNGE TTPLPPESKG KYTFTKAFRY KLGGKYVAPE VGALAMMVVA
GDPLMTDLVY RIGTSVLARE IARIVRLARI HEIMREELES YEYDEITYIK PEEKLSGRGY
GLVEASRGSL GHWLVIEEGK IKNYQVVTPT QINMGPEDPF GNPSHLSIAL QGTEVENPNN
PIEVAHIVRS HDACMVCNVH VLDGGKEILS MRL