Gene Msed_0944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0944 
Symbol 
ID5104374 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp870431 
End bp871729 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content48% 
IMG OID640506847 
Productnickel-dependent hydrogenase small subunit 
Protein accessionYP_001191040 
Protein GI146303724 
COG category[C] Energy production and conversion 
COG ID[COG1740] Ni,Fe-hydrogenase I small subunit 
TIGRFAM ID[TIGR00391] hydrogenase (NiFe) small subunit (hydA)
[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTCTAA CAAGAAGGGA CTTCCTAAAA GTTTCGGGCG TGACAGCCCT AGGTACAAGT 
CTAGCCCTAT CCGGCCTCTC ATACGAAGAA CTATTGTCTA AGGCTCAGGA AGCGGATCAA
CCAATAAACA TAGTATGGAC GGGTAATGGG TGGTGCGGAG GTAATACTAT AGCTTTCATA
GATGCAATGA ACCCATCAGT GGAGGATGTG GTAACAGGAG TGTACTTCAA CGGCAAGCCC
ATAACATTAA GGCAGCCAAG CATTCCGGAT CTGGGAATGA TAAACCTGGT TTATCACCCC
ATACTAATGC CTCAGGATAA CATGGCATAT GCAACCATGG AGATGGCATT AAATGGGGAG
TTAGATCCAT TCGTCCTGGT AGTCGAGGGG ACTATGTTTG ACGAATTCGG CGTTTACTAT
AAGAAGCCAA GTGGATCCTT CTGGTGTTCA GCGGGAAGGA AACCAGACGG AAGCGTTCTG
CTATGCGACG AGTTTGTCTT CAATCTTATG AAGAAAGCTG CGGCTGTAGT AGCAACTGGG
GCTTGCGCTA CTTATGGTGG AATTCCAGCT ACTACAAATG GGTTAGGGCA GAGAAGTCCC
ACATATGCCA TGGGAATGCT TGACGACCCG TACAGAGGGA TCTATGGCTT TCCATACTAT
GTTCACCAAG TTTATACACA GGATTTAGCA CCAGATATAA TCGACCAAGA CATATACTCA
GTCACGAATT CCCCAGTTAA CACTTCTTGG CCTGGGCCCA GCTATCACTG GCTTTCTACC
GCGGGTCTAC CAATAATTTC CATAGCTGCA GATCCTCCTG CCGGAGATTG GATTATGCGT
ACACTTGTCT CCGCAGTGCT TTATCTGAGA GGTTTAGGTC CTAATCCTGC CGATGATCTT
GACGTGTTCA ATAGACCAAA ATTCTTCTAT GGAAATGAGA CCCATCAGAA CTGCCCTAGG
GCGGGATTCT TCGCACAGGG CATTTTTGCC TATGAATTTG GAGACCCTCA ATGCACATAC
AGCCTTGGGT GTAAGGGTAC TGAGGCCAAT AGCCCAGCAC CTCTACTAGG ATGGGTGGGT
GGAGTTGGCG GATGTACTAG AGGCGGTGTA TGTATCGCAT GTACTGCTCC AGGATTTCCA
GATCTATATG AACCATTCTA TGCTCCACCA AACGCTCCCA CAATTCCTAG TACTACGTTG
TTCGCAGCTG CAGCAGCCGC AGGTATAATA GTAGGTGTGG GTAGTTATGC CTTCTCTAGG
AGGAAGAGGC TACCTCAGAT GCAAGGAGGC AAGAGGTGA
 
Protein sequence
MGLTRRDFLK VSGVTALGTS LALSGLSYEE LLSKAQEADQ PINIVWTGNG WCGGNTIAFI 
DAMNPSVEDV VTGVYFNGKP ITLRQPSIPD LGMINLVYHP ILMPQDNMAY ATMEMALNGE
LDPFVLVVEG TMFDEFGVYY KKPSGSFWCS AGRKPDGSVL LCDEFVFNLM KKAAAVVATG
ACATYGGIPA TTNGLGQRSP TYAMGMLDDP YRGIYGFPYY VHQVYTQDLA PDIIDQDIYS
VTNSPVNTSW PGPSYHWLST AGLPIISIAA DPPAGDWIMR TLVSAVLYLR GLGPNPADDL
DVFNRPKFFY GNETHQNCPR AGFFAQGIFA YEFGDPQCTY SLGCKGTEAN SPAPLLGWVG
GVGGCTRGGV CIACTAPGFP DLYEPFYAPP NAPTIPSTTL FAAAAAAGII VGVGSYAFSR
RKRLPQMQGG KR