Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0944 |
Symbol | |
ID | 5104374 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 870431 |
End bp | 871729 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640506847 |
Product | nickel-dependent hydrogenase small subunit |
Protein accession | YP_001191040 |
Protein GI | 146303724 |
COG category | [C] Energy production and conversion |
COG ID | [COG1740] Ni,Fe-hydrogenase I small subunit |
TIGRFAM ID | [TIGR00391] hydrogenase (NiFe) small subunit (hydA) [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTCTAA CAAGAAGGGA CTTCCTAAAA GTTTCGGGCG TGACAGCCCT AGGTACAAGT CTAGCCCTAT CCGGCCTCTC ATACGAAGAA CTATTGTCTA AGGCTCAGGA AGCGGATCAA CCAATAAACA TAGTATGGAC GGGTAATGGG TGGTGCGGAG GTAATACTAT AGCTTTCATA GATGCAATGA ACCCATCAGT GGAGGATGTG GTAACAGGAG TGTACTTCAA CGGCAAGCCC ATAACATTAA GGCAGCCAAG CATTCCGGAT CTGGGAATGA TAAACCTGGT TTATCACCCC ATACTAATGC CTCAGGATAA CATGGCATAT GCAACCATGG AGATGGCATT AAATGGGGAG TTAGATCCAT TCGTCCTGGT AGTCGAGGGG ACTATGTTTG ACGAATTCGG CGTTTACTAT AAGAAGCCAA GTGGATCCTT CTGGTGTTCA GCGGGAAGGA AACCAGACGG AAGCGTTCTG CTATGCGACG AGTTTGTCTT CAATCTTATG AAGAAAGCTG CGGCTGTAGT AGCAACTGGG GCTTGCGCTA CTTATGGTGG AATTCCAGCT ACTACAAATG GGTTAGGGCA GAGAAGTCCC ACATATGCCA TGGGAATGCT TGACGACCCG TACAGAGGGA TCTATGGCTT TCCATACTAT GTTCACCAAG TTTATACACA GGATTTAGCA CCAGATATAA TCGACCAAGA CATATACTCA GTCACGAATT CCCCAGTTAA CACTTCTTGG CCTGGGCCCA GCTATCACTG GCTTTCTACC GCGGGTCTAC CAATAATTTC CATAGCTGCA GATCCTCCTG CCGGAGATTG GATTATGCGT ACACTTGTCT CCGCAGTGCT TTATCTGAGA GGTTTAGGTC CTAATCCTGC CGATGATCTT GACGTGTTCA ATAGACCAAA ATTCTTCTAT GGAAATGAGA CCCATCAGAA CTGCCCTAGG GCGGGATTCT TCGCACAGGG CATTTTTGCC TATGAATTTG GAGACCCTCA ATGCACATAC AGCCTTGGGT GTAAGGGTAC TGAGGCCAAT AGCCCAGCAC CTCTACTAGG ATGGGTGGGT GGAGTTGGCG GATGTACTAG AGGCGGTGTA TGTATCGCAT GTACTGCTCC AGGATTTCCA GATCTATATG AACCATTCTA TGCTCCACCA AACGCTCCCA CAATTCCTAG TACTACGTTG TTCGCAGCTG CAGCAGCCGC AGGTATAATA GTAGGTGTGG GTAGTTATGC CTTCTCTAGG AGGAAGAGGC TACCTCAGAT GCAAGGAGGC AAGAGGTGA
|
Protein sequence | MGLTRRDFLK VSGVTALGTS LALSGLSYEE LLSKAQEADQ PINIVWTGNG WCGGNTIAFI DAMNPSVEDV VTGVYFNGKP ITLRQPSIPD LGMINLVYHP ILMPQDNMAY ATMEMALNGE LDPFVLVVEG TMFDEFGVYY KKPSGSFWCS AGRKPDGSVL LCDEFVFNLM KKAAAVVATG ACATYGGIPA TTNGLGQRSP TYAMGMLDDP YRGIYGFPYY VHQVYTQDLA PDIIDQDIYS VTNSPVNTSW PGPSYHWLST AGLPIISIAA DPPAGDWIMR TLVSAVLYLR GLGPNPADDL DVFNRPKFFY GNETHQNCPR AGFFAQGIFA YEFGDPQCTY SLGCKGTEAN SPAPLLGWVG GVGGCTRGGV CIACTAPGFP DLYEPFYAPP NAPTIPSTTL FAAAAAAGII VGVGSYAFSR RKRLPQMQGG KR
|
| |