Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0501 |
Symbol | |
ID | 5103662 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 455954 |
End bp | 456934 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640506406 |
Product | Rieske (2Fe-2S) domain-containing protein |
Protein accession | YP_001190601 |
Protein GI | 146303285 |
COG category | [C] Energy production and conversion |
COG ID | [COG0723] Rieske Fe-S protein |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence [TIGR03171] Rieske iron-sulfur protein SoxL2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.441209 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGTTAA AAATTAGGTT AGGGAACGGC GAGAAGGGTG TCCTGAAGAC TTCAGATCTG TATTTCATAC AGAAATTACT TAGGACCATG AGGAATCCCA AGACTAGGTT CGATAGCAGG GAGTTTGTAG AGAAAGGCGA GGATTACCTG TTTAACTATG TGGGGAAAAA TGTGGGCGGA ATTGATGAGG GGAGAAGAAA CTTCCTAAAG GGAATAGTCA TAGGTGTAGC CGCCGCGACG GTGGTTGGAA TAATCCCCGG GCTAAGAGTT CTTGTTCCGC CAGTGGAGCA GGCTACTGGC TTCCCCAAGT CCCTTCTGGT AGATGTGTCA GGGAACCCGT TGAAGGCGTC CTCTATCCCA GTGAACAGCC CCATAATCAC TCTATACGAG TATCCTCTAA CAGGTGAACC CAACTTCCTC CTCAATCTGG GCGATAAGTC TGGTAATCCG GTCTCCATTG CGCCCGTGGA GGTGTCCGTT CCACAAACTG GGAAGACGTA CAAGTTCCCT GGAGGCGTTG GACCAAATAA CTCGATCGTA TCCTACTCTG CCATATGCCA GCATCTAGGT TGTACGCCTC CGTACATACA TTTCTATCCT CCTAACTACG TGGGACCCTC ACAGCTCACC GCTCCAGAGC CTAACACGCT GACCGCTCAG GCCTTGCTAG CCGCAAAGCA GGCCAATATC CCAGCATTGA TCCATTGTGA CTGCCACGGT TCCACGTACG ATCCCTATGG GGGAGGTGCT GTATTAACAG GCCCTACGCA AAGACCGTTA CCCGCCGTTA TCCTTGAGTA TGACAGCTCC ACAGACTACC TTTACGCGAT AGGGGCCATA GGTGTTGCCA CTTACCCAGA GGGATCAGAC GGCGTACCAT CACAAGATCC CACAAAGGAT CTGGATACCT CTCAGTACGG CAGTTCGGTG GGAAGTAAGA CACAGGTACA AGCTAGCACG AACCCGTTCT CGGGTTCATG A
|
Protein sequence | MPLKIRLGNG EKGVLKTSDL YFIQKLLRTM RNPKTRFDSR EFVEKGEDYL FNYVGKNVGG IDEGRRNFLK GIVIGVAAAT VVGIIPGLRV LVPPVEQATG FPKSLLVDVS GNPLKASSIP VNSPIITLYE YPLTGEPNFL LNLGDKSGNP VSIAPVEVSV PQTGKTYKFP GGVGPNNSIV SYSAICQHLG CTPPYIHFYP PNYVGPSQLT APEPNTLTAQ ALLAAKQANI PALIHCDCHG STYDPYGGGA VLTGPTQRPL PAVILEYDSS TDYLYAIGAI GVATYPEGSD GVPSQDPTKD LDTSQYGSSV GSKTQVQAST NPFSGS
|
| |