Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0500 |
Symbol | |
ID | 5103661 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 454262 |
End bp | 455908 |
Gene Length | 1647 bp |
Protein Length | 548 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640506405 |
Product | cytochrome b/b6 domain-containing protein |
Protein accession | YP_001190600 |
Protein GI | 146303284 |
COG category | [C] Energy production and conversion |
COG ID | [COG1290] Cytochrome b subunit of the bc complex |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.847294 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTTTAA CAAAAAGGGT TTCAAACTGG TTTTCGGAGA GACTAGGATT AAATGACCTA CCTTTCTTCA AGACTCCAGA CTACATGTAT CACGTTAACG AATGGTTAGG CGCGTTGGTC GCTGCATCAT TTATCTACAC CGTGATATCT GGCCTAATAC TTCTGCTATA TTATAACGCA GCAGCAGGTT ACCAGTCAAC TGAGACCATA ATCAATCAAG TGCCCTATGG GTCCGTGGTG CTCTATAGTC ACCTATATGG CTCCTACGCC ATGATTATCC TGGCTTACGT TCACATGTTT AGGAACTATT TCGTGGGAGC ATATAAGAAG CCAAGGGAAC TCCTGTGGAT ACTCGGCGTT TTAATGCTCG TCCTCACCCT AGGAGCTTCC TTCCTGGGAT ATAGCCTTAT TGGGGACGCC CTCGCAGTTA GCGCTGTTGA CGTTGGTGCC GGAATCATCA CCTCAATTCC GCAACTGTCC TTCTTAATAC CCATCATCTT CGGGAATTAT GACACTGGAG ACTACACGAG AGTACTTGCG TTGCACATTA TCCTAGTGGC TTTAATAGGT CTACTTTTCG TGTTCCACTT CTTCCTTGCG GAGCAGTATG GAATGATGCC CTCAAGGAAG GTAAAGCCAA AGGCGCCTGC GGTTTACACC AAGGAGGAAT GGTCTAAGTT TAATCCATGG TGGCCTAGGA ACTTCGTTTA CATGAGCTCG CTGATTTTCA TGACTTGGGG TTTCATTCTA GCTATCCCGA ACGCGCTAGC CTACCTAAAC GGACTGCCAC AATACTTCGA TCCCTTCATG AACCCGAAGC CAGCTCCTCC ACCCAACAGC CCTGCTGCAG CGCATATTAC AACCTATCCA CCGTGGTTCT TCCTGTTCCT CTACAAGATC GCAGACTTCA CCAGTGACGT GGTAATATTT CTAATGATAG GTGTAATAAT ACCACTGGTA TACCTCATCA TCTTGCCATT CATAGATAGA AGCGACGAAC TTCACCCGCT CAAGAGGAAG GTATTCACTG GGATAGGTAT CCTCATGATT ACGTACCTTA TACAAACCTC CCTGTGGGGA GATCTTGCTC CCGGAGTTGA AGTAAGTATT AAGGAACAGG TTATCGCTTA TCTCATACCG GCAATTATAG TAGCTCTAGG TTTAACCTTC ATTAAGCCAA TGGACGTCAA GACGAAACAG GTCAGAGGAA TATCACCCCT TACATCTCTA CTCTTCGTGG TGGTGGCAAT GTTGTTCGCT GGTTCCGTCG TGGAGCTAAT CGACTATCCA AGCCTGTTTA CCTTTGCGGT CACGCTATTC ACGGGCTCGC TATTCTTTAT AGGAAGTAAG ACCATGGGCA AGGTAGTTCT GAGTAAGGGT TTTCCCAACA ATGCTGTACC TGAAGTCAGA ACAACCACTG AGACTCCCAC CTCCAATGAG ACCAAGAGGA AGTTCGCGGA GGTTCTCATG AGCATTCTCT TCATTCTAGT TGTGATAATA GTGGCGCAAA TGTGGACTAT CCCTCCAACT GGCTATGCCT CAAACCTGTT CGGAGTGGAT CTGGGACTGG TATTTCTAAT GCTGGGAGAG GTAATCTCTC TCTACCACTA CGTGGTCTAC AAGAAGCCCG TTGAGAAGGA AGAATAA
|
Protein sequence | MSLTKRVSNW FSERLGLNDL PFFKTPDYMY HVNEWLGALV AASFIYTVIS GLILLLYYNA AAGYQSTETI INQVPYGSVV LYSHLYGSYA MIILAYVHMF RNYFVGAYKK PRELLWILGV LMLVLTLGAS FLGYSLIGDA LAVSAVDVGA GIITSIPQLS FLIPIIFGNY DTGDYTRVLA LHIILVALIG LLFVFHFFLA EQYGMMPSRK VKPKAPAVYT KEEWSKFNPW WPRNFVYMSS LIFMTWGFIL AIPNALAYLN GLPQYFDPFM NPKPAPPPNS PAAAHITTYP PWFFLFLYKI ADFTSDVVIF LMIGVIIPLV YLIILPFIDR SDELHPLKRK VFTGIGILMI TYLIQTSLWG DLAPGVEVSI KEQVIAYLIP AIIVALGLTF IKPMDVKTKQ VRGISPLTSL LFVVVAMLFA GSVVELIDYP SLFTFAVTLF TGSLFFIGSK TMGKVVLSKG FPNNAVPEVR TTTETPTSNE TKRKFAEVLM SILFILVVII VAQMWTIPPT GYASNLFGVD LGLVFLMLGE VISLYHYVVY KKPVEKEE
|
| |