Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0763 |
Symbol | |
ID | 5103452 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 697760 |
End bp | 698755 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640506668 |
Product | hypothetical protein |
Protein accession | YP_001190862 |
Protein GI | 146303546 |
COG category | [S] Function unknown |
COG ID | [COG1630] Uncharacterized protein conserved in archaea |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.413008 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00447816 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | GTGATAGAGC TCGTCTATCA GGAACTGATA ACTAAACGTT CAGAACTCGT GAGTAAGCTG AATATTGTTC AAGGAGATCT CGATAAAAAA CTGGAATCTT TAGTTAAAAA CAACTGGGAA GAGTTCACCC CCTCCTCTAC GATAAAGAAG AGAATAGTTG CAGTAGATGG TGGAGAATTT GTAAAAGAGC TGAGAACTGG GATAGTGTTT GTGTTAAATG CCGAGGCATT GATAACGGAG GGAGTTCAAA TACTGGACAC AGATCAGGAG GTCAAGGCTG GGGTCTTTAG ACCCGGGAAT AGGGCGAAGG AGAGGGTTGG TGAGCTCATG GCGATAATGG AGCTGAAGCT GGCACTTCAA AATGGGAGTA GGGGAGATTG GATCTTGATG GACGGTAGCC TAAAGAAGAA GCTCGGAGAG GTGAGGGCAC AGGATAGCAA CTTTGATTTC AGGGAAGGCG AGATCACTTC CTTGAGTCAG GAGGACGAGG ACATAATGTT ACTTCACATG ATATATGAAA AACAGGTTTA CCTTTCTGAA TTGCTCAAGA GGTACGGTTC CAGAACAGTG TGGATCTCAA AGGTAAGCAG GACTAGGGAT CTCTTTCACC ATGAGCTATC AGATATCACC CTTCTGGAGA CCTTCACTAG TTCTCCTGGT TTCTCCACCG TCAGGTGTAG GTCCCTGTTA AGGGAGGAGA TCCAGGAAAG CGATCTGAGA AGGCCTCTAG ATGGAATTGA GATGTGCAGT TTCTACGCTA GGCTCGATTA CAATGAGAAT GTACTGAGGA TAGATGTGAT TGGAAGGCCC ACTACCGAGT TTATCAAGGG CCTACTTAAC GATCTGTACT CGGTTTCGGT GAAGGGTTAT CCTTATCCCC TGGTCAGGGT CCATTATGAC GTTAAGGTTT CAGGGAATGA TAGGAGGAGG ATCATTGAGA TGTTAAACCT TAGGAAGAGA AGGAGCGTTG GATGGTGGCC CGGCCAATTT TATTGA
|
Protein sequence | MIELVYQELI TKRSELVSKL NIVQGDLDKK LESLVKNNWE EFTPSSTIKK RIVAVDGGEF VKELRTGIVF VLNAEALITE GVQILDTDQE VKAGVFRPGN RAKERVGELM AIMELKLALQ NGSRGDWILM DGSLKKKLGE VRAQDSNFDF REGEITSLSQ EDEDIMLLHM IYEKQVYLSE LLKRYGSRTV WISKVSRTRD LFHHELSDIT LLETFTSSPG FSTVRCRSLL REEIQESDLR RPLDGIEMCS FYARLDYNEN VLRIDVIGRP TTEFIKGLLN DLYSVSVKGY PYPLVRVHYD VKVSGNDRRR IIEMLNLRKR RSVGWWPGQF Y
|
| |