Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0522 |
Symbol | |
ID | 5103682 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 478226 |
End bp | 479185 |
Gene Length | 960 bp |
Protein Length | 319 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640506426 |
Product | hypothetical protein |
Protein accession | YP_001190621 |
Protein GI | 146303305 |
COG category | [S] Function unknown |
COG ID | [COG1814] Uncharacterized membrane protein |
TIGRFAM ID | [TIGR00267] conserved hypothetical protein TIGR00267 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.655797 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.060241 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAGAGT TAGTTAACAG GAATTATAGG GCAGAGCTGT TCGGTGAGGA ACTTTACTCT GCTCTTGCCA GGGACGAAAA GTCCGAGAAG GTCAGGGAGG TACTACAGGA ACTCTCTGAG GGAGAGGGAA GACATGCTAA GTTCTGGGAG AGTATAGCCA GAAGCAGAGG GATTAAGCTG AGGGGATTGG GATTTCTGGA CAAATTGAAG CTCTGGATCT TGCGAAGACT CAGGAAAGTT CTGGGTCTAG CCCTAGTCCT AAAGATGGTT GAGGCGGGCG AGGAGAACGA TGCTGAGAAA TATTACCGCT TCTCCACCTC GCAAGAGTTT ACGGACACGG AGAGACAGGG CTTCAGGGAC ATTATGATGC AGGAAATGGT CCACGAAGAT ATGCTCATAC AGACGCAGGT GAACGTGGAT ACTGTTAGGG ATACCATCTA CGCTATTAGC GATGGCCTAA TCGAGGTACT AGCTTCGGTG TCAGGACTTG CAGGTATATT CTCGGTTCCC CTTTATGTAG CACTGGGAGG ACTCATCGTA GGAGTATCGG GAATGATATC CATGAGCATT GGGGCTTATC TTTCGGCGAA ATCTGAGGAG GACATAAGGA ACAATGCCCT CAGGAAGGCT AGGCTGAGGA GTCTTCTAGA AGGGGAGAAA CATGAGGAGG ACCAGGAGGT GTCCAGAACA GGGGAGAGTG TGAGAACCAC TGCGATCTCC TATATCATGG GAGCGATAAT TCCAATCCTA CCCTTTCTCC TGGGTTTAGG CGGTCTAGTT GGCTTGGTTA CCTCATATGC CGTGACTGGT GTCGCCACTT TCATAGTGGG TGCGCTCATA GGTGTACTCA GCGATGTGAG CCCCTGGAGA AAGGGGGCGG TAATGACTGG GTTGGCGCTG GGAGCAGCAC TAGTGACTCA CGTTCTGGGA CTATTGGCAC ATCTAGCGGG TTTTTCCTAA
|
Protein sequence | MEELVNRNYR AELFGEELYS ALARDEKSEK VREVLQELSE GEGRHAKFWE SIARSRGIKL RGLGFLDKLK LWILRRLRKV LGLALVLKMV EAGEENDAEK YYRFSTSQEF TDTERQGFRD IMMQEMVHED MLIQTQVNVD TVRDTIYAIS DGLIEVLASV SGLAGIFSVP LYVALGGLIV GVSGMISMSI GAYLSAKSEE DIRNNALRKA RLRSLLEGEK HEEDQEVSRT GESVRTTAIS YIMGAIIPIL PFLLGLGGLV GLVTSYAVTG VATFIVGALI GVLSDVSPWR KGAVMTGLAL GAALVTHVLG LLAHLAGFS
|
| |