Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0958 |
Symbol | |
ID | 5104510 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 884477 |
End bp | 885676 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640506860 |
Product | hypothetical protein |
Protein accession | YP_001191053 |
Protein GI | 146303737 |
COG category | [S] Function unknown |
COG ID | [COG1602] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0243215 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATGTGG ATCCCCACCT ATGTATTGCC TGTAGGGGAG CCAAGTACCT TTGCGGTCTC AGTTATTGTC CTGTCCTAGT TAAGAACCTC TCCATGAAGG TAAAGGTGGG GAAGTTTGTC GAGGGAGATT CTCCTCCCTC CGTCTTCGTG GGAAGGTTTG GTTATCCCAA GATTACCGTA TATCCTTCGA CTCCCCCAGA GTTTGGAGAC ACTTCCATGT ACGAGGATCC TAGAGCCTGG TTAGCCATGG ACATCAACAG GTTCTTGGCC ATGAGGATGT CCGTGGTACG AGGTGGAATT CAGTTCAAGG TCAGTGAGGC TAGAGCTCCT GGAAGGGAGT TGTACGACGT TCAGGTGGCC TCCCTCTCTC CTAGGCCTGT GGAAATGGAG CTTGACCTGG AGTCCGTTCC AAGGGGGAGA GTTCTAAGCG AGACGGTTCC CCCTCTAGGT CCCTCAGCTC CCCTGAAGAG GCTTAGGCTA GGCGCTCTAC CCCCTCCTGA GAGGGTAGTG GAGAAAGTCT TCCAGGAGAG GGACATGAAG GCAGGAAAGG CAATAGAGAG GTTATACAGT GACGGTATTC CCGTGGAGAG GATAGCCCGC CTTCTCAGCG TGGGTAACCT AGGTGTGGAG AGGAAGCTTG TCCCCACCAG GTGGAGTATT ACGGCAGTGG ACAAGACTCT GTCGGACCTC CTCGTGAGGA AGATCAAGGA GTACCCCTCA ATTGACCAGA TAGAGGTATA CGTGAGGAAG TTCAGGCTTA ACACCTTCGT GGCAATCCTG GTTCCTGGTG AGTGGGCATT TGAGTGGGGA GAGGCGTGGT TCCCATCAAC GACGTGGAAC ATGTGGGGGA GCTCGCCTCA GGTAGAGGTT GACTACGAGG GATATTTTGG GAGGAGAACA TATCCTGATA TAGGTGGATG TTATTACTCC TCTAGGCTGG CCGTAGCTGA ACACCTGGAA AGGAGGAGGA GACAGGCAAT TCCGATCCTG TGGAGGGAGA TCTATCCAGG TTTCTACTTC CCTGTTGGAG TGTGGTTCGT CAGAGAAAAC GTCAGGGAAT TGCTGAGGGG TGAGAGCGTG AAGTTCGACA CACTGAGCGA AGCGTTGAAG TTCCTTGAGG GAGTACTCAA GGTCAGTCCT CACGAGTGGG CTAAACATTC TGGATTGATT CCCATGATAA GGTCGAGGTT ATTCCCATGA
|
Protein sequence | MYVDPHLCIA CRGAKYLCGL SYCPVLVKNL SMKVKVGKFV EGDSPPSVFV GRFGYPKITV YPSTPPEFGD TSMYEDPRAW LAMDINRFLA MRMSVVRGGI QFKVSEARAP GRELYDVQVA SLSPRPVEME LDLESVPRGR VLSETVPPLG PSAPLKRLRL GALPPPERVV EKVFQERDMK AGKAIERLYS DGIPVERIAR LLSVGNLGVE RKLVPTRWSI TAVDKTLSDL LVRKIKEYPS IDQIEVYVRK FRLNTFVAIL VPGEWAFEWG EAWFPSTTWN MWGSSPQVEV DYEGYFGRRT YPDIGGCYYS SRLAVAEHLE RRRRQAIPIL WREIYPGFYF PVGVWFVREN VRELLRGESV KFDTLSEALK FLEGVLKVSP HEWAKHSGLI PMIRSRLFP
|
| |