Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0157 |
Symbol | |
ID | 5105010 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 126507 |
End bp | 128312 |
Gene Length | 1806 bp |
Protein Length | 601 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640506060 |
Product | hypothetical protein |
Protein accession | YP_001190258 |
Protein GI | 146302942 |
COG category | [K] Transcription |
COG ID | [COG1293] Predicted RNA-binding protein homologous to eukaryotic snRNP |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0273927 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTATAA AGTTATCAAG AAAACTTTCA ATGAGCTATA TTGATCTAGT GGCCTGGTTA GCTGAGAATA GGGAGAGGTT GGTCGGTTGT AGAATAGATA ACATTTTTGC TACTAATTTA CCGCATCTGT ACATCTTCGT TATTCATTGC CACAACGGCG ATTCTCAGCT TGTGATCCAG CCCGGTAAAA GGGTACACTT TACAAAGTTT AACCATGAGA GACTGCTTGA TTCGAAGGCA AAGATGCTAC GTGAGTTAAT AAGGGGTGAA CTGATAGAAG ACGTTGATGT AGTTAATGGA GAAAGAATTT TAAGAATGAA ACTAAAGGAC AAAATTGTAT ATATAGAGCT TCTTCCAAAG GGTACGTTAA TTGTGACGGA TAACGATAAT AGAATCAAGT TTGCACTGGA ACAAAGGGAG TTCAAGGACA GAACACTGAA GCCTGGGGAG CTCTACGTTC TACCTCCGTC TCCAACAGAG CTAAAACCTA ACGAGATAGA AAGCTATCTA AAGAAGGGCG CCCTTTCCAG GGTTCTAGGG GTTCCACAGG AATTTCTTAA TATTCTCTCA ATAAATGCAA ATAATCTAGA TGAATTGGAG GAAGCTAAGA AAAAACTGGA AAAAGTTATG CAAGATATAC AACACGGCGT AATTCAGCCT TGCGTGGATT TGGAGAGAAC GGTGTGGCCA GTTCGCTTTC CGGGATGTAC CGAATTACCC AGTTATAATG AGGCCTTGGA CAACTATTTC ACCTCGCTGG AAAAGGCGGA GCTTGAGAAA CTGGTTGATG AGGGGGAAGA GAAGAAGCTT GAGGCCACAA TCTCCAAGCT CAAGGAGACT TTGACTAAGA TGGAGGAGGA AGCTGAGACT TTGAGGAAGA AAGGTAAGGC AATAATGAAT AATTATCTAG AGGTTGAGGA AAAGATTAAG GAGGGTGCGA AGGAAATAGA GATTGAAGGG TTAAAGATAG AGATAGATCC CAAGATTTCA GCTTCTAAAA ACGCATCTCA ATATTTTGAA AAGGCCAAGG AATTAGATGC AAAGATAAGG AGGACAAGGG AGACAATCGA GGAGTTGGAA AAGAAAAAAC AGGAAATTAA GGCTAAGTCT AAGGAGACCA TTGAAGGAAG CAAGATTCTG GTAAGAAAGA AGGAGTGGTA TGAAAGGTAT CATTGGACCA TCACATCTAA TGGTTTCATT GTGATAGCTG GGAGGGATAT TGACCAGAAT GAGAGTATTG TGAGGAAAAT GCTAGAGGAC AAGGATATCT TTTTGCATGC AGATATCCAG GGGGCTCCAG CCACTGTGAT TAAGAATCCA GTTGGTATAG GAGAGCAGGA TCTAATGGAC GCTGCAGTGT TGGCAGGTTG TTACTCTAAA GCGTGGAAAT TGGGGCTAGC TAGCATAGAT GTGTTTTGGG TTTACGGAGA GCAGGTCTCA AAGTCGCCAC CCTCAGGCGA ATATCTGCCC AAGGGATCCT TCATGATCTA TGGGAAAAAG AATTACATCA AGAACGTGAA ACTAGAGTTG ACAATTGGGG TAAACGTGGA AAGCGATTTC AGGATTGAGG TGGGTTCATT TGAAGCTATT TCCAAAAGAT GTAAGGTATT TGTCACCATA ACTCCAGGAG ATTCTGATCC AGAAAAACTA GGAGATAGAA TCAGCAGGAT ATTCGCTAGG GAACTGGGTG TGGATGGGGT TAAGGCTCTG AAGGATGAAA TAGTGAGGAT GATCCCAGGG AAATCCAAAA TTAAGGGCAC AACACACCAG CTGGCTAACT CAACCGGATT GAATCTTAAG GATTAA
|
Protein sequence | MSIKLSRKLS MSYIDLVAWL AENRERLVGC RIDNIFATNL PHLYIFVIHC HNGDSQLVIQ PGKRVHFTKF NHERLLDSKA KMLRELIRGE LIEDVDVVNG ERILRMKLKD KIVYIELLPK GTLIVTDNDN RIKFALEQRE FKDRTLKPGE LYVLPPSPTE LKPNEIESYL KKGALSRVLG VPQEFLNILS INANNLDELE EAKKKLEKVM QDIQHGVIQP CVDLERTVWP VRFPGCTELP SYNEALDNYF TSLEKAELEK LVDEGEEKKL EATISKLKET LTKMEEEAET LRKKGKAIMN NYLEVEEKIK EGAKEIEIEG LKIEIDPKIS ASKNASQYFE KAKELDAKIR RTRETIEELE KKKQEIKAKS KETIEGSKIL VRKKEWYERY HWTITSNGFI VIAGRDIDQN ESIVRKMLED KDIFLHADIQ GAPATVIKNP VGIGEQDLMD AAVLAGCYSK AWKLGLASID VFWVYGEQVS KSPPSGEYLP KGSFMIYGKK NYIKNVKLEL TIGVNVESDF RIEVGSFEAI SKRCKVFVTI TPGDSDPEKL GDRISRIFAR ELGVDGVKAL KDEIVRMIPG KSKIKGTTHQ LANSTGLNLK D
|
| |