Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1722 |
Symbol | |
ID | 5105085 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1661163 |
End bp | 1662392 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640507617 |
Product | C/D box methylation guide ribonucleoprotein complex aNOP56 subunit |
Protein accession | YP_001191801 |
Protein GI | 146304485 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG1498] Protein implicated in ribosomal biogenesis, Nop56p homolog |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATTT ATTTGGCTGA GCACACAATA GGGTCCTTTG CCTTTGATGA GTCAGGGAAC CTTTTGGATT ACGTCCTTAA TCCAAAGGAA CTGGGCAAGG TAGTTGATAT TCTAATAAAC GCCGAGAAAG GTGAACCAAT GCCCTCTACC ATGGAATTAA TTCAGAAACT GAAACCGTCG GAAGTAGTTG TGGAAAGCGA AACGGAGAGC TCAAGGATGC AGACCTTAGG GATCAAGGTT GTATCCAAGC CCCATCATGT GGGAGCCAGG GCTTTGAGGG GTTCTCTCGC TGAACTAGCA GTAAAAACAA AATTCGCTGA AAATCCGAGC GAAGTGTACA ATTTCCTTTA TCAAGTATCT CTAGAATATA CGAGAAGAAA GCTGAGAAAG GCCGCCCAGA AAAGGGACCT TCTCGCCATA CAGGCCATAA GGGCTATCGA CGATATTGAT AAGACCATTA ACCTTTTCTC CGAAAGATTA AGGGAGTGGT ATAGTATACA CTTCCCCGAA GCCGATAAAC TGGTTGAGGA CCATGAACAA TACGCCAAAA TAGTTTCCCT GGCTGGTTAT AGGGATAATG TAACGGTGGA GACGTTAACC GAGATAGGAC TTAATGAGCA AAGGGCTAAG AAGCTAGCCG ATGCTGCCAA GAAGAGTATA GGAGCAGACA TCTCAGATGC GGATATCAAC TCCATCAGGG ATCTGGCTAA CACGATTTTG TCTCTTTTCA AGCTAAGGAA CTCGCTTTAC GACTACTTGG ACTCAATTAT GAGGGAAGTA GCTCCCAACG TGACTGAACT AGTGGGTCCC ACCCTTGGTG CTAGGCTGTT AAGTCTGGCA GGGAGCCTTG AGGAACTTTC TAAGATGCCA GCTAGTACGA TTCAAGTGTT AGGGGCTGAG AAAGCCCTCT TTAGGGCACT TAAGAGCGGA AGCAGACCAC CCAAACATGG AATCATTTTC CAGTATCCAG CAATTCACGT CTCTCCCAGA TGGCAGAGAG GGAAGATTGC CAGGGCCCTA GCTGCCAAGC TAGCAATAGC ATCAAGGATA GACGCCTATA GCGGAAGATT TGTGGGAACA CAGCTTGTGG AACAGGTGAA TAAGAGAATC GAGGAGATAA AAACGAAATA TGCCCAGCCA CCACCCAAAA AACAACAACC AGCTAAGGAA GAGGGGAAGA GATTTGATAA AAGAGAGCAT AAAAAGGGGA AAAAGGGAAA GAGAAGGTAG
|
Protein sequence | MKIYLAEHTI GSFAFDESGN LLDYVLNPKE LGKVVDILIN AEKGEPMPST MELIQKLKPS EVVVESETES SRMQTLGIKV VSKPHHVGAR ALRGSLAELA VKTKFAENPS EVYNFLYQVS LEYTRRKLRK AAQKRDLLAI QAIRAIDDID KTINLFSERL REWYSIHFPE ADKLVEDHEQ YAKIVSLAGY RDNVTVETLT EIGLNEQRAK KLADAAKKSI GADISDADIN SIRDLANTIL SLFKLRNSLY DYLDSIMREV APNVTELVGP TLGARLLSLA GSLEELSKMP ASTIQVLGAE KALFRALKSG SRPPKHGIIF QYPAIHVSPR WQRGKIARAL AAKLAIASRI DAYSGRFVGT QLVEQVNKRI EEIKTKYAQP PPKKQQPAKE EGKRFDKREH KKGKKGKRR
|
| |