Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0376 |
Symbol | |
ID | 4600776 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 341219 |
End bp | 342457 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 639773137 |
Product | Pre-mRNA processing ribonucleoprotein, binding region |
Protein accession | YP_919788 |
Protein GI | 119719293 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG1498] Protein implicated in ribosomal biogenesis, Nop56p homolog |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.599925 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAATGA ATGTTTACTT GCACTTCACT CCTCTGGGAC CGGTGCTCGT AAACGAGGAA GGACAGATAC TCGCCAGCGA CATGATAACG CAGGATAAGG ATCCCGAGAA GATCGCCCGC TTCATGTACG AGCTGGAAAC TGGAAACGTT CCAGAAAGGG TTCTAGGTTT TCTAGCCTCT AGTCTCTCAA AGGAATACAC GCTGGCAGTA GAGGACGAGG AAGTGGCCAG AAAGATCTCC CAAAGCTTAA AGGAGGTGAA AGTAACCGTG CAACCCGGGA GTAAGGTCCA CAGAGCTCTC CGAGAGCAGC AACAAGCAAT CGTCGAGAAA GCATTCGGCA TATCATACTC GGACTATTAC AGGCTAGTAC GAGAGGCAAC GATCCTGCTG GCACGGTGGA AAGTGAAGGA AGTCGCCGAA AAAAGAGACC TCTACGTAGC GCAAGCAGTT AACGCGCTGG ACGATGTAAA CAAGACGATA AACCTCTTCG CTTCGAGAGT GAGAGAGTGG TACGGGCTCC ACTTCCCGGA GCTTAACGAT ATAGTCGAAG ACCACGAGGA CTACTTCAAA ATAGTGAGCA AACTGGGTTC TAGGAGCAAC ATTTCCCTGG AAAAACTCAA AGAGCTGGGC TTTAAGGATG ACCTCGCCCA GAAAATAGTC AAAGCAGCTT CCAACAGCAT GGGAGCCGAG CTAACAGAGT TCGACCTCAA CGCCATAAGG CTCCTATCCG ACGCTGGACT CCAGCTCTAC AGCATACGGA GAAACCTAGA GAAGTACATA GACGAGGCGA TGTACGACGT AGCTCCCAAC ATAAGGGGTC TCGTCGGGCC AACCCTGGGT GCTAGGCTGA TTTCGCTCGC CGGAGGCTTA GAAAAGCTTG CCAGGTTGCC CGCGAGCACG ATCCAGGTTC TGGGCGCCGA AAAAGCTCTC TTCAGAGCAC TCAGATTCGG CGCACGTCCT CCCAAGCACG GAGTGATCTT CCAGCACCCG TACATACATA AATCGCCGAA ATGGCAAAGA GGTAAGATTG CAAGGGCTCT TGCAGGCAAA CTCGCGATCG CTGCCAGGAT CGACGCGTTC ACGGGAGAGT ATAAGGCAGA CGAGCTACGA GAAGACCTGG AAAAGAGGAT AGAAGAAATA AAGACACTCT ATGCAAAGCC TCCCGCAAAG CAGGCTAAAA AAGAGCCTGC ACAGAAAAAG TTTAGGGGGC ACGGCAAGAG GAAGGGTGAG AGCAAATGA
|
Protein sequence | MQMNVYLHFT PLGPVLVNEE GQILASDMIT QDKDPEKIAR FMYELETGNV PERVLGFLAS SLSKEYTLAV EDEEVARKIS QSLKEVKVTV QPGSKVHRAL REQQQAIVEK AFGISYSDYY RLVREATILL ARWKVKEVAE KRDLYVAQAV NALDDVNKTI NLFASRVREW YGLHFPELND IVEDHEDYFK IVSKLGSRSN ISLEKLKELG FKDDLAQKIV KAASNSMGAE LTEFDLNAIR LLSDAGLQLY SIRRNLEKYI DEAMYDVAPN IRGLVGPTLG ARLISLAGGL EKLARLPAST IQVLGAEKAL FRALRFGARP PKHGVIFQHP YIHKSPKWQR GKIARALAGK LAIAARIDAF TGEYKADELR EDLEKRIEEI KTLYAKPPAK QAKKEPAQKK FRGHGKRKGE SK
|
| |