Gene Tpen_0376 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0376 
Symbol 
ID4600776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp341219 
End bp342457 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content52% 
IMG OID639773137 
ProductPre-mRNA processing ribonucleoprotein, binding region 
Protein accessionYP_919788 
Protein GI119719293 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1498] Protein implicated in ribosomal biogenesis, Nop56p homolog 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.599925 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAATGA ATGTTTACTT GCACTTCACT CCTCTGGGAC CGGTGCTCGT AAACGAGGAA 
GGACAGATAC TCGCCAGCGA CATGATAACG CAGGATAAGG ATCCCGAGAA GATCGCCCGC
TTCATGTACG AGCTGGAAAC TGGAAACGTT CCAGAAAGGG TTCTAGGTTT TCTAGCCTCT
AGTCTCTCAA AGGAATACAC GCTGGCAGTA GAGGACGAGG AAGTGGCCAG AAAGATCTCC
CAAAGCTTAA AGGAGGTGAA AGTAACCGTG CAACCCGGGA GTAAGGTCCA CAGAGCTCTC
CGAGAGCAGC AACAAGCAAT CGTCGAGAAA GCATTCGGCA TATCATACTC GGACTATTAC
AGGCTAGTAC GAGAGGCAAC GATCCTGCTG GCACGGTGGA AAGTGAAGGA AGTCGCCGAA
AAAAGAGACC TCTACGTAGC GCAAGCAGTT AACGCGCTGG ACGATGTAAA CAAGACGATA
AACCTCTTCG CTTCGAGAGT GAGAGAGTGG TACGGGCTCC ACTTCCCGGA GCTTAACGAT
ATAGTCGAAG ACCACGAGGA CTACTTCAAA ATAGTGAGCA AACTGGGTTC TAGGAGCAAC
ATTTCCCTGG AAAAACTCAA AGAGCTGGGC TTTAAGGATG ACCTCGCCCA GAAAATAGTC
AAAGCAGCTT CCAACAGCAT GGGAGCCGAG CTAACAGAGT TCGACCTCAA CGCCATAAGG
CTCCTATCCG ACGCTGGACT CCAGCTCTAC AGCATACGGA GAAACCTAGA GAAGTACATA
GACGAGGCGA TGTACGACGT AGCTCCCAAC ATAAGGGGTC TCGTCGGGCC AACCCTGGGT
GCTAGGCTGA TTTCGCTCGC CGGAGGCTTA GAAAAGCTTG CCAGGTTGCC CGCGAGCACG
ATCCAGGTTC TGGGCGCCGA AAAAGCTCTC TTCAGAGCAC TCAGATTCGG CGCACGTCCT
CCCAAGCACG GAGTGATCTT CCAGCACCCG TACATACATA AATCGCCGAA ATGGCAAAGA
GGTAAGATTG CAAGGGCTCT TGCAGGCAAA CTCGCGATCG CTGCCAGGAT CGACGCGTTC
ACGGGAGAGT ATAAGGCAGA CGAGCTACGA GAAGACCTGG AAAAGAGGAT AGAAGAAATA
AAGACACTCT ATGCAAAGCC TCCCGCAAAG CAGGCTAAAA AAGAGCCTGC ACAGAAAAAG
TTTAGGGGGC ACGGCAAGAG GAAGGGTGAG AGCAAATGA
 
Protein sequence
MQMNVYLHFT PLGPVLVNEE GQILASDMIT QDKDPEKIAR FMYELETGNV PERVLGFLAS 
SLSKEYTLAV EDEEVARKIS QSLKEVKVTV QPGSKVHRAL REQQQAIVEK AFGISYSDYY
RLVREATILL ARWKVKEVAE KRDLYVAQAV NALDDVNKTI NLFASRVREW YGLHFPELND
IVEDHEDYFK IVSKLGSRSN ISLEKLKELG FKDDLAQKIV KAASNSMGAE LTEFDLNAIR
LLSDAGLQLY SIRRNLEKYI DEAMYDVAPN IRGLVGPTLG ARLISLAGGL EKLARLPAST
IQVLGAEKAL FRALRFGARP PKHGVIFQHP YIHKSPKWQR GKIARALAGK LAIAARIDAF
TGEYKADELR EDLEKRIEEI KTLYAKPPAK QAKKEPAQKK FRGHGKRKGE SK