Gene Tpen_0461 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0461 
Symbol 
ID4600988 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp421558 
End bp422574 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content53% 
IMG OID639773228 
Productputative rRNA pseudouridine synthase 
Protein accessionYP_919873 
Protein GI119719378 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0130] Pseudouridine synthase 
TIGRFAM ID[TIGR00425] rRNA pseudouridine synthase, putative
[TIGR00431] tRNA pseudouridine 55 synthase
[TIGR00451] uncharacterized domain 2 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCCCGCCA AGTACCTTGC AGCGACCCCT TCCCGCGAGA TATACGTGAG GTCCGAAGAC 
GAGACGGATT TTTCCTACGG GTGGCTCCCA GAAGAGCGAC CGCCGGAAGT ACTCCTCAAG
TACTCCGTCA TAAACCTAGA CAAACCCGTG GGACCCTCAA GCCACGAGGT TGTAGCGTGG
CTTAGGAAGC TCCTAGGGAT AGAGAGAATA GCCCACGCTG GCACGCTGGA TCCAAAGGTT
TCCGGGGTTC TTCCCATAAC GTTGAACAAC GCTGTACGCG TACTACCTGT GCTGCTAAAA
GAGGACAAGG AGTATGTGTG TGTAATGAGG CTGCACGGCG ACGTCGACCC GGAAAGGCTT
GAAAGAGTAG TCAGCATGTT TAAAGGCAGG ATTTACCAGA GGCCACCGCT AAGGTCTGCT
GTGAAGAGGG AGGTTCGAAT AAGGCAGATC TACGATATTA GGCTTTTAGA GTTTAACGAG
AGGACTGCCC TCCTCCATGT CTGGTGCGAA GCTGGAACCT ACATGCGTAA GCTGTGCCAC
GATATCGGAG AGATACTCGG TGTCGGGGCG CACATGCAGG AACTTAGAAG GATCAGGTCT
GGGAGTCTCT ATGAGGACAG GAACTGCTCT ACGATGCACG ACGTCGTGGA CGCGTACTAT
ATCTGGAAAG AGAGAGGGAT AGACCACTTT CTACGGCAAG TCTTTCTGCC TGTTGAAGCA
GCGATCCAGC ATCTTCCAAA AGTATGGATT AGAGACTCCG CGGTAGATGC TGTATGCCAT
GGAGCCCCTC TGGCTGTCCC CGGGATAGTA AAGCTAGAGG GAGGTATAAA GGTGAACAGC
ACAGTGGCTA TTCTCACGCT GAAAGGCGAG CTAGTAGCCA TCGGGAAAGC GCAAATGACC
ACGGAGAAGA TGCTTACAGA GTCTAGCGGG ATTGCCGTGA AAACGGAGCA CGTTGTCATG
GATCCGGGGA CCTACCCAAG GAAGTGGAAG TCCCATAAGG AGGCCTCAGG CCCCTAG
 
Protein sequence
MPAKYLAATP SREIYVRSED ETDFSYGWLP EERPPEVLLK YSVINLDKPV GPSSHEVVAW 
LRKLLGIERI AHAGTLDPKV SGVLPITLNN AVRVLPVLLK EDKEYVCVMR LHGDVDPERL
ERVVSMFKGR IYQRPPLRSA VKREVRIRQI YDIRLLEFNE RTALLHVWCE AGTYMRKLCH
DIGEILGVGA HMQELRRIRS GSLYEDRNCS TMHDVVDAYY IWKERGIDHF LRQVFLPVEA
AIQHLPKVWI RDSAVDAVCH GAPLAVPGIV KLEGGIKVNS TVAILTLKGE LVAIGKAQMT
TEKMLTESSG IAVKTEHVVM DPGTYPRKWK SHKEASGP