Gene Tpen_0354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0354 
Symbol 
ID4600892 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp323597 
End bp324931 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content53% 
IMG OID639773115 
Productpseudouridylate synthase 
Protein accessionYP_919766 
Protein GI119719271 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1258] Predicted pseudouridylate synthase 
TIGRFAM ID[TIGR01213] conserved hypothetical protein TIGR01213 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGTACAG GTACTAGCGC CAGCTACAAG AGGGATAATG CTTTCTACGC GCTAGACAAG 
GTAGAGAGGA TACTACTGGA CGGGTACTCT CTATGCGACG CCTGCACGGG TAGGCTTTTC
GGTCTGAGAG GCTACGGGCT CTCGAACACC GAAAGAGGAC GTGCCCTCAA GACACTCCTG
ATCATGAAGG CTTTTCAGGC TTCTCCTAGG CAGGCAGACC TGGAACTCTT ACGTGTACTG
GCAAGGACGG GGTTCGAGCC CGCCCGGGAA CTCTTAAAAA AGCTTTCAGG AGAAGATGTA
GAAGTCAAGG CGTGTAGTAT TTGCGAGGGG CTCACGGGCA GGTACTACGA GCTTGCGCTA
AGAGCTGTCG AAGAGGCGAA AAGCTACGAG TTTAACACGT TCGAGGTAGG CGTAAGGATC
GACGCCGAGG TGATTCGCCG GGAGGAAGAG CTCTGGCGCA GGTACGGCCT GGAGAGCGCG
GAGAGCATAC GGAACGAGGC GAGCAGGGAA GTAGGGAAAA TAATCTCTAA GCTGACCGGG
AAGGAGTATT CTAGGAACAA TAGCGAGTTG CTGATAATAG TCGACTTGTC GGCGGGCGCA
ATAGAGCTTC ACCCGGCCCC CGTGTTCGTG TACGGTAGGT ATAGGAAGTA CGCGAGAGGG
CTACCCCAAA ACCCCTGGCC GCAACCCGAC GAGAGGATAA AGTTCAACAC AAGTATAGAG
GAGCTGATAG TTAAACCCGC ACTTGAACTA TTCGAGGCCG AGAAAGCGAA GTTCCACGCC
GCCGGGAGAG AGGATATAGA CGTGCGCACG CTTGGAACTG GAAGACCCTT CGTACTTGAG
ATAAAGAAGC CGCGTAAGCG AAACATTGAC CTAAAGGTTC TCGCAGAGAA GATAAACTCG
GGTGCAGGGG GGTTAATAGA GGTTCTGGAC CTCGCGTACA CCGACCGGAA AACGATAAAG
AAGCTGAAGA GCCTGGCATC AATAGCTAAG AAAGCCTACG TAGCCAGGGT AAAGTTCGAG
AAACCTGTCG ACGACGAGAA ACTTGCAGAG ATCTCTAAGG TTTTCTCCAA CGCAGTTATT
AACCAGCGTA CGCCTACGAG GGTTCTCCAC CGCCGCGTAG ACAAGCTTAG GAAGAAGATT
GTCTACAGGT TGGAGGCGAG AAAAATCTCC CAAGACGAGG TAGAGTTCTA CCTAGAAACT
CAGGGAGGCT TTTACGTGAA GGAGTTCATA CACGGCGATA ACGGGAGGAC TACTCCGAGT
ATAGCGGAGT TCCTTGGAAA CAACGTCCTA AGCATAGAGC TCGACGTAGT CAGTATAGAA
GAAACTGCGG CCTAG
 
Protein sequence
MSTGTSASYK RDNAFYALDK VERILLDGYS LCDACTGRLF GLRGYGLSNT ERGRALKTLL 
IMKAFQASPR QADLELLRVL ARTGFEPARE LLKKLSGEDV EVKACSICEG LTGRYYELAL
RAVEEAKSYE FNTFEVGVRI DAEVIRREEE LWRRYGLESA ESIRNEASRE VGKIISKLTG
KEYSRNNSEL LIIVDLSAGA IELHPAPVFV YGRYRKYARG LPQNPWPQPD ERIKFNTSIE
ELIVKPALEL FEAEKAKFHA AGREDIDVRT LGTGRPFVLE IKKPRKRNID LKVLAEKINS
GAGGLIEVLD LAYTDRKTIK KLKSLASIAK KAYVARVKFE KPVDDEKLAE ISKVFSNAVI
NQRTPTRVLH RRVDKLRKKI VYRLEARKIS QDEVEFYLET QGGFYVKEFI HGDNGRTTPS
IAEFLGNNVL SIELDVVSIE ETAA