Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0354 |
Symbol | |
ID | 4600892 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 323597 |
End bp | 324931 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 639773115 |
Product | pseudouridylate synthase |
Protein accession | YP_919766 |
Protein GI | 119719271 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG1258] Predicted pseudouridylate synthase |
TIGRFAM ID | [TIGR01213] conserved hypothetical protein TIGR01213 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGTACAG GTACTAGCGC CAGCTACAAG AGGGATAATG CTTTCTACGC GCTAGACAAG GTAGAGAGGA TACTACTGGA CGGGTACTCT CTATGCGACG CCTGCACGGG TAGGCTTTTC GGTCTGAGAG GCTACGGGCT CTCGAACACC GAAAGAGGAC GTGCCCTCAA GACACTCCTG ATCATGAAGG CTTTTCAGGC TTCTCCTAGG CAGGCAGACC TGGAACTCTT ACGTGTACTG GCAAGGACGG GGTTCGAGCC CGCCCGGGAA CTCTTAAAAA AGCTTTCAGG AGAAGATGTA GAAGTCAAGG CGTGTAGTAT TTGCGAGGGG CTCACGGGCA GGTACTACGA GCTTGCGCTA AGAGCTGTCG AAGAGGCGAA AAGCTACGAG TTTAACACGT TCGAGGTAGG CGTAAGGATC GACGCCGAGG TGATTCGCCG GGAGGAAGAG CTCTGGCGCA GGTACGGCCT GGAGAGCGCG GAGAGCATAC GGAACGAGGC GAGCAGGGAA GTAGGGAAAA TAATCTCTAA GCTGACCGGG AAGGAGTATT CTAGGAACAA TAGCGAGTTG CTGATAATAG TCGACTTGTC GGCGGGCGCA ATAGAGCTTC ACCCGGCCCC CGTGTTCGTG TACGGTAGGT ATAGGAAGTA CGCGAGAGGG CTACCCCAAA ACCCCTGGCC GCAACCCGAC GAGAGGATAA AGTTCAACAC AAGTATAGAG GAGCTGATAG TTAAACCCGC ACTTGAACTA TTCGAGGCCG AGAAAGCGAA GTTCCACGCC GCCGGGAGAG AGGATATAGA CGTGCGCACG CTTGGAACTG GAAGACCCTT CGTACTTGAG ATAAAGAAGC CGCGTAAGCG AAACATTGAC CTAAAGGTTC TCGCAGAGAA GATAAACTCG GGTGCAGGGG GGTTAATAGA GGTTCTGGAC CTCGCGTACA CCGACCGGAA AACGATAAAG AAGCTGAAGA GCCTGGCATC AATAGCTAAG AAAGCCTACG TAGCCAGGGT AAAGTTCGAG AAACCTGTCG ACGACGAGAA ACTTGCAGAG ATCTCTAAGG TTTTCTCCAA CGCAGTTATT AACCAGCGTA CGCCTACGAG GGTTCTCCAC CGCCGCGTAG ACAAGCTTAG GAAGAAGATT GTCTACAGGT TGGAGGCGAG AAAAATCTCC CAAGACGAGG TAGAGTTCTA CCTAGAAACT CAGGGAGGCT TTTACGTGAA GGAGTTCATA CACGGCGATA ACGGGAGGAC TACTCCGAGT ATAGCGGAGT TCCTTGGAAA CAACGTCCTA AGCATAGAGC TCGACGTAGT CAGTATAGAA GAAACTGCGG CCTAG
|
Protein sequence | MSTGTSASYK RDNAFYALDK VERILLDGYS LCDACTGRLF GLRGYGLSNT ERGRALKTLL IMKAFQASPR QADLELLRVL ARTGFEPARE LLKKLSGEDV EVKACSICEG LTGRYYELAL RAVEEAKSYE FNTFEVGVRI DAEVIRREEE LWRRYGLESA ESIRNEASRE VGKIISKLTG KEYSRNNSEL LIIVDLSAGA IELHPAPVFV YGRYRKYARG LPQNPWPQPD ERIKFNTSIE ELIVKPALEL FEAEKAKFHA AGREDIDVRT LGTGRPFVLE IKKPRKRNID LKVLAEKINS GAGGLIEVLD LAYTDRKTIK KLKSLASIAK KAYVARVKFE KPVDDEKLAE ISKVFSNAVI NQRTPTRVLH RRVDKLRKKI VYRLEARKIS QDEVEFYLET QGGFYVKEFI HGDNGRTTPS IAEFLGNNVL SIELDVVSIE ETAA
|
| |