Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pcal_1782 |
Symbol | |
ID | 4908172 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum calidifontis JCM 11548 |
Kingdom | Archaea |
Replicon accession | NC_009073 |
Strand | + |
Start bp | 1652801 |
End bp | 1654036 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640125531 |
Product | pseudouridylate synthase |
Protein accession | YP_001056665 |
Protein GI | 126460387 |
COG category | [S] Function unknown |
COG ID | [COG0585] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR00094] tRNA pseudouridine synthase, TruD family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00000171757 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAGGCCC CGCCATTTGA CAAAAGCTTG GGGATGCAAT ACTACGCCAC GGACACTTGC CCGGCCGGGG GCCGTATCAA GGAAAGGGTG GAGGATTTCG TCGTGGAGGA GATCCTCAAA GACGGCACTG TTGTCTCTGT CAACGGCGTT GCGCTCACGC CGAGGGTTGG GAATTGGACG TGGATTCACG TCGTGAAGAG GGGCGTGGAC ACTCTTAAGT TTGTGTTAAG GTTGGCCAAG GCTCTTGGGG TAAATCCACG CGACATATCT ATCGGCGGCA TAAAGGACAC TAGGGCGGTC ACTTCGCAGA TAGTGTCTGT GAGGGGAACC GTCTCCAGCC TGCCCCAGAT CCCCGGGGTA GAGTTCTTAG GCATGTGGCC GATGGATAGG CCTATTACGC CGGCGGAGAT ATATGGGAAT AGATTTACGA TTGTTGTCAG AGGCGTGGAT AGGGCGGACT GCGCCGCTGA GGCGCTTGCC GCTTTGGCCA AGACGCCGAT TCCCAACTAC TACGGCTATC AGCGCTTCGG GACTGTGCGC CCCGTGGGCC ACCTCCTCGG CCTAGCCCTC TTGAAGAAGG ACGCCGAGGC GTTCTTCGAC GTCATGTTTT GTAAAATATT TCCGCGGGAG TCCGAGGCCG CCAAGAGAGC GAGGGAGTTG GCGTGTAGGG GGGAGTACGC CAAGGCGCTG GAGGCTTTTC CCAAGAGGTT TCTCGAGGAG AGGGCTTTTC TGAGGAAGCT GGTGGAGGGG GCGGACCTCT GGAACGCGGC GATGGCTATA CCGGGGCAGA TCCTCAAGAT CTACATAGAG GCTGCCCAGT CCTACGTCTT TAACCTCTTT CTCTCGAGGC GGATGGAGCT GGCGCCGCTG GAGCCCGTGG AGGGGGACTT AGTGGACGTG GGGGGCCAAG TGGCGTACTA CGTGGAGGGC CTCGCGGGGG ATCTCGTCTT GCCCGTGCCC GGCGCGGGGG TTAAGATGCC GAGGGGCAGA GTGGGCGAGG CTCTTGTAAG AGTGGTGAAG GACCTAGGCC TCGACCCAGC CCTCTTCTTG AAGATGCCGA GGGGCTTGAG GGCGTATGGT AGCTATAGGC GCGCCCGGCT GGACGTAGGC CAGCTGGAGT ACAAGGTCGC CGGCGGCGAG GTGGAGATAC GTATGACTCT GCCGAGGGGG AGCTACGCCA CTGTGGTATT GAGGGAGGTG GTGAAGCCGG TCGACCCCGC CGCGCATGGG TTCTAA
|
Protein sequence | MEAPPFDKSL GMQYYATDTC PAGGRIKERV EDFVVEEILK DGTVVSVNGV ALTPRVGNWT WIHVVKRGVD TLKFVLRLAK ALGVNPRDIS IGGIKDTRAV TSQIVSVRGT VSSLPQIPGV EFLGMWPMDR PITPAEIYGN RFTIVVRGVD RADCAAEALA ALAKTPIPNY YGYQRFGTVR PVGHLLGLAL LKKDAEAFFD VMFCKIFPRE SEAAKRAREL ACRGEYAKAL EAFPKRFLEE RAFLRKLVEG ADLWNAAMAI PGQILKIYIE AAQSYVFNLF LSRRMELAPL EPVEGDLVDV GGQVAYYVEG LAGDLVLPVP GAGVKMPRGR VGEALVRVVK DLGLDPALFL KMPRGLRAYG SYRRARLDVG QLEYKVAGGE VEIRMTLPRG SYATVVLREV VKPVDPAAHG F
|
| |