Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1684 |
Symbol | |
ID | 5054260 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1519476 |
End bp | 1520717 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640469225 |
Product | pseudouridylate synthase |
Protein accession | YP_001153887 |
Protein GI | 145591885 |
COG category | [S] Function unknown |
COG ID | [COG0585] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR00094] tRNA pseudouridine synthase, TruD family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.667214 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGGGAGG CTCCGCCGTT CGACAAGGCG CTTGGCATGT ACTACTATGT GACTGACACG TGCCCCTCGG GGGGCGTGAT TAAGAAGAGC CCAGAGGACT TCGTCGTGGA GGAGGTGCTG GCGGATGGGA CGGTGGTGGC CGTCGGCGGC GTGGAGCTGA GGCCGAGGGT CGGGGGCTGG ACGTGGATCC ACGTGGTGAA GCGCAATGTC GACACGATTA GGCTGATGAT ACGCCTCGCC AAGGCCCTCG GCGTAAGTCC CAGGGAAGTG TCTGTGGGAG GTATCAAGGA TACCCGGGCT GTGGCCTCCC ACATAATCTC GGTTAGGGGG GCCGTGAAAG GTTTGCCGGA GATCCCCGGC GTCAAGTTCC TCGGCATGTG GTCAATGGAT AGGCCTATGT CGCCGTCTGA GATATACGGC AACCGCTTCA CCATTGTGTT ACGCGACGTG GAGAGGGTGG ACTGCGCCGT GGAGGCTCTG GAGGCCTTGA AGAGCGCGGC GGTGCCCAAC TACTACGGCT ACCAGCGCTT CGGCACTATT AGGCCTGTGT CGCACCTCTT GGGCAGGGCG CTTTTGCGGA AAAGCCCCGA GGAGTTTTTC GACGCGATGT TCTGCAAGAT CTTCGAACAC GAATCGGCCG CCGCGAAGAA GGCCAGGGAG CTGGCGTGTA GGGGGGAGTA CCAGAAGGCC CTAGAGACCT TCCCCAGGCG ATTTGTCGAG GAGAGGGCCT TCCTCCGCAG GCTGGCTCAG GGCTATGACA TGTGGAACGC CATTATGGGG ATACCCCTCC AGATCTTGCG GATATACGTC GAGGCGGCCC AGTCCTACCT CTTCAACAGA TTCTTATCCG CCCGGCTGGA GCTAGGCCCC CTGGACAAGC CTCTAGAAGG CGACCTCGTG GAGGTGGGTG GGCAGGTGGC ATATTACGCC GAGGGCCTCG GGGGGGATGT TGTGTTGCCG GTGGCCGGCG CGGGGGTCAG GATGCCGCGG GGCAAGGTGG GGGAGGCGTT GCTGAAGGTG ATGAAGGAGG AGGGGGTTGA CCCCGCGGCT TTTTTGAAAA TGCCCAGAGG CCTAAAGGCC TACGGCTCGT ACCGCCGCGC CAGGCTGGAG GTGGGTGACT TCTCCTACGC TGTTCGGGGC AGAGACGTGG AGCTCCGGTT TGTCTTGCCC AGGGGGAGTT ACGCCACGGT GCTTCTGAGA GAGGCGGTGA AGCCGGCGGA GCCGTACAGA CATGGGTTTT AG
|
Protein sequence | MREAPPFDKA LGMYYYVTDT CPSGGVIKKS PEDFVVEEVL ADGTVVAVGG VELRPRVGGW TWIHVVKRNV DTIRLMIRLA KALGVSPREV SVGGIKDTRA VASHIISVRG AVKGLPEIPG VKFLGMWSMD RPMSPSEIYG NRFTIVLRDV ERVDCAVEAL EALKSAAVPN YYGYQRFGTI RPVSHLLGRA LLRKSPEEFF DAMFCKIFEH ESAAAKKARE LACRGEYQKA LETFPRRFVE ERAFLRRLAQ GYDMWNAIMG IPLQILRIYV EAAQSYLFNR FLSARLELGP LDKPLEGDLV EVGGQVAYYA EGLGGDVVLP VAGAGVRMPR GKVGEALLKV MKEEGVDPAA FLKMPRGLKA YGSYRRARLE VGDFSYAVRG RDVELRFVLP RGSYATVLLR EAVKPAEPYR HGF
|
| |