Gene Pars_1771 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1771 
Symbol 
ID5055373 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1590984 
End bp1592219 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content50% 
IMG OID640469316 
Productputative pseudouridylate synthase 
Protein accessionYP_001153974 
Protein GI145591972 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1258] Predicted pseudouridylate synthase 
TIGRFAM ID[TIGR01213] conserved hypothetical protein TIGR01213 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.524292 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0084243 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGAGTTAA TATCCAAGGC CCTAGAGGCG GTGAGGAGAT ATCCCCTGTG CGACTCTTGT 
CTAGGTAGGC TTTTTGCCCT AATGGGATAC GGCATTGAAA ACCGGGAGCG TGGCCAAGCC
ATCAAGACCA TCCTGCACAT GGCCGCAGTC TCAGACTATA GAAAGGGGAA AGATGTTACG
GCAGACCTCA TCGCATTGGC CAAATGCCAC CTCCCTACGA GAAGGTTTCT GGCAGGCGTT
GGGATAAGAG TAGACGAGGA AAGGTGCTAT ATCTGCGGCG ATCTGATGGA GGGGGTTGAA
AAATACGCAG AGATGGCTGT TGAGCAACTA CGCGGCTTAG ACTTTGTAAG CTTCGCTGTG
GGTTCTACGC TTCCAGAAGA GTTGCTTGAG AAGGAGGCTG AGGTGGTAAA GAGTCTTCTT
GTCACTACCG GTGAGTCGGT AAAGCATGAA GTAAACCGAC GCATCGGTAA GGAGCTTCTG
AGACGCCTTT CTGACAAGAG AGTAGACAAG CTAAGACCTA ACGTGGTGGT GAATGTAGAC
TTGGTAAGCG GCCAAGTAAA GGTGGTAAGA AACCCCATTC TTATTGGAGG CCGTTATCTC
AAGCTTGGGC GGAAAATTGC CCAGGCAAAA AGATTTGGTA ATGTGCGAAC GACGTTGCTA
GAAAAATTGG CCTATCTACG TGATACGTTC GGCGGAGAGG ACCACGTAAT ACATGTATCG
GGTAGAGAAG ATAGCGATGC GCGTATGCTT GGTAGCGGCC GTCCACTAGT TGTCGAAGTA
AAGCAGCCGC TTAGATACAC CGCCCAGGTA GCCCCGTTTA GAGATAAAGA CGTAATTTTT
CTTCCTGTGG GGTTTACCGA CAGAAATGAG GTAAGAAGGC TAAAGGAGAA GGCCAAGACC
GATATTAAGC TTTACCGGGT GCTTGTACTT TCAGAATCTC CGCTTAAACA GGATGACCTA
AGTAAGCTGT CTGCTCTTTC AGGCGCCACT GTTACACAAT ATACCCCGAG GCGGATAAAA
AGACTTCATC CGAGAAAAAA GAGGGTAAGG ATGGTCTACG ACGTGGCGTG GCGCCTCGTG
TCGCCGCACG TCTTTGAGCT CTATATAAGG TGCCAAGGTG GGCTATACGT AAAGGAATTT
GTACACGGCG ACGGCGGCAG GACCGCCCCA AACGTCGCAG AGCTTTTAAA TACTAGGTTA
GAAGTCCTAG AACTAGACGT CTTGTACATC GAATAG
 
Protein sequence
MELISKALEA VRRYPLCDSC LGRLFALMGY GIENRERGQA IKTILHMAAV SDYRKGKDVT 
ADLIALAKCH LPTRRFLAGV GIRVDEERCY ICGDLMEGVE KYAEMAVEQL RGLDFVSFAV
GSTLPEELLE KEAEVVKSLL VTTGESVKHE VNRRIGKELL RRLSDKRVDK LRPNVVVNVD
LVSGQVKVVR NPILIGGRYL KLGRKIAQAK RFGNVRTTLL EKLAYLRDTF GGEDHVIHVS
GREDSDARML GSGRPLVVEV KQPLRYTAQV APFRDKDVIF LPVGFTDRNE VRRLKEKAKT
DIKLYRVLVL SESPLKQDDL SKLSALSGAT VTQYTPRRIK RLHPRKKRVR MVYDVAWRLV
SPHVFELYIR CQGGLYVKEF VHGDGGRTAP NVAELLNTRL EVLELDVLYI E