Gene Pars_0089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0089 
Symbol 
ID5054298 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp81366 
End bp82361 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content56% 
IMG OID640467668 
ProductH/ACA RNA-protein complex component Cbf5p 
Protein accessionYP_001152356 
Protein GI145590354 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0130] Pseudouridine synthase 
TIGRFAM ID[TIGR00425] rRNA pseudouridine synthase, putative
[TIGR00431] tRNA pseudouridine 55 synthase
[TIGR00451] uncharacterized domain 2 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGGTGCA GTCAGAGAGA GGTTTTCGTC AAGAGGGAGG AGCCTACAAA TCCTGAGTGG 
GGTAAGCCGC CCTCCCAGAG AACTGCTGAC GAGTATATAC GGCACTCATT TGTGATTTTA
GACAAGCCTC GGGGGCCTAG TAGCCACGAG GTGGCGGCGT GGGTGAAGAA GATTCTGGGC
GTGGAGCGGG CCGGCCACGC GGGGACGCTG GACCCGAAGG TGTCGGGGGT GTTGCCAATT
GCAGTGGCTG AGGGGACTAA GGTGTTGATG GCCCTGTCTA GATCCGACAA GGTCTATGTG
GCTGTGGCTA AGTTCCACGG AGATGTGGAT GAGGAGAGGC TTAGGGCTGT GTTGCGGGAG
TTTCAGGGAG AGATATACCA AAAGCCGCCG CTCCGCTCTG CGGTGAAAAG GCAGTTGCGG
ACGCGTCGGG TTTTCTCGCT TGAGCTTCTA GAGTTGGAGG GGCGGTATGC CGTTATTAAG
ATGCATGTTG AGGCTGGGAC ATACGCCCGC AAGATTATAC ACGACATCGG CGAGGTTCTC
GGCGTAGGCG CCAATATGAG AGAGTTGAGG CGCGTGGCAG TCACCTGCTT TACTGAAGAC
GAGGCTGTTA CTTTGCAAGA CGTGGCCGAC GCGTATTATA TCTGGAAGAA ATACGGCGAC
GACACGTATC TAAGGAGCGT CCTGTTGCCT ATTGAGGAAA TTGCCAGGCA TTTGCCGAAG
ATTTGGGTAA GGGACAGCGC CGTAGACGCC GTGTGCCACG GCGCACCTCT AGCTGCGCCG
GGCATATCGA AGTTCGAGGT GCCGTTTTCC AAGGGGGACA TAGTCGCCAT GTTTACTCTG
AAAGGCGAGC TTGTAGGGAT TGGTAGGGCT CTGGTAGACT CGGAGGAGGT GAAGAAAATG
GAGAGGGGGG CCGTGGTTAG GACAGACAGG GTCGTCATGA GGCGGGGCAC ATATCCGGCT
ATGTGGAAGA AAGGCCAAAG AGCCGCAAAA ACTTAA
 
Protein sequence
MRCSQREVFV KREEPTNPEW GKPPSQRTAD EYIRHSFVIL DKPRGPSSHE VAAWVKKILG 
VERAGHAGTL DPKVSGVLPI AVAEGTKVLM ALSRSDKVYV AVAKFHGDVD EERLRAVLRE
FQGEIYQKPP LRSAVKRQLR TRRVFSLELL ELEGRYAVIK MHVEAGTYAR KIIHDIGEVL
GVGANMRELR RVAVTCFTED EAVTLQDVAD AYYIWKKYGD DTYLRSVLLP IEEIARHLPK
IWVRDSAVDA VCHGAPLAAP GISKFEVPFS KGDIVAMFTL KGELVGIGRA LVDSEEVKKM
ERGAVVRTDR VVMRRGTYPA MWKKGQRAAK T