Gene Pars_1684 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1684 
Symbol 
ID5054260 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1519476 
End bp1520717 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content62% 
IMG OID640469225 
Productpseudouridylate synthase 
Protein accessionYP_001153887 
Protein GI145591885 
COG category[S] Function unknown 
COG ID[COG0585] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00094] tRNA pseudouridine synthase, TruD family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.667214 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGGGAGG CTCCGCCGTT CGACAAGGCG CTTGGCATGT ACTACTATGT GACTGACACG 
TGCCCCTCGG GGGGCGTGAT TAAGAAGAGC CCAGAGGACT TCGTCGTGGA GGAGGTGCTG
GCGGATGGGA CGGTGGTGGC CGTCGGCGGC GTGGAGCTGA GGCCGAGGGT CGGGGGCTGG
ACGTGGATCC ACGTGGTGAA GCGCAATGTC GACACGATTA GGCTGATGAT ACGCCTCGCC
AAGGCCCTCG GCGTAAGTCC CAGGGAAGTG TCTGTGGGAG GTATCAAGGA TACCCGGGCT
GTGGCCTCCC ACATAATCTC GGTTAGGGGG GCCGTGAAAG GTTTGCCGGA GATCCCCGGC
GTCAAGTTCC TCGGCATGTG GTCAATGGAT AGGCCTATGT CGCCGTCTGA GATATACGGC
AACCGCTTCA CCATTGTGTT ACGCGACGTG GAGAGGGTGG ACTGCGCCGT GGAGGCTCTG
GAGGCCTTGA AGAGCGCGGC GGTGCCCAAC TACTACGGCT ACCAGCGCTT CGGCACTATT
AGGCCTGTGT CGCACCTCTT GGGCAGGGCG CTTTTGCGGA AAAGCCCCGA GGAGTTTTTC
GACGCGATGT TCTGCAAGAT CTTCGAACAC GAATCGGCCG CCGCGAAGAA GGCCAGGGAG
CTGGCGTGTA GGGGGGAGTA CCAGAAGGCC CTAGAGACCT TCCCCAGGCG ATTTGTCGAG
GAGAGGGCCT TCCTCCGCAG GCTGGCTCAG GGCTATGACA TGTGGAACGC CATTATGGGG
ATACCCCTCC AGATCTTGCG GATATACGTC GAGGCGGCCC AGTCCTACCT CTTCAACAGA
TTCTTATCCG CCCGGCTGGA GCTAGGCCCC CTGGACAAGC CTCTAGAAGG CGACCTCGTG
GAGGTGGGTG GGCAGGTGGC ATATTACGCC GAGGGCCTCG GGGGGGATGT TGTGTTGCCG
GTGGCCGGCG CGGGGGTCAG GATGCCGCGG GGCAAGGTGG GGGAGGCGTT GCTGAAGGTG
ATGAAGGAGG AGGGGGTTGA CCCCGCGGCT TTTTTGAAAA TGCCCAGAGG CCTAAAGGCC
TACGGCTCGT ACCGCCGCGC CAGGCTGGAG GTGGGTGACT TCTCCTACGC TGTTCGGGGC
AGAGACGTGG AGCTCCGGTT TGTCTTGCCC AGGGGGAGTT ACGCCACGGT GCTTCTGAGA
GAGGCGGTGA AGCCGGCGGA GCCGTACAGA CATGGGTTTT AG
 
Protein sequence
MREAPPFDKA LGMYYYVTDT CPSGGVIKKS PEDFVVEEVL ADGTVVAVGG VELRPRVGGW 
TWIHVVKRNV DTIRLMIRLA KALGVSPREV SVGGIKDTRA VASHIISVRG AVKGLPEIPG
VKFLGMWSMD RPMSPSEIYG NRFTIVLRDV ERVDCAVEAL EALKSAAVPN YYGYQRFGTI
RPVSHLLGRA LLRKSPEEFF DAMFCKIFEH ESAAAKKARE LACRGEYQKA LETFPRRFVE
ERAFLRRLAQ GYDMWNAIMG IPLQILRIYV EAAQSYLFNR FLSARLELGP LDKPLEGDLV
EVGGQVAYYA EGLGGDVVLP VAGAGVRMPR GKVGEALLKV MKEEGVDPAA FLKMPRGLKA
YGSYRRARLE VGDFSYAVRG RDVELRFVLP RGSYATVLLR EAVKPAEPYR HGF