Gene Pars_0803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0803 
Symbol 
ID5055342 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp715217 
End bp716683 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content57% 
IMG OID640468364 
Productprolyl-tRNA synthetase 
Protein accessionYP_001153041 
Protein GI145591039 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0442] Prolyl-tRNA synthetase 
TIGRFAM ID[TIGR00408] prolyl-tRNA synthetase, family I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0138093 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCTTA TCAGAGAGGC GAGACCCCAC GGCCGGGAGA AGCTTAGGGC AAATCTGATA 
GAGTGGTTCC ACTGGCTTCT TAGAGAGGCT GAGCTGTACG ACGTGAGGTA CCCCGTCAAA
GGGGCGTATG TGTGGAGGCC CTATGGGATG AGGCTGAGGC GCCATGTGGA GGAGCTGATT
AGGAGAAGCC ACGATGAGAC TGGGCACCAG GAGGTGTTGT TCCCCGTGTT TATTCCCTAC
GAGTTTTTTG GTAAGGAGTC CCAGCACATT AGGGGGTTTG AGAAGGAGGT GTTTTGGGTG
TCTAAGGGCG GCGAGGAGGG AGAGAGACTG GTGCTCCGCC CCACGTCAGA GACGGCGATT
ATGCCGATGG TGAAGCTGTG GGTTCATGAC TACAAGGACC TCCCGCTTAG GCTTTACCAA
ATTGTCAGCG TCTTCCGGGC TGAGACTAAG ATGACGCACC CCATGATTAG GCTTAGGGAG
ATTTCCATGT TTAAAGAGGC TCACACCGTG CACGTTGATA GGGAGGATGC AGAGCGGCAG
GTTCGTGAGG CTGTGGAGAT TTACAAGAAA ATTTTTGACG AGATGTGCCT GGCGTATATG
ATAAACAAGA GGCCTGACTG GGATAAATTC GCCGGGGCTG AGTACACAAT AGCCTTCGAC
ACGGTGCTCC CCGATGGGAG GACCCTGCAG ATAGGCACGG TGCACTACCT GGGGACGAAC
TTCACCAGGG TTTTCGAGGT GACGTACCTC GCCGCGGATG GTACGAGGAG GCTTGCCCAC
ACCACCTCCT ACGGGATTTC GGAAAGGAGC ATAGCCGCCA TGCTCATCAC GCACGGCGAT
GACGCGGGGA CAGTCCTACC GCCGAGGTTG GCGCCTATTC AAGTAGTTAT TGTCCCCATA
TTCTACGGCG AGGAGGAGGC TGCATCTGTG ATATCTTACG CCAGGGAAGT GGAAAAGGCT
CTGCGAGAGG CCGGCATGCG CGTACACATC GATGATAGGC CTGATAAGAC GCCTGGGTGG
AAGTTCTACT TCTGGGAGCT GAAGGGGGTG CCTCTGCGCG TCGAAGTTGG GAAGAGGGAT
TTGGAGAAGA GGCAGGTTGT GATTACGAGG AGGGATACCT TGGAGAAATA CGCCGTTGGG
CTGGGGGAGT TGGTGGACGC TGTGAGGGGG CTTATGAGAA CCGTGGAGGA GAATCTGCGG
AGGAGGGCGT GGGAGGAGTT GAGGAGTCGC ATCGTCCGGG CCGAGACGGT GGAGGCCGCC
AAGGCCGCCA TCCGTGAGGG GAAGGTGGTA GAGGTGCCGT GGAGTGGGGA TAACGACTGC
GGTATTAAGT TAAAAGACCT TGTCGGTGCC GATGCCTTGG GCGTCCCCCT AGACTCGGAC
GCGTCTGTGG GGGGCTTCGA CTTAAGGGAC TTGGCTTGCG GCGAAAAGCG GGCTGAGTTC
TGGCTGAGGC TTTCTGAGAG ATACTAA
 
Protein sequence
MELIREARPH GREKLRANLI EWFHWLLREA ELYDVRYPVK GAYVWRPYGM RLRRHVEELI 
RRSHDETGHQ EVLFPVFIPY EFFGKESQHI RGFEKEVFWV SKGGEEGERL VLRPTSETAI
MPMVKLWVHD YKDLPLRLYQ IVSVFRAETK MTHPMIRLRE ISMFKEAHTV HVDREDAERQ
VREAVEIYKK IFDEMCLAYM INKRPDWDKF AGAEYTIAFD TVLPDGRTLQ IGTVHYLGTN
FTRVFEVTYL AADGTRRLAH TTSYGISERS IAAMLITHGD DAGTVLPPRL APIQVVIVPI
FYGEEEAASV ISYAREVEKA LREAGMRVHI DDRPDKTPGW KFYFWELKGV PLRVEVGKRD
LEKRQVVITR RDTLEKYAVG LGELVDAVRG LMRTVEENLR RRAWEELRSR IVRAETVEAA
KAAIREGKVV EVPWSGDNDC GIKLKDLVGA DALGVPLDSD ASVGGFDLRD LACGEKRAEF
WLRLSERY