Gene Pars_0658 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0658 
SymbolhisS 
ID5056360 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp586036 
End bp587295 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content55% 
IMG OID640468218 
Producthistidyl-tRNA synthetase 
Protein accessionYP_001152901 
Protein GI145590899 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGGGC TTCCCGACCA GCTGAGGAGG CCTGTAAGGG GCATGCGAGA CTGGATGCCG 
CAACAACTCT ACGCACTGAG GCGTATGGAG GAGGTCTTAT CGTCTGTAGC CGAGCAGTAC
GGCTATAGAA GGGTCGAGAC GCCTGTAGTA GAACACTTCG AAGTTCTTGC AAAAAAGGCT
GGGCAGGAGG TTATTAACGA AATCTACTAC TTTAGGGACA AGGCGGGGCG GGAGCTGGGG
CTTAGATTCG ACATGACTGT GCCCATCGCC AGGGTCTTAT CCTACAACCT TGACCTCCCG
AGGCCAGTGC GGTGGTACTA CTTCAGCAAG GTTTTTAGAT ACGACGAGCC GCAACACGGG
AGGTACCGGG AGTTTTTCCA ATTCGGCGTA GAGCTAATCG GCTCAGCCTC ACCGAGGGCA
GACGCCGAGG TGGTCCAGCT CCTCGCGGCG TCGCTTGAGG CGGCTGGAGC GTCAAAATAT
GTCATAAGGA TAAACGATAG GAGGGCTGTT GACAAGTTGC TTGAGTCCCT AGGCGCGTTG
TCCCACAGAG ATGCTGTGTA CAGGGCGCTT GACAAGAAGC TAAAATTGCC CCGGGAGGAA
GTAATTGGGA TCATGACATC CGGCGGCCTG CCGAGAGATG CCGCGGAAAA GATCTACGAC
ACGGCCAGCG AGATGAGCTT AGACGAGGCC GTAGAGGTCC TAAGGAGGCT GGACGGAAGG
CTCGGCGAGG CCTACGCCAA GTTCGTAAAA TACCTCGAAG CCGCGGTGCC CCTGGAGAGG
TTTAAATTCG ATATGTCTAT TGTCAGAGGA CTCGACTACT ACACCGGCGT GGTTTTCGAG
GCCTTTGTGG GGGACTACTG GCTCGCCGTG GGCGGAGGCG GCCGCTACGA CGACTTGCTG
GAGCTGTACA GCGGAGTCAA AATCCCCGCC CTCGGCTTCG CCATAGGCGT AGAGAGGCTT
ATGGAAGCCG TCGGCTTGCA AAGCGTGGAG AAGCCCCTCG ACTACTACAT ATACATCTTC
GACGATGACG CGTACAAACA CGCCGTGGCC CTAGCCAATA GGCTACGCAA ACAGGGACAC
AGCGTAGTGG TTGAGTTAGG AGAAAAGAAC TTAAAGGACG TTTTTGAGTA CGTGTTGAAA
ATTGGTACCA GATACCTGGT ATTGATAGGC CGTAAGGAGC TTGAAAAAGG AGTGGTGAAG
ATAAGAGATT TGCAAAAAAG AGGGGAGGTC GAGGTGCCTC TCTCGGCTCT ACTATCTTAG
 
Protein sequence
MTGLPDQLRR PVRGMRDWMP QQLYALRRME EVLSSVAEQY GYRRVETPVV EHFEVLAKKA 
GQEVINEIYY FRDKAGRELG LRFDMTVPIA RVLSYNLDLP RPVRWYYFSK VFRYDEPQHG
RYREFFQFGV ELIGSASPRA DAEVVQLLAA SLEAAGASKY VIRINDRRAV DKLLESLGAL
SHRDAVYRAL DKKLKLPREE VIGIMTSGGL PRDAAEKIYD TASEMSLDEA VEVLRRLDGR
LGEAYAKFVK YLEAAVPLER FKFDMSIVRG LDYYTGVVFE AFVGDYWLAV GGGGRYDDLL
ELYSGVKIPA LGFAIGVERL MEAVGLQSVE KPLDYYIYIF DDDAYKHAVA LANRLRKQGH
SVVVELGEKN LKDVFEYVLK IGTRYLVLIG RKELEKGVVK IRDLQKRGEV EVPLSALLS