Gene Hlac_2219 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2219 
SymbolhisS 
ID7401154 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2202657 
End bp2203985 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content69% 
IMG OID643709291 
Producthistidyl-tRNA synthetase 
Protein accessionYP_002566866 
Protein GI222480629 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.911192 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.722289 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACGACG GCCTCAAGGG ATTCCGCGAT TTCTACCCCG GCGAGCAGTC GGCCCGCCGC 
GAGGTGACGG ACGCGATCGA GGACGCCGCG AGTCGGTACG GCTTCCGAGA GATCGCCACC
CCCGCGCTCG AACGGACAGA AATGTACGTC GACAAGTCCG GCGAGGAGAT CGTCGAGGAG
CTGTACGCCT TCGAGGATAA GGGCGGCCGC GGCGTCTCGA TGACCCCGGA GCTCACGCCG
ACCGTCGCCC GGATGGTGGT CGCGAAGGGC CAAGAGCTCT CGAAGCCCAT CAAGTGGATG
TCCACCCGCC CGTTCTGGCG CTACGAACAG GTCCAACAGG GTCGGTTCCG CGAGTTCTAC
CAGACGAACA TCGACGTGTT CGGCTCGTCG GCGCCCGAGG CCGACGCCGA GGTGCTGGCG
GTGGCTGCCG ATGCGCTCAC GGATCTGGGG CTCACCAATG ACGACTTCGA GTTCCGCGTC
TCCCACCGCG ACATCCTCGG TGGGCTGGTT CGGGCGCTCG CGGCCGACCC CGACGCGGTC
GACACGAAGG CCGCGATCCG CGCGGTCGAC AAGCGCGCGA AGGTCGACGA CGGCGAGTAC
CTCGGGCTCC TCTCAGATGC CGGGCTGGAC CGCGCGACCG CCCAGGAGTT CGACGACCTC
ATCTCGGACG TGGAGACCGT CGACGACCTT GACGCGGTCG CCGAGGCCGG CGGCGAGGAT
GTCGAGGCGG CAGTCGAGAA CCTCCGGAAC GTGCTCGCCG CCGCCGACGA CTTCGGCGCC
GGAGCGTTCT GTGAGGTCTC GCTGACGACC GCCCGCGGGC TCGACTACTA CACCGGCGTC
GTCTTCGAAT GCTTCGACTC CACCGGCGAG GTGTCCCGCT CCGTCTTCGG CGGCGGGCGC
TACGACGACC TCATCGAGAG CTTCGGCGGC CAACCCACCC CCGCGGTCGG GGTCGCGCCC
GGTCACGCCC CCCTCTCGTT GCTCTGTCAG CGCGCCGGCG TGTGGCCCGA CGAGGAGCTG
ACGACCGACT ACTACGTGCT CAGCGTGGGC GACACGCGCT CGGAGGCGAC CGCGCTCGCA
CGCGATCTCC GCGCGCTCGG CGACGACGTG GTCGTCGAAC AGGACGTCTC CGGCCGGTCG
TTCGGCGCGC AGCTCGGTTA CGCCGACTCG ATCAACGCGG AGACGGTGGT CGTCGTCGGT
GAGCGCGACT TGGAGAACGG CGAGTACACC GTGAAGGACA TGGCGAGCGG CGACGAGACG
ACCGTTCCGG TCGAGGAGTT CCCGCCCGAA GGGGGAGAGG AGCTCCCGAC CTACGAGGAC
TACGAGTAG
 
Protein sequence
MYDGLKGFRD FYPGEQSARR EVTDAIEDAA SRYGFREIAT PALERTEMYV DKSGEEIVEE 
LYAFEDKGGR GVSMTPELTP TVARMVVAKG QELSKPIKWM STRPFWRYEQ VQQGRFREFY
QTNIDVFGSS APEADAEVLA VAADALTDLG LTNDDFEFRV SHRDILGGLV RALAADPDAV
DTKAAIRAVD KRAKVDDGEY LGLLSDAGLD RATAQEFDDL ISDVETVDDL DAVAEAGGED
VEAAVENLRN VLAAADDFGA GAFCEVSLTT ARGLDYYTGV VFECFDSTGE VSRSVFGGGR
YDDLIESFGG QPTPAVGVAP GHAPLSLLCQ RAGVWPDEEL TTDYYVLSVG DTRSEATALA
RDLRALGDDV VVEQDVSGRS FGAQLGYADS INAETVVVVG ERDLENGEYT VKDMASGDET
TVPVEEFPPE GGEELPTYED YE