Gene Hmuk_1302 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_1302 
SymbolhisS 
ID8410822 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1232611 
End bp1233921 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content67% 
IMG OID645019633 
Producthistidyl-tRNA synthetase 
Protein accessionYP_003177130 
Protein GI257387357 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.194766 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACGACT CGCTGAAGGG GTTTCGAGAT GTCTACCCCG CCGAGATGGC CGCTTACCGC 
CAGGTCATCG ACGAGATCGA GACGACGGCC CGCCAGTACG GTTTCCGTGA GATCACGACG
CCGGCACTGG AAGCCACGGA GATGTACGTC GACAAGAGCG GCGAGGAGAT CGTCGAAGAG
CTGTACCACT TCGAGGACAA GGGCGGGCGC GACGTTGCGC TGACGCCGGA GCTGACTCCC
ACGGTGGCCC GGATGGTCGT CGCCAAGCAA CAGGAGCTCT CGAAGCCGAT CAAGTGGGTC
TCCACGCGCC CGTTCTGGCG CTACGAGCAG GTCCAGCAGG GCCGATTCCG CGAGTTCCAC
CAGACGAACA TCGACATCTT CGGGTCGAGC GAGCCGACGG CCGACGCCGA GATCCTGGCG
GTCGCGACCG ACATGCTGAC GGGACTGGGC CTCAGCGCCG ACGACTTCGA CTTCCGGGTC
TCACACCGTG ACATCCTCAC GGGACTGCTC GAATCCTTCG AGGCCGACGT GGACACCCAG
GACGCGATTC GCGTCGTCGA CAAGCGGGCG AAGATCGACC GCGACGAGTA CGTCGAGGGG
CTCACCGATG CGGGCCTCTC CCTCGACCAG GCCGAGCAGT TCGACGAGTG GCTCCGGGCC
GGCGACGACG ATCTGGACGC GCTCGCGGAG ATGAGCGGCT CCGAGCAGGT CGCAGACGCC
GTCGCGAACC TCGAAGCCGT CCTCGCGGCC GCGGAGGACT TCGGCGTCCG GGAGTACTGC
ACGATCTCGC TGACCACCGC CCGCGGGTTC GACTACTACA CCGGCGTCGT CTTCGAGTGT
TTCGACTCGA CCGGCGAGGT CTCCCGGGCG GTCTTCGGTG GCGGTCGGTA CGACGACCTG
ATCGAGGGCT TCGGCGGCGA GCCGACGCCA GCGGTGGGCT TCGCGCCGGG CGTCATGAAC
TCGACGCTCC CCCTCTTGCT CCAGCGAGCG GGCGTCTGGC CCGAGGAGGC GGTGTCGACG
GACTACTACG TCCTGCAGGT CGGTGACACT CGCCCCGTCG CGGCCCGCAT CGCGCGGGAA
CTCCGCGAGT CGGGCCACGT CGTCGAGGCC GACGTGTCCG ACCGGAGCTT CGGGGCGCAG
ATGGGCTACG CGGACTCGAT CAACGCCGAG ACGGTCGTCA TCGTCGGCGA GAACGACCTC
GAAAACGACG AGGTCACGGT CAAGGACATG GCCAGCGGCG AGCAGACGAC CGCGCCCGTC
GACGCATTCC CCGGCGATCA CGAGCGCCCG ACCTACGGCG ACTTCGCGTA A
 
Protein sequence
MYDSLKGFRD VYPAEMAAYR QVIDEIETTA RQYGFREITT PALEATEMYV DKSGEEIVEE 
LYHFEDKGGR DVALTPELTP TVARMVVAKQ QELSKPIKWV STRPFWRYEQ VQQGRFREFH
QTNIDIFGSS EPTADAEILA VATDMLTGLG LSADDFDFRV SHRDILTGLL ESFEADVDTQ
DAIRVVDKRA KIDRDEYVEG LTDAGLSLDQ AEQFDEWLRA GDDDLDALAE MSGSEQVADA
VANLEAVLAA AEDFGVREYC TISLTTARGF DYYTGVVFEC FDSTGEVSRA VFGGGRYDDL
IEGFGGEPTP AVGFAPGVMN STLPLLLQRA GVWPEEAVST DYYVLQVGDT RPVAARIARE
LRESGHVVEA DVSDRSFGAQ MGYADSINAE TVVIVGENDL ENDEVTVKDM ASGEQTTAPV
DAFPGDHERP TYGDFA