Gene Hore_12150 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_12150 
Symbol 
ID7313916 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp1307761 
End bp1309020 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content39% 
IMG OID643611654 
Producthistidyl-tRNA synthetase 
Protein accessionYP_002508960 
Protein GI220932052 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.00000175223 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGTAA AGGCACCACG GGGGACAAAT GATATTTTGC CCCCCGTTTC TTTAAAATGG 
CAGTATATTG AAGATACAGC AAGACGTATT TTTCAAATGT ATAACTATAA AGAAATTAGG
ACTCCTATAT TTGAGTATAC AGAGTTATTT CAGCGGGGAA TTGGTGAAAC AACAGATATT
GTTGAAAAAG AGATGTATAC CTTTGAAGAT AAAGGTGGAC GGAGTATTAC TTTAAGGCCC
GAAGGAACAG CTTCAGTGGT CAGGGCTTTT TTAGAACATA AAATTTATGG ACAGGTTCAG
CCTACTAAGT ACTTTTATAT AGGTCCCATG TTCAGGTATG AGAGACCACA GGCCGGTAGA
TTTAGGCAGT TTCACCAACT GGGGGTTGAA GCCTTTGGTT CCAATGATCC TGCCCTTGAT
GCTGAAGTTA TTGCCCTGGG ACTCGATATT TTAAAACGGT TAGGCTTAAC AGATGTAGAA
GTCTTTATCA ATAGTATTGG TTGTCCAGAG TGTCGGGCAA GATATTCAGA TGAACTAAAG
CAATATTTAG AGTCACATCA GGACAGGCTC TGTAAAGATT GTAAAGCAAG ACTCAATAAA
AATCCCCTGC GTATCCTGGA TTGTAAAAAT GAAGAATGTT CACTGGTGAT TAAAAATGCC
CCTAAAATAC TGGATTATTT ATGTGATAAT TGCAGGGTTC ACTTTGAGGA TGTTCAGGAA
TATCTGGACT TACTGGGTAT TAAATACAGG GTTGATCCAA CCCTGGTCAG GGGACTGGAT
TACTATACCA ACACTGCCTT TGAAATTAAA TTTAAAGAAC TGGGTGCTCA GGATGCTATT
TTTGGTGGCG GTCGTTATAA TGGGTTAACA GAAGAAATAG GTAATAAGTC TATTCCGGGA
ATTGGTTTTG CTGTGGGAAT TGAAAGGCTT ATTCTTGCTC TTGATAAAAA GGGGATAAAG
TTACCTGTTA ATGACAGTAT TGATGTATAC CTGGTTACAA TTGGTGAACG AGCCAAGCGG
GCGGCTTTTA ACTATACATA TTTATTAAGA GAATCAGGTA TTACAGCAGA GATAGATTAT
CTGGGCAGAA GTATTAAAAG CCAGATGAAG TCTGCTGACA GGACAGGTGC CAGTTATACT
ATTATAATCG GTGATAGTGA ACTGGATTCA GGTAAAGCAA CTGTAAAGAA TATGAGGACC
GGTGAACAGG TTGAAATTAT GCTTGCCAAT CTTATAGAGG AAATGCAAAA GCTAGTATGA
 
Protein sequence
MDVKAPRGTN DILPPVSLKW QYIEDTARRI FQMYNYKEIR TPIFEYTELF QRGIGETTDI 
VEKEMYTFED KGGRSITLRP EGTASVVRAF LEHKIYGQVQ PTKYFYIGPM FRYERPQAGR
FRQFHQLGVE AFGSNDPALD AEVIALGLDI LKRLGLTDVE VFINSIGCPE CRARYSDELK
QYLESHQDRL CKDCKARLNK NPLRILDCKN EECSLVIKNA PKILDYLCDN CRVHFEDVQE
YLDLLGIKYR VDPTLVRGLD YYTNTAFEIK FKELGAQDAI FGGGRYNGLT EEIGNKSIPG
IGFAVGIERL ILALDKKGIK LPVNDSIDVY LVTIGERAKR AAFNYTYLLR ESGITAEIDY
LGRSIKSQMK SADRTGASYT IIIGDSELDS GKATVKNMRT GEQVEIMLAN LIEEMQKLV