Gene Hhal_1788 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1788 
SymbolhisS 
ID4710917 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1961847 
End bp1963118 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content68% 
IMG OID639856258 
Producthistidyl-tRNA synthetase 
Protein accessionYP_001003354 
Protein GI121998567 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTCGAACA AGGGGATTCA AAGCATTCGC GGGTTCTCGG ATATCCTTCC GGAAGAAAGC 
CCGTTGTGGC AGTACGCCGA GTCGGCGATT CGTCGGGTCC TGGAGTCGTA CGGGTACCGC
GAGATCCGTC TGCCGGTGCT CGAACGCACC GAGCTCTTCA GCCGGTCCAT CGGCGAGGTG
ACCGACATCG TCGAGAAGGA GATGTACACC TTCGACGATC GCAATGGCGA CAGCGTTACG
CTGCGCCCTG AGGGCACGGC CGGCTGTGTC CGGGCCGGGA TCCAGTCCGG CTTGCTGCAC
AATGCCGAGC CGCGGCTGTG GTACGCCGGC CCGATGTTTC GCCATGAACG CCCGCAGAAG
GGGCGGCTGC GGCAGTTCCA CCAGGTCGGC GCCGAGGTCT TCGGCATCCC GGAGCCCGAG
CTGGATGCCG AGATGATCAT CATGACCGCG CGTATGCTGC GCGAGTTGGG GCTTTCCGAC
GTGCGGCTGC AGCTCAACTC GTTGGGTACC CCCGAGAGCC GTGCGGCGCA CCGCGAACAG
CTGGTGGCCT ACCTGCGCCG CCACGAGGAC CGCCTGGACG AGGATGCGCG GCGGCGTCTG
GAGACGAATC CGCTGCGGAT CTTCGACAGC AAGAACCCGC AAGTGCAGCA GGTCATGGCC
GATGCGCCCC GCCTGATGGA CTGTCTCGAC AGCGTCTCGG CGGAGCACTT CACCGTGGTC
CGGAACCTGC TGGAGCGGGC CGGCGTCGAA TACGAGGTGA ACCCGTCGCT GGTGCGGGGG
CTCGATTACT ATACGCGCAC GGTCTTCGAG TGGGTGACGG ATCGCCTGGG GGCGCAGGGT
ACCGTCTGTG CCGGTGGGCG TTTCGACGGT CTGGTCGAAC AGCTCGGCGG TCGGCCGACT
CCGGCGATCG GCTTCGCTCT GGGGCTGGAG CGGTTGGTCG CGCTGCTTGA GGATCAGGGT
ACCCCGGGGC AGGGTGGGGC GCCCCACGCC TACCTGGTGG TGGCTACCGA GGCCGGGGCC
GGGCTGGAGA TGGCCGAGGC CCTGCGGGAC GCGCTGCCGG CGTTGCGGGT ACAGATGCAC
GCGGGCGGAG GCGGTTTCAA AGCGCAGCTC AAGCGTGCCG ATCGCAGCGG CGCCCGCGTG
GCCCTGATCC TTGGCGATGA TGAGCAGGCG GCCGGGGCGC TGACGATCAA GGATCTGCGT
GGCGAGGATG GGCAGCAACG GCTGCCCTTG GACGACGCGG TGACGTATCT GCGAGGGCTG
ATCGGGGCGT AG
 
Protein sequence
MSNKGIQSIR GFSDILPEES PLWQYAESAI RRVLESYGYR EIRLPVLERT ELFSRSIGEV 
TDIVEKEMYT FDDRNGDSVT LRPEGTAGCV RAGIQSGLLH NAEPRLWYAG PMFRHERPQK
GRLRQFHQVG AEVFGIPEPE LDAEMIIMTA RMLRELGLSD VRLQLNSLGT PESRAAHREQ
LVAYLRRHED RLDEDARRRL ETNPLRIFDS KNPQVQQVMA DAPRLMDCLD SVSAEHFTVV
RNLLERAGVE YEVNPSLVRG LDYYTRTVFE WVTDRLGAQG TVCAGGRFDG LVEQLGGRPT
PAIGFALGLE RLVALLEDQG TPGQGGAPHA YLVVATEAGA GLEMAEALRD ALPALRVQMH
AGGGGFKAQL KRADRSGARV ALILGDDEQA AGALTIKDLR GEDGQQRLPL DDAVTYLRGL
IGA