Gene HS_0844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_0844 
SymbolthrS 
ID4240336 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp921208 
End bp923136 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content39% 
IMG OID638104399 
Productthreonyl-tRNA synthetase 
Protein accessionYP_719054 
Protein GI113460987 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0441] Threonyl-tRNA synthetase 
TIGRFAM ID[TIGR00418] threonyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAATTA TTACTTTACC TGACGGTTCA CAACGTCAAT TCGATAATCC CGTTTCTGTA 
TTGGAAGTTG CTCAATCAAT CGGTTCCGGA CTCGCTAAGG CTACTATTGC CGGTCGTGTG
AACGGTGAGC GTCGAGATGC CTCAGATATG ATTAGTGAGG ATGCCAATCT TGAAATTATT
ACCGCAAAAG ATGAAGACGG TTTAGAGATT ATCCGACACT CAACGGCACA TTTATTAGGG
CATGCTATCA AGCAACTTTT CCCAAATGTG AAGATGGCTA TCGGTCCAAC TATTGACAAT
GGTTTTTATT ACGACATTGA TTTGGACCGC TCTTTAACCC AAGAAGATAT TGATACCCTA
GAAAAACGTA TGCTGGAGCT GGCAAAAACG AATTATGACG TAATAAAAAA ACGTGTCAGT
TGGCAAGAAG CAAGAGATAC TTTTGAAAGT CGTGCTGAGC CTTACAAAAT AGCTATCTTA
GATGAAAATA TTGCAAAAGA CGACCAACCT GCACTTTATC ATCACGAAGA ATATATTGAT
ATGTGTCGTG GTCCGCACGT ACCCAATATG CGTTTCTGTC ACCATTTTAA ATTAATGAAA
GTAGCTGGTG CATATTGGCG TGGAAATAGT GATAATAAAA TGCTCCAACG CATTTATGGT
ACGGCTTGGG CAGATAAAAA ACAATTAGCC GATTATTTAC ATCGATTAGA AGAAGCTGCA
AAACGTGATC ATCGTAAAAT CGGTAAGGCA TTGGATTTAT ACCATATGCA AGAGGAAGCA
CCCGGTATGG TATTTTGGCA TAATGACGGT TGGACAATCT TCCGTGAGCT GGAAACATTT
GTGCGTACAA AGTTAAAAGA ATACGATTAT CAAGAAGTAA AAGGTCCATT TATGATGGAT
CGTGTGCTTT GGGAACGTAC AGGACACTGG CAAAATTATG CTGATTTAAT GTTTACTACA
CAATCTGAAA ATAGAGAATA TGCCATTAAG CCAATGAATT GTCCTGGACA TGTTCAAATT
TTCAATCAAG GCTTGAAATC TTATCGTGAT TTACCTATTC GTATGGCTGA ATTTGGTTCT
TGCCATCGTA ATGAACCATC AGGATCTTTA CATGGATTAA TGCGTGTGCG TGGTTTTACA
CAGGATGATG CACATATTTT CTGCACGGAA GATCAAATTG AATCGGAAGT AACGAGCTGT
ATTAAAATGG TTTATGACAT TTATAGTACG TTTGGCTTTA CCGATATTTT TGTCAAACTT
TCTACTCGTC CTGAAAAACG CATTGGTGAA GATGTCATGT GGGATCGTGC AGAGCAAGGC
TTGGCAAATG CACTTAAACA CAATGGTCTT GAATATGAAA TTCAAGAGGG GGAAGGGGCA
TTTTATGGGC CGAAAATTGA ATTTGCATTA AGAGATTGTT TAGATCGTGA ATGGCAATGC
GGTACAATCC AATTAGATTT TGCGTTACCC GGACGTCTAG ATGCTACCTA TGTTGCAGAA
GATAATGCCC GTCGCACACC TGTCATGATA CATCGTGCTA TCTTAGGTTC AATTGAACGT
TTTATCGGTA TTATCACTGA AGAATATGCA GGTTTTTTCC CTACGTGGTT AGCACCGATT
CAAGCTGTTG TGATGAATAT TACTGATAGT CAAGCGGATT ATGTGCAAAA AGTAGTTAAG
CAATTTTCTG AAGCGGGATT ACGTGTAAAA GCGGATATAC GCAATGAAAA AGTCGGTTTC
AAAATTCGTG AACATACCTT ACGTCGAGTA CCTTATATGC TAGTTTGCGG CGATAAAGAA
ATTGTGGAAA ACAAGATTGC AGTTCGTACC CGTAAAGGGA CGGATTTAGG CACGTTTAGT
GTTGAAGAAT TTGTTGAGAT TTTAAAACAA CAAGTGAGAA AACGTGAGTT AACCTTGTTA
GGTGAATAA
 
Protein sequence
MPIITLPDGS QRQFDNPVSV LEVAQSIGSG LAKATIAGRV NGERRDASDM ISEDANLEII 
TAKDEDGLEI IRHSTAHLLG HAIKQLFPNV KMAIGPTIDN GFYYDIDLDR SLTQEDIDTL
EKRMLELAKT NYDVIKKRVS WQEARDTFES RAEPYKIAIL DENIAKDDQP ALYHHEEYID
MCRGPHVPNM RFCHHFKLMK VAGAYWRGNS DNKMLQRIYG TAWADKKQLA DYLHRLEEAA
KRDHRKIGKA LDLYHMQEEA PGMVFWHNDG WTIFRELETF VRTKLKEYDY QEVKGPFMMD
RVLWERTGHW QNYADLMFTT QSENREYAIK PMNCPGHVQI FNQGLKSYRD LPIRMAEFGS
CHRNEPSGSL HGLMRVRGFT QDDAHIFCTE DQIESEVTSC IKMVYDIYST FGFTDIFVKL
STRPEKRIGE DVMWDRAEQG LANALKHNGL EYEIQEGEGA FYGPKIEFAL RDCLDREWQC
GTIQLDFALP GRLDATYVAE DNARRTPVMI HRAILGSIER FIGIITEEYA GFFPTWLAPI
QAVVMNITDS QADYVQKVVK QFSEAGLRVK ADIRNEKVGF KIREHTLRRV PYMLVCGDKE
IVENKIAVRT RKGTDLGTFS VEEFVEILKQ QVRKRELTLL GE