Gene Hore_19570 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_19570 
Symbol 
ID7312772 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp2101720 
End bp2102913 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content48% 
IMG OID643612403 
ProductO-acetylhomoserine/O-acetylserine sulfhydrylase 
Protein accessionYP_002509699 
Protein GI220932791 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2873] O-acetylhomoserine sulfhydrylase 
TIGRFAM ID[TIGR01325] O-succinylhomoserine sulfhydrylase
[TIGR01326] OAH/OAS sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAA AGACAAAAAA GCAGTATGGA TTTAATACCC TGGCTTTACA CCATGGTTAT 
GACCCGGTCC AGGAAGGGAG CAAATCCAGG GCAGTTCCCA TTTACCAGAC AACATCCTAT
ATGTTTGATA GTGCTGAACA TGCTGCTGGC CTGTTTGCCG AAGAAGAAGA AGGGTATATT
TATACCAGAA TTGGGAACCC GACAACTAAA GTTTTTGAAG AAAGGATGGC CGTCCTCGAG
GGAGGGGAGG CCGGGCTGGC AACCTCATCG GGGCAGTCTG CTATTACCCT GACTATACTG
ACATTGGTCA GCCAGGGGGA AGAGGTGGTA TCATCAAGCT ATATTTACGG AGGGACCTAT
CATCTTCTGG CTGAGAGTCT CCCCCGGTAT GGGGTTAAGA CCAGATTTGT CAAACCTGAT
GATATAAATG ACTGGGAGCA GGCTATAACA GATAAAACCC GGGTTTTTTA CCTGGAATCA
CCGGGCAATC CCCGGCTTAA TATTGTTGAT ATTGAGGCTG TATCCAGCCT GGCCCATCAA
TACGGTATAA CTGTAGTTGT TGATAATACC TTTAATACTC CCTATTTAAG CCAGCCCCTT
AAATTGGGGG CTGACATTGT AGTCCATTCT ACTACCAAGT ATATCGGGGG TCATGGTAAT
TCAATTGGGG GGGTTATTGT TGGAACCCGT GATTTTATCC ATAAAGTCCG GACTGAGCTT
TACCGTGATA CTGGTCCTGC CATAAGCCCC TTTAATGCCT GGCTTTTCAT CCAGGGGTTA
GAGACCCTTT CATTGAGAAT GGAAAAACAC TGTAGTAATG CCATGGAGGT TGCCCGGTGG
CTCTCCGGAG ATGAAAGGGT TGAATGGGTG ACTTACCCTG GCCTTCCTGA CCATCCCCGG
CATGAACTGG CCAAAAAGCA GCAGCGGGGG TTTGGTGGAA TGATTTGTTT CGGGGTTAAA
GGTGGTTATT CAGCGGCCCG GAACCTTATC AACAGGGTGG AACTGTGTTC TCTACTTGCC
AATATAGGTG ATACCCGCAC CCTTATTATT CACCCTGCCT CTACCACCCA TGAGCAGTTG
AGTAGAGAGG AGCAGGAAAA GGCAGGGGTT ACCCCTGATT TAATCAGACT ATCGGTAGGA
ATAGAGGATG TATGGGATAT AATTGATGAC CTGGATCAGG CCCTGGGGGG GTAG
 
Protein sequence
MNKKTKKQYG FNTLALHHGY DPVQEGSKSR AVPIYQTTSY MFDSAEHAAG LFAEEEEGYI 
YTRIGNPTTK VFEERMAVLE GGEAGLATSS GQSAITLTIL TLVSQGEEVV SSSYIYGGTY
HLLAESLPRY GVKTRFVKPD DINDWEQAIT DKTRVFYLES PGNPRLNIVD IEAVSSLAHQ
YGITVVVDNT FNTPYLSQPL KLGADIVVHS TTKYIGGHGN SIGGVIVGTR DFIHKVRTEL
YRDTGPAISP FNAWLFIQGL ETLSLRMEKH CSNAMEVARW LSGDERVEWV TYPGLPDHPR
HELAKKQQRG FGGMICFGVK GGYSAARNLI NRVELCSLLA NIGDTRTLII HPASTTHEQL
SREEQEKAGV TPDLIRLSVG IEDVWDIIDD LDQALGG