Gene Hlac_2551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2551 
Symbol 
ID7399776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2527223 
End bp2528521 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content69% 
IMG OID643709623 
ProductO-acetylhomoserine/O-acetylserine sulfhydrylase 
Protein accessionYP_002567193 
Protein GI222480956 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2873] O-acetylhomoserine sulfhydrylase 
TIGRFAM ID[TIGR01326] OAH/OAS sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACGTG GGTTCAACAC CCGGAGTCTC CACGCCGGCG CCGAGGCCGA CCCGGCGACC 
GGGTCGCGCG CGACCCCGAT CCACCAGACG ACCTCGTTCG TCTTCGACGA CGCGGAGACG
GCGGCGGAGA TGTACGCGCT CCGGGCGGAG GGCCACATCT ACTCCCGGCT CTCCAACCCG
ACCGTGAGCG TCCTCGAAGA CCGGATCGCC GACCTGTCGG GCGGCTCCGA CGCGGTCGCG
ACCGGCTCGG GGATGGCCGC GTTCGACGCG ATAACGACCG TGCTCGCGAG CGCGGGCGAC
AACGTCGTCG CCAGTTCCGA GATGTACGGC GGCACGGCCG CGTACCTCAC CAGCATCGCG
AACCGCCGCG GAGTCGAGGC CCGACTCGTC GACACGCTCG ACGACGAGGC GTACGCGGAC
GCGATCGACG ACGACACCGC GTTCGTCCAC GTCGAAACGG TCGCGAACCC TTCGCTCGTC
ACGCCCGACT TCGAGCGGCT CGCGGAGATC GCCCACGAGA ATGCGGTCCC GCTCGTGGTC
GACAACACGT TCGCGACCCC CTACCTCTGT CGGCCGTTCG AGCACGGCGC CGACATCGTT
TGGGAGTCGA CGACGAAGTG GATCACGGGC AACGGGACGA CCGTCGGCGG CATCGTCGTC
GACGGCGGCC AGTTCCCGTG GGACCACCCC GACGCCGACT ACGACGAACT CGACGGGCAG
TCCCCCGCCT ACCCGATCGA CTTCGTCGAG CGGTTCGGCG ACGCCGCCTT CGGCAACGTC
GCCCGGCAGC GCGGGGTGCG GCCGACCGGC GGCCAGCAGT CGCCGTTCGA CGCGTGGCAG
ACGATTCAGG GGCTCAACAC GCTCCCGCTC CGGATGGAGC GCCACTGCGA GAACGCCCGG
ACCGTCGCCG AGTTCCTCCA AGACGACGAC CGGGTCGATT GGGTGACGTA CCCCGGCTTC
GAGGACCACC AGAGCCACGA CAACGCCGCC AAATACCTCG ACGGCTACGG CGGGATGGTC
ACCTTCGGCG TCGACGGCGG CTACGAGGCC GCCAAGACCT TCTGCGAGGC CGTCGACCTG
ACGAGCTTCC TCGCGAACAT CGGGGACGCG AAGACGCTGG TCATCCACCC GGCCTCGACC
ACGCATGCGC AGATGGACGA AACGCAACAG CGGCTCGCCG GGGTCTACCC GGAGATGCTC
CGGCTCTCCG TCGGAATCGA GGACGCAGAC GACGTGATCG CCGACCTCGA TCAGGGGCTC
ACCGCCGGCG AACGCGCCGC GACCGACACG GAGGTGTGA
 
Protein sequence
MTRGFNTRSL HAGAEADPAT GSRATPIHQT TSFVFDDAET AAEMYALRAE GHIYSRLSNP 
TVSVLEDRIA DLSGGSDAVA TGSGMAAFDA ITTVLASAGD NVVASSEMYG GTAAYLTSIA
NRRGVEARLV DTLDDEAYAD AIDDDTAFVH VETVANPSLV TPDFERLAEI AHENAVPLVV
DNTFATPYLC RPFEHGADIV WESTTKWITG NGTTVGGIVV DGGQFPWDHP DADYDELDGQ
SPAYPIDFVE RFGDAAFGNV ARQRGVRPTG GQQSPFDAWQ TIQGLNTLPL RMERHCENAR
TVAEFLQDDD RVDWVTYPGF EDHQSHDNAA KYLDGYGGMV TFGVDGGYEA AKTFCEAVDL
TSFLANIGDA KTLVIHPAST THAQMDETQQ RLAGVYPEML RLSVGIEDAD DVIADLDQGL
TAGERAATDT EV