Gene Hhal_2153 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_2153 
Symbol 
ID4709698 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2362230 
End bp2363498 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content67% 
IMG OID639856628 
ProductO-acetylhomoserine/O-acetylserine sulfhydrylase 
Protein accessionYP_001003719 
Protein GI121998932 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2873] O-acetylhomoserine sulfhydrylase 
TIGRFAM ID[TIGR01326] OAH/OAS sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.271138 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAACTCG AGACCCAGGC CGTCCACGCC GGCTACAGCC CCGATCCGAC CACCCGAGCG 
GTGGCAGTGC CCATCTATCA GACCACCTCC TACGCCTTCG ATGACACGCA GCACGGGGCG
GATCTCTTCG ACCTCAAGGT CGAGGGCAAT ATCTATACAC GGATCATGAA CCCCACCAAC
GCCGTGCTCG AGCAGCGCGT GGCGGCTATG GAGGGGGGCA TCGGTGGGCT GGCGGTGGCC
TCCGGCATGG CGGCGATCAC CTACGCCATC CAGGCGATCA CCCGCGCCGG CGACAACATC
GTCTCCACCT CGCGGCTCTA CGGCGGCACC TACAACCTCT TCGCCCACTC GCTGCCCATC
CACGGCATCG AGGTTCGCTT CGCCGCCCAA GACGACTACG AGCGCCTCGA GGCGCAGATC
GACGAGCGCA CCAAGGCGCT CTTCTGCGAG AGTGTCGGCA ACCCCTCTGG TGAGGTGGTG
GACGTCGAGC GGCTGGCCGA GATCGCCCAC CGACACGGCA TCCCGCTGAT GGTCGACAAC
ACCGTGCCCA CGCCCTATCT GTGGCGCCCC ATCGACAACG GCGCGGACAT CGTCATCCAC
TCGCTGACCA AGTACATGGG TGGGCACGGC ACCACGGTGG GCGGGGTGAT CGTCGACTCC
GGGAAGTTCC CCTGGGCCGA CCACGGAGAC CGCTTCCCGA TGATGGTCGA GCCGGACCCC
TCCTACCATG GGGTGGTTTA CACCGAGGAT ATGGGCGAGG CGGCCTACAT CACCCGCTGC
CGGGTGGTGC CGTTGCGCAA CATGGGGGCC GCGCTGTCGC CGTTCAACGC CTTCCAGCTG
CTGCAGGGGA TCGAGACCCT GCCGGTGCGT ATGGACCGCC ACTGCGACAA CGCCCAGCGG
GTTGCCGAGT ACCTGCAGGC CCATCCGCGG GTAAGCTGGG TGAAGTTCGC CGGCCTCGAG
GACAGCCCCT ACAAGCCCCT GGCTGATCGC TACATGGGCG GCCGGGCGGC CAGCATCCTG
AGTTTCGGGG TGCAGGGCGG CTTCGAGGCG GGGGCACGCT TCATCGATGC CCTGCAGCTG
ATCACCCGGC TGGTGAACAT CGGTGATGCC AAGTCCCTGG CTTGCCACCC GGCCAGCACC
ACGCACCGCC AGCTCAACGA TGAGGAGCTG GAGAGCGCAG GGGTATCGCG GGATCTGGTG
CGCCTGTCCA TCGGCCTGGA GCACGTCGAC GACATCCTGG CGGATATCGA CCAGGCCTTG
CAGGCGTAA
 
Protein sequence
MKLETQAVHA GYSPDPTTRA VAVPIYQTTS YAFDDTQHGA DLFDLKVEGN IYTRIMNPTN 
AVLEQRVAAM EGGIGGLAVA SGMAAITYAI QAITRAGDNI VSTSRLYGGT YNLFAHSLPI
HGIEVRFAAQ DDYERLEAQI DERTKALFCE SVGNPSGEVV DVERLAEIAH RHGIPLMVDN
TVPTPYLWRP IDNGADIVIH SLTKYMGGHG TTVGGVIVDS GKFPWADHGD RFPMMVEPDP
SYHGVVYTED MGEAAYITRC RVVPLRNMGA ALSPFNAFQL LQGIETLPVR MDRHCDNAQR
VAEYLQAHPR VSWVKFAGLE DSPYKPLADR YMGGRAASIL SFGVQGGFEA GARFIDALQL
ITRLVNIGDA KSLACHPAST THRQLNDEEL ESAGVSRDLV RLSIGLEHVD DILADIDQAL
QA