Gene Hhal_0949 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0949 
SymbolmetX 
ID4709393 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1024649 
End bp1025797 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content68% 
IMG OID639855418 
Producthomoserine O-acetyltransferase 
Protein accessionYP_001002527 
Protein GI121997740 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2021] Homoserine acetyltransferase 
TIGRFAM ID[TIGR01392] homoserine O-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCAGAA GCCCGCCAAC CGACTCTGTC GGACTGGTGA CCCAGCACAA GGCCACCTTC 
GAGGAGCCCC TGCCGCTCGT CTGCGGGAGG GAGCTGCCCC GTTATGAGCT GGTCTACGAG
ACCTACGGCG AGCTCAATCG CGAGGGCACC AACGCCATCC TGGTCTGCCA CGCCCTCTCC
GGCAATCACC ACGCCGCCGG TTACCACTCC GAGCACGATC GCAAACCGGG GTGGTGGGAG
ACGTGTATCG GCCCGGGCAA GCCCCTGGAC ACCAATCGCT TCTTCGTCGT CTGCAGCAAT
AACCTGGGCG GCTGCCACGG CTCCACCGGA CCGGCGAGCA TCAACCCGGA GACCGGCAAA
CCCTACGGCG ACCAGTTCCC CATCGTCACC GTGCGCGACT GGGTGCGCAG CCAGGCGCGC
CTGGCCGACG AGCTGGGTAT CCGTCAGTGG GCGGCGGTGG CCGGCGGCAG CCTGGGCGGC
ATGCAGGCGA TGCAGTGGGC CATCGACTAC CCCGAGCGCC TGCGCCACGC CATCGTCATC
GCCGCCGCTC CGCGGCTGTC GGCCCAGAAC ATCGGCTTCA ACGAGGTCGC CCGGCAGGCG
ATTATGAGCG ACCCGGAGTT CCACGGCGGG CGCTACTACG ACTACGGCGT CTCGCCCCGG
CGGGGGCTGG CGGTGGCGCG CATGCTCGGC CACATCACCT ACCTCTCGGA CGACGCCATG
CGCGCGAAGT TCGGCCGCGA CCTGCGTGGC GACATGAGCT TCGACTTCGA GCAGGTGGAT
TTCGAGGTCG AGAGCTACCT GCGCTACCAG GGGCAGCGCT TCGTGCAGGA CTTCGACGCC
AACACCTACC TGCTGATGAC CAAGGCCCTC GACTACTTCG ACCCGGCCGC CGACCACGAT
GACGACTTCT CGGCAGCCCT GGCCCACATC CAGTGCTCGA CGCTGCTGCT CTCCTTCTCC
AGCGACTGGC GCTTCGCCCC GGCGCGCTCG CGCGAGATCC TCCGCGCGCT GCTGGAGCAC
AACAAGCCAG TCAGCTACAT GGAGATCGAG GCCACCCAGG GCCACGACGC CTTCCTGATG
CCCATCCAGC GCTACCTGGA GGCCTTCTCC GCCTACATGG GCAACGTCGC CCGGGAGGTG
GGGGCGTGA
 
Protein sequence
MVRSPPTDSV GLVTQHKATF EEPLPLVCGR ELPRYELVYE TYGELNREGT NAILVCHALS 
GNHHAAGYHS EHDRKPGWWE TCIGPGKPLD TNRFFVVCSN NLGGCHGSTG PASINPETGK
PYGDQFPIVT VRDWVRSQAR LADELGIRQW AAVAGGSLGG MQAMQWAIDY PERLRHAIVI
AAAPRLSAQN IGFNEVARQA IMSDPEFHGG RYYDYGVSPR RGLAVARMLG HITYLSDDAM
RAKFGRDLRG DMSFDFEQVD FEVESYLRYQ GQRFVQDFDA NTYLLMTKAL DYFDPAADHD
DDFSAALAHI QCSTLLLSFS SDWRFAPARS REILRALLEH NKPVSYMEIE ATQGHDAFLM
PIQRYLEAFS AYMGNVAREV GA