Gene Hlac_2550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2550 
Symbol 
ID7399775 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2525917 
End bp2527218 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content71% 
IMG OID643709622 
Producthomoserine O-acetyltransferase 
Protein accessionYP_002567192 
Protein GI222480955 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2021] Homoserine acetyltransferase 
TIGRFAM ID[TIGR01392] homoserine O-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCACCG TTCCGACCGA CCACGGCGTC GCCGCTCTCG GGGAGTTCGT CTTCGAGTGC 
GGCCAGTCGG TCCCCGATCT GGAGGTCGCC TACGAGACCC ACGGCGAGTT CGACGGCGAC
AACGTGGTGT TGGTCTGCCA CGCGCTCACC GGTAGCCAGA ACGTCGCCCG GTCGCCCGCG
CCGGAGCGCA ACGAGGGGAC CCGCGGAGCC GGGCAGGCCG GACAGGCCCG CGCGTGGTGG
GACGACATCG TCGGCCCGGG GAAGGCGATA GACACCACGA AGTACTACGT CGTCTGCGCG
AACGTTCCCG GTTCCTGTTA CGGCACCACG GGGCCGGCGA GCGAGCGCCC AGCCGACCTC
GACCTCCCCG AGGAACCCGA TCACGACCGG TGGGGGACCG CCTTCCCGCC GGTGCAGGTC
GAGGACTGGG CGCGCTCGCA GCGCCGTCTG CTGGACCACC TCGGCGTGGG CCGGCTCCGA
GCCGTCGTCG GCGGGAGCGT CGGCGGGATG AACGTCTTGG AGTGGGCGAA GCGCTACCCC
GACGACGTCG ACCGCGTGGT CGCCATCGCG ACCGCCGGTC GCCTCGACGC GCAGTGTCTC
GCGCTCGACG CGGTCGCCCG GCGGGCGATC CGCGCGGACC CGAACTGGAA CGGGGGCAAC
TACTACGGCG AGGGCCGCCC CTCGCCGGAC GAAGGGCTCG CCTTGGCCCG TCAGATCGGG
CACATCATGT ACCTCTCGAA GGCGTCGATG GAGCGGAAGT TCGGTCGTCG CTCGGCGGGC
CGCGACTCGC TGACCCGCGA GGAGGGAGAT TTGGGTCTCC CGCCGGAGCC AACGGCGGCC
TTCTTCCCGT ACCGCGAGGT GGAGTCGTAC CTCGACTATC AGGCGGAGGG GTTCAGCGAG
CGGTTCGACG CCAACAGCTA CCTCTACCTC ACGCGCGCGA TGGACGAGTA CGACCTCTCC
GCCGGCCACG GCACCGACGC CGACGCGCTC GCCGCCTTCG AGGGCGAGGC GCTGTTGATG
AGCTTTACCG CCGACTGGCA CTTCACCGTC GAGCAGTCGT CGTCGCTTGC GGCCGCCTTC
CGCGATCGGG ATGTCCCCGT CGCCCACCAC GTGATCGACT CCGATCACGG CCACGACGCG
TTCCTCGTCG AGCCCGAACA TGTCGGCCCG CCGCTGCGTG ACTTCCTCGT GGAGGGGGTC
GGAGGTCGGG CGGTCTCCGA TGACGGCGGC GGTGGGGGTA ACGACTCCGC GCGGCCCGAG
CGGGACCACG CGCCGGTTCA CGCGAGCCTT TTTAAAGGGT AG
 
Protein sequence
MSTVPTDHGV AALGEFVFEC GQSVPDLEVA YETHGEFDGD NVVLVCHALT GSQNVARSPA 
PERNEGTRGA GQAGQARAWW DDIVGPGKAI DTTKYYVVCA NVPGSCYGTT GPASERPADL
DLPEEPDHDR WGTAFPPVQV EDWARSQRRL LDHLGVGRLR AVVGGSVGGM NVLEWAKRYP
DDVDRVVAIA TAGRLDAQCL ALDAVARRAI RADPNWNGGN YYGEGRPSPD EGLALARQIG
HIMYLSKASM ERKFGRRSAG RDSLTREEGD LGLPPEPTAA FFPYREVESY LDYQAEGFSE
RFDANSYLYL TRAMDEYDLS AGHGTDADAL AAFEGEALLM SFTADWHFTV EQSSSLAAAF
RDRDVPVAHH VIDSDHGHDA FLVEPEHVGP PLRDFLVEGV GGRAVSDDGG GGGNDSARPE
RDHAPVHASL FKG