Gene Hhal_1685 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1685 
Symbol 
ID4710165 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1839722 
End bp1840906 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content70% 
IMG OID639856152 
Productacetyl-CoA acetyltransferases 
Protein accessionYP_001003251 
Protein GI121998464 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.813003 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCAGC GCCGAGTCGT CATCTGCTCA CCCACACGCA CGCCCATCGG GACGTTTGGC 
GGGTCCCTCA AGACCGTACC GGCCCCCGAC CTCGGTGCCA CCGCCATCCG CGGCACCCTG
GAGCGCTCCG GCCTCGATCC GGCAGCCGTG GATACCGTGG TCATGGGCAA CGTGATCCAG
GCCGGCTGCA AGATGAACCC GGCCCGCCAG GCCTCGATCA ACGGCGGCGT CCCCGCGGAA
ACGCCGGCAC TGACCGTCAA CCGGGTCTGC GGCTCTGGCG CCCAGGCCAT CGCCAACGCC
TTCCAGGAGG TGGCCCTCGG CTTTGCCGAC ACCGCTGTTG CCGGCGGCAT GGAGAACATG
GACCGGGCGC CGTACTTGCT CATGCAGGGG CGCTACGGGT ACCGCCTCGG CCACCAGCAG
ATCCTCGACG CCGTACTCAG CGACGGCCTC AATGACGCCT TCTCCGACCA GCACTCCGGC
TGGCATACCG AGGATCTGGC CGAGCAGTAT CAGATCAGCC GCCAGGAACA GGACGAGTGG
GCACTCCGTT CCCAGCAACG GTTCAGCGCC GCCCAGTCCA CCGGCCACTT CGAGGCGGAG
ATCACTCCGG TGGACGTGCC CGGCCGCAAG GGGCCCACGC GCTTCGAGGC CGATGAGCAT
AACCGCGGCG ACACCACGCT GGAGGGGCTG CAGAAGCTGC GCCCGGCATT CCGCAAGGAG
GGCACCATCA CCGCCGGCAA CGCCCCCGGG GTCAACACGG GTGCAGCCGC CATGGTGGTC
GCCGAGGAAG AGGCCGCCCG CAAGCACGGG CTCACGCCGG CAGCGCGCCT GGTCGCCTAC
GGCGTCGGCG GTGTCGAGCC GGGGATGTTC GGGATCGGCC CCGTGCCCGC GGTCAGGCAG
TGCCTCCAGC GCGCCGGCTG GTCCGCCGAC GAGGTGGGGC GCTGGGAGAT CAACGAGGCG
TTTGCAGCGA TCGCCATCGC CGTCACCCGC GACTTGGGCC TGGACCCGGA GCGGGTCAAC
GTCGAGGGGG GCGCTGTCGC CCACGGCCAC CCGATCGGCG CAACCGGCGC AGTGCTCACC
ACCCGGCTAA TCCACGCCAT GCAGCGAGAC GGCGTGGACA AGGGCGTGGT CACCATGTGC
ATCGGCGGCG GCCAGGGCGT GGCCCTGGCG ATCGAACGGG TCTAA
 
Protein sequence
MSQRRVVICS PTRTPIGTFG GSLKTVPAPD LGATAIRGTL ERSGLDPAAV DTVVMGNVIQ 
AGCKMNPARQ ASINGGVPAE TPALTVNRVC GSGAQAIANA FQEVALGFAD TAVAGGMENM
DRAPYLLMQG RYGYRLGHQQ ILDAVLSDGL NDAFSDQHSG WHTEDLAEQY QISRQEQDEW
ALRSQQRFSA AQSTGHFEAE ITPVDVPGRK GPTRFEADEH NRGDTTLEGL QKLRPAFRKE
GTITAGNAPG VNTGAAAMVV AEEEAARKHG LTPAARLVAY GVGGVEPGMF GIGPVPAVRQ
CLQRAGWSAD EVGRWEINEA FAAIAIAVTR DLGLDPERVN VEGGAVAHGH PIGATGAVLT
TRLIHAMQRD GVDKGVVTMC IGGGQGVALA IERV