Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1685 |
Symbol | |
ID | 4710165 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 1839722 |
End bp | 1840906 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639856152 |
Product | acetyl-CoA acetyltransferases |
Protein accession | YP_001003251 |
Protein GI | 121998464 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0183] Acetyl-CoA acetyltransferase |
TIGRFAM ID | [TIGR01930] acetyl-CoA acetyltransferases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.813003 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCCAGC GCCGAGTCGT CATCTGCTCA CCCACACGCA CGCCCATCGG GACGTTTGGC GGGTCCCTCA AGACCGTACC GGCCCCCGAC CTCGGTGCCA CCGCCATCCG CGGCACCCTG GAGCGCTCCG GCCTCGATCC GGCAGCCGTG GATACCGTGG TCATGGGCAA CGTGATCCAG GCCGGCTGCA AGATGAACCC GGCCCGCCAG GCCTCGATCA ACGGCGGCGT CCCCGCGGAA ACGCCGGCAC TGACCGTCAA CCGGGTCTGC GGCTCTGGCG CCCAGGCCAT CGCCAACGCC TTCCAGGAGG TGGCCCTCGG CTTTGCCGAC ACCGCTGTTG CCGGCGGCAT GGAGAACATG GACCGGGCGC CGTACTTGCT CATGCAGGGG CGCTACGGGT ACCGCCTCGG CCACCAGCAG ATCCTCGACG CCGTACTCAG CGACGGCCTC AATGACGCCT TCTCCGACCA GCACTCCGGC TGGCATACCG AGGATCTGGC CGAGCAGTAT CAGATCAGCC GCCAGGAACA GGACGAGTGG GCACTCCGTT CCCAGCAACG GTTCAGCGCC GCCCAGTCCA CCGGCCACTT CGAGGCGGAG ATCACTCCGG TGGACGTGCC CGGCCGCAAG GGGCCCACGC GCTTCGAGGC CGATGAGCAT AACCGCGGCG ACACCACGCT GGAGGGGCTG CAGAAGCTGC GCCCGGCATT CCGCAAGGAG GGCACCATCA CCGCCGGCAA CGCCCCCGGG GTCAACACGG GTGCAGCCGC CATGGTGGTC GCCGAGGAAG AGGCCGCCCG CAAGCACGGG CTCACGCCGG CAGCGCGCCT GGTCGCCTAC GGCGTCGGCG GTGTCGAGCC GGGGATGTTC GGGATCGGCC CCGTGCCCGC GGTCAGGCAG TGCCTCCAGC GCGCCGGCTG GTCCGCCGAC GAGGTGGGGC GCTGGGAGAT CAACGAGGCG TTTGCAGCGA TCGCCATCGC CGTCACCCGC GACTTGGGCC TGGACCCGGA GCGGGTCAAC GTCGAGGGGG GCGCTGTCGC CCACGGCCAC CCGATCGGCG CAACCGGCGC AGTGCTCACC ACCCGGCTAA TCCACGCCAT GCAGCGAGAC GGCGTGGACA AGGGCGTGGT CACCATGTGC ATCGGCGGCG GCCAGGGCGT GGCCCTGGCG ATCGAACGGG TCTAA
|
Protein sequence | MSQRRVVICS PTRTPIGTFG GSLKTVPAPD LGATAIRGTL ERSGLDPAAV DTVVMGNVIQ AGCKMNPARQ ASINGGVPAE TPALTVNRVC GSGAQAIANA FQEVALGFAD TAVAGGMENM DRAPYLLMQG RYGYRLGHQQ ILDAVLSDGL NDAFSDQHSG WHTEDLAEQY QISRQEQDEW ALRSQQRFSA AQSTGHFEAE ITPVDVPGRK GPTRFEADEH NRGDTTLEGL QKLRPAFRKE GTITAGNAPG VNTGAAAMVV AEEEAARKHG LTPAARLVAY GVGGVEPGMF GIGPVPAVRQ CLQRAGWSAD EVGRWEINEA FAAIAIAVTR DLGLDPERVN VEGGAVAHGH PIGATGAVLT TRLIHAMQRD GVDKGVVTMC IGGGQGVALA IERV
|
| |