Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1230 |
Symbol | |
ID | 7977700 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 1283798 |
End bp | 1284946 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 644798176 |
Product | acetyl-CoA acetyltransferase |
Protein accession | YP_002949349 |
Protein GI | 239826725 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0183] Acetyl-CoA acetyltransferase |
TIGRFAM ID | [TIGR01930] acetyl-CoA acetyltransferases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00138705 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAGAAG TTGTCATTGT CGAAGCCGTG CGCACGCCAG TCGGCAAGCG AAACGGCGTG TTCCGCAACG TACATCCCGT TCATTTAGCG TCAACGGTGC TCAATGAAGT CGTAAAAAGA GCGGGAATCG AAAAACGGCT TGTCGAAGAT ATTGTGATGG GATGTGTCAC GCCCATTGCA GAGCAAGGAT ACAATATTGG GCGGCTTGCT GCGCTGGAGG CGGGATTTCC AATCGAAGTG CCAGCCGTGC AAATCAATCG GATGTGCGGG TCAGGGCAGC AGGCAATTCA TTTTGCTGCC CAAGAAATTC GCTCTGGCGA TATGGATATT ACGATTGCTG CCGGCGTCGA AAGCATGACG AAGGTGCCGA TTTTAAGCGA TGGAAACGAA AAAACGATTC CGCCGTCGCT GCATGAAAAA TATGAATTTG TTCATCAAGG CATTTCCGCG GAATTAATTG CCGAAAAGTA CGGGCTGACG CGCGAGCAGC TGGACGCATA TGCATACGAA AGCCATCAGC GCGCGATTCG GGCGCAAGAA CAAGGGATAT TTGATCGAGA AATTGTGCCT GTGGAAGGTT TGGATAAGGA CGGAAACGTG ATAATAGTAA CAAGTGATGA AGGGCCGCGT CGGGATACAT CGCCAGAAGC GCTCGCTTCA CTGCAGCCTG TCTTTCGTGA AAACGGAAAA ATTACAGCTG GCAATGCGAG CCAAATGAGC GATGGGGCAG CTGCGGTGCT ATTAATGGAA AAAGAAACAG CGTGGAAGCT TGGCGTAACG CCAAAAGCGA AGATCATTGC GCAAACAGTC GTCGGCTCCG ATCCGACATA TATGCTTGAT GGTGTGATTC CAGCAACAAA AAAGGTGCTT CAAAAAGCAG GATTAACGAT TGATGATATC GACTTAATCG AAATTAACGA AGCGTTTGCC CCTGTTGTAT TAGCGTGGCA AAAAGAAATC GGCGCTCCGT TTTCCAAAGT GAATGTAAAC GGTGGCGCCA TCGCGCTCGG CCACCCGTTA GGAGCGACCG GTGCGAAATT AATGACATCG CTCGTTTACG AACTCAAACG GCGAAATGGC AAATATGGAT TGCTGACGAT TTGTATCGGT CACGGGATGG CTACCGCAAC CATCATCGAA CGGCTGTAA
|
Protein sequence | MREVVIVEAV RTPVGKRNGV FRNVHPVHLA STVLNEVVKR AGIEKRLVED IVMGCVTPIA EQGYNIGRLA ALEAGFPIEV PAVQINRMCG SGQQAIHFAA QEIRSGDMDI TIAAGVESMT KVPILSDGNE KTIPPSLHEK YEFVHQGISA ELIAEKYGLT REQLDAYAYE SHQRAIRAQE QGIFDREIVP VEGLDKDGNV IIVTSDEGPR RDTSPEALAS LQPVFRENGK ITAGNASQMS DGAAAVLLME KETAWKLGVT PKAKIIAQTV VGSDPTYMLD GVIPATKKVL QKAGLTIDDI DLIEINEAFA PVVLAWQKEI GAPFSKVNVN GGAIALGHPL GATGAKLMTS LVYELKRRNG KYGLLTICIG HGMATATIIE RL
|
| |