Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_67034 |
Symbol | THIA |
ID | 4837570 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | + |
Start bp | 1858711 |
End bp | 1860017 |
Gene Length | 1307 bp |
Protein Length | 392 aa |
Translation table | 12 |
GC content | 46% |
IMG OID | 640388885 |
Product | Acetyl-CoA acetyltransferase IA (Peroxisomal acetoacetyl-CoA thiolase) (Thiolase IA) |
Protein accession | XP_001382583 |
Protein GI | 126132116 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0183] Acetyl-CoA acetyltransferase |
TIGRFAM ID | [TIGR01930] acetyl-CoA acetyltransferases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.16531 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGTAT ATATCGTAGC CTCCATCCGG ACTCCCATCG GCTCGTTCCA GGGCTCGCTC TCGCCTTTAA GCTCGGTAGA TTTGGGTGCA AAAGCGGTGC ACGAAGCACT TAAACAAGTG CCCTCTTTGC CTGCTTCAGC TGTCGAAGAG ATTTTCTTTG GCTCTGCTCT CCAGGCTAAC TTGGGACAAA ACCCAGCTCG TCAGGTGGCA CTCTCCGCCG GTTTGCCAGA AGCCGTGGTC GCTACTTCCG TTAATAAGGT ATGTGCTTCG GGGTTAAAGG CCATTATCTC TGGAGCCCAG ACCATCCTCA CCAACACCGC CGATGTAGTT GTAGTTGGCG GCTCTGAGTC GATGTCGAAT GTCCCATTCT ATGCTCCTAT CAGATCTGGA GTCAGATACG GAGACGCTTC GCTTGTGGAC GGGATTCAAA ATGACGGGCT TAAGGACGTC TACTCCCAGA AGCTCATGGG CCATGCTGGA GAAAAAGTTG CCAGCGACTT AAACATCACA AGAGCTGAGC AAGATGAATA CGCGATTGGC AGTTACGCCA AAGCTATTAA TGCTCATGAG ACGGGAAAGT TTGAGAACGA AATCACTCCA ATTACAATCA AAACGAGAGC AGGTACCAAG ACGGTCTCCA AAGATGAGGA TATCTCTAAG TACAACCCAG AGAAGTTGAA GACAATGAAA TCTGCTTTTA TTGACAATGG TACTGTGACA GCTGGTAATT CACCATCTCT TAATGATGGA GGTGCTGCTC TTATCCTTGT CTCGGAAGCT GCACTCAACA AGTATGGACT CAAACCATTG GCCAAAATCA GAAGCTGGGG TGAGGCCGCT AGAGCTCCTA TGGACTTCAC GATTGCACCA AGTTTGGCCA TTCCAAAAAC GTTGGAAAGA GCTGGAGTTT CGATCAATGA TGTCGATTAC TTTGAGCTTA ATGAAGCATT TTCAGTTGTT GGACTTGCCA ACTCCAAATT GTTGGATATC CCATTGGAAA AATTGAACGT CTATGGAGGT GCTGTAGCTA TTGGACATCC ATTGGGTTGC TCTGGGGCCA GAATTGTGGT CACATTGTTA AGTGTATTGA AGCAGGAGAA GACGAATTCC AAGTTGGGTG TTGCTGCTGT ATGTAACGGA GGCGGTGGAG CTTCTTCGAT CCTCATTGAA GCCTTGTGAT TTATAGAATT TATTGTCTAC TTAGCCACAT CCATAGAACT CATCATTGCT ATATAGATTA CTTCACAATG GTTTGGAAAA GTTGTAGCTA GTTAACACTA AAGTTAATAC AAGTTTGAAG TTCTTTC
|
Protein sequence | MSVYIVASIR TPIGSFQGSL SPLSSVDLGA KAVHEALKQV PSLPASAVEE IFFGSALQAN LGQNPARQVA LSAGLPEAVV ATSVNKVCAS GLKAIISGAQ TILTNTADVV VVGGSESMSN VPFYAPIRSG VRYGDASLVD GIQNDGLKDV YSQKLMGHAG EKVASDLNIT RAEQDEYAIG SYAKAINAHE TGKFENEITP ITIKTRAGTK TVSKDEDISK YNPEKLKTMK SAFIDNGTVT AGNSPSLNDG GAALILVSEA ALNKYGLKPL AKIRSWGEAA RAPMDFTIAP SLAIPKTLER AGVSINDVDY FELNEAFSVV GLANSKLLDI PLEKLNVYGG AVAIGHPLGC SGARIVVTLL SVLKQEKTNS KLGVAAVCNG GGGASSILIE AL
|
| |