Gene PICST_67034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_67034 
SymbolTHIA 
ID4837570 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp1858711 
End bp1860017 
Gene Length1307 bp 
Protein Length392 aa 
Translation table12 
GC content46% 
IMG OID640388885 
ProductAcetyl-CoA acetyltransferase IA (Peroxisomal acetoacetyl-CoA thiolase) (Thiolase IA) 
Protein accessionXP_001382583 
Protein GI126132116 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.16531 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGTAT ATATCGTAGC CTCCATCCGG ACTCCCATCG GCTCGTTCCA GGGCTCGCTC 
TCGCCTTTAA GCTCGGTAGA TTTGGGTGCA AAAGCGGTGC ACGAAGCACT TAAACAAGTG
CCCTCTTTGC CTGCTTCAGC TGTCGAAGAG ATTTTCTTTG GCTCTGCTCT CCAGGCTAAC
TTGGGACAAA ACCCAGCTCG TCAGGTGGCA CTCTCCGCCG GTTTGCCAGA AGCCGTGGTC
GCTACTTCCG TTAATAAGGT ATGTGCTTCG GGGTTAAAGG CCATTATCTC TGGAGCCCAG
ACCATCCTCA CCAACACCGC CGATGTAGTT GTAGTTGGCG GCTCTGAGTC GATGTCGAAT
GTCCCATTCT ATGCTCCTAT CAGATCTGGA GTCAGATACG GAGACGCTTC GCTTGTGGAC
GGGATTCAAA ATGACGGGCT TAAGGACGTC TACTCCCAGA AGCTCATGGG CCATGCTGGA
GAAAAAGTTG CCAGCGACTT AAACATCACA AGAGCTGAGC AAGATGAATA CGCGATTGGC
AGTTACGCCA AAGCTATTAA TGCTCATGAG ACGGGAAAGT TTGAGAACGA AATCACTCCA
ATTACAATCA AAACGAGAGC AGGTACCAAG ACGGTCTCCA AAGATGAGGA TATCTCTAAG
TACAACCCAG AGAAGTTGAA GACAATGAAA TCTGCTTTTA TTGACAATGG TACTGTGACA
GCTGGTAATT CACCATCTCT TAATGATGGA GGTGCTGCTC TTATCCTTGT CTCGGAAGCT
GCACTCAACA AGTATGGACT CAAACCATTG GCCAAAATCA GAAGCTGGGG TGAGGCCGCT
AGAGCTCCTA TGGACTTCAC GATTGCACCA AGTTTGGCCA TTCCAAAAAC GTTGGAAAGA
GCTGGAGTTT CGATCAATGA TGTCGATTAC TTTGAGCTTA ATGAAGCATT TTCAGTTGTT
GGACTTGCCA ACTCCAAATT GTTGGATATC CCATTGGAAA AATTGAACGT CTATGGAGGT
GCTGTAGCTA TTGGACATCC ATTGGGTTGC TCTGGGGCCA GAATTGTGGT CACATTGTTA
AGTGTATTGA AGCAGGAGAA GACGAATTCC AAGTTGGGTG TTGCTGCTGT ATGTAACGGA
GGCGGTGGAG CTTCTTCGAT CCTCATTGAA GCCTTGTGAT TTATAGAATT TATTGTCTAC
TTAGCCACAT CCATAGAACT CATCATTGCT ATATAGATTA CTTCACAATG GTTTGGAAAA
GTTGTAGCTA GTTAACACTA AAGTTAATAC AAGTTTGAAG TTCTTTC
 
Protein sequence
MSVYIVASIR TPIGSFQGSL SPLSSVDLGA KAVHEALKQV PSLPASAVEE IFFGSALQAN 
LGQNPARQVA LSAGLPEAVV ATSVNKVCAS GLKAIISGAQ TILTNTADVV VVGGSESMSN
VPFYAPIRSG VRYGDASLVD GIQNDGLKDV YSQKLMGHAG EKVASDLNIT RAEQDEYAIG
SYAKAINAHE TGKFENEITP ITIKTRAGTK TVSKDEDISK YNPEKLKTMK SAFIDNGTVT
AGNSPSLNDG GAALILVSEA ALNKYGLKPL AKIRSWGEAA RAPMDFTIAP SLAIPKTLER
AGVSINDVDY FELNEAFSVV GLANSKLLDI PLEKLNVYGG AVAIGHPLGC SGARIVVTLL
SVLKQEKTNS KLGVAAVCNG GGGASSILIE AL