Gene PICST_31267 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31267 
Symbol 
ID4838812 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp368804 
End bp369868 
Gene Length1065 bp 
Protein Length354 aa 
Translation table12 
GC content44% 
IMG OID640390127 
Productpredicted protein 
Protein accessionXP_001384025 
Protein GI150864987 
COG category[I] Lipid transport and metabolism 
COG ID[COG1946] Acyl-CoA thioesterase 
TIGRFAM ID[TIGR00189] acyl-CoA thioesterase II 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0263815 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.226493 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTACCT TAGAAGAGCT TCAGAGAAAC GTCTACGATA AGGACAACAT TTCCAAACTT 
GAAGCCAAGT TCGAGTTGAT TGAACAAACC AGCGATTCCC GAGTTTCAAT CTACAACGGT
AGATACCCAT TACAGCCTTT CAGGGACGAC CAAAGAGGAG TATACGGAGG TGAGTTCGTG
AGTCAGGGTG TCTTAGCTGC CTGGAAGACA TTGTCAGACC CCGAACTCAC TCCACATTCG
TTACATGGCT ACTTCGTCAA AGCCGGGTCG AATAACTCTG TCGTCAGATG GGAAGTTGAA
AATGTCAGTG ACGGAAGAAA CTTTGCTAAC CGTTTGCTCA GGGCATTTCA AACACATACA
GATGTTTTAG TATTCACCCT TCAAGTGTCT TTTACCAAGA ATAACGACGG TGTCAAGAGA
AGGGAGGTGT ATGAAGAACA GCTTGCTAAA GGTGTAGAAA ACATCAGGTC CATTCCATTT
CTGTTTCAGA AGGTTCCCAA CCCTCTTTTC TACCAGTTCA AGGACAATAT TGATTCTTTG
CCATCCATCG AACATACCCA TGAATTCATG ACTCATGCAT TTACTCCAGA TGCTTTCCGT
CTGCCAAAAG TTTTGAACCA TGAAACTATC GGGAGCAGGC AATTAGGGTT ATTTGCCAAG
ATCAACGAGG ACCCTTCTCT CGCTACAGAC AAGATCAAGA GCAAGTACAC TGCTGCACTC
TATTTGAGTG ATTCCTTGTT TATCACCTTG GTTATGTCAG CAGTTGGTGT GGCCATATCT
GAGGAAGAAA AGAATTTTTT CAGAGTGAGT CTTGACCACG CTGTGTATTT CCACGATCTG
AATTTCGATG CGAGGGATTG GTTGTTCATC GACTTCAAGT TTCCATCCAT GGGCAACGAC
AGAGCGTTGG TGTTGTGTAA CTTTTATACC TTGGATGGGC GTTTGGTTTT CAGTGTTAAC
CAGGAGTTCT TGTGTTTCTT CCCCAAGAAG ATCATCGACA AGTCCAACTC ATTGCATGAG
AAATATCTTG CTGCCCAGAA TAGCCAGGAG TCAGCCAAGT TGTAA
 
Protein sequence
MVTLEELQRN VYDKDNISKL EAKFELIEQT SDSRVSIYNG RYPLQPFRDD QRGVYGGEFV 
SQGVLAAWKT LSDPELTPHS LHGYFVKAGS NNSVVRWEVE NVSDGRNFAN RLLRAFQTHT
DVLVFTLQVS FTKNNDGVKR REVYEEQLAK GVENIRSIPF SFQKVPNPLF YQFKDNIDSL
PSIEHTHEFM THAFTPDAFR SPKVLNHETI GSRQLGLFAK INEDPSLATD KIKSKYTAAL
YLSDSLFITL VMSAVGVAIS EEEKNFFRVS LDHAVYFHDS NFDARDWLFI DFKFPSMGND
RALVLCNFYT LDGRLVFSVN QEFLCFFPKK IIDKSNSLHE KYLAAQNSQE SAKL