Gene PICST_42666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_42666 
Symbol 
ID4837007 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp356719 
End bp357906 
Gene Length1188 bp 
Protein Length372 aa 
Translation table12 
GC content43% 
IMG OID640388322 
Productpredicted protein 
Protein accessionXP_001382300 
Protein GI150863733 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGATC TACACAAGGC ATTTGTCAAG AGGATAAAAG ACCAGGAGAG GTCACTAGGG 
CTCACCCTGT TGTCAGAAAA GGATGAAGAA GAGCAAGAAG AGGAAGAAGA TGAAGATGGG
GATGTTCTTT CTGGGATGAC TAAGTTTGGT TCTAGGCCCA CGATCCCTAC TGTACATCCA
CTACATCCGC TACCATCTTC TACACGAGCC AGCACAGCTG ACCAAACTAC AGTAGACAAC
TATGAAAATT TCAAACGAGT CCATTTCAAA GTGACAGAGA AATATACAAG GGCAGACGGA
ATTTTGTCCA TAAGTACATA CTACAAACCT CCATCTTCTG CGAAAGATCC AATCTGTATC
TTCCTTCATG GAGCTGGATC TACTGCTATG ACGTTTGCCT TCTTGTCACA GCTGATTGGT
GAGATATCGG AAAATACTGG GTTGTTTTTG TTTGACCTTA GGGGACATGG AAACTCGTCT
TTAACTAAGG ACTTTTCGTT AGACGCGCTT GTAGAAGACC TCCATTTTGT TCTCAATGAG
TTTATTGGAA AACACAACAT CAATTCCAAT CCAATATACT TAGTGGGACA TTCTCTTGGA
GGAGCCATAT CATCCAGCTT CCTACGTAAG TATCAAATGC AATATCCCCA AGTGAAAGGA
CTGGTGGTTT TAGATATAGT AGAAGAAACA GCCGTCAAAT CCCTCGGGGC CATGCCAAAC
TTCATTTCTA ATAGACCGAA GTCGTTTTCC AGTTTGACAA ATGCCATTAA GTGGCATATG
GGCTTCTTGT TGTATAACAA GGCTTCTGCA GAAGTAAGTG TTCCTGACTT GTTCGATCTC
GAAAGCTTGA CCTGGAAGAC AGACTTAGCT TTGACACAGC CATTCTGGTC AACCTGGTTC
GACAAGTTGT CAGAAAACTT TCTCGGATTT AGAGGTCCCA AGCTATTAAT TCTCTCAGCG
CATGAGACAC TCGATAAGAA TTTGATGATA GGGCAGATGC AGGGGAAATA TCAACTTGTA
GTATTCAGCA ACCAACAGAA ATATGGTCAC TTCATTCATG AAGACTTGCC CCGACAGGTG
GCTACATGCT TGTTGGATTT CATAAAGAGG AACGAAGATC CCGACAAGTT CATGAAGGAG
GAGCTAGGAA TTATTCCTAA ATGGGGTGGC AAGATTCACC ACACATAG
 
Protein sequence
MSDLHKAFVK RIKDQERSLG LTSLSEKDEE EQEEEEDEDG DVLSGMTNTA DQTTVDNYEN 
FKRVHFKVTE KYTRADGILS ISTYYKPPSS AKDPICIFLH GAGSTAMTFA FLSQSIGEIS
ENTGLFLFDL RGHGNSSLTK DFSLDALVED LHFVLNEFIG KHNINSNPIY LVGHSLGGAI
SSSFLRKYQM QYPQVKGSVV LDIVEETAVK SLGAMPNFIS NRPKSFSSLT NAIKWHMGFL
LYNKASAEVS VPDLFDLESL TWKTDLALTQ PFWSTWFDKL SENFLGFRGP KLLILSAHET
LDKNLMIGQM QGKYQLVVFS NQQKYGHFIH EDLPRQVATC LLDFIKRNED PDKFMKEELG
IIPKWGGKIH HT