Gene PICST_38202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_38202 
SymbolSEH1 
ID4850798 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp82511 
End bp83476 
Gene Length966 bp 
Protein Length321 aa 
Translation table 
GC content44% 
IMG OID640392506 
Productepoxide hydrolase, soluble (sEH) 
Protein accessionXP_001387255 
Protein GI126273542 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.927116 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.566303 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGAAA GATTCGTTAT CAAACTTACC CACGGTTCCA GAAGTTTCAC CACCTTCTCG 
AACTACAGCG AACAAGATGT TTTCAAAGGT GCCGGAACCA AATGGACAAG AGTGATCTTT
CTCTTGCATG GGTTTCCTGA TGAAAATTCG TCCTATGATG AAGCCTGGCC GCATTTAGCA
CAAGGGTTTC CTAATGAAAA GGGTCTTTTG TTGCTAGCAC CATTATTGAG AGGCTACGAA
GAGCTGAGTT TGGGGCCAGA CGAATATAGT ACTCATGATG TCGCTGGAGA CGTCGGTGCC
TGGATCAAGC AGATTAACCC CAGCAACAAG GTTCCAGTTC ACATTTTGGG CCACGATTGG
GGTGCTATAA CTGCCTTCAA AACTGCTTCA AGGTTTCCAG AGTTGGTTAC TTCAATTGTG
ACTTTGGCAA TTCCTTATTT GACCAATGTG GTTCCCTGGA AGTTGGCTTG GAATGTTCCT
GAACAGTTGT ACTATTCGTC GTATATGGTG ACGATGCAGT TATCGTTCTT GTACAGATCC
AGATTCGAAC AAACAGGCAG AGATTCGTAC TTAGATTCGC TCTGGAAGTA CTGGTCTCCT
ACCTGGAAGT ATACCGAAAA AGATATTAGT AAGACCAGAG CCAGATTGAG TGATCACAGA
ATCATGGATG CTACCACAGC CTATTACAGA GCCATCTTCA ACCCGATTAA CCTTATTAAC
GGCAAGTCTA AATGGCCCGT TGACTTCAGC CAAGTTCCCA CATATTTTAT AGGTGGAGCC
CAAGACGGTT GTATGACCAG CAAGTTGTAT GAATGGGAAA GAGAGTTGTT GAAGGACGAA
CCCAATGTCA AGACCACTAT TTTGCCCAAC CTGGGCCATT TCTTACATCG AGAAGAACCC
CAAAAAGTTG CTGAGTTAGC GATTGAGTTC TTCGAAAAGT ACTCTTCCAA GGCTACCAGT
AGTTAG
 
Protein sequence
MTERFVIKLT HGSRSFTTFS NYSEQDVFKG AGTKWTRVIF LLHGFPDENS SYDEAWPHLA 
QGFPNEKGLL LLAPLLRGYE ELSLGPDEYS THDVAGDVGA WIKQINPSNK VPVHILGHDW
GAITAFKTAS RFPELVTSIV TLAIPYLTNV VPWKLAWNVP EQLYYSSYMV TMQLSFLYRS
RFEQTGRDSY LDSLWKYWSP TWKYTEKDIS KTRARLSDHR IMDATTAYYR AIFNPINLIN
GKSKWPVDFS QVPTYFIGGA QDGCMTSKLY EWERELLKDE PNVKTTILPN LGHFLHREEP
QKVAELAIEF FEKYSSKATS S