Gene PICST_38391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_38391 
SymbolIFD4 
ID4851277 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp1391164 
End bp1392222 
Gene Length1059 bp 
Protein Length352 aa 
Translation table 
GC content41% 
IMG OID640392985 
Productaryl-alcohol dehydrogenase (AAD4) 
Protein accessionXP_001387911 
Protein GI126274263 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.142072 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.861944 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTCTT CAATTGAATA CAAAAAGCTT GGTGCCTCTG GTTTGGCTAT TTCTCCTATC 
ATTGTGGGAT GCATGTCCTA CGGTAAGAAA GTTTGGGCCG ACTGGGTAAT GGAAGATGAA
GAACAGATCT TCAAAATCTT GAAAAAGTGC TACGACTCTG GTATTAGAAC TTTTGATACT
GCTGACTTGT ACTCCAATGG TCAATCGGAA GTTATCTTGG GTAAGTTTTT GAAGAAGTAC
AATATTCAAA GAGAGAAAGT GGTAATTTTA ACGAAGTGCT TCTGTCTAAT TGACACAAGT
ATCCCTGATT TAAACCCTGT AACACAATAC GATTATCCAT CCTATGAGTT TGTCCATAAC
CAAGGTTTGT CGAGAAAGCA TATTTTCGAT GCCGTCAAAG GTTCAGTTGA AAGATTGGGA
ACCTACATTG ATGTCTTGCA AATTCACAGA TTGGATAAGT CGACTCCAAA GGCTGAAATC
ATGAAAGCTT TGAACGACGT AGTTTCTAAT GGTGATGTCA GGTATATCGG TGCTTCTTCT
ATGAGAGCCG CTGATTTCGT TGAATTGCAA TTCATTGCTG ATAAGAATGG CTGGACTAAG
TTCATCAGTA TGCAAAACTT CTACAACTTA ATCTACCGTG AGGAAGAGAG AGAAATGATT
CCTTTCTGTA ACGATAACTC CCTTGGTAAG GTTGGCTTGA TTCCATGGTC TCCAATTGCC
AGAGGTCTTT TGGCTAGACC TCTTGGTGTA GAATCTGACC ATAACAGATC TGTCGACACT
GACTTGGCAA TAGAGTTCTT TGGTTTGGCA AACTTGACTG AGGCCGACAA GGAAATTATC
AAGAGAGTTG AAGAAGTTGC CAAAAAGCAT GAGGTTAGTA TGGCTGTAAT CTCCTCTGCT
TGGGTTTTGA GCAAGGGTGC CTTCCCTATC ATCGGTCTCA ACTCTGAAGC AAGAGTTGAC
GATGCAATCA AGTCTCTCGC TGTTAAGCTA ACTGATGAAG AAGTCGCATA CTTGGAAGAA
CCTTACAAAC CTAAGCCAGT ATACGGTTTG CTTGATTAG
 
Protein sequence
MSSSIEYKKL GASGLAISPI IVGCMSYGKK VWADWVMEDE EQIFKILKKC YDSGIRTFDT 
ADLYSNGQSE VILGKFLKKY NIQREKVVIL TKCFCLIDTS IPDLNPVTQY DYPSYEFVHN
QGLSRKHIFD AVKGSVERLG TYIDVLQIHR LDKSTPKAEI MKALNDVVSN GDVRYIGASS
MRAADFVELQ FIADKNGWTK FISMQNFYNL IYREEEREMI PFCNDNSLGK VGLIPWSPIA
RGLLARPLGV ESDHNRSVDT DLAIEFFGLA NLTEADKEII KRVEEVAKKH EVSMAVISSA
WVLSKGAFPI IGLNSEARVD DAIKSLAVKL TDEEVAYLEE PYKPKPVYGL LD