Gene PICST_47116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_47116 
SymbolAAD5 
ID4839667 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp1268123 
End bp1269184 
Gene Length1062 bp 
Protein Length353 aa 
Translation table12 
GC content42% 
IMG OID640390982 
Productaryl-alcohol dehydrogenases 
Protein accessionXP_001384930 
Protein GI126136813 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGCCT CTCCTGTTCA AAACACAAAA CTCGGTGCTT CTGGATTGTC CATTTCTCCG 
CTTATTGTTG GTTGCATGAC ATTTGGTAAA AAGAAATGGG CCGACTGGGT TATTGAAGAT
GAAGAAAAGG TCTTCAGCAT TTTGAAGAAG TGCTATGATT CTGGTCTCAG AACATTTGAT
ACTGCTGATG TCTACTCCAA CGGTCATTCT GAGATCCTCT TGGGGAAGTT CTTGAAGAAG
TACAACATCC CCAGAGAAAA GGTTGTTATC ATGACCAAGG TGTTTGGTAC CATTGATACC
AGCTATGAAG ACTTTACTTT CTTCACTGAA ATGGAGAAGC CAGCTTTCGA ATTTGCCAAC
AACAAGGGAT TGTCTAGAAA ACACGTTTTG GATGCTGTCA AGGGCTCAGT TGAGAGATTA
GGAACATTCA TCGATGTTTT GCAAATCCAC AGATTGGACA AGGAAACCCC CAAGGCTGAA
ATTATGAAGT CTTTGAATGA CGTTGTTGTT TCTGGAGATG TCAGATATAT TGGTGCATCT
TCCATGAAAG CCAGTGAGTT CTGTGAGTTA CAGTACATTG CTGACAAAAA TGGATGGACC
AAATTCATTA GTATGCAAAA CTTCTACAAC TTGCTTTACC GTGAAGAGGA GCGTGAAATG
ATTCCATTCT GTAAAAACAA CGATTTGGCT GAAGTTGGAA TAATCCCATG GTCCCCTATT
GCTACTGGAA TTTTGGCCAG ACCTCTTGGT GCCAAATCTG CAAAGAGTAC TAGAGCCGAT
ACCGATTGGG CCAAACAATT CACTGGTTTG GACAAGTTAA CTGAAGCTGA CGAGACCATT
GTTAACAGAG TAGAAGAAAT CGCCAAAAAG CATGATACCA GCATGGCATC TGTTGCTTCT
GCCTGGGTTT TGAGTAAGGG AGCTCATCCT ATTCTTGGAA TCAACTCTGT TGAGAGAGTT
GATGATGCCT TGAAGTCGCT CACCTTCAAG TTAACTGCTG AAGAAACTGC CTACTTGGAA
GAACCATACA AGCCAAAGAA AGTCTACGGT TTGTTTGATT AG
 
Protein sequence
MSASPVQNTK LGASGLSISP LIVGCMTFGK KKWADWVIED EEKVFSILKK CYDSGLRTFD 
TADVYSNGHS EILLGKFLKK YNIPREKVVI MTKVFGTIDT SYEDFTFFTE MEKPAFEFAN
NKGLSRKHVL DAVKGSVERL GTFIDVLQIH RLDKETPKAE IMKSLNDVVV SGDVRYIGAS
SMKASEFCEL QYIADKNGWT KFISMQNFYN LLYREEEREM IPFCKNNDLA EVGIIPWSPI
ATGILARPLG AKSAKSTRAD TDWAKQFTGL DKLTEADETI VNRVEEIAKK HDTSMASVAS
AWVLSKGAHP ILGINSVERV DDALKSLTFK LTAEETAYLE EPYKPKKVYG LFD