Gene PICST_31957 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31957 
SymbolOYE2.9 
ID4839512 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp313251 
End bp314468 
Gene Length1218 bp 
Protein Length405 aa 
Translation table12 
GC content47% 
IMG OID640390827 
ProductNADPH dehydrogenase 
Protein accessionXP_001385076 
Protein GI150865740 
COG category[C] Energy production and conversion 
COG ID[COG1902] NADH:flavin oxidoreductases, Old Yellow Enzyme family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.113274 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACAGT CTGGTGTCCC CTTGAAGAAC ACTAATGTGT TTAAGCCCGT TGCAGTGGGT 
CAAAACTCAG TTTCCAACCG TATCGTGTAC GTTCCTACTA CTAGAATGAG AGCCACAGCA
GACTTCGTTC CTTCAGACTT GGAGTTGAAG TACTATGAAG ACAGAGCTCA GTACCCAGGC
ACATTGTTGA TTACAGAGGC TACGTATGTT TCCGAGAGAG CAGGTTTGTA TGACCGAGTT
CCTGGGATCT GGAACGAGAA GCAAACCAAG GCATGGAAGC AGATCACTGA TGCCATTCAC
AAAAAGGGTT CCTTTGTCAG TTGTCAGTTG TGGTTTTTGG GCCGTGTAGG CGATCCAGCT
TTGTTGAAGA AGTATGGCCA CGATTTGGTG GGTGCTTCGG CCGTATACCC TTCAGGCGGC
TACCAGAAGA AAGCTGAAAA GGTAGGTAAC CCACTCAGAG CTTTGACTAG AGCTGAGTTG
AAGGATATCA TAGTCAATGA CTATATCAAT GCTGCTAAGA ATGCCTTTGC TGCTGGCTTC
GACTACATTG AACTCCACGG TGCTCACGGT TACTTTTTAG ACACTTTCTT GCATCCCCTG
TCCAACCAGA GAACAGACGA CTACGGTGGT TCCATCGAGA AGAGAGCCAG GTTCGTTTTG
GAGGTAATTG ACGAGTTGAT TAAGGCTGTA GGTGCCAACA GAGTGGCCTT GAGAATTTCT
CCCTGGGCCA TGGTTCAAGG TGTGGGAGCT CAGTACGAAG AAGTTCATCC TATCACTACT
TTCAGCTACT TGTTGAACGA GTTGCAGAAG AGAGCTAACC AGGGTAACGA GTTAGCATAC
ATATCTGTAG TCGAGCCCCG TGTTCAAGGT ACCGTCACAG TTGATGAATG GGTTGGTGAC
AATAGTTTCG TTCGTCACAT CTGGAAGGGA GTCATTGTCA AAGCTGGTAA TTACACCTAT
GATGCTCCCT CCTTCAAGAC TATTCAGGAG GAAACTAATG ACGATAAAGT ACTTGTGGGA
TTCTCGAGAT ACTTCACCTC TAACCCCGAT TTGGTCTCGA GATTGGCTGA AGGCAAGCCT
TTGACTAAGT ACGAAAGACC TACTTTCTAC ACTCCTGATA ACTGGGGCTA CAACACCTGG
AGCAACCATG ACGATAAAAG TAAATATGAC AAGAATGACG AGAAGAAGGT CTTTCCTAGA
GCTCTTGCCA GACTTTAG
 
Protein sequence
MSQSGVPLKN TNVFKPVAVG QNSVSNRIVY VPTTRMRATA DFVPSDLELK YYEDRAQYPG 
TLLITEATYV SERAGLYDRV PGIWNEKQTK AWKQITDAIH KKGSFVSCQL WFLGRVGDPA
LLKKYGHDLV GASAVYPSGG YQKKAEKVGN PLRALTRAEL KDIIVNDYIN AAKNAFAAGF
DYIELHGAHG YFLDTFLHPS SNQRTDDYGG SIEKRARFVL EVIDELIKAV GANRVALRIS
PWAMVQGVGA QYEEVHPITT FSYLLNELQK RANQGNELAY ISVVEPRVQG TVTVDEWVGD
NSFVRHIWKG VIVKAGNYTY DAPSFKTIQE ETNDDKVLVG FSRYFTSNPD LVSRLAEGKP
LTKYERPTFY TPDNWGYNTW SNHDDKSKYD KNDEKKVFPR ALARL