Gene PICST_90594 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_90594 
SymbolMSC7 
ID4840166 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp1577837 
End bp1579955 
Gene Length2119 bp 
Protein Length616 aa 
Translation table12 
GC content42% 
IMG OID640391481 
ProductMeiotic Sister-Chromatid recombination aldehyde dehydrogenase 
Protein accessionXP_001386003 
Protein GI150866411 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.308604 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TAAGTTGTCG TCTACCAATT GAGTAAATAG GAATAGGCGA AACTTCAATT CTTTCTGATA 
GCTGCTTATA TCTTCTAAAC CATTTTGTTA TATTTCTGTA GATCTTCACG AAGTTACTAC
TACAACCTAA AACCTCGATC GATAGCAACC TCAAACATGA TCTTGTTAGA CTTCAAGTTC
CACGAATGGC AATGGCAGTA CCAGATTTCG ACAACTTTCT TTGTGTTTGG AATTGTTCCC
TTTGTCTTCT GGGTTTATGC CCGCTACATC ACAGCTTCAC CTAACAAGTA CAACAAACTT
GAAGAACCAG TCAAGCTTTC GGTTCCTATT CCTGCCGAGG CCAAACCCCA CTGGAAGGGC
AAGAGATTGT ACCCTCCAAA CTTGACTATC AGAGCAGCTA ATGAGCCAAC AAAGATTCAG
AGTTACTGTC CTGCAACTGG CCAGTACTTG GGAACTTTCA CAGCTACAAC CAGAGACGAA
ATGAACCAAC AAATAGCCAA CGCTAAAGTG GCGCAGAAGG AGTGGAAAGC CTCCAGCTTC
TCGTTGAGAA GACAAGTGTT GAAAACATTA AGCAGATTCA TTCTCGACAA CCAAGAAGAC
ATCGCAAGAA TTGCATGTCG AGACAGTGGA AAAACGAAAC TCGACGCTCT GATGGGTGAA
ATTATGGTAA CCTTAGAAAA GCTCAAATGG ATCATTGCCC ATGGTGAAAG AGTTTTGAGG
CCTTCGCAAC GTCCAGGACC TTCAAATTTA TTGATCGGAA TGATGAAAAA TGGAGAAGTC
AGATACGAAC CATTAGGGGT TGTAGCTGCT CTTGTTTCAT GGAATTATCC CTTTCACAAT
CTCATGGGTC CGATCATCGC GGCCTTGTTT ACTGGAAATG CAATCATAGT TAAATGTTCT
GAGCAGGTGA TTTGGTCTTC GACATGGTAC ATTGATTTGG TTAGACTTGT GTTGAAGCTG
CTTGAGATCG ACCCCAATTT GGTACAATTG TGCTGTTGCT ATGCTGAGGA TGCTGACCAT
TTTACCTCTC ATCCGGGCTT GTCGCATATC ACCTTCATTG GCTCTAAACC TGTGGCCCAT
AAAGTTGTAG AAAGTGCTTC AAAGGAGCTT ACCCCAGTAG TTGTGGAGCT TGGTGGAAAA
GATTCGCTTA TTGTTTTGGA TGATGTCAAG GATATCGAGT CATTGTCATC TGTGATATTG
AGAGGAACTT TCCAGAGCGC AGGTCAGAAC TGTATTGGTG TCGAAAGAGT AATTTGTCTT
CCAAAGTCGT ACGAGAAATT GGTTGAGATT TTCACCGAGA GAATCAAGGA GTTCCGCTTG
GGCTCCGATA TCGACCAGCT AGACGAAATC GACATGGGTG CAATGATTTC AGACAATAGA
TTTAAACAAT TAGAAGCATT GGTGGAAGAT GCTGTCAGTA AAGGAGCGAG ACTAATACAT
GGTGGGAAAC CATACCAGCA TCCGAACTAT CCTCAGGGCC ACTACTTCGA ACCTACGTTG
ATTGTGGACG TAGATCCCAG CATGAGAATC TTCCAAGAAG AAGTGTTTGG ACCAGTTCTC
ACCATGATCA AAGCCAATGA CGTAGACGAT GCCGTCAACT TGGCTAACGG AACCGAATAT
GGATTGGGTA ACTCTGTCTT TGGCAGCAAC TTCAGGCAAA TCAACGAGAT TGCTAACAGA
CTTGATAGTG GCAATGTTGC CATAAATGAC TTTGCCACTT TCTATGTAGC ACAGCTTCCA
TTTGGAGGAA TCAAGAAGTC CGGCTATGGT AAGTTTGGAG GAGAAGAAGG TCTCTTGGGC
TTATGTGTAG CTAAGTCTGT TGTAATGGAT AAGCCAATCA TGAGACTATT TGGAGTAGCA
ACAAGCATTC CACCTCCAAT TGATTATCCT ATTAAGGATG ACAAGAGGGC ATGGAAATTT
GTCCTGAGCT TGAACACTGC TGGTTACGAT ACCAGAGTGT GGAACATCAT CAAAGCATTC
AAAAAACTCG CAAAGGGGGG AGCATGATAA AGTAAGGAAT ACTATATTTT GATTCATAAA
TACTTATACA CCACAAAAAC TTCATAATAG ATAGATACAT GGTAAATTGT TATAGATACC
TATAAAAAAG ATCGTACAG
 
Protein sequence
MILLDFKFHE WQWQYQISTT FFVFGIVPFV FWVYARYITA SPNKYNKLEE PVKLSVPIPA 
EAKPHWKGKR LYPPNLTIRA ANEPTKIQSY CPATGQYLGT FTATTRDEMN QQIANAKVAQ
KEWKASSFSL RRQVLKTLSR FILDNQEDIA RIACRDSGKT KLDASMGEIM VTLEKLKWII
AHGERVLRPS QRPGPSNLLI GMMKNGEVRY EPLGVVAALV SWNYPFHNLM GPIIAALFTG
NAIIVKCSEQ VIWSSTWYID LVRLVLKSLE IDPNLVQLCC CYAEDADHFT SHPGLSHITF
IGSKPVAHKV VESASKELTP VVVELGGKDS LIVLDDVKDI ESLSSVILRG TFQSAGQNCI
GVERVICLPK SYEKLVEIFT ERIKEFRLGS DIDQLDEIDM GAMISDNRFK QLEALVEDAV
SKGARLIHGG KPYQHPNYPQ GHYFEPTLIV DVDPSMRIFQ EEVFGPVLTM IKANDVDDAV
NLANGTEYGL GNSVFGSNFR QINEIANRLD SGNVAINDFA TFYVAQLPFG GIKKSGYGKF
GGEEGLLGLC VAKSVVMDKP IMRLFGVATS IPPPIDYPIK DDKRAWKFVS SLNTAGYDTR
VWNIIKAFKK LAKGGA