Gene PICST_60847 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_60847 
SymbolALD3 
ID4839190 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp26625 
End bp28058 
Gene Length1434 bp 
Protein Length477 aa 
Translation table12 
GC content42% 
IMG OID640390505 
Productmitochondrial aldehyde dehydrogenase 
Protein accessionXP_001384658 
Protein GI126136269 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.858717 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCTA TTAGTAGAAC ATTCCCCAGT ATTATCGAGG GAAAGGATTT CCACAGCAAT 
GAAAAACATC CAGTATACTC CCATGTCACC CAGAAAGAAG CAATCCACTA TTTCTCCTAC
TTAACTGATA TCAAGAAGGC AGTTTCGAAA ATCGCTGCTG ATGCTGACGA AGGGTTTGAA
GAATGGTCGT CCATGGCCTA TCAAGAAAGA GTGAAGATTT TCGAGAAGGC TGCTGCTTTA
CTTGCTGAAA GAAGAGAAGA GTTGATAGCT TCTCACAAAA ATATCGGAGG TCCTACCTGG
TTTTCCCATG TGAATGCTGA TGAAATCATC TCGCAATTGA AGGAATATAC TTCACTTCTC
TCTAGACCTA CTGGTTTAGT AGCCCAATCT GCTCATTCTG ATCTCGCACT CGTCGTCAAG
CAGCCACTAG GTCCTGTCCT CGCCATTGCT CCCTGGAACG CTCCTGTTCT TTTGGCAGGT
AGAGCCATAG TGGCTCCGTT GGCTGCGGGC TGTTCGGTCA TCCTAAAAGC TTCTGAAAAG
GCTCCAGAAT CCGCATACCT TGTTGTGAAG ACCTTCATTG ATGCTGGTAT CCCGTCAAAA
GCATTGCAAT TGGTCTTCAT CAAACCAGAT GACAATCCAG AATTCATCAA CTCCATCTTG
GACACTGGTT TGATCAAAAA GGTCAACTTC ACTGGCTCTA CAATCGTGGG CAAGAAGATC
GCTGAAGCTG CCAGTAAACA CTTAGTACCA TATCTTATGG AATTAGGCGG AAAGAATGTG
TCTATTGTTG AAAAGGATGC TGACTTGGTG AGGGCCGTTG AAACTATAAT CTGGAGTTCG
TGGTCGCACA AAGGTCAAAT ATGCATGAGT ACTGACAAGG TCTTCGTTGA TGAAAGCATC
TACGACAAAT TCGTTGCTCA ATTAAAAGTA TCCGCCAATG AGATCGTCAA GGACCCCGAC
TACGCAATTT CTCAAAGAGA TATTACATTC AAGAGAAACC TTGTTAAGTT GGTTAAGAAT
GCATTAGATT TGGGTGCAAA TTTGATATTT GGTAAATTAA ATGACCATTT GGACAGCGGT
TCCTTCAGTC CATTGATCTT GGAAAATGTC ACTTCAAACA TGTTGCTTGA TTCTACCGAA
TCATTCGGAC CTTTGTTCGC TGTATATAAG TATTCAGATA CAATCAAACT TGTCAAGGAA
TTAAACAGAG CTGATTATGG ATTGAAGGCC TCAATTTGGT CCCAAAATGT TTTGCAAGCA
TTGGAAACAG CTAAAAAAAT CCACGTAGGT GGTGTACATA TCAATAGTTC TACGATTCAC
GACGAAGCGA CTCTACCACA TGGCGGTGTT AAGTCAAGTG GTGCTGGAAG ATTCAACTCC
ATATGGGGTA TTGACGATTT TTCCATTACC AAGACAATTA CTCTTAGTCA GTAA
 
Protein sequence
MSAISRTFPS IIEGKDFHSN EKHPVYSHVT QKEAIHYFSY LTDIKKAVSK IAADADEGFE 
EWSSMAYQER VKIFEKAAAL LAERREELIA SHKNIGGPTW FSHVNADEII SQLKEYTSLL
SRPTGLVAQS AHSDLALVVK QPLGPVLAIA PWNAPVLLAG RAIVAPLAAG CSVILKASEK
APESAYLVVK TFIDAGIPSK ALQLVFIKPD DNPEFINSIL DTGLIKKVNF TGSTIVGKKI
AEAASKHLVP YLMELGGKNV SIVEKDADLV RAVETIIWSS WSHKGQICMS TDKVFVDESI
YDKFVAQLKV SANEIVKDPD YAISQRDITF KRNLVKLVKN ALDLGANLIF GKLNDHLDSG
SFSPLILENV TSNMLLDSTE SFGPLFAVYK YSDTIKLVKE LNRADYGLKA SIWSQNVLQA
LETAKKIHVG GVHINSSTIH DEATLPHGGV KSSGAGRFNS IWGIDDFSIT KTITLSQ