Gene PICST_63844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_63844 
SymbolALD2 
ID4841070 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp393627 
End bp395132 
Gene Length1506 bp 
Protein Length501 aa 
Translation table12 
GC content48% 
IMG OID640392385 
Productmitochondrial aldehyde dehydrogenase 
Protein accessionXP_001386665 
Protein GI150866913 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.704992 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCTCC CATTAGAATA CCTGGTTACC CTTCCAGACG GCAACACATT CACACAGCCT 
ACTGGTTTGT TCATCAACAA CGAGTTTGTC AAGTCTGTTT CCGGAAAGAC GTTGGACTCC
ATCAACCCAT CTACCAGTGA AGTCAATGGT ACCGTTTACT GTGCCGAGGA AGAAGATGTC
GATATCGCCG TCAAGGCTGC CAGAGCCGCA TTTAAGGACT GGAAGAAGGT CACCGGTGTG
GACAGAGGAA TCTTGTTGAA CAAGGTCGCT GATGCCTTTG AAGCCCAAAG AGACTTGATT
GGTGCCATTG AAGCCTGGGA CTCTGGTAAG ACCAAGGAAC AAAACGCCGT ATACGATATT
GACGAATGTA TCAGTTGTTT CAGATACTTT GCTGGCTGGG CTGACAAAAT CCAGGGTAAG
GTGATCCAGA ACGATCCAAA GAAGTTGGCC TACACCATCC ATGAACCACT TGGTGTTTGT
GGTCAGATCA TCCCATGGAA CTACCCATTG GCTATGGCAG CCTGGAAGTT GGCTCCAGCT
TTGGCTGCTG GTAACGTAGT TGTGTTGAAA ACTTCAGAAA TTACACCATT GTCTCTCTTG
TACGTAGCTC GCCTCTTCAA GGACGCTGGT TTCCCAGCCG GTGTGGTTAA CATCATCTCC
GGCTTCGGTG CTGTTGCCGG TAAGGCTCTT TCATCCCATT TAGATGTCGA CAAGATCGCT
TTCACTGGTT CTACTGCTAC TGGTAAGCTT ATACAACAGG CTGCTGCTTC TAACTTGAAG
GCTGTGACCT TAGAGTGTGG TGGTAAGTCG CCTTTGATCA TCCGTGAAGA CGCTGACTTG
GAACAAGCTG TCAAGTGGGC TGCCATCGGT ATCATGAGTA ACCAAGGTCA GATCTGTACG
TCCACTTCCA GAGTGTACGT TCACGAATCT GTCTACGACA AGTTCTTGGA GGAATACACT
GCACACGTGA AAGAAGCCTA CAAACAGGGT AGTATGTTCG ATTCTGAAGC AGTCGTTGGT
CCACAAGTTT CGAAGGTTCA GCGTGATAAA GTGTTGAGCT ACATCGAAAT CGGTAAGAAA
GAAGGTGCCC GCTTGCTTTT GGGAGGTGAA AAGAACTCGG AGGGTGAATT ATCCAAAGGA
TTCTACATCA AGCCAACTAT TTTTGCTGAC ATCAAGCCTG AGATGAGAAT CGTCAACGAG
GAAATATTTG GCCCTGTTGT AGTGGTGGGT AAGTTCTCGT CAGACGAAGA AGTCATCACT
TACGCCAACC AGACCCAATA CGGTTTGGGT GCTGCCATCT TCACCAAGGA CATCACCGTG
GCCCACACTA TGGCTGCTGA AATACAAGCT GGTATGGTGT GGATCAACTC TTCCAACGAC
TCTGATGTCC ACATTCCATT CGGTGGTGTC AAGATGTCTG GTGTAGGTAG AGAATTGGGT
GAATACGGAT TGTCCATCTA CACCCAGGCA AAGGCCATTC ACGTTAACTT GGGCAACAAG
TTGTAG
 
Protein sequence
MSLPLEYSVT LPDGNTFTQP TGLFINNEFV KSVSGKTLDS INPSTSEVNG TVYCAEEEDV 
DIAVKAARAA FKDWKKVTGV DRGILLNKVA DAFEAQRDLI GAIEAWDSGK TKEQNAVYDI
DECISCFRYF AGWADKIQGK VIQNDPKKLA YTIHEPLGVC GQIIPWNYPL AMAAWKLAPA
LAAGNVVVLK TSEITPLSLL YVARLFKDAG FPAGVVNIIS GFGAVAGKAL SSHLDVDKIA
FTGSTATGKL IQQAAASNLK AVTLECGGKS PLIIREDADL EQAVKWAAIG IMSNQGQICT
STSRVYVHES VYDKFLEEYT AHVKEAYKQG SMFDSEAVVG PQVSKVQRDK VLSYIEIGKK
EGARLLLGGE KNSEGELSKG FYIKPTIFAD IKPEMRIVNE EIFGPVVVVG KFSSDEEVIT
YANQTQYGLG AAIFTKDITV AHTMAAEIQA GMVWINSSND SDVHIPFGGV KMSGVGRELG
EYGLSIYTQA KAIHVNLGNK L