Gene PICST_80168 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_80168 
SymbolALD6 
ID4850920 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp446792 
End bp448676 
Gene Length1885 bp 
Protein Length543 aa 
Translation table 
GC content46% 
IMG OID640392628 
Productmitochondrial aldehyde dehydrogenase 
Protein accessionXP_001387323 
Protein GI126273881 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01722] methylmalonic acid semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AAAACCAGTA GTTTGAGTCA GAGCTACCCA GAACTGCTAG AGAAGTCGTC AGAAACACCC 
TGTCGGTACA ATTTCAAAAA CGCGATTTTC CCTATATCAT AAACTCGAGG TCTGAGATTC
TGATTGCACT CTATTCCTGT GATTTGAAAT TTCACTTCCA ATTTTTCACT TTACACATTT
GTTCTTCACA ATACAATGTT AACCAGAGCC CTTCGTAGGT CAGCCTTGAC AGCTGCTGCT
GCCAAACGCT TCAAATCCGT GTTAACGTTG TCATCCACCG TTTCTTCATA CCCCAAGAGC
CATACCGAGG CAGGTGCTGA GCCCTACTTG ACACCATCTT TCGTGAACAA CAAGCTCATT
AAGTCCGACT CGACCGAATG GTTCGATATT CACGACCCTG CAACCAACAG CGTCGTACTG
AAGGTACCCC AACTGACCCC TGAGGAATTG GAAGAAGCCA TTGCTGCTGC CGAAGCTGCC
TTCCCAATCT GGAAAGAGTA CTCCATCATC AAAAGACAGG GTATCGCCTT CAAGTTTGTT
CAATTATTGA GAGAAAATAT GGATAGAATC GCTTCTGTCA TTGTCTTGGA ACAAGGAAAG
ACATTTGTTG ATGCCCAGGG TGACGTCTTG AGAGGCTTAC AAGTCGCTGA AGCTGCCTGT
AACATCACCA ATGACTTGAA AGGGGAAACT TTGGAAGTAG CTACTGATAT GGAAACCAAA
ATGATCCGGG AACCTCTTGG AGTTATAGGA TCCATCTGTC CTTTCAACTT CCCAGCCATG
GTTCCCTTGT GGTCACTTCC ATTGGTCTTA GTTACTGGAA ACACTGCTGT AGTTAAGCCT
TCGGAGCGTG TTCCTGGTGC TGCAATGATC ATTGCCGAGT TGGTGGCTGA AGCTGGTGTT
CCTCCTGGTG TTCTTAATAT TGTCCATGGT AAGCACGCCA CAGTGAACAA GTTGATTGAA
GACCCAAGAA TCAAGGCTTT GACATTTGTA GGCGGTGACA AGGCCGGCAA GTACATCTAC
GAAAAGGGAA CGTCCTTGGG AAAGAGAGTA CAGGCCAACT TGGGTGCTAA GAACCACTTG
GTGGTACTTC CTGATGCCAA CAAAGAGCAG TTCGTCAATG CTGTCAACGG TGCTGCTTTT
GGTGCTGCAG GTCAGAGATG CATGGCTATT TCCGTATTGG TAACCGTAGG AAAGACCAAG
GAATGGTTAG CTGACGTAGC CAGAGATGCC AAGTTGTTGA ATGTAGGCAG TGGATTCGAC
CCCAAGAGCG ATTTGGGTCC TTTGATCAAC CCTGGTAGTT TGGAAAAGGC TCACGATATT
CTTGACGAAT CTGTAAAGCA AGGGGCCAAA ATTCTTTTGG ATGGCAGAGG CTACAAGCCA
GCCGATCCAA AGTTTGCTAA GGGTAACTTC TTGGCTCCTA CCATCATCAC TAACGTGGGT
CCTGGTATCA GAGCCTACGA TGAAGAAATC TTCGCTCCTG TATTGGCTGT AGTCAACGTA
GAAACCATCG ACGAAGCTAT TGAGTTGATC AACAAAAACA AGTACGGAAA CGGTGTTTCC
ATCTTCACAT CGTCCGGTTC ATCTGCTCAA TACTTCACCA AAAGAATTGA CGTCGGACAG
GTTGGTGTCA ATGTGCCCAT TCCAGTGCCT TTGCCCATGT TCTCATTCAC AGGCTCTAGA
GGCTCGTTCT TGGGAGACTT GAACTTCTAC GGTAAGGCCG GTGTCACTTT CTTGACCAAG
CCAAAGACCA TTACCAGCTC GTGGAAGTCC AACACCATCG ACAACCAGAT CTTGAAGCCT
TCTACTTCCA TGCCAGTGCA GCAGTAATCG TTATTTAATC AGGTTCATAA AATCTTTTGT
AGAGCTAATA CAAAGTTTCA GGTTC
 
Protein sequence
MLTRALRRSA LTAAAAKRFK SVLTLSSTVS SYPKSHTEAG AEPYLTPSFV NNKLIKSDST 
EWFDIHDPAT NSVVLKVPQL TPEELEEAIA AAEAAFPIWK EYSIIKRQGI AFKFVQLLRE
NMDRIASVIV LEQGKTFVDA QGDVLRGLQV AEAACNITND LKGETLEVAT DMETKMIREP
LGVIGSICPF NFPAMVPLWS LPLVLVTGNT AVVKPSERVP GAAMIIAELV AEAGVPPGVL
NIVHGKHATV NKLIEDPRIK ALTFVGGDKA GKYIYEKGTS LGKRVQANLG AKNHLVVLPD
ANKEQFVNAV NGAAFGAAGQ RCMAISVLVT VGKTKEWLAD VARDAKLLNV GSGFDPKSDL
GPLINPGSLE KAHDILDESV KQGAKILLDG RGYKPADPKF AKGNFLAPTI ITNVGPGIRA
YDEEIFAPVL AVVNVETIDE AIELINKNKY GNGVSIFTSS GSSAQYFTKR IDVGQVGVNV
PIPVPLPMFS FTGSRGSFLG DLNFYGKAGV TFLTKPKTIT SSWKSNTIDN QILKPSTSMP
VQQ