Gene PICST_74130 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_74130 
SymbolGLY1 
ID4841071 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp396387 
End bp397591 
Gene Length1205 bp 
Protein Length369 aa 
Translation table12 
GC content43% 
IMG OID640392386 
Productthreonine aldolase 
Protein accessionXP_001386466 
Protein GI126139888 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2008] Threonine aldolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TAACATACCC TAATGGACTT CTCAACTTAT ACCGCCCAGA GTCCGGCCCA TAACGAATTC 
CGTAGTGACA CTTTCACCAC GCCAACGGCT TCCATGATCC AGGCATTGGC CAATGCAACC
TTGGGAGACG CCGTCTATAA CGAAGACGAA TCTACCATTG CCTTGGAGAA AAAGGTAGCC
GATTTGGCTG GAAAAGAAGC CGGCTTGTAT TGTGTTAGTG GAACTCTCTC CAACCAAATT
GCCCTTAGAA CAAACCTCAT CCAGCCTCCA TTCAGCATCT TGTGTGACCA CAGGGGCCAT
GTCTATGTCC ACGAAGCGGG TGGATTGTCC ACCTTATCAC AGGCTATGGT TCAGCCGATC
GTGGCCAAGA ATGGACATCA TTTGACGTTG GAAGATGACA TTTTGCCCAA CTTCATTCCT
GACGACGGAG AAATCCATGG AGCTCCAACG AAGGTCATTT CTTTGGAAAA CACCTTACAT
GGTATGATTT TCCCCTTGGA TGAAATCAAG AAAATCTCTA ACTTCTGCAA GAAGAACGAC
GTAAAATTGC ATCTTGACGG TGCCAGATTG TGGAATGCGT CTGTTGCCAC TGGAATTTCT
CTCAAAGAAT ACTGTTCATA TTTCGACAGT GTCTCATTAT GTCTTTCCAA GACTTTGGGT
GCTCCTGTTG GCTCTGTTCT AGTGTCAACC AGAAAATTTG TAAACAAAGC TAACCACTTT
AAGAAACAAA ACGGTGGTGG AATCAGACAA AGTGGTTTGT TGGCTGTAAT GGCCATCACA
GCCATCGACG AGAACTTGCC TAAATTGCAA AAGACCCATG AAAGAGCCAA AGAGTTAGGT
GAATTGTGTG ACAAGAATGG AATCTACTTG GAACATCCCG TCGAAACAAA CTTCGTGTTT
ATCGACACGA AGAAAAACAA ATGGAACCCG GAGTCCATAA AGACATTAGC AGAGAAGCAC
GGAATTAAGT TTTACGGAGG AAGAATATCT TTCCACTATC AAGTTTCTGA CGAAAGCTTT
GAAGCCGTTA AGAAATTTGT ATTGGAAACG CAGGAAGATG CTAAGAAAAA CCCATACGAC
GGCGGCGATC AGGTCAGATT TTACAGTAAT ATCGAAGAGT GATTGCAAAG TAAGATGATT
TACATACCCT ATAATATGTA TAATTAACTA AGTTATATAG ACATTATATA CGTATATTAA
TATGT
 
Protein sequence
MDFSTYTAQS PAHNEFRSDT FTTPTASMIQ ALANATLGDA VYNEDESTIA LEKKVADLAG 
KEAGLYCVSG TLSNQIALRT NLIQPPFSIL CDHRGHVYVH EAGGLSTLSQ AMVQPIVAKN
GHHLTLEDDI LPNFIPDDGE IHGAPTKVIS LENTLHGMIF PLDEIKKISN FCKKNDVKLH
LDGARLWNAS VATGISLKEY CSYFDSVSLC LSKTLGAPVG SVLVSTRKFV NKANHFKKQN
GGGIRQSGLL AVMAITAIDE NLPKLQKTHE RAKELGELCD KNGIYLEHPV ETNFVFIDTK
KNKWNPESIK TLAEKHGIKF YGGRISFHYQ VSDESFEAVK KFVLETQEDA KKNPYDGGDQ
VRFYSNIEE