Gene PICST_71203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_71203 
SymbolGLY2 
ID4838059 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp1101225 
End bp1102525 
Gene Length1301 bp 
Protein Length373 aa 
Translation table12 
GC content45% 
IMG OID640389374 
Productthreonine aldolase 
Protein accessionXP_001383841 
Protein GI126134633 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2008] Threonine aldolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.200501 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AACGGCGACA CCTTTTCACA TCTTTCTTGA TTTCCGAATA TTGTCACTTG TCATCGTAGC 
AAGTGATAGT TGCTTATATA GCGCAGTACA AAAGATAGCA TTTCCATGAT GACCATCGAC
GACGAAGAAA CGTTTGTTAC CCACAACGAG TTCAGATCCG ACACGTTCAC GGTTCCCACC
CGGGCCATGG TCGAAGCCAG TTTCGCCAAC AGCACCTACG GAGACTCCGT TTACAAGGAA
GACGCTGTCA CTTTGACTTT GGAAGAGAAG ATGTGTACCT TGACCGGAAA GCCGGCTGCT
CTCTTCTGTG TCAGCGGAAC CATGTCCAAC CAGATCGGTT TGAGAGCCAA CTTGGTTCAA
CCTCCTTATA GCGTCTTGTG TGACCACCGT GCCCATGTCT TCTTGCACGA GGCAGCTGGT
TTGGCTATGT TGTCACAGGC AATGGTTCAT CCTGTGACTC CCAGCAACGG AAACTACTTG
ACATTTGAAG ACGTAGTAGA GAACGTCACT TTCGACGATG GTGATATCCA TGCGGCTCCT
ACCAAGGTGA TTTCGCTTGA AAACACCTTA CATGGGATCA TCATGCCCAT TGAAGAGATT
CGCAAGATTT CTCAATTTTG TAGAGAAAAT GACATCAAGT TGCATTTGGA CGGGGCCAGA
TTGTGGAATG CCTCAGCAGC CACTGGAATT TCTATCGAAG AGTACTGCTC TTATTTCGAC
AGTGTCTCCT TATGTTTGTC CAAGTCGTTG GGTGCACCCA TTGGTTCAAT TCTTGTAGGA
GAACAGAAGT TCATTGACAA GGCCAACCAT TTCAAGAAAC AATGTGGTGG TGGTATAAGG
CAGGCTGGTA TCGTGACATC AATGGCTATA AATGCCATTG AGCAGAACTT CCCCAAGTTG
GTGAAGTCCC ACCAGTACGC CAAGCAGGTC GGAGACTTCT GCGACCAACA CGGCATTGCA
TTAGAAAGTC CAGTGGACAC CAATTTTGTC TTCTTGGACT TAAAGGCTAA CCACATGGAC
GACAGGCTCT TGATTGAGTT GGGTAAGAAG AACAACATCA AGTTGATGGG AGGAAGAATC
GCATTTCACT TCCAGTTGAG CCAGCAGAGT GTGGACGCTG TCAAGAGGGT GATCTTGGAG
TGCTACCAGT ACAACCAGAA GAATCCCTAC AAATATAACA TCAAAAATAA CAAGAAGATG
TACAACTATG ATTCGATCAA GGCATGATTT ATGAAGCACT ACTAATAGCA AGAAACTATA
TATACGCTAT ATACTAATTT TAATGCAAAC TAATAGATAA G
 
Protein sequence
MMTIDDEETF VTHNEFRSDT FTVPTRAMVE ASFANSTYGD SVYKEDAVTL TLEEKMCTLT 
GKPAALFCVS GTMSNQIGLR ANLVQPPYSV LCDHRAHVFL HEAAGLAMLS QAMVHPVTPS
NGNYLTFEDV VENVTFDDGD IHAAPTKVIS LENTLHGIIM PIEEIRKISQ FCRENDIKLH
LDGARLWNAS AATGISIEEY CSYFDSVSLC LSKSLGAPIG SILVGEQKFI DKANHFKKQC
GGGIRQAGIV TSMAINAIEQ NFPKLVKSHQ YAKQVGDFCD QHGIALESPV DTNFVFLDLK
ANHMDDRLLI ELGKKNNIKL MGGRIAFHFQ LSQQSVDAVK RVILECYQYN QKNPYKYNIK
NNKKMYNYDS IKA