Gene PICST_74289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_74289 
SymbolTAL1 
ID4841165 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp649536 
End bp650616 
Gene Length1081 bp 
Protein Length323 aa 
Translation table12 
GC content47% 
IMG OID640392480 
Producttransaldolase 
Protein accessionXP_001386719 
Protein GI126140394 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0176] Transaldolase 
TIGRFAM ID[TIGR00874] transaldolase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.318947 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CCTCAATTCC ACATCACAAT GTCCTCCAAC TCCCTTGAAC AATTGAAAGC CACAGGTACC 
GTCATCGTCA CCGACACCGG TGAATTCGAC TCGATTGCCA AGTACACTCC ACAAGATGCC
ACCACCAACC CATCGTTGAT TTTGGCTGCT GCTAAGAAGC CTGAATACGC CAAGGTCATT
GACGTCGCCA TTGAATACGC CAAGGACAAG GGTTCCTCCA AGAAGGAAAA GGCTGAAATC
GCCTTGGACC GTTTGTTGAT TGAATTCGGT AAGAACATCT TGGCCATTGT TCCAGGAAGA
GTGTCTACCG AAGTCGACGC CAGATTGTCT TTCGACAAAG AGGCCACCAT CAAGAAGGCT
CTTGAATTGA TTGCCTTGTA CGAATCCCAA GGTATCTCCA AGGACAGAAT CTTGATCAAG
ATCGCCTCCA CTTGGGAAGG TATCCAAGCT GCCAGAGAAT TGGAAGCCAA GCACGGTATC
CACTGTAACT TGACTTTGTT GTTCTCTTTC GTTCAGGCAG TTGCCTGTGC TGAAGCCAAG
GTCACCTTGA TCTCGCCATT CGTCGGCAGA ATCTTGGACT GGTACAAGGC TTCTACCGGA
AAGACCTACG AAGGTGACGA AGACCCAGGT GTGATTTCTG TCAGAGCCAT CTACAACTAC
TACAAGAAGT ACGGCTACAA AACTATTGTC ATGGGTGCCT CTTTCAGAAA CACCGGTGAA
ATCAAGGCTT TGGCTGGTTG CGACTACTTA ACTGTTGCTC CTAAGTTGTT GGAAGAATTG
TTGAACTCCA CTGAACCAGT TCCACAAGTG TTGGACGCTG CTTCTGCCTC TGCTACTGAT
GTCGAAAAGG TTTCTTACGT CGATGACGAA GCTACCTTCA GATACTTGTT CAACGAAGAC
GCCATGGCTA CCGAAAAGTT GGCCCAAGGT ATCAGAGCTT TCGGCAAGGA CGCTGTCACC
TTGTTGGAAC AATTGGAAGC CAGATTCTAA GTATTGTGCT TCGAGTCCTA GATGGATCTC
TGGTATTTAC ATATTTCGCT TCTATTAATA ATTTCACAAA AACAATATAA TACGATCAAT
G
 
Protein sequence
MSSNSLEQLK ATGTVIVTDT GEFDSIAKYT PQDATTNPSL ILAAAKKPEY AKVIDVAIEY 
AKDKGSSKKE KAEIALDRLL IEFGKNILAI VPGRVSTEVD ARLSFDKEAT IKKALELIAL
YESQGISKDR ILIKIASTWE GIQAARELEA KHGIHCNLTL LFSFVQAVAC AEAKVTLISP
FVGRILDWYK ASTGKTYEGD EDPGVISVRA IYNYYKKYGY KTIVMGASFR NTGEIKALAG
CDYLTVAPKL LEELLNSTEP VPQVLDAASA SATDVEKVSY VDDEATFRYL FNEDAMATEK
LAQGIRAFGK DAVTLLEQLE ARF