Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_74289 |
Symbol | TAL1 |
ID | 4841165 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009048 |
Strand | - |
Start bp | 649536 |
End bp | 650616 |
Gene Length | 1081 bp |
Protein Length | 323 aa |
Translation table | 12 |
GC content | 47% |
IMG OID | 640392480 |
Product | transaldolase |
Protein accession | XP_001386719 |
Protein GI | 126140394 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0176] Transaldolase |
TIGRFAM ID | [TIGR00874] transaldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.318947 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CCTCAATTCC ACATCACAAT GTCCTCCAAC TCCCTTGAAC AATTGAAAGC CACAGGTACC GTCATCGTCA CCGACACCGG TGAATTCGAC TCGATTGCCA AGTACACTCC ACAAGATGCC ACCACCAACC CATCGTTGAT TTTGGCTGCT GCTAAGAAGC CTGAATACGC CAAGGTCATT GACGTCGCCA TTGAATACGC CAAGGACAAG GGTTCCTCCA AGAAGGAAAA GGCTGAAATC GCCTTGGACC GTTTGTTGAT TGAATTCGGT AAGAACATCT TGGCCATTGT TCCAGGAAGA GTGTCTACCG AAGTCGACGC CAGATTGTCT TTCGACAAAG AGGCCACCAT CAAGAAGGCT CTTGAATTGA TTGCCTTGTA CGAATCCCAA GGTATCTCCA AGGACAGAAT CTTGATCAAG ATCGCCTCCA CTTGGGAAGG TATCCAAGCT GCCAGAGAAT TGGAAGCCAA GCACGGTATC CACTGTAACT TGACTTTGTT GTTCTCTTTC GTTCAGGCAG TTGCCTGTGC TGAAGCCAAG GTCACCTTGA TCTCGCCATT CGTCGGCAGA ATCTTGGACT GGTACAAGGC TTCTACCGGA AAGACCTACG AAGGTGACGA AGACCCAGGT GTGATTTCTG TCAGAGCCAT CTACAACTAC TACAAGAAGT ACGGCTACAA AACTATTGTC ATGGGTGCCT CTTTCAGAAA CACCGGTGAA ATCAAGGCTT TGGCTGGTTG CGACTACTTA ACTGTTGCTC CTAAGTTGTT GGAAGAATTG TTGAACTCCA CTGAACCAGT TCCACAAGTG TTGGACGCTG CTTCTGCCTC TGCTACTGAT GTCGAAAAGG TTTCTTACGT CGATGACGAA GCTACCTTCA GATACTTGTT CAACGAAGAC GCCATGGCTA CCGAAAAGTT GGCCCAAGGT ATCAGAGCTT TCGGCAAGGA CGCTGTCACC TTGTTGGAAC AATTGGAAGC CAGATTCTAA GTATTGTGCT TCGAGTCCTA GATGGATCTC TGGTATTTAC ATATTTCGCT TCTATTAATA ATTTCACAAA AACAATATAA TACGATCAAT G
|
Protein sequence | MSSNSLEQLK ATGTVIVTDT GEFDSIAKYT PQDATTNPSL ILAAAKKPEY AKVIDVAIEY AKDKGSSKKE KAEIALDRLL IEFGKNILAI VPGRVSTEVD ARLSFDKEAT IKKALELIAL YESQGISKDR ILIKIASTWE GIQAARELEA KHGIHCNLTL LFSFVQAVAC AEAKVTLISP FVGRILDWYK ASTGKTYEGD EDPGVISVRA IYNYYKKYGY KTIVMGASFR NTGEIKALAG CDYLTVAPKL LEELLNSTEP VPQVLDAASA SATDVEKVSY VDDEATFRYL FNEDAMATEK LAQGIRAFGK DAVTLLEQLE ARF
|
| |