Gene PICST_69669 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_69669 
SymbolTRP5 
ID4851612 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp2334859 
End bp2337079 
Gene Length2221 bp 
Protein Length700 aa 
Translation table 
GC content44% 
IMG OID640393320 
Producttryptophan synthetase 
Protein accessionXP_001387018 
Protein GI126275035 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0133] Tryptophan synthase beta chain
[COG0159] Tryptophan synthase alpha chain 
TIGRFAM ID[TIGR00262] tryptophan synthase, alpha subunit
[TIGR00263] tryptophan synthase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.390453 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.887683 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGCTT TATTGAAAGA AACTTTTGCT AGGTGTAAAA AAGAGGGACG AAATGCTTTG 
GTCAACTTCA TTACTGCTGG TTACCCTACT ATTGAGGACA CAGTTCCAAT CTTGAAGAGC
ATGCAAGATG CCGGTGTAGA CATCATTGAA TTGGGTATCC CATTCTCCGA CCCAATTGCT
GATGGTCCAA CCATCCAAGC TGCTAATAAT GTCGCGTTGG ACAACGGAAT CACCGTCCCA
AAGTGCTTGG ACTTGGTCAA GCAAGCAAGA GAACAAGGTG TCACTGTTCC AATCATATTG
ATGGGTTACT ACAATCCAAT CTTAAAGTAC GGTGAAATCA AGTTGATTGA GGACTCTGCA
AGGGTAGGTG CTAACGGTTT CATTGTCGTC GACTTGCCTC CTGAGGAAGC CATCAAGTTC
AGATCGTCTT GTGCACGTTA TGGATTGTCT TATGTTCCTT TGGTTGCCCC TGCCACTACT
GATGAAAGAT TGAAGGTCTT GGGAGAAATC GCAGATTCCT TTATCTACGT AGTTTCTAAG
ATGGGTACCA CTGGTGCTTC CAAATCTGTT TCTTCCGGTA TCACTGAGTT GTGTGCTAGA
GTTAGAAAAT TTGCCGGCTC TACCCCAATT GCCGTAGGTT TCGGTGTGTC TACAAGAGAG
CATTTCTTAA CTGTTGGTGA GAGCGCTGAT GGTGTGGTTA TTGGTTCTAG AATCGTGACC
TTAATTGGTG AATCTAAGCC AGGTGAAAGA GGTGTGACCG CTTACAAGTA TGTCAAATCC
ATTTTAGGTG AAGGCTTTTC CGTCAATGCG CCAACTTCTT TCTCTCGGGC TGTAGTTATA
GATGGAACTG AAACTAAGCC AGTTCTTGAA GAGGACCACA AGTTCAACCC AAAATTTGGT
GAATTTGGTG GTCAATATGT TCCTGAAGCA TTGCACACTT GTTTGGCTGA ATTGGAGAAA
GGATTTGAAA GCGCAGTTGC TGATCCCGAG TTCTGGAAGG AATTCAAGGA CTTGTACTCT
TACATTGGAA GACCATCTTC TTTGCATAGG GCCGAAAGAT TGACTGAATA TGCCGGAGGC
GCTCAGATCT GGTTGAAGAG AGAAGATTTG AACCATACTG GTTCTCACAA GATCAACAAT
GCCTTGGCTC AAGTGTTAAT TGCTAAGAGA TTAGGTAAGA AGAAGATTAT TGCTGAGACT
GGTGCTGGTC AGCATGGTGT TGCTACTGCT ACTGCATGTG CTAAGTTTGG ATTGGAATGT
ACCGTTTTCA TGGGAGCCGA AGATGTAAGA CGTCAAGCCT TGAATGTGTT CAGAATGAGA
ATCTTGGGTG CAAAGGTTGT TGCTGTCACT AATGGTACCC AAACATTGAG AGATGCGACT
TCTGAAGCCT TCAGATTCTG GGTATCAAAC TTAGAGTCAA CGCACTATGT TGTCGGTTCA
GCAATTGGAC CACATCCATA CCCAACCTTG GTTAGAACCT TCCAAAGTGT TATTGGTCAA
GAAACCAAAG AGCAGTTTAA GACTTTAAAC GGCGGTAAGT TACCAAACGC CGTTGTTGCT
TGTGTCGGCG GTGGTTCGAA CTCCACTGGT ATGTTCTCTC CTTTTGAACA CGATACTGAA
GTCAAAATGT TAGGTGTCGA AGCTGGTGGT GACGGCTTAG ACACTGATCG CCATTCTGCA
ACTTTGACGG CAGGTATTCC AGGTGTGTTC CATGGTGTCA AAACTTACGT TCTTCAGGAC
AGTGATGGAC AGGTTCATGA CACTCATTCA GTTTCTGCGG GTTTAGACTA TCCTGGCGTA
GGTCCAGAAT TGGCATTTTG GAAGAGCACT GGTCGTGCTG ACTTCGTTGC TGCTACAGAT
GCTCAGGCAT TGATCGGATT TAAATTATTG TCCCAATTGG AGGGTATAAT TCCAGCTTTG
GAGTCTTCTC ACGCTATTTA TGGTGGTGTT GAGTTGGCTA AGACTATGCC AAAGGATCAA
CACATTGTTA TCAATGTTTC AGGACGTGGT GACAAGGATG TGCAAAGTGT TGCCGAAGTT
TTACCGAAGT TAGGCGAGCA GATTGGCTGG GACTTGAGAT TCGAAGCCGA TCCTACGAAG
TAAGTTTAGT TAATACTAAG AAATTCTATA TTTTTTATAA CCTCTAATAA CAATGCTTGG
ATGATTTTAC AATTCTTCAG CCATGTTCAT GATTAACAAG ATTGTTGATG TCCAACCCAA
G
 
Protein sequence
MSALLKETFA RCKKEGRNAL VNFITAGYPT IEDTVPILKS MQDAGVDIIE LGIPFSDPIA 
DGPTIQAANN VALDNGITVP KCLDLVKQAR EQGVTVPIIL MGYYNPILKY GEIKLIEDSA
RVGANGFIVV DLPPEEAIKF RSSCARYGLS YVPLVAPATT DERLKVLGEI ADSFIYVVSK
MGTTGASKSV SSGITELCAR VRKFAGSTPI AVGFGVSTRE HFLTVGESAD GVVIGSRIVT
LIGESKPGER GVTAYKYVKS ILGEGFSVNA PTSFSRAVVI DGTETKPVLE EDHKFNPKFG
EFGGQYVPEA LHTCLAELEK GFESAVADPE FWKEFKDLYS YIGRPSSLHR AERLTEYAGG
AQIWLKREDL NHTGSHKINN ALAQVLIAKR LGKKKIIAET GAGQHGVATA TACAKFGLEC
TVFMGAEDVR RQALNVFRMR ILGAKVVAVT NGTQTLRDAT SEAFRFWVSN LESTHYVVGS
AIGPHPYPTL VRTFQSVIGQ ETKEQFKTLN GGKLPNAVVA CVGGGSNSTG MFSPFEHDTE
VKMLGVEAGG DGLDTDRHSA TLTAGIPGVF HGVKTYVLQD SDGQVHDTHS VSAGLDYPGV
GPELAFWKST GRADFVAATD AQALIGFKLL SQLEGIIPAL ESSHAIYGGV ELAKTMPKDQ
HIVINVSGRG DKDVQSVAEV LPKLGEQIGW DLRFEADPTK