Gene PICST_41330 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_41330 
SymbolTPS1 
ID4837586 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp1448341 
End bp1449759 
Gene Length1419 bp 
Protein Length472 aa 
Translation table12 
GC content46% 
IMG OID640388901 
ProductTrehalose-6-phosphate synthase 
Protein accessionXP_001382500 
Protein GI126131950 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0380] Trehalose-6-phosphate synthase 
TIGRFAM ID[TIGR02400] alpha,alpha-trehalose-phosphate synthase [UDP-forming] 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.336184 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.497082 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAGTCG GCAAGGTCCT TGTAGTCTCC AACCGACTTC CAGTAACCAT CAAGCGTTCT 
GACTCCGGCT CCTACGACTA CTCGATGTCT TCCGGGGGTC TAGTCACCGC ACTTCAAGGG
TTGAAGAAGT CAACAGAATT TCAGTGGCTA GGCTGGCCTG GTTTGGAAGT ACCTGCGGAC
GAACAGGAGA GAGTCAATTC CGACTTGAAG TCGAAGTTTA ACTGTACAGC CATCTATTTA
AGTGACGTTA TAGCTGATTT GCACTATAAT GGCTTCTCCA ATTCAATCCT TTGGCCGCTT
TTCCATTACC ATCCGGGTGA AATGAACTTT GACGAAAACG CCTGGGCTGC TTATATCGAA
GCCAACCGCC AGTTTGCCGT AGAAATAGCA GGCCAGGTCA ATGACAACGA TATGGTATGG
GTGCACGATT ACCACTTGAT GCTCTTGCCT CAGATGTTGC GGGAAGAAAT CGGCAACAGA
AAGAAGAATA TCCGTATCGG TTTCTTCTTA CACACGCCGT TTCCATCGTC AGAAATATAT
AGAATTTTGC CCGTAAGAAA AGAGATCTTG GAAGGTGTTT TGAGCTGTGA CTTGATCGGC
TTCCACACCT ATGACTATGC CAGACACTTC TTGTCTTCAG TATCGCGTAT TGTAGCCGAC
GTGACTACTT TACCCAATGG AATTGAGTTC CAGGGAAGAT CTATCAGTAT TGGGGCTTTT
CCCATCGGTA TCGACGTCGA CAAGTTCACT GAGGGCTTGA CCAAACAGTC GGTTATCGAC
AGAATCAAGC AGTTGAAGTC CCGCTTTGGT GACACCAAGA TTATCGTGGG GGTAGATCGC
TTGGATTACA TCAAGGGTGT CCCCCAGAAA CTCCACGCAT TCGAGGTTTT TTTGGAAGAA
AACCCAGAAT GGATCGGCAA AGTAGTCTTG GTCCAAGTTG CAGTGCCTTC TAGAGGCGAC
GTAGAGGAGT ACCAATCACT CAGAGCTACT GTTAACGAGT TGGTAGGTAG GATAAATGGG
AAGTTTGGAA CCGTGGAATT TGTACCTATC CATTATATGC ATAAGTCCGT GCCCTTTGAC
GAGTTGATAA GCTTGTACCG TGTGTCTGAT GTCTGTCTTG TCAGTTCTAC AAGAGACGGA
ATGAACTTGG TTTCTTACGA ATACATCGCT TGTCAGCAGG AAAACAACGG GGTATTGATA
TTGTCTGAGT TCGCTGGTGC TGCGCAATCG TTGAATGGAG CTATCATTGT CAATCCATGG
AATACAGAAG ACTTGAGCAT TTCTATCAAG GAAAGCTTGA CGTTACCAGA AGAAAAGAAA
GCTATCAACT TCAACAAGCT CTTCACTTAT ATCTCCAAGT ATACTTCCGG CTTCTGGGGT
GAAAGCTTCG TCAAAGAATT GTACAAATGC ACATCTTGA
 
Protein sequence
MVVGKVLVVS NRLPVTIKRS DSGSYDYSMS SGGLVTALQG LKKSTEFQWL GWPGLEVPAD 
EQERVNSDLK SKFNCTAIYL SDVIADLHYN GFSNSILWPL FHYHPGEMNF DENAWAAYIE
ANRQFAVEIA GQVNDNDMVW VHDYHLMLLP QMLREEIGNR KKNIRIGFFL HTPFPSSEIY
RILPVRKEIL EGVLSCDLIG FHTYDYARHF LSSVSRIVAD VTTLPNGIEF QGRSISIGAF
PIGIDVDKFT EGLTKQSVID RIKQLKSRFG DTKIIVGVDR LDYIKGVPQK LHAFEVFLEE
NPEWIGKVVL VQVAVPSRGD VEEYQSLRAT VNELVGRING KFGTVEFVPI HYMHKSVPFD
ELISLYRVSD VCLVSSTRDG MNLVSYEYIA CQQENNGVLI LSEFAGAAQS LNGAIIVNPW
NTEDLSISIK ESLTLPEEKK AINFNKLFTY ISKYTSGFWG ESFVKELYKC TS