Gene PICST_39206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_39206 
SymbolPDT1 
ID4851197 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp1175297 
End bp1176388 
Gene Length1092 bp 
Protein Length363 aa 
Translation table 
GC content45% 
IMG OID640392905 
Productpredicted phosphatidyl synthase 
Protein accessionXP_001387869 
Protein GI126274183 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0647] Predicted sugar phosphatases of the HAD superfamily 
TIGRFAM ID[TIGR01456] HAD-superfamily class IIA hydrolase, TIGR01456, CECR5
[TIGR01460] Haloacid Dehalogenase Superfamily Class (subfamily) IIA 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.116854 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0363344 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAAAT ATACTACTGG AAACAGCAAT TTGGCATTTG TATTTGACAT TGACGGTGTG 
CTCATCCGGG GAGAAAAGGC AATTCCCGGG GCTGGACCCA CACTTGAGCT TTTGAACGAA
CACAAGGTTC CATTCATCTT GTTGACAAAT GGCGGCGGAG TTCTGGAGAA GGAAAGAGTG
CAGTTCATAT CTGAAACCGT GCAAGTTCCC ATTTCTCCTT TGCAGATTGT TCAGAGCCAT
ACCCCAATGA AGGCATTGGC CCATAAACAT GCTTACGACC GGGTCTTGGT AGTTGGTGGT
CCCGGAGATA AGGCTAGGCA CTGTGCCATT GGTTATGGAT TCCACGATGT AATAATGCCT
ATAGACATTG TTAGAGCCAA TCCGGCCGTA TCGCCTCATC ACAGATACAC AGTCGAAGAC
TTTGACCGTT ACTCCCGGGA AGTCGATTTA AAGAAACCCA TTGAGGCCAT CTTGGTGTTT
AATGACCCCA GAGACATGAC AACTGATATT CAGATTGTTT CAGATTTGCT CAATTCAGAT
CACGGAGTTA TAGGAACGAA GCGCTCTATC ACGAAGTTGA AACATCGTGA AGACCCGTCT
ATCCCCATCA TATTCAGTAA CAATGACTTC CTCTGGGCCA ATGACTATGC GTTGCCACGT
TTTGGTCAAG GTGCATTTAG AATAATCGTA GAGAACTTGT ATCGTGAAGT TAACCAATTG
AAAGACAGCC AACATTTGCA CTCTATAATT ATGGGCAAGC CGTTCAAGAT TCAGTACGAC
TTCGCCCACC ATGTGCTTAT TGACTGGCGC AACAAGCTTT TGGCAAACGA TACAAGCTCA
CAATCGCAAT TCTTGCCTAA CTTAGGTAGT GAACCCAAGA ATTCGCCGTT TAAGAGCATT
TTCATGGTGG GTGACAATCC GGCCTCTGAC ATTAAGGGTG CTAACGACAA TGGGTGGGAG
TCCATTCTCG TCAGAACAGG TGTCTACGAC AATGAGGATT TAAGCACGAT CATCGCCCAG
CCTACTGTGG GAGTATTTGA CGATGTCTAT GCGTCTGTCG AAGCAGTCTT GAAATCTCAA
AAGATTCTCT AG
 
Protein sequence
MRKYTTGNSN LAFVFDIDGV LIRGEKAIPG AGPTLELLNE HKVPFILLTN GGGVLEKERV 
QFISETVQVP ISPLQIVQSH TPMKALAHKH AYDRVLVVGG PGDKARHCAI GYGFHDVIMP
IDIVRANPAV SPHHRYTVED FDRYSREVDL KKPIEAILVF NDPRDMTTDI QIVSDLLNSD
HGVIGTKRSI TKLKHREDPS IPIIFSNNDF LWANDYALPR FGQGAFRIIV ENLYREVNQL
KDSQHLHSII MGKPFKIQYD FAHHVLIDWR NKLLANDTSS QSQFLPNLGS EPKNSPFKSI
FMVGDNPASD IKGANDNGWE SILVRTGVYD NEDLSTIIAQ PTVGVFDDVY ASVEAVLKSQ
KIL