Gene PICST_33690 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33690 
SymbolPHA2 
ID4840877 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp224035 
End bp225137 
Gene Length1103 bp 
Protein Length306 aa 
Translation table12 
GC content43% 
IMG OID640392192 
ProductPrephenate dehydratase (PDT) 
Protein accessionXP_001386638 
Protein GI150866894 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0077] Prephenate dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.748096 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCACCA AAGTCGCATT CTTGGGTCCA GAAGGGACAT ATACACATCA AGTACGTTTT 
TTTTTAGAGG AAAAGGAGCC AAAACAAGAT CTGACTCATT CTCAGACCAC CGTTCTACTT
TTCAGACATG ATCTGGTTTA CTAACTCTTC TAAAGGCAGT TATCCAGCAG TTCGGCAACA
AAGACAATGT TCTGATATAT CCAGTCAAGA CAATTTCAGA CTGTTTCAAA GAGATCCACT
CCAAAAATGT AGATTTCGCT GTGGTTCCAT TAGAAAACTC TATCAATGGT GGTGTAGTTT
TCACTTTTGA CCTCATCAGA GATTGGTTCA TACCGTCTTT GCAGAACAAT AGTAGACAAA
ATGACGACAG TGGCTTCTTA AGTCCACCCC CTTCTTCTAA AACGACGACA TCTTCAACGT
CTTCCAAGCC AACTTTCAGA ATCGTAGCTG AGCAGTTTGT GTCTATTCAC CATAACTTCT
TAACTAGGGC AGAAGACGTC TCCAAGATCA CATGTATATA TTCTCATCCC CAGGTGTGGA
CCCAGGTCAC AGGCTTCTTG TCAACTATCC CGGCTAGCAT TCCCAGAATA GACAGTACGT
CTACTTCAAA GGCTGCTGAG TTGGTTAATG GTGACGAATC CAATACCTCA GCCTGTATCT
CATCTCAGAT GAGTTCAGAC TTGTACCAAT TGCCTATAAG AAATGCCAAC ATAGAGGACA
ATCCTAACAA CACCACCAGA TTCTTAGTTT TGGGATACGA GAAACCACCC GCTCCATCTC
CATCCCCAGC TCCAGAAGTT GGAGAGCCAG AAAGACCAGA TTCTCGTATT ACTTCTATCA
TCTTCACTTT GAATCACAAT GATCCAGGTG CACTTTGTGA CGTATTGTAC GAGTTCAAGA
AGAATGGAGT CAACTTGACT TCGATCACAT CTAGACCATC CCATTTGAAA CAATGGCAGT
ATGTTTTCTT CGCAGAGGTC ATTGGCGATC TGAGCAGTGA CGCTAATATT GCTAAAGGTA
TAGAGCTGGC TAGTAGTATT TGTCTGGAAT TGGTAGTGCT CGGTTCCTTT GACAGAAGCT
GGAGGTACTG GAAATCATCG TAG
 
Protein sequence
MVTKVAFLGP EGTYTHQAVI QQFGNKDNVS IYPVKTISDC FKEIHSKNVD FAVVPLENSI 
NGGVVFTFDL IRDWFIPSLQ NNSRQNDDTE QFVSIHHNFL TRAEDVSKIT CIYSHPQVWT
QVTGFLSTIP ASIPRIDSTS TSKAAELVNG DESNTSACIS SQMSSDLYQL PIRNANIEDN
PNNTTRFLVL GYEKPPAPSP SPAPEVGEPE RPDSRITSII FTLNHNDPGA LCDVLYEFKK
NGVNLTSITS RPSHLKQWQY VFFAEVIGDS SSDANIAKGI ESASSICSEL VVLGSFDRSW
RYWKSS