Gene PICST_46345 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_46345 
SymbolHPD1 
ID4839301 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp1188147 
End bp1189739 
Gene Length1593 bp 
Protein Length530 aa 
Translation table12 
GC content38% 
IMG OID640390616 
Product4-hydroxyphenylpyruvate dioxygenase (4HPPD) (HPD) (HPPDase) 
Protein accessionXP_001384908 
Protein GI150865616 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.693171 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.28283 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CTACTTAAAG AACTTCCATT TTTGCCAACT TCTCTGGACC CCATAACTGA ACCCGATATT 
GATGAATTAC TTCTGGATGG TCATGTGAAT TCTAAATATC CAACTGATGG GTTTATCAAG
TTTTTCTCGC TTAAGATCTG CAGTTCCAAC GCAAAACAGA TGTCCAAATA TCTACAACTT
GCAATGGATT TCAAGGAGAT CGCCTACAAA GGTTTAGAGA ATGATTCTCG TTTGGTGGGA
GCCCACGTCA TTAGAAACGG GGATGTCACT CTTGAAATTG TCAATACTTT GGAAACTGTG
GAAGATGACA ATGTTTTAAA ATTCCCCTAT TTTGAAAAGG ACTTGAAGCA ATTTCCCCAG
CTTAATGAAC TGAAATATTT GAGAGATTTC AAAATCACCA CCAACGACCT AGTATTTGAT
TTTGTCAATA GCAGAATTGA AAGTTTTTCT GTTAGTCCAA ATGCTCATTA CTTCAGAAGA
AAGCTTTACA ATAAAATTGT CTCTTCTCGT GCTTTTAGGA ATAATATGTT TGACTACAAC
AACCTTATCC TCAATGTCAT CAACAATTCG GAGGTAATTT ACAATGACAT AATGGAATGT
ACATTGATTC AGAAGTTCCT TAAAACTCAT GGTGAAGGAG TCATGGATAT TAGTTTTCTT
GTTGAAGATG TCATAACCAT TTTCGATAAG GCAGTTGCCG CTGGAGCTGG TATTATTCGG
TTGCCAAAGA TCATTAGCGA TTGTAACGGT TCCGTTAGGT TGGGAACAAT CAGTATTCCC
AAGACTGATA TTCAGCACAC CTTGATAGAG AATATCGATT ATACGGGACC ATTCTTACCT
AATTACTCTG AGTCAGTGAC CCAATACAAC TCCAAATACT ATGATCAGAT GCAAAATATT
CCAACAGTTA GTTTCCAATG CATCGATCAC TGTGTAGAGA ACTATTCTTG GAACCAAATG
ATGGCACAAG CAAAACTCTA TGCATCTCTA TTTGGATTTC ACAAGTATTG GTCTGCTGAT
GATCATGATA TAGCTACTGA CAACACTGCT CTCAGATCAA TCGTCATGGC ATCCGGAAAC
GGGAAGATAA AGATGCCTAT CAATGAGCCC GTGAAGTCAA AAATGAGAGG TCAAATTGAA
GAATTCCATG ATTTCAATGG CGGCCCAGGT GTTCAACATA TTGCATTGAG AACCAATGAT
ATAATTGATA CCGTGTGTGC CTTATTGGCT AGAGGAATTG AATTTAATAC TGCTTCGGAC
AAATATTACA CGAATTTGGA ACGCCTTCTT AGAGAAGATG ACGTTGCATT ATTTGAGGAT
TTTGATACTC TCAGAAAGTT GAATATCTTG GTTGATTATG ACATTTCTAC CAGAAATAAA
AAGACAGGGA TTTGTAACTA CCTACTACAG ATCTTTACAA AACCATTACA CGATCGGCCA
ACGCTTTTCA TTGAAATTAT TCAGAGACAT CATCACAATG GATTTGGAAA GGGAACTTTT
AAGGGTCTTT TCGAAACTAT CGAAGAGCAA CAGCGAATCA GAGGAACTCT TGTACAAGTA
GACGAAGATG ATGATTCGCA ACAAAGTACA TAG
 
Protein sequence
LLKELPFLPT SSDPITEPDI DELLSDGHVN SKYPTDGFIK FFSLKICSSN AKQMSKYLQL 
AMDFKEIAYK GLENDSRLVG AHVIRNGDVT LEIVNTLETV EDDNVLKFPY FEKDLKQFPQ
LNESKYLRDF KITTNDLVFD FVNSRIESFS VSPNAHYFRR KLYNKIVSSR AFRNNMFDYN
NLILNVINNS EVIYNDIMEC TLIQKFLKTH GEGVMDISFL VEDVITIFDK AVAAGAGIIR
LPKIISDCNG SVRLGTISIP KTDIQHTLIE NIDYTGPFLP NYSESVTQYN SKYYDQMQNI
PTVSFQCIDH CVENYSWNQM MAQAKLYASL FGFHKYWSAD DHDIATDNTA LRSIVMASGN
GKIKMPINEP VKSKMRGQIE EFHDFNGGPG VQHIALRTND IIDTVCALLA RGIEFNTASD
KYYTNLERLL REDDVALFED FDTLRKLNIL VDYDISTRNK KTGICNYLLQ IFTKPLHDRP
TLFIEIIQRH HHNGFGKGTF KGLFETIEEQ QRIRGTLVQV DEDDDSQQST