Gene PICST_89228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_89228 
SymbolHIS2 
ID4838598 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp79214 
End bp81930 
Gene Length2717 bp 
Protein Length867 aa 
Translation table12 
GC content43% 
IMG OID640389913 
ProductHistidine biosynthesis trifunctional protein 
Protein accessionXP_001384313 
Protein GI150865196 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0140] Phosphoribosyl-ATP pyrophosphohydrolase
[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase
[TIGR03188] phosphoribosyl-ATP pyrophosphohydrolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.145236 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TAGAAATCTG TGATAATCCG CGCAAAATGA CATTCCCAGT GTTGCCCTTG GTCTCGGGTC 
CTTCTGCGCA CGAAGAAATA GCTGCTTTTG CTGTTGTTGG ACAGATATTG TTACCTTTCA
AAAGTGTGCA AATCTCAAAG ACGTTTTTGA ACCAATTCCC ATACGAGCTT CTCGTTGATG
TTGATGTGTC AGCCGACTCT GTTACTGTTG ACGATGTCGT TCTCTTATTG AACAGCGGTG
TCAGACAAGT AATTGTGTCT GAAAAGCAGG CACCAGAGTT GGTTTTTGAG AGTGGATTGC
CTACTTCCAG ATTTACAATT GAATTGCAAG GTTCAGAATT GTCAGCTCAA ATTGCTAAAT
CTTCAGCTTC TGTAGTATTG AATTCTGCAG TATCCAAAGA TGCTTTTAAG AGTATCAGTT
CCAGCGGAAA CAGAACGGTT TACTATAGAA ATGGCGCCAT ATCACAAGAT TTAGCAGAAA
CTTTAGCTCA AGACGGCTAC ACTCTTATCA TCCCTGCTGA AAAGTTGACT GATAAGACTA
GTGAGTCTGG CAAAATCTCG ATCTCGACTA TTTTCACCAG CACTTTGACA ACAGACCGTC
CAGATGGTCT TTTCACCACT TTAATTACTG CTCCAGCTCC ATCTTATACT GCCTTGGGAG
TGGTATATTC ATCCAAGGCC TCCATTGAAG CAGCCATTGC CGAAAAAGTT GGAGTGTATC
AATCACGTAA ACGTACTCAA GAGTTGTGGT ATAAAGGTAA GACCTCTGGT GCTACCCAAA
AGTTGTTGAA GTTGGAAAGA GACTGTGATT CTGATGTCGT TAAATTTGTA GTAGACTCCA
GAGACGGCTA CGGTTTCTGC CACTTGGACA ATAACTACAC CTGTTTTGGA GATGGACAAT
TAAAGCAGAA GTCTGAGGCT GTTGGTACGG GATTGGCCAA ATTGGACAGT ACTTTGGCTC
TGAGATTCCA GTCTGCCCCT GAAGGTTCCT ACACCAAGCG ACTTTTCAGC GACGACACCC
TCTTAATTGC AAAGTTAAAA GAAGAATTAG ACGAGTTGAT TGAGGCAGGA CAGAATAAGG
AGAAAGATGC GTCAGATGTT GCCTTTGAAT GTGCCGATTT GTTCTACTTC GCATTGGTGT
GGTGTACCAA GAATGGAGTT AAGTTGGCAG ATGTCGAGAA AAACTTGGAT ATCAAGGCTG
GTAAGGTCAC CAGAAGAAAG GGCGACGCCA AGCAACAATA CTTGGAAACT AAGGAACAAT
CCAAGGAAGA GTCGAAAGAA GATCCAAAGG AAAAACAATT GTATAAGATG GAAACCATCA
ACACCAAGGA TGCTGCTTCT GCTTCTCTGA TCCAGAGAGC CTTGACGAGA CCAGTGCAAA
AGACTTCGGA CATCATGAAG TTGGTTCTCC CCATTGTTCA GAAGGTTCAA AAAGAAGGCG
ATAAGGCTTT GATCGAGCTC ACTGAGAAGT TCGATGGTGT CAAGTTAGAT TCGCCGGTAT
TGAAAGCTCC TTTCCCACAA GATCTCATGA ATATCTCTGA GGACATGAAA AAAGCTATTG
ACTTGTCGAT TTCCAACATC GAAAAGTTTC ATGCTGCTCA GCTTCCAAAG GAAAAGGTGA
TGACAGTTGA AACTTCTCCT GGGGTATATT GTTCTCGTTT CGCCAAACCT ATAGAAAACG
TTGGTTTGTA CGTTCCAGGT GGTACTGCGG TTTTGCCTTC TACTGCCATG ATGTTGGGAG
TTCCTGCTAA AGTTGCCGGT TGTTCCAATA TCGTATTAGC GTCTCCTCCA GCCAGAGCCA
CTGGTAAGTT GACTCCAGAA GTCGTTTATG TTGCCCACAA GATCGGTGCC AAGTGCATTG
TTATGGCTGG GGGTGCTCAG GCCGTCACTG CCATGGCCTA CGGTACTGAA AGTGTTCTCA
AGTGTGACAA GATCCTCGGT CCAGGGAACC AGTTTGTCAC TGCAGCAAAA ATGTATATAC
AAAACGATAC TCAAGCTCTT TGCTCCATCG ACATGCCTGC CGGTCCTTCT GAAGTCCTAG
TCATGGCTGA CGAAAATGCC GACGCTGACT TTGTTGCCAG TGACTTGTTG TCTCAAGCCG
AACATGGTGT GGATTCTCAG GTCATCTTGA TTGGTGTTAA CTTATCAGAC AAGAAAGTAC
GAGAATTTGA AGAAGCTGTT AGAAAACAAG CTGAGGTACT TCCAAGAAAG GAAATCGTCG
CTAAGTGCTT GGCTCACTCG TTCATTCTCT TGGTTGACAA TTATGACGAA GCGTTTGACT
TGTCCAACAA GTACGCCCCA GAACATTTGA TCCTTCAAAT CGATAATGCT TCCTCGTTTG
TTCCGGACTA TATAGAAAAT GCTGGTTCCG TTTTCGTGGG TGCCTTGTCG CCAGAGTCAT
GTGGTGACTA CTCCTCAGGT ACCAATCACA CATTGCCTAC CTACGGTTAT GCCAGACAGT
ATTCTGGTGT GAATACTGCT ACATTCCAGA AGTTCATTAC TTCGCAAGAT GTTAGTGAAG
AAGGGTTGAA GAGTATTGGT AAGGCTGTTA TGACCTTGGC TGCTGTAGAA GGATTGGAAG
CTCACAGAAA TGCCGTCCAA GTCAGAATGG ACAAATTGGG ATTGTTGTAA TAAGTGTCAA
AGAATAATTT TAGACAAATT TTATTAAGAC TATAATATGT TTGTAGTTAT CACATATTTT
CATATACATT AACTTTG
 
Protein sequence
MTFPVLPLVS GPSAHEEIAA FAVVGQILLP FKSVQISKTF LNQFPYELLV DVDVSADSVT 
VDDVVLLLNS GVRQVIVSEK QAPELVFESG LPTSRFTIEL QGSELSAQIA KSSASVVLNS
AVSKDAFKSI SSSGNRTVYY RNGAISQDLA ETLAQDGYTL IIPAEKLTDK TSESGKISIS
TIFTSTLTTD RPDGLFTTLI TAPAPSYTAL GVVYSSKASI EAAIAEKVGV YQSRKRTQEL
WYKGKTSGAT QKLLKLERDC DSDVVKFVVD SRDGYGFCHL DNNYTCFGDG QLKQKSEAVG
TGLAKLDSTL ASRFQSAPEG SYTKRLFSDD TLLIAKLKEE LDELIEAGQN KEKDASDVAF
ECADLFYFAL VWCTKNGVKL ADVEKNLDIK AGKVTRRKGD AKQQYLETKE QSKEESKEDP
KEKQLYKMET INTKDAASAS SIQRALTRPV QKTSDIMKLV LPIVQKVQKE GDKALIELTE
KFDGVKLDSP VLKAPFPQDL MNISEDMKKA IDLSISNIEK FHAAQLPKEK VMTVETSPGV
YCSRFAKPIE NVGLYVPGGT AVLPSTAMML GVPAKVAGCS NIVLASPPAR ATGKLTPEVV
YVAHKIGAKC IVMAGGAQAV TAMAYGTESV LKCDKILGPG NQFVTAAKMY IQNDTQALCS
IDMPAGPSEV LVMADENADA DFVASDLLSQ AEHGVDSQVI LIGVNLSDKK VREFEEAVRK
QAEVLPRKEI VAKCLAHSFI LLVDNYDEAF DLSNKYAPEH LILQIDNASS FVPDYIENAG
SVFVGALSPE SCGDYSSGTN HTLPTYGYAR QYSGVNTATF QKFITSQDVS EEGLKSIGKA
VMTLAAVEGL EAHRNAVQVR MDKLGLL