Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_89228 |
Symbol | HIS2 |
ID | 4838598 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | - |
Start bp | 79214 |
End bp | 81930 |
Gene Length | 2717 bp |
Protein Length | 867 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640389913 |
Product | Histidine biosynthesis trifunctional protein |
Protein accession | XP_001384313 |
Protein GI | 150865196 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0140] Phosphoribosyl-ATP pyrophosphohydrolase [COG0141] Histidinol dehydrogenase |
TIGRFAM ID | [TIGR00069] histidinol dehydrogenase [TIGR03188] phosphoribosyl-ATP pyrophosphohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.145236 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TAGAAATCTG TGATAATCCG CGCAAAATGA CATTCCCAGT GTTGCCCTTG GTCTCGGGTC CTTCTGCGCA CGAAGAAATA GCTGCTTTTG CTGTTGTTGG ACAGATATTG TTACCTTTCA AAAGTGTGCA AATCTCAAAG ACGTTTTTGA ACCAATTCCC ATACGAGCTT CTCGTTGATG TTGATGTGTC AGCCGACTCT GTTACTGTTG ACGATGTCGT TCTCTTATTG AACAGCGGTG TCAGACAAGT AATTGTGTCT GAAAAGCAGG CACCAGAGTT GGTTTTTGAG AGTGGATTGC CTACTTCCAG ATTTACAATT GAATTGCAAG GTTCAGAATT GTCAGCTCAA ATTGCTAAAT CTTCAGCTTC TGTAGTATTG AATTCTGCAG TATCCAAAGA TGCTTTTAAG AGTATCAGTT CCAGCGGAAA CAGAACGGTT TACTATAGAA ATGGCGCCAT ATCACAAGAT TTAGCAGAAA CTTTAGCTCA AGACGGCTAC ACTCTTATCA TCCCTGCTGA AAAGTTGACT GATAAGACTA GTGAGTCTGG CAAAATCTCG ATCTCGACTA TTTTCACCAG CACTTTGACA ACAGACCGTC CAGATGGTCT TTTCACCACT TTAATTACTG CTCCAGCTCC ATCTTATACT GCCTTGGGAG TGGTATATTC ATCCAAGGCC TCCATTGAAG CAGCCATTGC CGAAAAAGTT GGAGTGTATC AATCACGTAA ACGTACTCAA GAGTTGTGGT ATAAAGGTAA GACCTCTGGT GCTACCCAAA AGTTGTTGAA GTTGGAAAGA GACTGTGATT CTGATGTCGT TAAATTTGTA GTAGACTCCA GAGACGGCTA CGGTTTCTGC CACTTGGACA ATAACTACAC CTGTTTTGGA GATGGACAAT TAAAGCAGAA GTCTGAGGCT GTTGGTACGG GATTGGCCAA ATTGGACAGT ACTTTGGCTC TGAGATTCCA GTCTGCCCCT GAAGGTTCCT ACACCAAGCG ACTTTTCAGC GACGACACCC TCTTAATTGC AAAGTTAAAA GAAGAATTAG ACGAGTTGAT TGAGGCAGGA CAGAATAAGG AGAAAGATGC GTCAGATGTT GCCTTTGAAT GTGCCGATTT GTTCTACTTC GCATTGGTGT GGTGTACCAA GAATGGAGTT AAGTTGGCAG ATGTCGAGAA AAACTTGGAT ATCAAGGCTG GTAAGGTCAC CAGAAGAAAG GGCGACGCCA AGCAACAATA CTTGGAAACT AAGGAACAAT CCAAGGAAGA GTCGAAAGAA GATCCAAAGG AAAAACAATT GTATAAGATG GAAACCATCA ACACCAAGGA TGCTGCTTCT GCTTCTCTGA TCCAGAGAGC CTTGACGAGA CCAGTGCAAA AGACTTCGGA CATCATGAAG TTGGTTCTCC CCATTGTTCA GAAGGTTCAA AAAGAAGGCG ATAAGGCTTT GATCGAGCTC ACTGAGAAGT TCGATGGTGT CAAGTTAGAT TCGCCGGTAT TGAAAGCTCC TTTCCCACAA GATCTCATGA ATATCTCTGA GGACATGAAA AAAGCTATTG ACTTGTCGAT TTCCAACATC GAAAAGTTTC ATGCTGCTCA GCTTCCAAAG GAAAAGGTGA TGACAGTTGA AACTTCTCCT GGGGTATATT GTTCTCGTTT CGCCAAACCT ATAGAAAACG TTGGTTTGTA CGTTCCAGGT GGTACTGCGG TTTTGCCTTC TACTGCCATG ATGTTGGGAG TTCCTGCTAA AGTTGCCGGT TGTTCCAATA TCGTATTAGC GTCTCCTCCA GCCAGAGCCA CTGGTAAGTT GACTCCAGAA GTCGTTTATG TTGCCCACAA GATCGGTGCC AAGTGCATTG TTATGGCTGG GGGTGCTCAG GCCGTCACTG CCATGGCCTA CGGTACTGAA AGTGTTCTCA AGTGTGACAA GATCCTCGGT CCAGGGAACC AGTTTGTCAC TGCAGCAAAA ATGTATATAC AAAACGATAC TCAAGCTCTT TGCTCCATCG ACATGCCTGC CGGTCCTTCT GAAGTCCTAG TCATGGCTGA CGAAAATGCC GACGCTGACT TTGTTGCCAG TGACTTGTTG TCTCAAGCCG AACATGGTGT GGATTCTCAG GTCATCTTGA TTGGTGTTAA CTTATCAGAC AAGAAAGTAC GAGAATTTGA AGAAGCTGTT AGAAAACAAG CTGAGGTACT TCCAAGAAAG GAAATCGTCG CTAAGTGCTT GGCTCACTCG TTCATTCTCT TGGTTGACAA TTATGACGAA GCGTTTGACT TGTCCAACAA GTACGCCCCA GAACATTTGA TCCTTCAAAT CGATAATGCT TCCTCGTTTG TTCCGGACTA TATAGAAAAT GCTGGTTCCG TTTTCGTGGG TGCCTTGTCG CCAGAGTCAT GTGGTGACTA CTCCTCAGGT ACCAATCACA CATTGCCTAC CTACGGTTAT GCCAGACAGT ATTCTGGTGT GAATACTGCT ACATTCCAGA AGTTCATTAC TTCGCAAGAT GTTAGTGAAG AAGGGTTGAA GAGTATTGGT AAGGCTGTTA TGACCTTGGC TGCTGTAGAA GGATTGGAAG CTCACAGAAA TGCCGTCCAA GTCAGAATGG ACAAATTGGG ATTGTTGTAA TAAGTGTCAA AGAATAATTT TAGACAAATT TTATTAAGAC TATAATATGT TTGTAGTTAT CACATATTTT CATATACATT AACTTTG
|
Protein sequence | MTFPVLPLVS GPSAHEEIAA FAVVGQILLP FKSVQISKTF LNQFPYELLV DVDVSADSVT VDDVVLLLNS GVRQVIVSEK QAPELVFESG LPTSRFTIEL QGSELSAQIA KSSASVVLNS AVSKDAFKSI SSSGNRTVYY RNGAISQDLA ETLAQDGYTL IIPAEKLTDK TSESGKISIS TIFTSTLTTD RPDGLFTTLI TAPAPSYTAL GVVYSSKASI EAAIAEKVGV YQSRKRTQEL WYKGKTSGAT QKLLKLERDC DSDVVKFVVD SRDGYGFCHL DNNYTCFGDG QLKQKSEAVG TGLAKLDSTL ASRFQSAPEG SYTKRLFSDD TLLIAKLKEE LDELIEAGQN KEKDASDVAF ECADLFYFAL VWCTKNGVKL ADVEKNLDIK AGKVTRRKGD AKQQYLETKE QSKEESKEDP KEKQLYKMET INTKDAASAS SIQRALTRPV QKTSDIMKLV LPIVQKVQKE GDKALIELTE KFDGVKLDSP VLKAPFPQDL MNISEDMKKA IDLSISNIEK FHAAQLPKEK VMTVETSPGV YCSRFAKPIE NVGLYVPGGT AVLPSTAMML GVPAKVAGCS NIVLASPPAR ATGKLTPEVV YVAHKIGAKC IVMAGGAQAV TAMAYGTESV LKCDKILGPG NQFVTAAKMY IQNDTQALCS IDMPAGPSEV LVMADENADA DFVASDLLSQ AEHGVDSQVI LIGVNLSDKK VREFEEAVRK QAEVLPRKEI VAKCLAHSFI LLVDNYDEAF DLSNKYAPEH LILQIDNASS FVPDYIENAG SVFVGALSPE SCGDYSSGTN HTLPTYGYAR QYSGVNTATF QKFITSQDVS EEGLKSIGKA VMTLAAVEGL EAHRNAVQVR MDKLGLL
|
| |