Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_67165 |
Symbol | HIS1 |
ID | 4837608 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | - |
Start bp | 2660226 |
End bp | 2661416 |
Gene Length | 1191 bp |
Protein Length | 304 aa |
Translation table | 12 |
GC content | 40% |
IMG OID | 640388923 |
Product | ATP phosphoribosyltransferase (ATP-PRTase) (ATP-PRT) |
Protein accession | XP_001383262 |
Protein GI | 150864445 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0040] ATP phosphoribosyltransferase |
TIGRFAM ID | [TIGR00070] ATP phosphoribosyltransferase [TIGR03455] ATP phosphoribosyltransferase, C-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | AAAATTGCAA GTTGAAAAAT TACCAGATTT TCTTTTTGTA GTTCTCCAGC CAATTCTAAC TATACACAAT GGATTTAGTT AACCACCTCC CAGACCGTTT GTTGTTTGCT GTGCCCAAGA GTATGTGATC TTGTCTATCG AAATTTTACA GATTTTTCAG ATTTTTCAGT CCGATAAACG ATTTCAGATT TCGTTCTCCG ATTTAAAAAT AAATACCTAG CTATAAGTTA TCATTGTCAT ATTGTTACTA ACTATATTCA GAGGGTCGTT TATATGAAAA GTGTTGCAAC TTGTTGCAGG GTGCTGACAT CCAGTTCAGA CGTTCCAACA GATTGGATAT TGCCCTTTCC ACCAACTTGC CTGTTGCCTT GATTTTCTTG CCTGCTGCCG ACATTCCTAT TTTTGTTGGA GAAGGTAACT GTGATTTGGG TATAACCGGT TTAGACCAGA TCCAAGAAGC TCTGATGCTT GACCATACTG AGGACTTGTT GGACTTGAAC TTTGGCTCTT GTAAGTTACA GATTCAGGTT CCAGCCGAAG GAGACATCAC TACTCCTGAG CAATTAGTCG GTAAGAAGAT TGTTTCGTCG TTTACCAAGT TGTCTACCAA CTACTTTAAG TCATTGGAAA AGGTTTCGGA TGAATCACAA TTAACAACTA GCATCAGATA CGTTGGTGGA TCTGTCGAAG CTTCATGTGC CTTAGGAGTT GCTGATGCAA TTGTCGATTT GGTAGAAAGT GGTGAAACCA TGAAGGCTGC CGGATTGAAG CCTATAGAAA CTATTTTACA GACCTCGGCC CATTTGATCT CGTCTAAGAA CCCCAAGTTT CCCGAGTTAG TTGAAATCAT CCACCAAAGA TTCGAAGGTA TCTTGGCTGC TCAGAAGTAC GTATTGTGTA ACTACAATGC TCCAAGAAGA TTGTTGCACG ACGTGTTAAG TATTACTCCT GGAAGAAGAG CTGCCACCGT GTCACCTTTA GAAAAACACA ACCCAGAAGA CGAAGACTGG GTGGCCATAT CGTCAATGGT GGAGAGAAAG GCTATCGGAG ACAAGATGGA CTTGTTGAAG AAGAGTGGAG CCTCCGACAT CTTGGTGTTT GAAATCAGCA ACTGTAGAGT TTAGACCAAA ATTACAAAAC AATTTACAAA TAATTACATA TACATAATTA ATAGACTTCA TTTATTCTGT C
|
Protein sequence | MDLVNHLPDR LLFAVPKKGR LYEKCCNLLQ GADIQFRRSN RLDIALSTNL PVALIFLPAA DIPIFVGEGN CDLGITGLDQ IQEASMLDHT EDLLDLNFGS CKLQIQVPAE GDITTPEQLV GKKIVSSFTK LSTNYFKSLE KVSDESQLTT SIRYVGGSVE ASCALGVADA IVDLVESGET MKAAGLKPIE TILQTSAHLI SSKNPKFPEL VEIIHQRFEG ILAAQKYVLC NYNAPRRLLH DVLSITPGRR AATVSPLEKH NPEDEDWVAI SSMVERKAIG DKMDLLKKSG ASDILVFEIS NCRV
|
| |