Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_65481 |
Symbol | HIS7 |
ID | 4838325 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | - |
Start bp | 1475977 |
End bp | 1477792 |
Gene Length | 1816 bp |
Protein Length | 588 aa |
Translation table | 12 |
GC content | 39% |
IMG OID | 640389640 |
Product | imidazole glycerol phosphate synthase |
Protein accession | XP_001383902 |
Protein GI | 150864898 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0107] Imidazoleglycerol-phosphate synthase [COG0118] Glutamine amidotransferase |
TIGRFAM ID | [TIGR00735] imidazoleglycerol phosphate synthase, cyclase subunit [TIGR01855] imidazole glycerol phosphate synthase, glutamine amidotransferase subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.361476 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.151399 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CTACATTAAA AAAATGGTAA AGTCAGTTTA TGTAATTGAC GTAGAGAGTG GAAACTTGCA GTCATTGGCC AATGCCATAA AGCGTATTGG GGAATACGAC GTCAAATTCA TTCGTAATGC GGATGATTTC GAACAGTATG ACAATGACAT TGAAAAGCTC ATCTTTCCAG GGGTCGGAAA CTATGGACAC TTCGTGAGGG AAATGCATGC GAGAAACTTA ATTAAACCAA TCAACAAGTA CATTGATAGT GGAAGATCAT TGATGGGTAT TTGTGTTGGG TTGCAAGCAT TCTTTGACAG CTCAGAAGAA AGCCCAGGAG TGGATTTTCG TGGATTGGGT TTTTTAAAAT TGAGATTAGC CAAGTTTAAC ATTCATGATC CAATATTCGA AGAGAAAAAA TTAAAGAAGT CTGTTCCACA TATCGGTTGG AACAGTATAA CCGATATCAA GATTGGTTGC AACTCTTTAG AGAGGTCTAA ATCGTTATAC CATATCAACA CATTTAACAA ATATTATTTT GTTCATTCCT ACGCAGCTAT TATAAATGAT GAGAACAAAC ATATTTTGGA AAAGGCTTCT AAAGAGGGCT GGAACTTTGC AATTGCAAGA TATGGTTCAG AAAAATTCCT AGCAGCAATT AACTACAAGA ATTTCTTTGC AACTCAATTT CACCCGGAAA AGTCTGGTTT GGCTGGCTTA AGGGTTATAA AATCTTTCTT AGAAAGTATT CAGTTTGCCG ATGTTGATAA ATCTATCATT CAAGAAGTTG TTGGAGTCGA GCAATCCTTA GGTGGAACCA CCAGAAGAAT CATAGCATGT TTAGACGTCA GATCTAACGA TGAGGGCGAT TTGGTTGTCA CAAAAGGTGA TCAATACAAC GTCCGCGAAA CTGCCCTGAG TGAAAGCAAA GTTAGAAATC TTGGAAAACC AGTTGAATTA GCGACCAGAT ATTACAATCA AGGTGCTGAC GAAGTTACCT TTCTAAACAT TACCTCTTTC CGTAACTCTC CGTTAAAAGA CTTGCCCATG CTTCAAGTTT TGAGCAAAGC CGCTGAAACC ATTTTTGTCC CTTTAACAGT TGGTGGTGGT ATTAAGGATA TGACAGACCC AGAAACCGGC AAATTAGTGC CTGCGGTTAA GGTTGCTGAT TTGTATTTCA GATCTGGAGC AGACAAAGTT AGTATTGGAA GTGATGCTGT TACTATTGCA GAAGAGTATT ATGCAAATGG AAAGCAAAAA ACTGGCAAAA CATCTATCGA AAGCATTTCT GCAACTTTTG GTGCGCAAGC TGTGGTTATA TCCGTTGATC CAAAGAGAAA GTATGCTGCT AGTCCGATGG AAACCAAGAT GCAAACCATA AAAATTGTAG ACCCAGCCAA GTTTGGACCT AATGGCGAAC AGTACTGCTA CTACCAAGTT ACTTCACAAG GAGGGAGAAA GGTCCACGAG TTGGGCGCTC TTGAATTATG TACCGCTTGC GAGGAATTGG GTGCAGGTGA AATATTATTG AACTCGATCG ATCATGATGG GTCCAACAAG GGATACAATC TCGAATTATT GACTCAAATC AAGAGCAACG TTTCCATCCC AGTAATTGCA AGTTCTGGTG CTGGTAATCC GCAACATTTC CAAGACGCTT TCGAATTGGA ATGTGGAATT GACGCTGCAT TGGGAGCAGG AATGTTTCAC AGAGGTGAAT ACGAAGTCAA TGACGTTAAA AAGTATCTTC AGACCAATGG CAAGATGGAC GTTCGATTAG ATGAAGAAGT AGAATTATAA ATCATATAAA TTTTGTATAT AGTGCAGTCA TTATCG
|
Protein sequence | MVKSVYVIDV ESGNLQSLAN AIKRIGEYDV KFIRNADDFE QYDNDIEKLI FPGVGNYGHF VREMHARNLI KPINKYIDSG RSLMGICVGL QAFFDSSEES PGVDFRGLGF LKLRLAKFNI HDPIFEEKKL KKSVPHIGWN SITDIKIGCN SLERSKSLYH INTFNKYYFV HSYAAIINDE NKHILEKASK EGWNFAIARY GSEKFLAAIN YKNFFATQFH PEKSGLAGLR VIKSFLESIQ FADVDKSIIQ EVVGVEQSLG GTTRRIIACL DVRSNDEGDL VVTKGDQYNV RETASSESKV RNLGKPVELA TRYYNQGADE VTFLNITSFR NSPLKDLPML QVLSKAAETI FVPLTVGGGI KDMTDPETGK LVPAVKVADL YFRSGADKVS IGSDAVTIAE EYYANGKQKT GKTSIESISA TFGAQAVVIS VDPKRKYAAS PMETKMQTIK IVDPAKFGPN GEQYCYYQVT SQGGRKVHEL GALELCTACE ELGAGEILLN SIDHDGSNKG YNLELLTQIK SNVSIPVIAS SGAGNPQHFQ DAFELECGID AALGAGMFHR GEYEVNDVKK YLQTNGKMDV RLDEEVEL
|
| |