Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_70077 |
Symbol | THI20 |
ID | 4837639 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | - |
Start bp | 427321 |
End bp | 429280 |
Gene Length | 1960 bp |
Protein Length | 583 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640388954 |
Product | Phosphomethylpyrimidine kinase THI21 (HMP-phosphate kinase) (HMP-P kinase) |
Protein accession | XP_001382844 |
Protein GI | 126132638 |
COG category | [H] Coenzyme transport and metabolism [K] Transcription |
COG ID | [COG0351] Hydroxymethylpyrimidine/phosphomethylpyrimidine kinase [COG0819] Putative transcription activator |
TIGRFAM ID | [TIGR00097] phosphomethylpyrimidine kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.583202 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CTCTTTTCCA TTGTTCTTAC CATTCAATAC AGGCATCGTG TTGAAAATGA CAACATTTTC TGTGGTGAAA TTGAGAACTC CCACTGTCAA GTCCAAACCA GTCCTACCAG CTGTATTAAC TATAGCAGGA TCCGACAACT CAGGTGGAGC CGGCATTGAG GCCGACTTGA AGACATTCAG TGCCCACAAG GTTTACGGGT TGACTTGTAT TGCTGCTTTG ACGGCCCAAA ATACACAATT GGTGAAAACA TTTGAAAAAA CTCCAAAAGA ATTGGTTAAA AACATCTTGC AGCTCAACTT TGACGATTTC CTCTACGGAT ATGAAGACAG TACACAGCCA TTGAAGGTCA TCAAGACTGG GATGCTCACA GAAGAAGCTG TTCATGTTAT CCAGGACTTT CTACCAGACA TCAAGAAACA CAATGTGAAA CTAATAGTCG ACCCAGTGAT GATCAGTACT TCTGGATCCA GTCTTTTTGA TAGCGAAGGC ATGAAGCTTT GTGTGAATAC CTTGATCAGT GGAGCCTATT TGATCACACC TAATTTTGTA GAAGCCAGAG CACTTTGGGA GATTGCTTGT GGAGAAAGTG CAGCTATCGA GAAGCTCACT ATCAACTCTT TGGATGACTT TATAGACTTT GTCAAGCAAT TGCAGAAGAC CCTCAAGTCA CAGAATGTTT TAGTAAAGGG TGGACATATT CCTTGGGATT CCCGAACGGG CAAGCCATTT GTTGGAACCA ATCTTGCTGA TGTTGAAGAC AGTATTGTTG TTTTGGATGT GTTATATGAA TCTGAAATTG ATAAGGCGAC TGTGTTTGAG TCCAAGTACA TAAATACCAA GGATTCACAT GGAACCGGTT GCACTCTTGC TTCAGCCATA TCTGCAAATG TAGCCAAGGG AAAGAACTTG AAAGAAAGTA TCTCTTTATC CATAGACTAT ATCCACAAGG GAATGTTGAG TGTAGGCAAG AAATTGGGAT ATGGAAATGG ACCCTTGAAC CACAATGTGG AACCCGAAGA AAATCTAAGC AATGTCATCA TTGGAAACAG CACAGACACA TACATGAGCG TTAAGAAAGG GAACCAGTCT TTCTTTGAGT ACTTCAAGAC CCATCCTAGT GTCAAGGAAA GCTGGAAGCT CTACACTGAG CATAGATTTA TCCAGCAATT GGCTCAGGAC AAGTTGCCAT TCCAGCGCTT CCTTTACTTC TTGAAGCAAG ATTACTACTA CTTGATCAAC TATGCACAGA TGCACGGGTT AGCGGCTTCA GTCGCACCAA CATACCATCA AACCCATGCC GAAGCACTTA TCATAGGAGA AGTCATCACT GAAATTGAGA AGCACAAGGA AAAGCTTTCC AAGAAATACG ACATTGATTA TGAAAGAGAT ATTGATTTCG ATATCGAGTT GCAACCTGGA AAGGCATGCG TGGACTACTG CAACTATCTC TTGGAAATTG GAAATAGGGA GAATTTCTTG GGTATCAAGG TAGCTTTGGC TCCTTGTTTG CATGGATACG CTGAGGCTGG GTTGTATGGT AAGAGCATCA GAGAGAGCTA TGACAAGAGT ACCTCCAGCT TGGATAAGGT ACTTTCTGAA ACCTACGACA CGTGGTTAGG AGACTATAGT TCCGAATGGT ATTTGAACGC TCATAAAGAA GGAGAGGCTA CGCTTCAGGA GTTGATGGAA TCGAACGACG TTTCAAATGA GAGAATGGAC GAACTTGTTG AGATTTTCAG GAAGGTGACA GAGTTGGAAG TGCACTTTTG GGACGAGGTC TTGGATGTAT TACCATAATT GTTGGTGAAC ATCATGAGTT GTAAGTTCTG ATGCAATAAC ATAGACTATG AAGTGTTCAC GAGTCTCTCT AAGAAAGTAT TTCGATAGAG AAGATTGAAG TGCAAAGAAA ATAGAAAATG TATAGAACTG TTTAACAAAT TGAATATAGA AAAAGCGTGG
|
Protein sequence | MTTFSVVKLR TPTVKSKPVL PAVLTIAGSD NSGGAGIEAD LKTFSAHKVY GLTCIAALTA QNTQLVKTFE KTPKELVKNI LQLNFDDFLY GYEDSTQPLK VIKTGMLTEE AVHVIQDFLP DIKKHNVKLI VDPVMISTSG SSLFDSEGMK LCVNTLISGA YLITPNFVEA RALWEIACGE SAAIEKLTIN SLDDFIDFVK QLQKTLKSQN VLVKGGHIPW DSRTGKPFVG TNLADVEDSI VVLDVLYESE IDKATVFESK YINTKDSHGT GCTLASAISA NVAKGKNLKE SISLSIDYIH KGMLSVGKKL GYGNGPLNHN VEPEENLSNV IIGNSTDTYM SVKKGNQSFF EYFKTHPSVK ESWKLYTEHR FIQQLAQDKL PFQRFLYFLK QDYYYLINYA QMHGLAASVA PTYHQTHAEA LIIGEVITEI EKHKEKLSKK YDIDYERDID FDIELQPGKA CVDYCNYLLE IGNRENFLGI KVALAPCLHG YAEAGLYGKS IRESYDKSTS SLDKVLSETY DTWLGDYSSE WYLNAHKEGE ATLQELMESN DVSNERMDEL VEIFRKVTEL EVHFWDEVLD VLP
|
| |