Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_85224 |
Symbol | THR1 |
ID | 4840689 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009047 |
Strand | + |
Start bp | 669886 |
End bp | 671307 |
Gene Length | 1422 bp |
Protein Length | 353 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640392004 |
Product | homoserine kinase |
Protein accession | XP_001386141 |
Protein GI | 126139237 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0083] Homoserine kinase |
TIGRFAM ID | [TIGR00191] homoserine kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.481244 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.37391 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATAAATAGTG CCCACTGCCG CTTCCGGATT TCCTCAGATT CAGAGTGTTG AGAAGCTAGA CAAATTTCTA ATTCACAGAG ATAAAAACCA TACGGTTGCC AATATAAAGT CATACTAGAT AGTATTCCCG TACTATATGT ATGAAGAACC ATTGAAGGTC ATAAGAAGTT ACCAGTCTTG ACCTCGGACC ACCGAAAAGC CGTAATATAG AAAACAAAGT AATCGCTAAA CACATAAAAG TAACAACCAT CTAAAAGAGT ACAACTTCCA AGTTATTACT ATCATTTACT CCACTATTAG TACATAGAAT CATACTGTCA TGACCATCAG ATCGTTTGAA GTCAAAGTTC CAGCATCGTC TGCCAATATC GGCCCAGGAT TTGACGTTCT TGGGGTCGGA CTCCAGTTGT ACTTGCAAAT CAAAGTCACT ATTGATTCGT CCAAGGATAC CAGCCATGAC CCATACCACG TCAAGTTGAG TTACGAGGGA GATTTGGCTG AAAAAGTGCC ACTCACCTCC GACAAGAACT TGATCACCCA GACTGCTCTA TACATTTTGC GAGTAAATGG TATGGACTCG TTCCCTCAGG GTACCCATAT CCATGTGATC AACCCTGTTC CATTGGGTAG AGGATTGGGT TCGTCAGCTT CTGCTATTGT TGGAGGCATT GTTTTAGGTA ACGAGATCGG AGAGTTCAAG TTTTCTAAGA CCAGATTGAT GGATTACTGT TTGATGATCG AAAGACATCC AGACAACATT GCTGCTGCTA TGTTGGGTGG GTTTGTGGGC TCGTACTTGC ATGACTTATC GCCAGAAGAC ATGGCTGCGA AAAACGTTCC CTTGGACTAC ATCTTGCCCA AGCCAGACAC TCCTAAAGAA AAAATCGTGT CATCGCAACC ACCCACCAAT ATCGGCGAGT ACTTGCAATA CAATTGGTGC CACAAAATTA AGTGTGTTGC AATCGTGCCG AACTTTGAAG TTTCGACTGA CTCCTCCAGA GCCGTATTGC CAGAAAAGTA CGATAGACAA GACATTGTCT TCAACTTGCA AAGATTAGCC ATTTTGACGA ATGCTTTGAC ACAGGAAACA CCAAACAACA AGCTTATCTA TGAGTCCATG AAAGACAAGA TCCACCAGCC ATACAGATCG GGGTTGATTC CTGGTTTGCA GAAGGTATTG GCTTCTGTGA CTCCCGATAC CCACCCTGGA TTGTGTGGTA TCTGTTTGTC TGGAGCTGGA CCTACTATCT TGTGTTTGGC TACCGGAGGT TACGACGCCA TTGCTGAGAC GGTCATAGGA ATCTTCAACA AGGCTGGCGT AGAATGCAGC TGGAAGTTGT TAGAATTGGC TTACGACGGT GCCACTGTTG AAATAAAGTA ATAGATATAA ATAGACCTTG TATAATCAAA AGTTATTGCT TT
|
Protein sequence | MTIRSFEVKV PASSANIGPG FDVLGVGLQL YLQIKVTIDS SKDTSHDPYH VKLSYEGDLA EKVPLTSDKN LITQTALYIL RVNGMDSFPQ GTHIHVINPV PLGRGLGSSA SAIVGGIVLG NEIGEFKFSK TRLMDYCLMI ERHPDNIAAA MLGGFVGSYL HDLSPEDMAA KNVPLDYILP KPDTPKEKIV SSQPPTNIGE YLQYNWCHKI KCVAIVPNFE VSTDSSRAVL PEKYDRQDIV FNLQRLAILT NALTQETPNN KLIYESMKDK IHQPYRSGLI PGLQKVLASV TPDTHPGLCG ICLSGAGPTI LCLATGGYDA IAETVIGIFN KAGVECSWKL LELAYDGATV EIK
|
| |