Gene PICST_85224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_85224 
SymbolTHR1 
ID4840689 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp669886 
End bp671307 
Gene Length1422 bp 
Protein Length353 aa 
Translation table12 
GC content43% 
IMG OID640392004 
Producthomoserine kinase 
Protein accessionXP_001386141 
Protein GI126139237 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0083] Homoserine kinase 
TIGRFAM ID[TIGR00191] homoserine kinase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.481244 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.37391 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATAAATAGTG CCCACTGCCG CTTCCGGATT TCCTCAGATT CAGAGTGTTG AGAAGCTAGA 
CAAATTTCTA ATTCACAGAG ATAAAAACCA TACGGTTGCC AATATAAAGT CATACTAGAT
AGTATTCCCG TACTATATGT ATGAAGAACC ATTGAAGGTC ATAAGAAGTT ACCAGTCTTG
ACCTCGGACC ACCGAAAAGC CGTAATATAG AAAACAAAGT AATCGCTAAA CACATAAAAG
TAACAACCAT CTAAAAGAGT ACAACTTCCA AGTTATTACT ATCATTTACT CCACTATTAG
TACATAGAAT CATACTGTCA TGACCATCAG ATCGTTTGAA GTCAAAGTTC CAGCATCGTC
TGCCAATATC GGCCCAGGAT TTGACGTTCT TGGGGTCGGA CTCCAGTTGT ACTTGCAAAT
CAAAGTCACT ATTGATTCGT CCAAGGATAC CAGCCATGAC CCATACCACG TCAAGTTGAG
TTACGAGGGA GATTTGGCTG AAAAAGTGCC ACTCACCTCC GACAAGAACT TGATCACCCA
GACTGCTCTA TACATTTTGC GAGTAAATGG TATGGACTCG TTCCCTCAGG GTACCCATAT
CCATGTGATC AACCCTGTTC CATTGGGTAG AGGATTGGGT TCGTCAGCTT CTGCTATTGT
TGGAGGCATT GTTTTAGGTA ACGAGATCGG AGAGTTCAAG TTTTCTAAGA CCAGATTGAT
GGATTACTGT TTGATGATCG AAAGACATCC AGACAACATT GCTGCTGCTA TGTTGGGTGG
GTTTGTGGGC TCGTACTTGC ATGACTTATC GCCAGAAGAC ATGGCTGCGA AAAACGTTCC
CTTGGACTAC ATCTTGCCCA AGCCAGACAC TCCTAAAGAA AAAATCGTGT CATCGCAACC
ACCCACCAAT ATCGGCGAGT ACTTGCAATA CAATTGGTGC CACAAAATTA AGTGTGTTGC
AATCGTGCCG AACTTTGAAG TTTCGACTGA CTCCTCCAGA GCCGTATTGC CAGAAAAGTA
CGATAGACAA GACATTGTCT TCAACTTGCA AAGATTAGCC ATTTTGACGA ATGCTTTGAC
ACAGGAAACA CCAAACAACA AGCTTATCTA TGAGTCCATG AAAGACAAGA TCCACCAGCC
ATACAGATCG GGGTTGATTC CTGGTTTGCA GAAGGTATTG GCTTCTGTGA CTCCCGATAC
CCACCCTGGA TTGTGTGGTA TCTGTTTGTC TGGAGCTGGA CCTACTATCT TGTGTTTGGC
TACCGGAGGT TACGACGCCA TTGCTGAGAC GGTCATAGGA ATCTTCAACA AGGCTGGCGT
AGAATGCAGC TGGAAGTTGT TAGAATTGGC TTACGACGGT GCCACTGTTG AAATAAAGTA
ATAGATATAA ATAGACCTTG TATAATCAAA AGTTATTGCT TT
 
Protein sequence
MTIRSFEVKV PASSANIGPG FDVLGVGLQL YLQIKVTIDS SKDTSHDPYH VKLSYEGDLA 
EKVPLTSDKN LITQTALYIL RVNGMDSFPQ GTHIHVINPV PLGRGLGSSA SAIVGGIVLG
NEIGEFKFSK TRLMDYCLMI ERHPDNIAAA MLGGFVGSYL HDLSPEDMAA KNVPLDYILP
KPDTPKEKIV SSQPPTNIGE YLQYNWCHKI KCVAIVPNFE VSTDSSRAVL PEKYDRQDIV
FNLQRLAILT NALTQETPNN KLIYESMKDK IHQPYRSGLI PGLQKVLASV TPDTHPGLCG
ICLSGAGPTI LCLATGGYDA IAETVIGIFN KAGVECSWKL LELAYDGATV EIK