Gene PICST_65754 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_65754 
SymbolTHR4 
ID4839718 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp612899 
End bp614434 
Gene Length1536 bp 
Protein Length511 aa 
Translation table12 
GC content44% 
IMG OID640391033 
Productthreonine synthase 
Protein accessionXP_001385124 
Protein GI150865777 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0498] Threonine synthase 
TIGRFAM ID[TIGR00260] threonine synthase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.148721 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCAAA AGTATAGATC ATCGCGTTCT GCGGAGCCCC AGGCTTTGTC CTTTGAGGAT 
GTGGTCATGA CCGGTTTGGC CAACGATGGA GGTTTGTTCC TTCCCTCACA AGTTCCCCAG
CTTCCAGCTT CATTCTTGCA AGACTGGGCG GATTTGTCTT TCCAAGAATT GGCTTTCAAT
GTATTGAGAT TGTACATCAA CGCTGCTGAA ATCCCTGACC AAGACTTAAG AGACTTAATC
TCCAAATCTT ACTCCACTTT CAGATCGGAA GAAGTCACTC CATTAAAGAA GATCGACGAC
AAGTTGTACT TGCTTGAATT GTTCCATGGT CCTACCTATG CCTTCAAGGA TGTTGCCTTG
CAGTTTGTCG GCAACCTCTT TGAGTACTTC TTGACCAGAA GAAATGCCAA GAAGGTTGAA
GGCGAAGCTC GTGATGTTAT CACTGTCGTT GGAGCTACTT CTGGTGATAC TGGTTCTGCT
GCTATCTACG GCTTAAGAGG TAAGAAGGAT GTGTCTGTGT TCATTCTCTA TCCAACAGGC
AGAATTTCTC CTATTCAAGA AGAGCAAATG ACCACAGTAG AGGATGCCAA TGTGCACACA
TTGTCGGTTA ACGGTACCTT CGATGACTGT CAGGACATCG TGAAGCTGAT CTTTGGAGAC
CGCGAGTTCA ATGATAAGTA CCATGTTGGA GCTGTCAACT CCATTAACTG GGCAAGAATT
TTGGCTCAAC AAACCTACTA CTTTTACTCA TACTTCCAAT TGCAGAAGAA GTTAAATGAC
ACATCTGCGA AGGTCAGATT CGTCGTTCCT TCTGGTAACT TCGGCGATAT ATTGGCTGGT
TACTATGCAT ACAAGATGGG CTTGCCAGTG GACAAGTTGA TCATTGCCAC TAATGAAAAC
GACATTTTGG ATAGATTCAT GAAGACTGGT CGTTACGAAA AGAAAGCTGA AAAGGACGCC
TCTGCGGCTG TCAAAGCCAC ATTCTCGCCA GCTATGGATA TCTTGATATC ATCCAACTTT
GAAAGGTTGT TGTGGTACTT GATCAGAGAC TCCGTTGCCA ACGGTAGTGA CGAAGTTGCT
GGTAAGACTT TGAACTCCTG GATGCAACAA TTGAAAGAGA CTGGTTCTGT TGTGGCTGAC
CCAGAAGTTC TCGCTGGAGC CAGATCCATT TTCGATTCTG AAAGAGTTGA TGATGCTGAA
ACTGTTGCTA CCATAAAAGA AGTTTACTCT GCTCACCCAG AAAGCTACGT GTTGGATCCA
CACAGTTCTG TCGGTGTTAC GACTTCCTAC AGATTCATCA AGAAGGACGA CAAGAAGGAC
AACATCAAGT ACATATCTTT GTCTACCGCC CATCCAGCCA AGTTTTCTGA AGTTGTCAAC
AAGGCTTTGG ACTCGATCGC AGGGTATTCT TTCGAGAAGG ATGTATTGCC AGCTGAATTG
AAGGCTTTGA GCACCAAGCG CAAGAGAATT AACTTGATTG ATGAAGCATC CATAGAAAAG
GTCAAGGATG CCATCAAGAA GGAATTGAAC TTTTAG
 
Protein sequence
MSQKYRSSRS AEPQALSFED VVMTGLANDG GLFLPSQVPQ LPASFLQDWA DLSFQELAFN 
VLRLYINAAE IPDQDLRDLI SKSYSTFRSE EVTPLKKIDD KLYLLELFHG PTYAFKDVAL
QFVGNLFEYF LTRRNAKKVE GEARDVITVV GATSGDTGSA AIYGLRGKKD VSVFILYPTG
RISPIQEEQM TTVEDANVHT LSVNGTFDDC QDIVKSIFGD REFNDKYHVG AVNSINWARI
LAQQTYYFYS YFQLQKKLND TSAKVRFVVP SGNFGDILAG YYAYKMGLPV DKLIIATNEN
DILDRFMKTG RYEKKAEKDA SAAVKATFSP AMDILISSNF ERLLWYLIRD SVANGSDEVA
GKTLNSWMQQ LKETGSVVAD PEVLAGARSI FDSERVDDAE TVATIKEVYS AHPESYVLDP
HSSVGVTTSY RFIKKDDKKD NIKYISLSTA HPAKFSEVVN KALDSIAGYS FEKDVLPAEL
KALSTKRKRI NLIDEASIEK VKDAIKKELN F