Gene PICST_30795 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_30795 
SymbolMET23 
ID4837883 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp956715 
End bp958079 
Gene Length1365 bp 
Protein Length454 aa 
Translation table12 
GC content45% 
IMG OID640389198 
Producthomoserine O-acetyltransferase 
Protein accessionXP_001383814 
Protein GI150864832 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2021] Homoserine acetyltransferase 
TIGRFAM ID[TIGR01392] homoserine O-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATTCC CATGTCTTGA TAAGTTGGAG AAGAAGACCC AGGATCTCTC CAATTCATGT 
AACATAGAAG ATGACAATAT CAGCTCATTA GATGCCGGTC CAGAACCTTC CTATTCTTAT
GTTCAGACTG GGTTCAAGTT GTACAAGTCA GACAAGCCGA TTTTCCTTGA TAACGGAGGC
TATTTACCCG AGTACGAAGT GGCGTATGAA ACTTGGGGAC AATTGAATGC AGCCAAAGAC
AACTTAGTGT TGATCCATAC TGGCTTATCA GCATCTTCGC ATGCCAAATC ACAACCAGAC
AATACCAAGC CGGGATGGTG GGAAGATTTC ATTGGTTCTG GAAAATACAT TGACACTGAT
AAATACTTCG TTGTATGTAC CAATGTGCTT GGAGGTTGTT ATGGATCAAC CGGGCCCTCC
CTGAGAGACC CAGCCAATGG AGAAATCTAT GCTACAAGAT TCCCCATCAT AACTGTTAAC
GATATGGTGA GAGCTCAGCG AGAGTTGATC AGAAATGTTT TCGAAGTATC CAAGATTCAT
GCACTGGTAG GTGCGTCTAT GGGTGGCATG CAATCGTTAG CATATGCGTG GGAATTCCCT
GACGAAGTCA ACAAGATCGT ATCTATCAGT GGCTGTGCCA GATCTCATCC ATACTCTATT
GCATTAAGAC ATACTCAAAG ACAGGTATTG ATGAGCGATC CTAATTGGAA GAGAGGATTC
TACTACGGAG GTGTGCCTCC TCATGTAGGG ATGAAGTTGG CCAGAGAAAT CGCTACCGTC
ACCTACCGTT CTGGACCTGA ATGGGAGTAC AGATTCGGCA GAAACAGAGC TGATGACTCT
CGTCCTCCAG CATTGTGTCC GGATTTCCTC GTCGAAACGT ATTTGGACCA TGCTGGCGAG
AAGTTTTGTT TGGAATACGA TGCCAACTCA CTTTTGTATG TGTCGAAAGC TATGGACTTG
TTTGACTTGA GCTCTTCCAA CCGTTCTAAA GCTCTTCAAA AGAGAACTGC TACTGAACAA
AGTTGGAATT CGCAGAACAA TATCTCCGAA TCTCACTCGG TTCCTCCAGA GCCATATAAG
GAGAAGGTGG GTTCAGCAAC TGTGACTCCT GAGGAGTCTT TAGAAGATTT GAGAAACGGA
ATCAGCAAGA TCTCGCACAA AGATGTGCTA GTCATTGGTG TTGAGTCAGA TATCTTGTTT
CCAGTGTGGC AACAGCGAGA AATTGCCGAT GTCTTAATGC AAAACAATAC TACTGGGGAT
ATCCGGTACT TTGAGTTGGG CACCAACATC AGTAACTACG GCCACGACAC TTTCTTGTTG
AGCTTGGACC ATATTGGACC GCCAGTTCGC GATTTCTTGG GTTAG
 
Protein sequence
MAFPCLDKLE KKTQDLSNSC NIEDDNISSL DAGPEPSYSY VQTGFKLYKS DKPIFLDNGG 
YLPEYEVAYE TWGQLNAAKD NLVLIHTGLS ASSHAKSQPD NTKPGWWEDF IGSGKYIDTD
KYFVVCTNVL GGCYGSTGPS SRDPANGEIY ATRFPIITVN DMVRAQRELI RNVFEVSKIH
ASVGASMGGM QSLAYAWEFP DEVNKIVSIS GCARSHPYSI ALRHTQRQVL MSDPNWKRGF
YYGGVPPHVG MKLAREIATV TYRSGPEWEY RFGRNRADDS RPPALCPDFL VETYLDHAGE
KFCLEYDANS LLYVSKAMDL FDLSSSNRSK ALQKRTATEQ SWNSQNNISE SHSVPPEPYK
EKVGSATVTP EESLEDLRNG ISKISHKDVL VIGVESDILF PVWQQREIAD VLMQNNTTGD
IRYFELGTNI SNYGHDTFLL SLDHIGPPVR DFLG