Gene PICST_90814 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_90814 
SymbolMLS1.2 
ID4840685 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp444915 
End bp446612 
Gene Length1698 bp 
Protein Length565 aa 
Translation table12 
GC content46% 
IMG OID640392000 
ProductMalate synthase 1, glyoxysomal (MAS) (DAL7) 
Protein accessionXP_001386287 
Protein GI150866625 
COG category[C] Energy production and conversion 
COG ID[COG2225] Malate synthase 
TIGRFAM ID[TIGR01344] malate synthase A 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0778949 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.869225 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTATCC AGGCCCAACC ACTCTTCAGC TCGCTTGCTG GTGTCCAGGT ATTGGCTCCA 
GTCTCCAAGA CTCCAGAATA TGAGCCCTCT ACCACTACCC AGGCAGACAT TTTGACCAAA
TCAGCCTTGT CTTTTGTAGT TCTTCTTCAC CGTTCTTTCA ATTCCACTAG GAAACAGCTC
TTGGAAAACA GACAACTTGT TCAGGAACAA CTTGACCAGG GCATTCCACT TCTGTTTCTC
CAGGACGAAA CCATGACAAA GGTCAGAAAT GACCCAACCT GGCAGGGAGC CTTTCCTGCT
CCAGGTTTGA CCGATAGAAG AACAGAAATC ACCGGTCCTC CGGAGCGTAA GATGATAGTC
AATGCTCTCA ACACACCTGT CAAGACCTAC ATGAGTGACT TCGAAGACTC ATCTGCTCCT
ACTTGGGCCA ATGTCATCAC TGGCCAAGTC AACTTGTACG ATGCTGTCAG ACACCAGATT
GACTTCGTTA GTAAGGACAA TGGTAAAGCA TATAGAGTCA ACCAGTCCCA ACAGTTTCTG
ACTCCCACAC TTCTTGTACG TCCCAGAGGC TGGCACATGG TTGATAAGCA CATCTTAGTA
GATGGAGAGC CTGTGAGTGC CTCGATTTTG GATTTTGGCT TATACTTCTT CCATAATGCT
CACGAGCTCA TCAACCAGGG TAGGGGTCCA TACTTCTACT TGCCGAAGAT GGAGCACCAT
TTGGAAGCTA AGCTCTGGAA CGACGTCTTC AACGTAGCCC AGGACTCATT GGCCGTTTCT
AGGGGTACCA TCAGAGCTAC CGTTCTCATT GAAACATTAC CTGCTGCCTA TCAGATGGAA
GAAATCATTT TCCAATTGAG AAACCATTCG GCAGGATTGA ACTGTGGAAG ATGGGACTAC
ATATTTTCAA CTATCAAGAG ATTACGTAAC GACCCCTCTA AAATCTTACC CGACAGAGAC
CAGGTTACAA TGACGGTTCC TTTTATGAAA GCCTACTGTG AGCGTTTGAT AAACATCTGC
CACCGTAGAC AAGTCCATGC TATGGGAGGC ATGGCAGCTC AGATCCCTAT AAAGAATGAT
CCTGAAGCCA ACAAAGTTGC TATTTCCAAA GTCAAAAATG ATAAGCTCAG AGAAGCTACA
ATGAATTACG ACGGAACTTG GGTGGCTCAT CCTGCTTTGG CACCAATTGC CAACGACGTC
TTCAACGAGC ATATGCCCAC TCCTAACCAG ATCCACATTG TTCCCGATGA AGATGTCTCG
GAAGCCGATT TGTCCAACAC AGCCATTGCT GGAGGCAAGA TCACAACCGA GGGTATTCGT
AAGAATCTCT TCATTGCCCT AAGCTACATT GAATCGTGGC TCAGAGGTGT GGGATGTGTT
CCTATTAACA ACTTAATGGA AGACGCTGCT ACAGCTGAAG TGTCTCGTTT GCAATTATAT
TCGTGGGTGT TGCACCTGGT AAAGATGGAA GACTCTAACA AGACTGTGAC CCCCGACTTA
ATGAGCCTGA TATTAGAGGA AGAAGTCGAA AAACTCACTG AACAATTTGG CTCTAAGGGC
CGCAAGTTTA AAGAGGCAGC CAGATACCTT GAGCCAGAAA TCACTGGCAA GTCTGTGTCG
GAGTTCTTAA CAACCTTGAT CTATGACTCT GTAGTTACTG TTGGCAAGCC AATCGACTTG
GAAGCCTTGA AGGACTAG
 
Protein sequence
MTIQAQPLFS SLAGVQVLAP VSKTPEYEPS TTTQADILTK SALSFVVLLH RSFNSTRKQL 
LENRQLVQEQ LDQGIPLSFL QDETMTKVRN DPTWQGAFPA PGLTDRRTEI TGPPERKMIV
NALNTPVKTY MSDFEDSSAP TWANVITGQV NLYDAVRHQI DFVSKDNGKA YRVNQSQQFS
TPTLLVRPRG WHMVDKHILV DGEPVSASIL DFGLYFFHNA HELINQGRGP YFYLPKMEHH
LEAKLWNDVF NVAQDSLAVS RGTIRATVLI ETLPAAYQME EIIFQLRNHS AGLNCGRWDY
IFSTIKRLRN DPSKILPDRD QVTMTVPFMK AYCERLINIC HRRQVHAMGG MAAQIPIKND
PEANKVAISK VKNDKLREAT MNYDGTWVAH PALAPIANDV FNEHMPTPNQ IHIVPDEDVS
EADLSNTAIA GGKITTEGIR KNLFIALSYI ESWLRGVGCV PINNLMEDAA TAEVSRLQLY
SWVLHSVKME DSNKTVTPDL MSSILEEEVE KLTEQFGSKG RKFKEAARYL EPEITGKSVS
EFLTTLIYDS VVTVGKPIDL EALKD