Gene PICST_28701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_28701 
SymbolLYS21 
ID4851460 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp1865914 
End bp1867269 
Gene Length1356 bp 
Protein Length451 aa 
Translation table 
GC content46% 
IMG OID640393168 
Producthomocitrate synthase 
Protein accessionXP_001387991 
Protein GI126274588 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR02146] homocitrate synthase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.453642 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAATGG ACTCGGTAAT CGACGGGTTT CTCAAGTTTG ATCTGGAATA TACGGAAATA 
ACGTACAATA ATCCGTATGG TCCCAATCCT GCTGATTATC TTTCAAATGT ATCACATTTC
CAGGTTATTG AGTCAACATT ACGAGAAGGG GAGCAATTTG CAAATGCTTT TTTTCTGACG
GACACAAAGA TAGCCATTGC AAAGGCGCTC GACGACTTTG GTGTCGATTA TATTGAGTTG
ACATCTCCTG TAGCCTCTGA GCAGTCTAGG AGAGACTGTG AGGCCATATG CAAGTTGGGT
TTGAAAGCGA AAATATTGAC CCATATACGG TGTCATATGG ATGATGCACG AGTAGCAGTT
GAGACGGGTG TGGATGGTGT TGATGTTGTT ATAGGAACAT CTCAGTTTTT GCGAGAGTAT
TCGCACGGGA AGGATATGAC ACACATCACC CAGAGTGCCC TAGAGGTGAT CGAGTATGTG
AAGTCTCATG GAATAGAGAT TCGTTTTTCT TCAGAGGATT CGTTTCGGTC GGAGTTGACT
GACTTGCTTA GCATTTACCG GGCTGTAGAC AAAGTCGGTG TAAATCGTGT AGGTATAGCA
GATACTGTTG GATGTGCCAA TCCCCGGCAA GTTTATGAGT TGGTAAGAAC TTTAAAGGGT
GTAGTGAGTT GTGATATCGA GTGTCATTTC CACAATGATA CTGGTTGTGC TATTGCCAAT
GCGTACACTG CTTTGGAAGG TGGAGCCAAG TTGATCGATG TGTCGGTATT GGGCATTGGC
GAGAGAAACG GCATTACTCC TTTGGGAGGA CTAATGGCTC GGATGATCGC AGCTGATCGT
GACTACGTCC TCTCCAAATA CAAGTTGCAC AAGCTACGAG ACATTGAAAC GCTTGTGGCC
GAGTCTGTGA GAGTCAACAT CCCGTTCAAC AACCCTGTGA CTGGCTTTTG TGCTTTTACT
CACAAGGCTG GAGTTCATGC CAAGTCGATT TTGGCTCCTC CTTCGGAGTA CGAGATATTG
AGTCCCTCGG ACTTTGGTTT GACCAGGTAC ATCCACTTTG CCAACCGGTT GACGGGTTGG
AATGCCATCA AGTCACGAGT GGACCAATTG AATTTAGATC TCAGTGACGA ACAGTGCCAA
GAAGTAACGA TGAAAATCAA GAAACTCGGC GATGTACGTC CCTTGAACAT CGACGATGTG
GATTCCATCA TCAAAGATTT CCATGCCAAT GTGACCACAC CTGTTGTACG TCCCGTGGGA
ATCAACAGTG ATACAGCTCC GCGAGTACCA CATAATCTCG AGAGATTGGA TGGGAATGGC
GTAGTAGCCC GGAAACTATT GGGACGTCGT CGTTGA
 
Protein sequence
MEMDSVIDGF LKFDLEYTEI TYNNPYGPNP ADYLSNVSHF QVIESTLREG EQFANAFFLT 
DTKIAIAKAL DDFGVDYIEL TSPVASEQSR RDCEAICKLG LKAKILTHIR CHMDDARVAV
ETGVDGVDVV IGTSQFLREY SHGKDMTHIT QSALEVIEYV KSHGIEIRFS SEDSFRSELT
DLLSIYRAVD KVGVNRVGIA DTVGCANPRQ VYELVRTLKG VVSCDIECHF HNDTGCAIAN
AYTALEGGAK LIDVSVLGIG ERNGITPLGG LMARMIAADR DYVLSKYKLH KLRDIETLVA
ESVRVNIPFN NPVTGFCAFT HKAGVHAKSI LAPPSEYEIL SPSDFGLTRY IHFANRLTGW
NAIKSRVDQL NLDLSDEQCQ EVTMKIKKLG DVRPLNIDDV DSIIKDFHAN VTTPVVRPVG
INSDTAPRVP HNLERLDGNG VVARKLLGRR R