Gene PICST_62151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_62151 
SymbolGSH1 
ID4840141 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp213608 
End bp215911 
Gene Length2304 bp 
Protein Length738 aa 
Translation table12 
GC content45% 
IMG OID640391456 
ProductGlutamate--cysteine ligase 
Protein accessionXP_001385377 
Protein GI150865952 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.146555 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCTTGT TATCTTTGGG AACTCCCCTT GATTGGCACG AGTCCAAGAA ACACAACGAA 
CATGTTCGGG AGAACGGAAT CACCCAGTTG ATCAACATCT TCCGCCAGCA TGCCTCACGC
TCGAACGACA AGTTCCTCTG GGGGGATGAG GTTGAGTACA TGCTTGTAGA CATAGACAGC
AAGAACAAGA CAGCCAGATT GTCCATCGAC AAGGACTACA TCCTCGACGA CTTGAACGAC
CCTGACAAGT CGTTGCACAA GCTGGTGGAT AACAATATTC TGTTCCATCC AGAGTACGGT
CGGTTTATGC TTGAAAGTAC CCCTGCTTCA CCTTATAATG GTACTTGTTT GAAAGACTAC
GTTTATGTAG AGGAGAATAT GATCGTCCGT AGAAAGATCA GCGAGTCTGA GTTGCCTCCC
CACATCAAGC CTTTGACACT TACTTCGTTT CCACGTATGG GATGTAACAA CTTCACGTCT
CCTCCAGCCA AAGCCATTGG CCCTGCTTCG CAGTCGTTGT TTCTCCCCGA CGAGATCATT
AATCGTCATG TGAGATTCCC CACGTTAACA GCCAACATCA GAAAAAGAAG AGGATCCAAA
GTGGCTATCA ATTTGCCCAT GTACCCTGAC GTCAATACGA AGTTGATTGA CGACTCCATT
CCTAGAGACA GAGACTTGTT TGTCAGCGAT AAAGAGCCCT GGCTCGGTGC TGCTAAACCC
GGACATGTCT ATATGGACTC GATGGGGTTT GGTATGGGCT CCAGCTGTTT GCAGATCACA
ATGCAAGCCC AGGACATCAG CGAAGCCAGA TATCTCTACG ACACTTTGGC ACCCATTACG
CCTATCATGT TGAGTTTGTC AGCTGCTTCC CCTATATTCA AAGGCTATCT CGTCAATCAA
GACGTGAGAT GGAACGTAAT CAGCGGTGCT GTCGACGACA GAACATTTGT AGAACGTGAC
GTAGAGCCAT ATAATGGCTA CGATCTATTT GGAGGCATGA ATGTAGACAA AAGGCAGCAC
TTACCACTGC CAAAGCAGTC AGTAAACAAG TATGGAGACA TCAACAACTT GACTACCAAA
GATGGGAAGC CGATCCAGAA GATTCCGAAA TCGAGATACG ACTCCATTGA CAACTACTTA
AACGATCAAA ACTACTCGAC TAACTACTTC AAAGACGAAT ACAATGATAT CTACTCGCCC
ATCAACGAAA AAGTCTACAA ACGTTTGTTG GAACAGAACT CCGACTTGTT TGACGAGTAT
ACAGCAAAGC ATTTCGCGCA TTTGTTCATC AGAGATCCTT TGGTATTGTT CAGTGAGCGG
ATAGATCAGG ACAACAGTAC GGAGAACGAC CATTTCGAGA ATCTCCAGTC CACAAACTGG
CAGACCCTCA GATTCAAGCC TCCAGCATTA TATTCTTCTG ACACGGATCT CAGCTCAAAG
CCCGGTTGGA GAGTTGAATT CCGTCCTATG GAGATCCAGT TGACCGACTT CGAAAATGCA
GCATACTCTA ACTTCATTTC TTTGCTTTCG AAAGCCATCA TCAAGTTCAA ACCTAATCTC
TACATTCCGA TCCTGAAGAT CGAAACCAAC ATGAGATTGG CTCACAATGT TGATTCCGTG
CTTAACGAGA AGTTCTGGTT CAAGTCCTTA GACCAATGGA ACTTGGATAA CAGCGACTTT
GTTGGCTATG ACCTCTCATG GTTCGACAGA TTCTTAAACA AGGGCAATGA CGAATTGGGC
TACGATGAAG TGTACGTCAA TGGCTACTCC ATTAACGGTA CTAATACTAC CGGTGAGTCT
GAGGAAGACT TGTTGTTGAA CGGTACAATC AATGGTTCCT ACGGACACAA GGGTTTCTAT
AGAGGGAGAA GAAGAACCAT TGAGGCCGTC AACGATATCG ATGACCATGG AATCGACGAA
AGATATTCTA TCAATGAGAT CATCAATGGG AATGACAAGT TCCCAGGTTT GATCAGATTG
GTCGTCAAGG TTATTGCTAC AGATTTGCTT CCTGAAGGAT CTCACCATTG TGAGAACTCT
GACTTGGCTA AGAATTTGAC TAGAATTCAG AAATACTTGC AATTGATTTC TCAAAGAGCA
AGTGGCAAGG TTCCCACGAC AGCCAACTGG TTGAGAAACA GTGTGTTAAG ATCTGACTTC
TACAAGCGAG ACTCGAAGGT AAGCGAAGCT TTGAACTACG TTCTTGTTGA GAAAGCTGCT
GCCATCACTG ACTTGAGGGA CAAGGACTTG TTGATAGACT TGTTGGGAGA GACAACAACC
GAATATTTGT TGAACAGCAT TTAG
 
Protein sequence
MGLLSLGTPL DWHESKKHNE HVRENGITQL INIFRQHASR SNDKFLWGDE VEYMLVDIDS 
KNKTARLSID KDYILDDLND PDKSLHKSVD NNISFHPEYG RFMLESTPAS PYNGTCLKDY
VYVEENMIVR RKISESELPP HIKPLTLTSF PRMGCNNFTS PPAKAIGPAS QSLFLPDEII
NRHVRFPTLT ANIRKRRGSK VAINLPMYPD VNTKLIDDSI PRDRDLFVSD KEPWLGAAKP
GHVYMDSMGF GMGSSCLQIT MQAQDISEAR YLYDTLAPIT PIMLSLSAAS PIFKGYLVNQ
DVRWNVISGA VDDRTFVERD VEPYNGYDLF GGMNVDKRQH LPSPKQSVNK YGDINNLTTK
DGKPIQKIPK SRYDSIDNYL NDQNYSTNYF KDEYNDIYSP INEKVYKRLL EQNSDLFDEY
TAKHFAHLFI RDPLVLFSER IDQDNSTEND HFENLQSTNW QTLRFKPPAL YSSDTDLSSK
PGWRVEFRPM EIQLTDFENA AYSNFISLLS KAIIKFKPNL YIPISKIETN MRLAHNVDSV
LNEKFWFKSL DQWNLDNSDF VGYDLSWFDR FLNKGNDELG YDEVYVNGYS INGRRRTIEA
VNDIDDHGID ERYSINEIIN GNDKFPGLIR LVVKVIATDL LPEGSHHCEN SDLAKNLTRI
QKYLQLISQR ASGKVPTTAN WLRNSVLRSD FYKRDSKVSE ALNYVLVEKA AAITDLRDKD
LLIDLLGETT TEYLLNSI