Gene PICST_32201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_32201 
Symbol 
ID4839587 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp931295 
End bp933544 
Gene Length2250 bp 
Protein Length749 aa 
Translation table12 
GC content40% 
IMG OID640390902 
Productpredicted protein 
Protein accessionXP_001384846 
Protein GI150865575 
COG category 
COG ID 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.214118 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTTTGC GAGGCAGAAG TCGTCTTGTA CGACATTTGG ATCCACAAAA GTCGCAGCAT 
GTATTTTTCA GTGTGTTTCC AGCACTTTCC AGTCTGAAAA ATGTGATTCC TACCATTCCA
GTGCCTGAAC TCCAGGATTC TTCCACGATT AGAACTTTTC AGCCGCAGAG CTCTAGACCC
CCAAGCAGTA GGAAGCAAAT CTTCCTTGAG CCTCGAGAAA CCTTTAATCG AAATTTCAAC
TCGAGGATCG AAGACATCAA AAGTTCGGGA CTAGAGAACC CCATCGAATC CAAGTCTATG
ACTTTTATTG AATCTGAAGT CGATATTGAC GAAGCATCTC CAGAAAGATT AATTGGAATA
ATAGACAAGT TCAAGCCAGC TGTAGCGGAG GTTAACCGAA TGAGATTTCG TAGCATTCAA
CAGGAATTAG AAAAAGCCTT TTCTCAGCAG CAATTGGCTT CTTATTTAGA ACAGAACTAT
TCCAAAGTTG AATTTGCAAA TAGTAGGAGA TCCACCAGAC GAGTTTCAAA GAAGAAGCTT
GCGCAGCGTA TTATCAATGA CATTTGGGGT ATAAAAGTGA ACTCGAAATT GACGGAAACG
AGTGATTTGA TAACAAAAAA GACTGTACCC CTTTCAGATC TTGATATCTT TTTGATCAGA
CTGAATCGAA GACTTTTGCA TTGGCTCTCT TCCAATGGAT GTAGAATTAA ACTTGTGAAG
GACAAGAAGG AGTTGGAGAT TTCTGGGACA ACCTCCCAAG TCAGTAATGT AGAAGTCAAT
TTGATTGAGT TCTTGAGACA ATCTGAAAAA CAGGAACTCA ACTTGACCGA CATAAAGCAT
TTGTTTCAAG AAAAATACGG CCGATTTTCC CTTGAAAAAG TCGCTGGATA TACCGATGTT
TTTTTCAAAG ATGTAGGAAA TGATATTTAC GAGGTTCATT CACAGACTCC ACACCAAGTA
GAGCGCTTGA AAAGGTTGCT TCTTTGGTCC TTAAACTACA ATAAGCATTC TAACGAAGTA
ATTACACTTC CAGAAGAGCC AGAAAAGAAG CAGCTCTTTC CATTTAAGGA CGAATTGTCA
TTGTCGTGGA ACAACAGGCG TCAGCCACTC TTCACGTTGA AAGATTGTGG TTCTTCCAGA
TCAGAAACGA ACCATCGTTT ACTTTCTGAT TTGCAAAGAT ACGACGATGC CAATATGAAC
ATAGATGCAG AGATACCTAA TAAAGAAAAT GATAGTGAAG TTGATACCAT CATAAAAGAT
GAAGACGCCC CTATACTTTC TCAGAATAAG ATCAACGACA TCTACGATCA GTTATACGAC
TTCGACTACA GAAAGAATTT GATCAGTTTA GAAACTACGC TGTCTCCTGT CTTTACTATA
TCGCTTGGCA ATATTCTTTT CGAAGGAGAA CAGCAAGAAA CAAAGAGTGC AATGAGCAAA
CTTTTCCCTA CACCACCAGA ACTTTCTGAA ACAACCAAGT TTAAGTTCAA CTCTAACGTT
CAGTTGGTAG CTGACAAAGC ACTTTCTCTT CCTATCCACA GTACCAGTCT GACATCAGGT
TCAGATGTAT TGGATGACAT TTTCAGAGAT GAACGTTACA AGAATTCAGT CCAAATCACG
TTCTTGCCTT CTTTATTTGT AGAAGACAAC AAGGATATCA AGATGGAAGA CTTAACCAAG
TTTCAACCTG TGGAGTTGTG GGTTGATCTT AAGCACAACA TGACTCCAGA CATGGACACT
TTACAACTTG TAACAGTAGA AGGCGAGAAT ACCAATTACG TTTCGCTACC ATCTTTTAAG
TCAGACTTGA AGGTGAGCTG CCAATTATCA GGAAATCTTC TACAAGAGAA AGGCGAATTG
GCTCAAGACT TTGAGCTCAG TATTGACGAT ATCTTGAACT CACAAGCCGA CAGGTATACA
CGGTTCAAGT CTCAACCTGG ACTTGGTGGC TTTCTCCTGA GATCGAAGCT TGATTTCCAG
GAAGAGACTT CAATTTCGCC ATACATAGAC TTGAACATCA ATGGCGATAT TGTCAGGTAT
AGATTCATTA GTATGGAATT CCGCAAGAAG TTGAGTTTTG ATTTCAACGG TCGCGAAGTC
CAGTACAACA TGGTAGAGGG AGGTAATCTT GGAGGACGTA AGGTCGAGGT ACTTTTTGTA
GGAGACATGG CATTGGATGC GTCTGGTGAA GAGAGGAAGA GGTTCGGACA ATTGATGAAC
GATGCTGTTC TGTTTGTGAG CGAGCTCTAA
 
Protein sequence
MLLRGRSRLV RHLDPQKSQH VFFSVFPALS SSKNVIPTIP VPELQDSSTI RTFQPQSSRP 
PSSRKQIFLE PRETFNRNFN SRIEDIKSSG LENPIESKSM TFIESEVDID EASPERLIGI
IDKFKPAVAE VNRMRFRSIQ QELEKAFSQQ QLASYLEQNY SKVEFANSRR STRRVSKKKL
AQRIINDIWG IKVNSKLTET SDLITKKTVP LSDLDIFLIR SNRRLLHWLS SNGCRIKLVK
DKKELEISGT TSQVSNVEVN LIEFLRQSEK QELNLTDIKH LFQEKYGRFS LEKVAGYTDV
FFKDVGNDIY EVHSQTPHQV ERLKRLLLWS LNYNKHSNEV ITLPEEPEKK QLFPFKDELS
LSWNNRRQPL FTLKDCGSSR SETNHRLLSD LQRYDDANMN IDAEIPNKEN DSEVDTIIKD
EDAPILSQNK INDIYDQLYD FDYRKNLISL ETTSSPVFTI SLGNILFEGE QQETKSAMSK
LFPTPPELSE TTKFKFNSNV QLVADKALSL PIHSTSSTSG SDVLDDIFRD ERYKNSVQIT
FLPSLFVEDN KDIKMEDLTK FQPVELWVDL KHNMTPDMDT LQLVTVEGEN TNYVSLPSFK
SDLKVSCQLS GNLLQEKGEL AQDFELSIDD ILNSQADRYT RFKSQPGLGG FLSRSKLDFQ
EETSISPYID LNINGDIVRY RFISMEFRKK LSFDFNGREV QYNMVEGGNL GGRKVEVLFV
GDMALDASGE ERKRFGQLMN DAVSFVSEL