Gene PICST_41012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_41012 
Symbol 
ID4837605 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp105792 
End bp108782 
Gene Length2991 bp 
Protein Length847 aa 
Translation table12 
GC content41% 
IMG OID640388920 
Productpredicted protein 
Protein accessionXP_001382783 
Protein GI150864087 
COG category 
COG ID 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.164038 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AAGACGACAA AGTCCAGAAA TGGCTGTATT ACATGTAAGA GAAAACGCTT GAAATGTGAC 
GAAACCAAAC CAGCCTGTCT CAACTGTAAG AAAAGGAATA TTGAATGTGG AGGCTATGCC
ACTAACTTCA AATGGAGATC TTTTGGTGAA ACCGACTCCA ATAGTACTGT AACCAGTTTC
ACAACATCTA CAAAGGACAA AAGAACATCC ATTTCCGTCA TCTCTACGAG CCCAACTTCG
ACGTCTGCAT CAATATCTTC AACCGCATCT GTACCTTCAT CAAACTTGAC ATCTGCTTCT
TCTGAGCAAA AGTCAAATAG TCTCAAGCGT CATCTAGAGC TTGCCTCGCT TTCTGTCACT
GGAAGGACCA TTGATGATAT CAAGATCGAA AACGACTTAA TCTCGAAAGG AATAAATCCA
CACAGTTATA AAAGGAAAAA GCACCATTCC GATTTCAGTA GCCAACACAT GATCCATGCT
GCCTCTGTAG ATTCTATACC GAGAAGAAGC AATAGCATGT CTAAGGATAT GTCTCCAGTA
GCCAGAGAAT CGCTTGTTAG ATCCTTCAGC ACGAACTCGA CTATGGAATC TGGGAATTTG
ATAACTTCAT TAAGGGAAGA ATACTCTCGT GATACTTCCG CCCGTCTTGA TTCTCTAGCT
GATGCTGCAG TAGACGAAAT GAATCGTTCT CCATCGTCTA TAAAGAAAGA ATCTATAGCC
GAAAGTCCGG CTCTGCAACT TCCTTTTTCA CCTAACTTCG GTGATTTTCT TACTGTTCGT
TCTCCGGCTG GTTCCGTAGA CGAAGTGTCA AGTAGTCATA CAGATAAGGC ACAAGCTGCC
CCAACGAGTT TGCTACTCTC CAGAGAGGCA GAGATCTTAA ACGAAATAAA CCTCACACCT
TCGCTCTCTG CTATAATCAA TTTTGCTTTC AGTGTAGACG ATCCAAAGGA TGCCTTCATT
GATGGAAAGA CCAAGTTTCC CTTTGAAGGT CCATTCAGTC CTTTGACGTT GACATTTCCA
TATGATAACC ACGAAAATGG AACTGAGTCA TCCACAGCAA ACAAACAAGA TTCACCTTTG
TCATTTGGAA AAATGGCCCT TAATATGAGA GATATATCTT CTCCTGTAGC ATCTGTAATC
ACACCATTGT CGACCATACC CGATAGCCAG CTTAACAGGT CCTTAGTTAA GACATCAGAA
CAAGAGCAAA TCTTGCATTT ATACTCTCGG TACACTTGTT GCATCATGTC TATAAAGAAT
GGAGCCAACG AAAATCCCTG GCGAAACTTG ATTGTGCCTC TTGCTACTAA GTATTCGTGT
TTATTCAACT CGATTGCTTC TATGACATTA TTTCATTTAG CAGGTAACAG CGATTTGAAA
GGAAATGGAG CTGATATGAG AGCTAAGGGG TACATCTACT TAAGGAGATG CATCCTTGAG
TTGGCCAGTG GACTTTCCAA AGTCAATGGC GAGGGTGAAG ACAACGAAGA ACTTCCAGCC
GATATTGCAC TTGCCACTTG CCTAAATTTG GCCGTTTGTG AATCTTGGGA TATACAAACT
TCTAGCGGAA TAGCTCATCT TAAAGGTGCC AAAAGTATGA TCCAGAAGGT CTTGACACTC
TTGAGAGACC AACAGCAGTC TTTATCAAAA AAGAGGAAGG AGTTGGAACT TGTTTCAAGT
CCAGCAGAAA GCGAGCATCA GTATAATGAA CTTGTTATTT CTAAAAAGAA AGACTTGGAA
AAGAAATTAG TATTAGTAGA AGACAACGAG TGGGAATGTT TATTTGAAGA TGCTGGTAGC
CAAGAAGAAC CATCAGGACA GGGATTTGCC AATAGAGTAG ACAAGAGTAG CATTCAAATT
CCCAAGAGCT TACAATTCCT CTTCAATATC TGGATATATT TTGAGGTATT AGCTCAAATG
ACTACAGACA CCAATTATGA CGACAAGGGA GTGGATTTAG TGGCTACAAT TACCACTATG
TTGCAGACAT CTCAAAAGAA CCACAGGAAA AAGAGTGGTG GCAGTATTCT GCTGCTAGCA
AGCGAACACG GTGACGTCGA TGCTGACACT GGTTCACACA AATCAGATTC AGGAGAACTG
AGCGTCACTG AATCGATTAC TAGAAATGGA TTTAGTCTTT TCGAGAACTT TGACTCCTTT
AGCTACAACA CTGAGTATGT AGATCCATTA TTGGGATGTG CTCAATCACT CTTTCTGATC
ATGGGTAAAG TTGCAAACTT GATTGCCAAG ATTAGGAAGT CTAGAAGAAA GGAAACAGAG
AGTCAAAACT GGAAAGGAGC AAAACCAAGA AATAGTCTCA AAACCATAAC TCTTGCTACC
GAATTGAAGC AGCAACTTGT TAACTGGAAA CTGACTATAA CAGCTTCCAT GATTAACCAA
GCCAATTTGT ATGACGACAA CAATGGAAAC ACTTGGGATC TTCCCTCGTG TATTGCAACA
GCAGAAGCTT ATAGATTTGC TACAATTATC TATTTGCATC AGGCAGTTCC TGAAATTCCA
TGTCCTTCTA CACACTCATT AGCAGAGAAG ATATTTATAC TCTTTGCATC AATTCCAACA
AATTCCGATT TGCATGTCAT TCACATCTTT CCCCTTCTAG TTGCATCTTG TGAGGCAGAA
CCGGGAGATG AAAGAGAATG GTGTGAGACG AGATGGAAGT TGCTATCTGA AAAGATGTGG
ATTGGTAACA TTGACCGTGC ATTGCAGGTT GTCAAAGAGG TATGGAAGAG AAAGGATGAT
TACAAGAAGA AGAGGAGAAG AGGAGAGTTT GACGAAGCTA CGATGAAAGG TGGTGCCAAT
AAAGAAGATG AGGACTCGTT GAGAAATATT TCTGCGCAAA TCAGCGGACT TATGTCTGTT
ATCAATGACC TGAATGGCTC TTCGCTGGAA GATATTCGTG GTGGAATTGG TTCAAAACTC
CATTGGAGCA CCGTGATGAA AGAATGGGGG TGGGAAGTCT TGTTAGGTTA G
 
Protein sequence
KTTKSRNGCI TCKRKRLKCD ETKPACLNCK KRNIECGGYA TNFKWRSFGE TDSNSTQKSN 
SLKRHLELAS LSVTGRTIDD IKIENDLISK GINPHSYKRK KHHSDFSSQH MIHAASVDSI
PRRSNSMSKD MSPVARESLV RSFSTNSTME SGNLITSLRE EYSRDTSARL DSLADAAVDE
MNRSPSSIKK ESIAESPASQ LPFSPNFGDF LTVRSPAGSV DEVSKILNEI NLTPSLSAII
NFAFNISSPV ASVITPLSTI PDSQLNRSLV KTSEQEQILH LYSRYTCCIM SIKNGANENP
WRNLIVPLAT KYSCLFNSIA SMTLFHLAGN SDLKGNGADM RAKGYIYLRR CILELASGLS
KVNGEGEDNE ELPADIALAT CLNLAVCESW DIQTSSGIAH LKGAKSMIQK VLTLLRDQQQ
SLSKKRKELE LVSSPAESEH QYNELVISKK KDLEKKLVLV EDNEWECLFE DAGSQEEPSG
QGFANRVDKS SIQIPKSLQF LFNIWIYFEV LAQMTTDTNY DDKGVDLVAT ITTMLQTSQK
NHRKKSDSGE SSVTESITRN GFSLFENFDS FSYNTEYVDP LLGCAQSLFS IMGKVANLIA
KIRKSRRKET ESQNWKGAKP RNSLKTITLA TELKQQLVNW KSTITASMIN QANLYDDNNG
NTWDLPSCIA TAEAYRFATI IYLHQAVPEI PCPSTHSLAE KIFILFASIP TNSDLHVIHI
FPLLVASCEA EPGDEREWCE TRWKLLSEKM WIGNIDRALQ VVKEVWKRKD DYKKKRRRGE
FDEATMKGGA NKEDEDSLRN ISAQISGLMS VINDSNGSSS EDIRGGIGSK LHWSTVMKEW
GWEVLLG