Gene PICST_76282 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_76282 
Symbol 
ID4837585 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp2076311 
End bp2079516 
Gene Length3206 bp 
Protein Length1046 aa 
Translation table12 
GC content41% 
IMG OID640388900 
Productpredicted protein 
Protein accessionXP_001382625 
Protein GI150863965 
COG category[R] General function prediction only 
COG ID[COG1026] Predicted Zn-dependent peptidases, insulinase-like 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.967871 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.769643 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CCATATTTAG CAGCCGCAAA CCATTATGTT AGCCGCTCGT TCCCGAAGCA TAGTCAACGC 
AGCAAGACGG AGCTTTGCTA CGGCAGCTTC TCCCTCTGCA GCAACTTCCG CCATTCTCTC
CAAATACCCC ATTGGCCTCA ACCTCTATGG GTTTGTCATA GACAATGTTC AGCCCATTCC
TGAGTTTTCG CTTGTGGCTG TCCATCTAAA GCATGAAAGA AGCGGAGCAG CACACCTTCA
CTTGGACTCA CCTACAGATA ATAATAACGT GTTTCTGATT GCGTTCAAAA CCAATCCTCC
AGATGCTACT GGTGTACCTC ATATCTTGGA ACATACCACC TTGTGTGGTT CGTATAAGTA
CCCTGTACGA GACCCATTCT TCAAGATGAC AAACAGGTCG CTTTCCAACT TCATGAATGC
CATGACTGGC CACGACTTCA CCTTCTATCC GTTTGCCACG ACTAACGCAA AGGACTTTGA
CAACTTGATG GATGTCTATC TTTCATCGGT ATTCGAGCCT CTTTTGTCAT ACAATGATTT
CATCCAAGAA GGATGGAGGT TGGAAAATGA GGATATAAAT GACCCTGAGA GCAAGCTCGA
ATTGAAGGGG GTAGTATACA ACGAGATGAA GGGCCAGAAT TCCAACACGT CGTACTACTT
CTACATCAAG TTCTTGGAAT CCATATATCC TTCTTTAAAC AATGCTGGTG GGGATCCAGC
AAAAATCCCT GACTTGCAGT ACGAAGACTT AGTCGATTTC CACCACAGAA ATTACCATCC
TTCCAATGCC AGAACTTTCA CGTATGGTAA TTTGCCGTTG ATGAACCATT TGAAACATTT
GAGCGACTAC TTCCAGACGT TTGGCGTCCG TCCTCAATCC AAGGATTTGA AATTGCCAAT
CTTCTCCAAC ACGACTTCGC CATCGTCAAC CACCGTTGTC AAGGTTCCTG GTCCTGTAGA
TACCATGTCG TCGAAACCAG CTGAACAACA GTACAAGGCT TCGATCACAT GGAACTTAGG
AAATGCTCTC GAAGAAGCCA ACCAGTACGA GTTGTTCAAA TGGAAAATCT TGAGTTCTCT
TCTCTGTGAC GGCCATAATG CTCCATTTTA TCAAGAGCTT ATTGAAACTG AGTTTGGTGA
TGACTTTTCA GCAAACTCAG GAATAGATGC CACGACTGCT TTGATATCGT TCACTATTGG
GTTGAACAAT TTGACTGTAG AAAAAGTTGG CCAGTTGGAA TCCAAAGTGT TGGATATTAT
TAGAAACAAG GTGCTACCCG AGTTCGAAAA TCCGGAAAGC TCCTACAAAA CTAGAATTGA
AGCTATTTTG CATCAGATAG AATTGAACTT GAAAAAACAT AAGCCAGATT TCGGTTTGAG
CTTGTTGAAT GTGATCGTTC CTACATGGGT GAATGGCTTG GACCCAATCA AATCGTTAAG
GGTTGAACCG ATTTTGAACC AGTTCAAGTC TGACTTTGAG TGCAAGGGAT TACTTGTTTT
CAAGGAACTC CTTGATAGTT CAATTTTGAA TCCACAATGT GAAAAGTTCT CGTTTGTCAT
GGAACCCCAG AATGAATTCA ACAAGAATTT AACCACAGTT GAAGCTGAGA GAGTAAAGAC
GATGGTACAA AGCTTATCTG AAGAAGATAA AAAGATTATA AACGAAAGAG GCCAAGAGCT
AGCTAGAAAA CAGACAGAAG AACAGGACGG AGAAGTATTG CCTACTTTGA CCATTAAGGA
TATACCAGAG AAGGGAGATT TCCATCCACT TTTGTACTCG CAAATTGGCT CCAATACATT
GCAAAAGAGA ATTGTAGATA CAAATGGTTT GGTGTACACT GCTGCTTTAA AGAATATCAA
ATATTTGCCA AAGGAGTACT ATAAATACCT TTCTTTGTTC AGTTCCTGTC TCACCAATTT
AGCAGGGACT TCTCATACAT CCGTTACCGA TCTTGAGACT AAGATAAGCA GATTGACAGG
TGGTATCTCC TTTAGCCACA GAATATCAAC CAATCCCTAT GATATCAGAG AAGTTGACTT
GTACTTTCAG ATGTCGGGAA TGGCTCTAAA AGAGAACTCT GAACATATCT ACAACTTGTG
GCATGAAGTT CTTACTGATA CGCAATTTGA TTCCAATGAC GAATTGGTCG TTGACAAGTT
ATTCACTTTA ATCAAGAATA TGAGCCAGAA CCAGCTAAAT GTAATAGCAG ACAGGGGTCA
TTCATATGCG AGTGGATATT CCAATGCCAA GTTGACTCCT GCTAGGTATA TTTCTGATAT
TACCTCAGGA ATTAGCCAAG TGGAATTTAT AATGGAGTTA AATTCAAATA TCGAAAAGAA
GGGTAAGCAG TATCTTGCTG ACGAAATTTT GCCAATTTTA AAGGAAATCC AGTTATTAAT
CATTGACAAT GCCAAGGGCG AATTCAAGTA TAGATTGGTA GGTGACAAGG ACATTATTGG
AGAGAATGAG AAGTTGGTAC AAGAATTTAA CGACAAAATT TCACATTTCA GCAACACTTC
TAGTCAAGGA AATGCGTCTG AATTGAAGAA TTTGGTCAAT GCTTTCAATA ACAACAATTT
GGGAATAAAT GCAAACCAAA ATACCTTGCT TAACTTACCT TTTCAAGTCA GCTATGCTTC
TCTCGGGAAA CTTGGTGCAG AGTATGCTTG CAAAGATGGA GCTAGTTTAC AGATATTGGC
CCAGTTATAC ACTTTCAAGC ATTTGCACTC TGTAATTAGA GAATCTAATG GAGCTTATGG
TGGAGGCTTG CTTTATGATG GATTGGGTGG TACTTTGAAC TTCTATTCCT ACAGAGATCC
TAATCCGTTG AAGTCAGTAG AATCTTTTGA GAAATCGTTT GATTTTGGGC TTCTGGTAAA
CTGGGAGCCT AAGGACTTGC AAGAAGCCAA ATTGAGAATA TTCCAGAGTG TTGACGCTCC
TACCAATATT GCTAGTCAGG GTTCTACTGA ATTTTACGAA GGAATCACAG ACGTGATGAG
ACAGGAAAGA AGAGAGAATT TCTTGAGCGT GAACAACCAG GACTTGACGA ACGTCATCCA
GAAGTATTTG GTTGACCAGA AGGATAACCT GGTTACAGTT ATCGGTGATA ACACAACTTT
AAATGTTGGA AAGGACTGGA CTGTCCAAGA GTTGAATGTT AACTAAACCT GTAAATAGAT
AATAGACGTT ATGTATATAT TAATAT
 
Protein sequence
MLAARSRSIV NAARRSFATA ASPSAATSAI LSKYPIGLNL YGFVIDNVQP IPEFSLVAVH 
LKHERSGAAH LHLDSPTDNN NVFSIAFKTN PPDATGVPHI LEHTTLCGSY KYPVRDPFFK
MTNRSLSNFM NAMTGHDFTF YPFATTNAKD FDNLMDVYLS SVFEPLLSYN DFIQEGWRLE
NEDINDPESK LELKGVVYNE MKGQNSNTSY YFYIKFLESI YPSLNNAGGD PAKIPDLQYE
DLVDFHHRNY HPSNARTFTY GNLPLMNHLK HLSDYFQTFG VRPQSKDLKL PIFSNTTSPS
STTVVKVPGP VDTMSSKPAE QQYKASITWN LGNALEEANQ YELFKWKILS SLLCDGHNAP
FYQELIETEF GDDFSANSGI DATTALISFT IGLNNLTVEK VGQLESKVLD IIRNKVLPEF
ENPESSYKTR IEAILHQIEL NLKKHKPDFG LSLLNVIVPT WVNGLDPIKS LRVEPILNQF
KSDFECKGLL VFKELLDSSI LNPQCEKFSF VMEPQNEFNK NLTTVEAERV KTMVQSLSEE
DKKIINERGQ ELARKQTEEQ DGEVLPTLTI KDIPEKGDFH PLLYSQIGSN TLQKRIVDTN
GLVYTAALKN IKYLPKEYYK YLSLFSSCLT NLAGTSHTSV TDLETKISRL TGGISFSHRI
STNPYDIREV DLYFQMSGMA LKENSEHIYN LWHEVLTDTQ FDSNDELVVD KLFTLIKNMS
QNQLNVIADR GHSYASGYSN AKLTPARYIS DITSGISQVE FIMELNSNIE KKGKQYLADE
ILPILKEIQL LIIDNAKGEF KYRLVGDKDI IGENEKLVQE FNDKISHFSN TSSQGNASEL
KNLVNAFNNN NLGINANQNT LLNLPFQVSY ASLGKLGAEY ACKDGASLQI LAQLYTFKHL
HSVIRESNGA YGGGLLYDGL GGTLNFYSYR DPNPLKSVES FEKSFDFGLS VNWEPKDLQE
AKLRIFQSVD APTNIASQGS TEFYEGITDV MRQERRENFL SVNNQDLTNV IQKYLVDQKD
NSVTVIGDNT TLNVGKDWTV QELNVN