Gene PICST_35572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_35572 
Symbol 
ID4837870 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp1070233 
End bp1073619 
Gene Length3387 bp 
Protein Length1069 aa 
Translation table12 
GC content42% 
IMG OID640389185 
Productpredicted protein 
Protein accessionXP_001383832 
Protein GI150864847 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCACC AACGCCCGTC GGTTTCTGGC GGAACGTCCT CCACCGCTGG AACCAACGGC 
AACTCGGTGA CTACACCATC GTCTAGAAGA ACCAATTCCT CTTCTTCTTC ACTATTGTCA
TCTACTAAAC ACACACCATA TGAAGATCCC AGCATCAGTC TCATGACTCC CAATCACAAA
ACAACAAATA CAACAGCAGG ACAAACCAGT AACAGACCCA CTAGCCATCC TCGTAAAATG
TCACGAGAGT CGTCATTGGC CCAAACCCCA TCGATCTATA TCAACAACAA TCTGCTAGAG
TACTTCACCA ATAACCATTT TTCCCGGTCG TTCAACAGCT TGATCTTGGA CGATATCAAA
GACGAGCTCA ATCCACATCC CAGCGGGGGC TCTTCTCGTA AAAAGATCAA CAAGTCACCT
CCATTCAATG TAAAACAAGA TTACTATGGA GACAAAATCG ACGCCTATCT TTTCGAAGAT
GCCACAGAAA TAGACGGAAT GAATGATGAC AACGACAACG ATATAGAAAT TGGTCAAGAC
GATCTCGACG AAGAGGACGA ATTTGATGAT GACGATGATG AGTATAAGGA CGAAGATAAC
GATTACGAAG GACCCTTCTC TTCTACATTA AGCATCATCA AACCCAAACG AGAAAAAGTA
CCAAAGAAAC TGATTCCCCA AAACTTGGTA ACGAAATCTA AACCGATAGA TGAAACCTTA
TTGCATTCCG ATGAAGAATC AGAAAAGCTT AAATCGTCAC AACAGTCTTC TATAAAGAGA
TCTTCCAAGT TCCTCAACTT GTCTATAGAT TCGAATTTCA AAGCGTTGAG CAATAGAGTC
ATGGAAGACA TAGACTCCAT AAGCGACTTG AACGAAATCG ATTCCTCCAT TATCGGAGCC
GTAGCCCAGT CACTGAAACT ACACACCAAA GAACATACCC CTATTAGCTT GAGTCCCGTG
GCAACAAGAA CACCTTTGCA TCTTAACAAC ACCAACAAGT TCAAGCGACC CCATAAGTTG
GTCAGCCAGT CTCCATCACC ATCTTCCAAT AAAGGTTCAT CTACCACACT CTACCAACTT
AGTCGCTCTT CACCTGAAAG AGTGCTACGT CACAAAAGCA GCGCACTCGA CCAAGCCAAT
CTCTATTCGC CCTCGAAGTT AGGAATGAAA GGTTTCAAGA TGTTCAAAAA CGCCAATCGA
GACGCCATAA TATCACCTAA TAGGTCTACT CCAGAGAAAA AGATATCAAC CATTTTTGAT
ACCAAGAATG ATCACTCTAC TTCAAAGTTA AGAAAGACCT CATTAAACTA CAACAAGTCT
CCTTTCAACA AGCCAAACTC GTCACCTCCA ATACCCTCAC TAGAATACTA CAATGTAGAC
GACTTAGATG TTGATAGTCC TTCCAGAAGC AGGAAGTTTT CTAATTCTTC GAGCAACTCA
ATTATAATAT ATCAAGATGC AAATGAGCAA CATAAGAAGA AGACTCTTTC AACCTTCCAC
ACCACATTGC CTCCAACGGC ACCTTTGCGT CCAGACTATG ATGATAAAGA GAATAGAAAC
TCATATCGTT TTGTCAAACC TCTTCAAACA GCTTTCAAAT CAACTGGATT GGTGAAGAAG
AACTGTGCAA ATACAGAATC ACGAAAATTG CCTCCTGAAA CTCCAGTCAA GAGGAATCCA
CTTATGATAT TAAACACCAA CAAGCCACTA AGCTCACATA GTCTCACCAA CAATACTATG
GGCTTGGAAG GACTTCACGA GGAAAACCTG GAAGAGTCTA TTGAAATAGG TAGAAATAAC
TTGTCGTATA ATTCAGGACT CAATGAGTCT CAAACAAGCT TCTTTCGGAT CCCTTCAAGT
TCAGCTCCAA ACAAGGGACA TATCTTGACC AAGGAAGAAA TAGATATGGA TTTGGGCTCA
GATGCTGAGC TTGGTATACC AGAGACTCCA ACCAAGTCAG TCAAGAAGTC ACATTCCACG
CCAGCCAATT TCAGTCCTTT GCATTTGCTT CATCCTCTTA AACCACCATC GCTCAAGTTG
TCAAACGAAG AACCTTCTAC GCCTACTAAT TTGGTATTTC TTGGTAAGGG CGTAAAAGTA
GACGCTAAGA CCGAAGATCG AACTATAATG CAGATAATGA ACCCTGATGA TGATACAATC
ACGAGTTTCA ATCAACTGCA AGAAGTTATT AAGCCGTCCA GAATAGACGA TCATTTGGTG
AATAAGTTCG GAATGAAAAA CATCAAGTAT GTGGGCAGCG GGGAATTTTC TATTGCTTAC
GAGTGTCTTT TCCAGGACCA AAAGTTTGCT ATTAAGAGAT CTAAGAAACC CATAGTCGGC
AAATTGGAGC GTAAAACAAT TATGAGAGAA GTCGAAGCTT TGAGAGTGTT GACTTCTGTC
AAAGATAATG AGAAATTGAA CTTACAGGAA CAAGAAGAAG GGAAGGAGTA CCTTGTATAC
TTCATTGAAG CGTGGGAATT CAACAATTAC TATTACATCA TGACTGAATT CTGTGACAAC
GGCACTCTCT TTGATTTCTT GGAAGAGAAC AAGAATTATA AGATAGACGA GTTCAGAATC
TGGAAGATAT TGATAGAAAT TCTCAATGGG TTGAAGTTCA TTCATCTGAA GAACTATCTT
CATCTAGACT TGAAGCCGGC TAACATTTTT GTTACATTTG AAGGATCATT GAAAATTGGA
GACTTTGGTT TGGCCACCAA GTTGCCGATT TTAGAAAAAG ACTTTGACTT GGAGGGAGAC
CGCAACTACA TTGCTCCCGA ATTGATCAAC GACAAGATAT ATACACCATT TGCCGATATC
TTTAGCGTTG GGTTGATGAT CTTGGAAATT GCTGCCAACA TCATCTTACC GGACAATGGT
ACGCCATGGC GGAAGCTACG TAGTGGAGAT TTAAGCGATG CAGGGAAGTT GTCCAGTGAC
AATATTAGCA TTTTTTTGCA ACATCAAAAC TTTTCATCTA CGACGTCATA CAATACGAAC
GTTAATTTCT CGTCCAACCA GTCGCTTAAT TTTGAAGGAC CGGCCAACTT CCAGCTTCCA
CTGTCTGGGA GTTCAGCAAA CAGCAAGGAA AAGAATTTGC TCACCCCGAA TGTACCTACA
GTTGCAACCA GTGGTTGTAG TACTTCGACC ATACGCAATA TTCGCGACTT GATTCCAGCG
TGGGCTCCGG AATTTCTAAT CAGTGGAGAT CTGATGATCT TGGACAAGTT GGTCAACAAG
ATGTTGCGCC CCAACCCATT TGATCGTCCC AACGCTACAA TGATTTTAGA GATGGACGAG
TGTGTAACTA TTGAAAATCT TCGCAAGGCT GGAGCAACTA TCTTCGAAGG AGAGTTTGGT
CCCAACCCCA ACGACGACGA AGAATAG
 
Protein sequence
MNHQRPSVSG GTSSTAGTNG NSVTTPSSRR TNSSSSSLLS STKHTPYEDP SISLMTPNHK 
TTNTTAGQTK YFTNNHFSRS FNSLILDDIK DELNPHPSGG SSRKKINKSP PFNVKQDYYG
DKIDAYLFED ATEIDGMNDD NDNDIEIGQD DLDEEDEFDD DDDEYKDEDN DYEGPFSSTL
SIIKPKREKV PKKSIPQNLV TKSKPIDETL LHSDEESEKL KSSQQSSIKR SSKFLNLSID
SNFKALSNRV MEDIDSISDL NEIDSSIIGA VAQSSKLHTK EHTPISLSPV ATRTPLHLNN
TNKFKRPHKL VSQSPSPSSN KGSSTTLYQL SRSSPERVLR HKSSALDQAN LYSPSKLGMK
GFKMFKNANR DAIISPNRST PEKKISTIFD TKNDHSTSKL RKTSLNYNKS PFNKPNSSPP
IPSLEYYNVD DLDVDSPSRS RKFSNSSSNS IIIYQDANEQ HKKKTLSTFH TTLPPTAPLR
PDYDDKENRN SYRFVKPLQT AFKSTGLVKK NCANTESRKL PPETPVKRNP LMILNTNKPL
SSHSLTNNTM GLEGLHEENS EESIEIGRNN LSYNSGLNES QTSFFRIPSS SAPNKGHILT
KEEIDMDLGS DAELGIPETP TKSVKKSHST PANFSPLHLL HPLKPPSLKL SNEEPSTPTN
LVFLGKGVKV DAKTEDRTIM QIMNPDDDTI TSFNQSQEVI KPSRIDDHLV NKFGMKNIKY
VGSGEFSIAY ECLFQDQKFA IKRSKKPIVG KLERKTIMRE VEALRVLTSV KDNEKLNLQE
QEEGKEYLVY FIEAWEFNNY YYIMTEFCDN GTLFDFLEEN KNYKIDEFRI WKILIEILNG
LKFIHSKNYL HLDLKPANIF VTFEGSLKIG DFGLATKLPI LEKDFDLEGD RNYIAPELIN
DKIYTPFADI FSVGLMILEI AANIILPDNG TPWRKLRSGD LSDAGKLSSD NISIFLQHQN
FSSTTSYNTN EKNLLTPNVP TVATSGCSTS TIRNIRDLIP AWAPEFLISG DSMILDKLVN
KMLRPNPFDR PNATMILEMD ECVTIENLRK AGATIFEGEF GPNPNDDEE