Gene PICST_59020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_59020 
Symbol 
ID4838505 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp1659973 
End bp1662351 
Gene Length2379 bp 
Protein Length792 aa 
Translation table12 
GC content45% 
IMG OID640389820 
Productpredicted protein 
Protein accessionXP_001384270 
Protein GI150865166 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.200991 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTCTCCACTA AGTTTGTGGA CTACTGGAGA TCAAAGAACA CACGCCAACA ACATACGTTG 
CTAGCCCTGG TCTCAATTGC GGCTTCGCTC TTCTACTACG TGCTTCTGCC GTCACTTCCT
TGGAACCGCA GTACTATGTT CGCGAAAAGA CCTGACAAAT ACACTACGGG CCTAATCAAC
CTACGGAACG ATTGCTTTGC CAACTCGTCC GTCCAAGCCT ATCTGGCTCT TCCTGGACTT
ACCGACTACT TGAACAAGTT CATCACCAGT TTCAATGAGC TTTCGGCTTT TGCGAAGTCG
AAAAATATCG ACATCAACAA TGTAATCCAC CACCAGAGAC TCAGTACGAA AAATGGAGAT
TCCAGTGCAG CCTCAACTTC AAAGTTCAAA AATACTATAT CCAAGTTTGA CATCTCGCTC
CATATTGGGC TCGCCGACAT CATCAAGAAA CTCCAGGAAA CCCAGATGGC ATCACGAACT
ATATCCGTAT GGACTTTTCT CCACGTACTA GAGGGTATCT TCAATGCAAA AATATCTCGA
TCACAACATG ATGCCCATGA ATTGACTCAG CTTATCAACG AGACGTTAGA GAATGAAAAT
ATCAAAATCA AAAACTTGCA CAAGTACATC AAGACCAATC TCCACACCAT TCTCGGACTG
AGGGACACTC CTTCACCGAG AGACTACTCG ACATTGGACA AGATCCAGGT GCCAGAGTTT
CCCTTCTCGG GGTTAACTTT GAGCCAGCTC AAGTGTCTCA AGTGTCTTGG GGTATCTACA
CCGAACTTTG CTCCGTTCTT GATGATCACA TTGCACACGC CACAAAAACT GTCTAGTGAC
ATCGAAGACA TGTTGAACGA CAACAAGACA GAGTCCATAG AAGGCTACCA ATGCCTCAAG
TGTAGGATCG TCGCGATAAT AAATAACGAA AACCACATGA AGCGTACCAT TCCTGATCAA
GATGTCAAGC ACATCAACGA ACTCAAGAAG TTGAACAACA ACTCCAAACT TTGTATCAAT
GACGATTTGC CAAAAGAATT GGAAGACTTC ATCCGCGATT ATAACGTAGG CGGAGTTAAT
ATTTCCCAGA TCACTTCGAC AGTGTTTAGG CATACGCAGA TCTTGAAGCC TCCCAAAATA
TTCGGTGTGC ACCTCTCTCG TTCAAGCTTC AACGGAGGCA ATGCCACAAG GAATCCCTGC
AAAGTTTCGT TTAAAGAGCA CTTGACGTTG TCTATAGGTA AGGAGTACCA CGAGCAGTTA
AGACAATTCC AACACCAGGC AGAGGAAGAG GAGGAAAAGC AGTTGGAGTC GAAAATCGAA
GTCACGGCTG CACATGTCTT AACCCGCGAT GTAAACGATA TGGAAGACGA AGACGTCCAA
AGAGAAGATG TCGATGTTAA AGGAACAGAA GATGTAGATG TTGAAGCTAA TGTAGTCGAT
AATGGCACTT CTACAGATGA TGCTGGAGAA GAAAACGATC TAGAAGACAA CGACACTTCG
TCTACTTCAA CTGAAGAATC GATGCAGCCC TCAGTTAGTA CCACGGCCAC GATGCAGAAC
AGTTCAATTA CCAATGCCTC CGACAAGTCT CGTACCATAA ACAGCGCTCC AATTTCAGAC
GACCAATCAG AAAAGTTACG GGACCACTTC AAAAAGTTCA AGTTCAACGA AAATGACGTT
TACAAGTACA GACTCAAAGC CATGATCAAG CACCAAGGCT CGCACACCCA AGGTCACTAC
GAGTGCTACA AGAAGAAGCC TTTGTTTGTC AAGGATAAGG ACGGGAACAT ATTCAAATTG
TTCCCCGAGA TTATTGACGA TTTTAATGGT GACACAACAT ACGATGTAGT TCCGGCTACC
TCTTCTGAGC TTGCATCAGT GTCCATGAGG ACTTCCTCAG AAGGAACCAA ATCATCTATT
AACACGGGTC ATTCTTCTTT GGACAAACGT AGATCATCTA GCAATAGCTC TATGGGCTCC
AAGGATGGAA ACGTTCGTCG TAGACTCTCC ACTATGATGG GCCGTCGTCC ATCTGTGTTC
CAAGCTGATC CGGAGGAAGC CGGTATTCAA GAGATTGTCA ACTCAGGGTC GGCTACTCCA
GCAGAGTTGT TAGTAGATGA GCCTCGCGAG TACTTCTCAG CTGAGTTGGC CCTGGCTGCT
ATTAGCAAAT CGGTCCACGA TTCGCAGAAT AGCCAACATT CGGATAAGGT CAAGATGAAG
AAGATTCCTT CTGCCATAAA ACAACCCTAC TGGAGAATCA GCGATTCCAA AGTGACTGAG
GTAAGTCGAA GCACAATGAT GCTTGAAATG ACAAGTGTCT ACATGTTGTA CTACGAGAGA
GTCGACCGCA AACAAATCAA ACATTCCCAA ATAGTTTAG
 
Protein sequence
VSTKFVDYWR SKNTRQQHTL LASVSIAASL FYYVLSPSLP WNRSTMFAKR PDKYTTGLIN 
LRNDCFANSS VQAYSALPGL TDYLNKFITS FNELSAFAKS KNIDINNVIH HQRLSTKNGD
SSAASTSKFK NTISKFDISL HIGLADIIKK LQETQMASRT ISVWTFLHVL EGIFNAKISR
SQHDAHELTQ LINETLENEN IKIKNLHKYI KTNLHTILGS RDTPSPRDYS TLDKIQVPEF
PFSGLTLSQL KCLKCLGVST PNFAPFLMIT LHTPQKSSSD IEDMLNDNKT ESIEGYQCLK
CRIVAIINNE NHMKRTIPDQ DVKHINELKK LNNNSKLCIN DDLPKELEDF IRDYNVGGVN
ISQITSTVFR HTQILKPPKI FGVHLSRSSF NGGNATRNPC KVSFKEHLTL SIGKEYHEQL
RQFQHQAEEE EEKQLESKIE VTAAHVLTRD VNDMEDEDVQ REDVDVKGTE DVDVEANVVD
NGTSTDDAGE ENDLEDNDTS STSTEESMQP SVSTTATMQN SSITNASDKS RTINSAPISD
DQSEKLRDHF KKFKFNENDV YKYRLKAMIK HQGSHTQGHY ECYKKKPLFV KDKDGNIFKL
FPEIIDDFNG DTTYDVVPAT SSELASVSMR TSSEGTKSSI NTGHSSLDKR RSSSNSSMGS
KDGNVRRRLS TMMGRRPSVF QADPEEAGIQ EIVNSGSATP AELLVDEPRE YFSAELASAA
ISKSVHDSQN SQHSDKVKMK KIPSAIKQPY WRISDSKVTE VSRSTMMLEM TSVYMLYYER
VDRKQIKHSQ IV