Gene PICST_47023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_47023 
Symbol 
ID4839236 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp922570 
End bp924465 
Gene Length1896 bp 
Protein Length581 aa 
Translation table12 
GC content42% 
IMG OID640390551 
Productpredicted protein 
Protein accessionXP_001384844 
Protein GI150865573 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGGAG AAAGAAGGTC CGGCAGAACG AACAAGGGCC AACATACCAA GAGATATCTC 
GACGAGTTCG AGGACAATGC TGTTAGCATA GATGGAACTA GAACCAAAAA GTCCAGACTT
TCTAACGAAT ATGAGAACGA CAGTCACAAC GAAGAAGGCG TGGTTAGGTG TAATCCTTGT
GGCACTAATC AGGATAACTA TGACGAGGAA CATGACAAAG GCGGAACCTT CATCCAATGC
GACGAGTGCA ATACATGGCA GCATGCTAAG TGCATGGGGT TCAAGAAGGC TAACATTCCT
GACCTCTACA ATTGTGATGT TTGTGATCCT TCCCTGTTTA AGAAGAGAAT CGAAGATGCC
GAAATTGAAA AGAAGAACAA GAAAAAGTCG AAGGAAGAAC CAGATGCAGA AGTGCGTCAA
AAGAAAAAGG AAAACGAATA TGTTGAACTG ACTCCTACCG TTACTACTTC TACCGATAGC
CCCAGCCAAG AGCCAGAACC AGTAAGTAAA CCTAAACCTG TAGAAGAACT TAGTATGAAG
GATGTGTTGA AGGATGACCA CCGAGTCAGT ATTGGCAAGG CATTCTATAA CTACTTCAGA
AAAAGTTTCC CGGCTGACTA TAACATCACC GAGGAAGAAA AAGACAAGAA GGCTACTCGT
TGGGCGCTAG AAATCGAAGA CATAATTTTC CGTACCTACT CTGGTAAGCA GTATATATCA
GAAGGTAGAC GTATCTTATT CTTGCTAAAA AAGTACTTCA TGAAAGATAT CATAGCTGGA
ACCATAACGT TGAGTGATGT TGTTAACAAA ACCCCCAAGG AAATCAACCA GGATATCGAA
AGAATAGAAG CTGAAAACAG AGGCAATATC AAAAACATCA TTTTGACTGA AAATGACCAC
TCTGATATCA TTAGAAGAAC CCATAAGGGA GACATTGTTC GAGAGAATGA GAACGATGAA
CCTAGCTATT TAGAAGAGAG CATTGCTACA AGAAAAGTAG ACCACAGAAT CTTTTCTGCT
GACGATGCAC CTAAACCTAG AATTATCTCT GATGAAAACA AATTTCATTC TTACCAGAAT
TTGAACCCAA GATTTTTTGA TGATAATGAC GAAGATGAAG ATGAAGTCGA ACCCGTAGAA
GAAGAAGCCA ATGAAACCTC GAATCAGCAA CGACGGACTT CTACAGAAAG AAATTCGTCA
TCTGAGTCAA GTGATTTGGA ACATGTAACA GAACCGATCG AGGACGAGTT ATTGCCTATT
CTTGGAGTGG AGAGCAAAAG TCCAAAAGTG GGTCCTTCGG TATGGTCTGG TTCGATTGAG
TTTCCGGACT TTGCCAGTTT CAAAGCTTCT GCTAACTTTT ACTCTAGTAG CAATGAAGAA
AGCAGGGACA CGTCGATTAA CGCTTGCTAT GATTTGATGA CCCAGAATAC GTACTCCATT
TTGGGACGTT TAGACAGAGT CACAGCCGAT AAGTACTTGA ACAAGATCAT TAGTTCACGG
GATCTCTACT TAGTTGAAAT AAAGAGTACC AGTGACACCG AAACTGAGTT TCAGAAGTTG
TACCAGTATT TACTTATAGA GAACAAGGTG GGTGTTCTTT CTGGTAGACC TGAATTCGTA
AAGGACTCCT ACATTATGCC CATCGACTTT CGTGACAGCA GATTGCCTTT CTATTTGGAA
GAACATAAAC GAGACATGAG AATCGGCTTG TTTGTGTTGT TTGTAGTCAA GAGAGGCTAT
ACCCCAGTAA GCGATGCTGC AATTGCCCCA ACCCACACAC ATAGCCATAA TGTAAGCAGA
AACAACAGCA TTAGCAGACC TGAAGAATCT GCCCTGACGA AGTTGGGAGC AATTATGTCG
CAACTTGGCG GTGGTTCTAG CTACCAGTAT GTATAA
 
Protein sequence
MPGERRSGRT NKGQHTKRYL DEFEDNAVSI DGTRTKKSRL SNEYENDSHN EEGVVRCNPC 
GTNQDNYDEE HDKGGTFIQC DECNTWQHAK CMGFKKANIP DLYNCDVCDP SLQEPEPVSK
PKPVEELSMK DVLKDDHRVS IGKAFYNYFR KSFPADYNIT EEEKDKKATR WALEIEDIIF
RTYSGKQYIS EGRRILFLLK KYFMKDIIAG TITLSDVVNK TPKEINQDIE RIEAENRGNI
KNIILTENDH SDIIRRTHKG DIVRENENDE PSYLEESIAT RKVDHRIFSA DDAPKPRIIS
DENKFHSYQN LNPRFFDDND EDEDEVEPVE EEANETSNQQ RRTSTERNSS SESSDLEHVT
EPIEDELLPI LGVESKSPKV GPSVWSGSIE FPDFASFKAS ANFYSSSNEE SRDTSINACY
DLMTQNTYSI LGRLDRVTAD KYLNKIISSR DLYLVEIKST SDTETEFQKL YQYLLIENKV
GVLSGRPEFV KDSYIMPIDF RDSRLPFYLE EHKRDMRIGL FVLFVVKRGY TPVSDAAIAP
THTHSHNVSR NNSISRPEES ASTKLGAIMS QLGGGSSYQY V