Gene PICST_88514 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_88514 
Symbol 
ID4837936 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp1735311 
End bp1737246 
Gene Length1936 bp 
Protein Length555 aa 
Translation table12 
GC content46% 
IMG OID640389251 
Productpredicted protein 
Protein accessionXP_001383618 
Protein GI150864683 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3118] Thioredoxin domain-containing protein 
TIGRFAM ID[TIGR01126] protein disulfide-isomerase domain
[TIGR01130] protein disulfide isomerases, eukaryotic 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AATTGATTTG TTCCATACTC TTTCTACTTC ATACGCTGTA TACGTAGTAT ATTAGTTCCC 
ACTATACGAG AATATCAGTA TTCAAACTCA GTGTATTAGT TTTCAGTATT TTTCTCTACC
AGAATTACTA CTATTACTAA TAAATACTCC ATCTACCGCT ACTACCTAGT ACAATATATC
ATAGAATGAA GTTCTGGAAA TTCTCATCTT CTGTGCTTGC CACCCTTCTC GCTGTCGTCT
CTGTCCAAGC TTCGGGACCA GCCGAAGGCG ACGCTGTCGC CGATCCAAAC TCCGCTGTCG
TAAAGCTTAC GGCCGAGACA TACAAACAGT TCCTTGACGA GAACCCTCTT GTTCTCGCCG
AATACTTTGC CCCATGGTGT GGCTACTGTA AGATGTTGGG ACCTGAATAC GCCAAGGCTG
CCAACTCGTT GAACGAAACC AACCCAAACA TCAAGTTGGC CCAGATCGAC TGTACCGAGG
AAGAAGAACT CTGTCGTGAC CAAGGTATCA GAGGCTACCC TACCTTGAAG GTTGTCTCCA
ACGGCGCCTA TGCCGACTAC GATGGCCCCA GAGATGCCGC CGGTATCGCC AACTATATGG
TCAAACAGTC TTTGCCTGCC GTCCAAGTGC CAGCTGACGC TGACGCTTTG ACTGCTGCTA
TTGAAGAACA GACCAAGCCA TATGTCATCC AAGTAGGTGC TTCTACTGAC TCTGACGCCG
CTTCCGCCTA CGAGCAAGTC GCTAAGGCCA ATAGAAACGA CTACTCTTTC TTCTCAGTGG
AAGAGCCAGC TTTGGTCAAG GAATTGAACA CGAAGTTTAC CAATGTTAAA GTAACTGGCA
AGTCCCCTTC ATACTACGTA GTCCATCCTG GTCAATTGGA TGACGTAAGA GAATTTGAAG
GCAAGGACAT CAATGCTGAC ACTTTGACCC TGTTTGTTAC CACCGAAGTT GTTCCATACT
TTGGCGACAT CAACAGAGAC ACCTACTTGA CATACATGGG TTCTCCATTG CCTCTCGGCT
ACTACTTCTA CAACACTGCT GAACAGAGAG CTGCTTTTGC TGACGAATTC TCGAAGTTGG
GTAAGCAATA CCGTGGAAAG ATCAACTTTG TCGGTTTAGA CGCTACCCAA TTCGGAAAGC
ACGCCGAGTC CATCAACATG GACCCAGCAA TCGTGCCTTT GTTCGCCATC CAAGACACAC
CAAACAACAA GAAGTATGGT GTTAACCAAA AGGAAAACCC AGAAGGTCCA TCTTTGAAGA
CGATCAAACA GTTCGTTGCT GACTACCTCG ACGACAAGTT GACTCCTATC GTCAAGTCTG
AAGATTTGCC AACCGAAGAA GAAAAGAAAG CCAACCCAGT TGTCAAGTTG GTAGGCCACA
ACCACAACGA AATCATCGAA GATGTCTCCA AGGACATCTT TGTCAAGTAC TATGCTCCAT
GGTGTGGCCA CTGTAAGAAG ATGGCTCCTA TCTGGGAAGA ATTGGCTTCC GTTTTTGGCT
CCAACAAGGA CGACGCCAAG GTGGTCGTTG CCGACATTGA CCATACCAAC AATGACGTCG
TTCTTCCCTT CGAAATCGAA GGCTACCCAA CCTTGGTTTT ATATCCTGCC AACGGTGAAG
TTGACGAAAA GACCGGCTTG AGAAAGCCAG TTGTTTTCTC TGGCGCAAGA GAATTAGATG
CCTTCATTGA CTTTGTAAAG GAAAATGGTG CCCTTGGTGT TGACGGCCAT GTATTGAAGG
CTGCTCAAGA CAAGGCAGCT GCTGAAGCTG CTCCTGAAGA AGAAGAAGAA GCCGCTGAAG
AAGTTAAAGA AGAAGCTGCT GAAGATGAGG ATGTTGAACA CGACGAGTTG TAAGTTTCCT
GAAATCAGGC TATAGCTTTC TGTATAATGT CATGTTTAAA AGTGAGTGCA GTATTGTAAT
ATAAACTTGA ATTTGT
 
Protein sequence
MKFWKFSSSV LATLLAVVSV QASGPAEGDA VADPNSAVVK LTAETYKQFL DENPLVLAEY 
FAPWCGYCKM LGPEYAKAAN SLNETNPNIK LAQIDCTEEE ELCRDQGIRG YPTLKVVSNG
AYADYDGPRD AAGIANYMVK QSLPAVQVPA DADALTAAIE EQTKPYVIQV GASTDSDAAS
AYEQVAKANR NDYSFFSVEE PALVKELNTK FTNVKVTGKS PSYYVVHPGQ LDDVREFEGK
DINADTLTSF VTTEVVPYFG DINRDTYLTY MGSPLPLGYY FYNTAEQRAA FADEFSKLGK
QYRGKINFVG LDATQFGKHA ESINMDPAIV PLFAIQDTPN NKKYGVNQKE NPEGPSLKTI
KQFVADYLDD KLTPIVKSED LPTEEEKKAN PVVKLVGHNH NEIIEDVSKD IFVKYYAPWC
GHCKKMAPIW EELASVFGSN KDDAKVVVAD IDHTNNDVVL PFEIEGYPTL VLYPANGEVD
EKTGLRKPVV FSGARELDAF IDFVKENGAL GVDGHVLKAA QDKAAAEAAP EEEEEAAEEV
KEEAAEDEDV EHDEL