Gene PICST_50192 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_50192 
Symbol 
ID4840884 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp265747 
End bp266820 
Gene Length1074 bp 
Protein Length357 aa 
Translation table12 
GC content39% 
IMG OID640392199 
Productpredicted protein 
Protein accessionXP_001386450 
Protein GI150866753 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3118] Thioredoxin domain-containing protein 
TIGRFAM ID[TIGR01126] protein disulfide-isomerase domain 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000211726 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TCGAATCTCC TCCAGGTGAA TGACAAAAAT TTCAAAGAGA TTGTCATTGA CTCCGGTAAA 
TTCACCTTCG TAGATTTCTA TGCTGACTGG TGTCGTCATT GCAAGAACTT GATGCCAACA
ATTGAAGAAC TTGCAGATGT TTTCGAGCCA TTCCAAGACC AAGTCCAAGT TGTGAAAATA
AACGGAGACA AGGACGGAAA GAAGATGTCT AAGAAATACG TCTTCAAAGG CTATCCAACC
ATGTTGCTTT TCCACGGCAA CGATGAACCA GTTGAATATG ACGGTATTAG GGATTTGCAG
GCTTTGAGCA ATTTTGTTCA ACAAATCACA GGAGTCAGAT TAGCAAGCAT AAAACCGGAA
GGGGAGGTCG AAGAGTCTAA GGTAGAACAG GAACCAACTG GTTTGATTCG ATTGAATGAT
ATCAATTTTG AAGACAAAAT TAGAGAGACT CCGTATTCAA TTGTAGTATT CACTGCCACT
TGGTGTCAAT TCTGTCAGAA GTTAAAGCCG GTACTTGAAA CTCTTGTTGA TGTCGTATTC
GCCAACGAGA AAGAAAAAAT ACAGATAGCA ATTGTGGAGC TTGACACGGA ACCCGGAGAC
AAACTAAGTG ATAGATACCA CATACTGACG TTACCAACAA TCCTCTTCTT CAGTAATGAG
TATGATGAAC CTAGCATATA TGATGGTGAA AAGGAATTAT TGCCTTTGCT TGCCCTGATC
AATGAATTCA CAGACTCACA CCGCGACGTT GAAGGAAGGC TTTCTAATAC TGCGGGGAGA
ATTCAAGAAG TTGACAATTT GATTAGCCAG AAGATTTTAC AAGGATTCAA AGGAGATTTG
AGTACAGCAG GAATAGAGCT TTTAGGCGAA ATTTCACATT TATCAAATGA GAATTACGAG
ATGCTTCCCT ATTATAAGAA GTTGGTAAGC AAGATCATAA ATAATGAAAT GGACTTCTTC
AAGAATGAGT TTTCCAGATT AGCGACTATA TTGGAGAATG ACATCTCAAA ATTGACGCCA
AATACAATCG ACTCGATGCA AAAGAGATCT AATATTTTAT CTTCATTTAT CTAG
 
Protein sequence
SNLLQVNDKN FKEIVIDSGK FTFVDFYADW CRHCKNLMPT IEELADVFEP FQDQVQVVKI 
NGDKDGKKMS KKYVFKGYPT MLLFHGNDEP VEYDGIRDLQ ALSNFVQQIT GVRLASIKPE
GEVEESKVEQ EPTGLIRLND INFEDKIRET PYSIVVFTAT WCQFCQKLKP VLETLVDVVF
ANEKEKIQIA IVELDTEPGD KLSDRYHIST LPTILFFSNE YDEPSIYDGE KELLPLLASI
NEFTDSHRDV EGRLSNTAGR IQEVDNLISQ KILQGFKGDL STAGIELLGE ISHLSNENYE
MLPYYKKLVS KIINNEMDFF KNEFSRLATI LENDISKLTP NTIDSMQKRS NILSSFI