Gene PICST_42049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_42049 
Symbol 
ID4837280 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp197995 
End bp199263 
Gene Length1269 bp 
Protein Length422 aa 
Translation table12 
GC content42% 
IMG OID640388595 
Productpredicted protein 
Protein accessionXP_001382272 
Protein GI150863710 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1331] Highly conserved protein containing a thioredoxin domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.614219 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TCTACTACAA CTAGCCCATC TACTACAACT AGCCCATCTA CTACAACTAG TTCACCTACA 
ACTTCTTCAA GTTCGACGTC TTCCACTACT TCAAGTACTT CCTCTTCTTC CCTGTCTTCT
CTGTCTAGTG CTCCATCGGC CTCTGCTGTT CCTGTGAATG CTTTCCAAGA CACAGTTGAT
GACATTTGGA ATAGATTTTG GAGTGAGCCA AATGCAGCTT GGTCAGACAA TGATAATATA
TGTGGACTGA CCAGTTTTGC TCCTCTTGTT GTTTGGGATC AGGCAGTTGT CGGTACTGCA
ATTGTCAACA CCTTGAACAA ACCCAGAATT GATGCTACAC TCACAAACCT TTTGCAATAC
AAGAACAACG AAAATGGAGC ATTTTCTGCT ACAACAGCTG GAGATGACGA TATCTACAAC
GATGACAACG CCCAGGCCGC TTGGGTATTC ATTGACGCTT ACAAAGCAAC TGGAAATACC
GACTACTTAG ACAATGCCAA AGGCATTGTG AATTGGGTCA TATCCCAATG GGCCAGCAAC
GGTGGAGTGT TCTGGCACCT TAACAATAAC TACGTTGCCT CCATTTCTAC TACGGAAGCA
GCCTTGGCAG CAGTAAGATT ATACGAAATT GAGCAAGATA ACAAGTTGTT GGAATTTGCA
TCCAACTGTA TGGACTGGAT GTTTGCCAAT CTTCAGGATA AGAGCGACTA CTTGTTCTAC
GATGGATTAA ATAATGATGA TTACAGTGAC ATCAACAAGG GTAAGCTTAC TTACTCTGTT
GGCTGTGGTT TAAGCGCTAT GGCTTACTTA CACTATTACA CTGGAAACCA AAAATGGTTG
ATTGAATCTA AGAATATCGC TACGTCTGTC ACCAAGGGTT CTGGTGCATT TTACGGCAGT
GACGGTGCTT GGAATAACCA ATTACAATAT GTTCATTTAT TATTCATGGG ATTTGCAGAT
CTCTTCAAAT ATATTCCATG GCAACCTGAA TTCGATGGCT ATAAGACCGA AGTCTTGAAA
CAAGGTTCGT ACGTCTACAC CACTAAGCAG GACCCTGACG ATCCTAACAT GTACTTCAAT
CTGCCAACTA CGACTCCACA GATGACCAAG AAATACAATG CTTTATTTAA CCAAGACGCT
TCCAGCTCAG ACACTGCGGC TCAGCATTGT GATTCAAATG AGAACAATCC AATTCCAAAA
TCCTTGATGG ACAATGCTTC GGCTGCGCAA ATAATGTTTG CAATCTCCCA GATAGAATCT
CCACAATAG
 
Protein sequence
STTTSPSTTT SPSTTTSSPT TSSSSTSSTT SSTSSSSSSS SSSAPSASAV PVNAFQDTVD 
DIWNRFWSEP NAAWSDNDNI CGSTSFAPLV VWDQAVVGTA IVNTLNKPRI DATLTNLLQY
KNNENGAFSA TTAGDDDIYN DDNAQAAWVF IDAYKATGNT DYLDNAKGIV NWVISQWASN
GGVFWHLNNN YVASISTTEA ALAAVRLYEI EQDNKLLEFA SNCMDWMFAN LQDKSDYLFY
DGLNNDDYSD INKGKLTYSV GCGLSAMAYL HYYTGNQKWL IESKNIATSV TKGSGAFYGS
DGAWNNQLQY VHLLFMGFAD LFKYIPWQPE FDGYKTEVLK QGSYVYTTKQ DPDDPNMYFN
SPTTTPQMTK KYNALFNQDA SSSDTAAQHC DSNENNPIPK SLMDNASAAQ IMFAISQIES
PQ