Gene PICST_88034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_88034 
Symbol 
ID4837336 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp2403543 
End bp2405583 
Gene Length2041 bp 
Protein Length530 aa 
Translation table12 
GC content43% 
IMG OID640388651 
Productpredicted protein 
Protein accessionXP_001382686 
Protein GI150864015 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CCGCTTACAA CGTATCCCCG ATCGACCCAT CCATATCACA CTTTCTTCGG ACGTCCAACA 
TTGCGCAGAA CGGCAATTGA CTGTCATCTA TCTGAATAAG CGAGAACTTC TATCACAGTA
GTCTCAGTTC ATCCCAGTTC TTTCAAAGCT TTTCTCAATC TCAGTGTCAT TTCTCAGTAC
TCATTTGCAA TTATTTTCCC TAATTTTTGA CGGGCTTTTC ATCGACATCT CAATCCTATC
AACTTTTCCT TGACATTCGT ACACATCTCA CTTCTCACGT TCATCACTTT CCGTTAGCTT
TCGTCATCCC AATCCATCAG AATGTCCAAT CCCCCCTTCC AAAGACAACA ACCCTCGTCG
CCCTTAACTA CGCCCAACCA CCACCCTTCT AACCATCAGC ACCATGCATC ACGTAAACCC
TCCATCGTGG AATTGTTGAG CTCGCCACCT CCCTTGCCCA ACAACACCAT CGATGACGAA
ATCCACCAGT TCAGCTTGTC CCGCAACACG TCCATAAGCT CACGGACTTC GTCTTTCAGT
CAACAGCAGC ACGGTGCTCC ACATCACGGC TCTAATTCCC ACCATTTGTC GGTATCAGGT
ATGGACTGGT CGGAGATCCC TTTGAGCGAA TTGACCGAGT CCAACAAATT GATCTATATC
AATTCGTCTT ACTCGGTGCA AAAAGCATTC GAAACGTTGG TATCAAACAA CTTGACCTCG
GTGCCTGTCT CAATTTCGTC CTCTAATGAA AACGATTTAA GTAGCTGTTT GACGTTTGAC
TACTCAGACT TGAACACATA TCTCTTATTG ATCATGAACA AGATCAATCT CAGTGAGCTT
TCTGTCAGTG AGATAGGCAA TGAACACGAC TCTACAGCCA AGAAGCATGA GATCATCACC
CAGACAATCA ACAAAGCCAA AAGAGGAGAA GAAGTACCCA TAGAATTCAT CATCAAACTT
CATCCAAAAA ACCCCTTCAT AAAGTTTACG GAAAACGACA CATTGTTCAA GGTGATGGAG
ACATTGGGTA ACGGAGTTCA CCGTGTAGCC ATCACCAATC TTGAGTCTAC CAAGATTACA
GGAATATTGT CGCAAAGAAG ATTGATAAAA TACATGTGGG AGAACGCCCG GAGATTCCCC
TCGCTTGACT TTTACTTGAA CTCTACATTG CAAGACTTGA AGATCGGCTC CAGTACCCCC
ATATTCATCT ACGAAGACCA GTTATTGATC GAAGCCTTAT ACAAGATGTT CAACGAAAGA
GTAAGCTCCT TGGCTGTCAT TGACAGAACA AAGTCACTTA TCGGCAATAT ATCTATTGTT
GACGTCAAGA ACGTTTCAAG CTCCAAGAAC TCTCACTTAT TGTTCAAGTC AGTGTTGACT
TTCATCAGCT ACAACTTGAG CCAGAAAGGC ATTGAAGAGG GCCAAGACCA GTATCCTATC
TTCCATGTCA ACAAGCAGAG TTCGCTAGGC AGAGTCATTG CCAAGTTGGT GGCTACGCAA
TCACATCGTT TGTGGATTGT GGAGTCCAAC ACAAGAACTC ACCAAAACTC CATCTCATCG
CCTGTCACTA TTGAAGCAAC TTTGAATGTT AGTGCCAACC CTTCGTCAGC ATCTTCTTCT
AATGCAAACA CTCCTGAAGG TAACTTCGGT TTACCAGGAA AATTAATTGG TGTCGTCACC
TTGACCGATA TCTTGGGATT GTTTGCTACA TCTAAAGGTA CCAAGACCGA TCCACAATTC
GCGAGAAACC AAAGAAGAAG ATCGTCTACT TCAACTACGC GCTCATCTAT AGACAGTGCT
ATCAGTGTAG GTGACGGAAG CGCCAGAACA ACCAATGCCA ATGCTGACCT GGAGATCTTC
CGCAAATCGT ACACTGCCGC TGCAAAGAAT GAAAGTGCCA TTTCCAAGGA CTAGAGAAAG
AACTGATAGC ATGGCAGAAT CAGCTCAATC CATGTATCAG TAGTTATTCT CTATCCATAT
TGTTCTACAG TTTATTTATA TTTATTATTT ACCAATTCTT ATAAATGCAT AAACCATACA
G
 
Protein sequence
MSNPPFQRQQ PSSPLTTPNH HPSNHQHHAS RKPSIVELLS SPPPLPNNTI DDEIHQFSLS 
RNTSISSRTS SFSQQQHGAP HHGSNSHHLS VSGMDWSEIP LSELTESNKL IYINSSYSVQ
KAFETLVSNN LTSVPVSISS SNENDLSSCL TFDYSDLNTY LLLIMNKINL SELSVSEIGN
EHDSTAKKHE IITQTINKAK RGEEVPIEFI IKLHPKNPFI KFTENDTLFK VMETLGNGVH
RVAITNLEST KITGILSQRR LIKYMWENAR RFPSLDFYLN STLQDLKIGS STPIFIYEDQ
LLIEALYKMF NERVSSLAVI DRTKSLIGNI SIVDVKNVSS SKNSHLLFKS VLTFISYNLS
QKGIEEGQDQ YPIFHVNKQS SLGRVIAKLV ATQSHRLWIV ESNTRTHQNS ISSPVTIEAT
LNVSANPSSA SSSNANTPEG NFGLPGKLIG VVTLTDILGL FATSKGTKTD PQFARNQRRR
SSTSTTRSSI DSAISVGDGS ARTTNANADS EIFRKSYTAA AKNESAISKD