Gene PICST_88690 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_88690 
Symbol 
ID4838125 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp492599 
End bp495499 
Gene Length2901 bp 
Protein Length629 aa 
Translation table12 
GC content45% 
IMG OID640389440 
Productpredicted protein 
Protein accessionXP_001383729 
Protein GI150864761 
COG category[R] General function prediction only 
COG ID[COG5354] Uncharacterized protein, contains Trp-Asp (WD) repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTCTTCCACA ACAAGTGTCG AAACGGGACT AACAAACCCA ACTGTGTGAT TGCTATCTTT 
AATAGCCGAT TTCGTCCTTG TAAGCCACTT CTCCAGCTGA TTTCCGTCGA TTATTCACTT
AGTTCGTCTA CTACTGGAGT CCGTAGAAAA TTCACAACCA GATAGGATTC CCGATCACAA
GTAAAAGCAT AAAGCAAACA ATACCATGTC CAAGTCCACC GAGTTCTTCT GTATGTATCT
GACTTTGAAA GAGGGAATGG CAAGAAGCCT GACTAGTGAG AATTAAAGAG AATTGAGATC
AGCGTAGTAC GGCTGATTTG GTTGATTTGA TTCTGAAATT TCTGGAAATT CTGGAGAAGA
GAATTTTTGT CATGAAACTA TTCCAGATCT CTACCCGGAG ACAGACATTG TTAAGATTGA
GGGTATATGA CTACGAGTAT AGAGGATAAT ACCGATAATG CTGATATCCA GATAATGGCA
TTGCCACTGT TAGTGAATCT CATCTCTCAT CTGAAATCTG TAAATGCATC TGAAAATCAT
TTGAGTATTC GCGTACAAGA ATTATGAACT TTTGAATTTC AATAAATTCA TTCTACAAAC
ATCTACAAAA TCCTCAGAAT GTACAGAATC CACAGAATAT GCCAAATCTA CAGATCTACG
AAATCAACAA TTTCATCAAT ATTACGAAAT TTTTGAAAAT AATTCTCATT TTCTCTCATT
CTCATACTCT TCATACTTAA CATCTTATAC TAACCTGCCA GGTCGTCTAC CACGGTCGAT
AGAATTGACA CATGACTACG AGGAAATCAC GCCGTCTCCC AAAGCGGCGG AAGAATGCCG
TTCGGCCTTG TACTCGCCCA ATGGAGCATT CTTTGCTTAC ACCCAGCCCA ACGAGGTGAT
AGTATTGAAT ACTAAGAGTA AGGCTGAGTT GTACCACAAG ATCGAGTTGC CCGAGGTGTT
TGATATATAT TTTTCACCAC AAGGAACGTT CTTATGTTTA TGGTGCAAAC CCATTCAAAT
CAACCGTGAA AACGGAACCT GGAATAACAA CTTGAAGATT TTCAACTTGA AGACCAAGTC
CTTGATAGTG GAATGGCTGC AGAAACACCA GAGCGGATGG AAGCCCCAGT TTACCCAGGA
CGAGAAACTC GTAGCCAAAA ACTTCAACAA CAAGGAAATC CATTTCTTTG ACATCAGCAC
GTCTCTGCAG GAAACCATAA ATATAAACCA GCCTACCCAC AAGTACAAAG TGGCTGATGC
CAAACAGCCG TTCCAGAACT TCCAGATTTC ACCAGGTTTA AACCCCTCTG TTGCTATCTT
CATACCAGAA GCTAGTGGCA AACCAGCCTC GGTGTTGATC TATAACGTTC CAAACTTCAA
CCAACCCACC TGTAGCAAGA ACTTCTTCAA GGCTGAACGG TGTCAATTGA AGTGGAACTC
GTTGGGTACA GCATTGTTAG CATTGGCCTC TACCGACCAC GATACCAGTA ACAAGTCGTA
CTACGGCGAA ACCAACTTGT ACCTCTTGGG AATTGCCGGT TCATATGATT CCAGAATCGA
CTTGAAGAGA GAAGGACCCA TTCACGACAT CACCTGGTCT CCTTCTGCCA GAGAGTTTGC
TGTTATCTAC GGTTACATGC CATCAGAAAC GACTTTCTTT GATGCTCGTG GTAACGCTAT
CCACTCGTTA CCTACAGCTC CACGTAACAC CATCTTGTAT TCTCCTCACG CTCGTTACGT
GTTGGTTGCC GGTTTCGGCA ATTTGCAGGG AACTGTAGAT GTATACGATC GTCAGAACAA
GTTCAGTAAA GTCGTAACCT TTGAAGCTGC CAACACTTCT GTGTGCGAAT GGTCACCTTG
TGGACGTTAC ATCTTGACTG CAACGACTTC TCCTCGTTTA CGTGTGGATA ACGGCTTGAA
GGTGTGGCAT GCATCTGGTC AATTGGTTTA CTTGAAAGAG TACCAAGAGT TGTATGCAAT
TGGATGGAAG CCTCAGACTA TCGCTGAGTT TCCTCCTTTG AAGCAATTGG AGCCTGCTCC
ACCAGCCCAT GACTCGGCAC GTGAATACAT CGCCAAGAAA GCTGCTGCTT CAGCTACTGC
TGCTTCCAAG CCTGCTGGTG CATACCGTCC TCCTCATGCC AGGGGCAGTT CTGCTCCAAG
CACAGCTACT TCCTTGTACC AGAAGGAGCT TCAAAACAAC TTGAAGTTAC AACAACAGCA
GCGTAATGGA ATCAACCCTA GTAGTGGCAG AGCCCGTGTT GTTCCAGGAG CCAACCCTGT
AGAGATCAAG GAATCCAAGA CAGCTCAAAA AAACAGAAAG AAGAGAGAAG CTAAGAAGAA
CCTGAAGGAA GACTCTCCCT CTATAGAAGG CTCTCCTGCA CCATCTGCTG CTCCTTTGGG
TCCACCACCA GGACTTGGCC AATTATCGTC TGCCCCAGCT TCTGCCCCAG CTTCTACTCC
AGCTGCTCCA GCTACTCCCT CACCAGTGGC TCCGGCTGTT GCAGCTACTT CCTCTCAGGG
AGGTGTTGTT GTTGGCGGAG TTGCGCTGTT GGAAGAGAAG AAAATCAGAT CGTTGTTGAA
GAAGTTGAGA GCCATCGAGA CCTTGAAGAT GAAGCAGGCC AGCGGCGAAC CTTTGGAAGA
CACCCAGGTC AGCAAAATCA ACAAGGAGGA CGACATCAGA AAGGAGTTGA GTGCATTGGG
CTGGAACGAT TAGGCGACCT CAGTCTACTA CACTCTACCA TCTATATAAA TAATGTAGCA
TTGCAGGTGT CCAGACCACA ATTCAAAATA TAACAGAGTA GAAATGTCTC CTGTAAAATG
GGACTAGAAC TATAGAGATG TACCACTACG GCGCATTGTA AAATCTTGAG TATTGCCACT
CCACAATATA TGATAAAAGT C
 
Protein sequence
MSKSTEFFCR LPRSIELTHD YEEITPSPKA AEECRSALYS PNGAFFAYTQ PNEVIVLNTK 
SKAELYHKIE LPEVFDIYFS PQGTFLCLWC KPIQINRENG TWNNNLKIFN LKTKSLIVEW
SQKHQSGWKP QFTQDEKLVA KNFNNKEIHF FDISTSSQET ININQPTHKY KVADAKQPFQ
NFQISPGLNP SVAIFIPEAS GKPASVLIYN VPNFNQPTCS KNFFKAERCQ LKWNSLGTAL
LALASTDHDT SNKSYYGETN LYLLGIAGSY DSRIDLKREG PIHDITWSPS AREFAVIYGY
MPSETTFFDA RGNAIHSLPT APRNTILYSP HARYVLVAGF GNLQGTVDVY DRQNKFSKVV
TFEAANTSVC EWSPCGRYIL TATTSPRLRV DNGLKVWHAS GQLVYLKEYQ ELYAIGWKPQ
TIAEFPPLKQ LEPAPPAHDS AREYIAKKAA ASATAASKPA GAYRPPHARG SSAPSTATSL
YQKELQNNLK LQQQQRNGIN PSSGRARVVP GANPVEIKES KTAQKNRKKR EAKKNSKEDS
PSIEGSPAPS AAPLVAPAVA ATSSQGGVVV GGVASLEEKK IRSLLKKLRA IETLKMKQAS
GEPLEDTQVS KINKEDDIRK ELSALGWND