Gene PICST_89595 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_89595 
Symbol 
ID4839361 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp586873 
End bp588840 
Gene Length1968 bp 
Protein Length503 aa 
Translation table12 
GC content42% 
IMG OID640390676 
Productpredicted protein 
Protein accessionXP_001385121 
Protein GI150865774 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.136395 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTTTTTCAAA TCAAATTCAC AATTCAGTTC ACGGAATATC TATTCCATTA TTTCTCAATT 
CTGTACATAC ATGTACCATG AACGCGTCCA CCATAACCGT CCATTGGCAC AACGACAATC
AGCCCATTTA CTCTGTGGAC TTCCAGCCCT CGGCATCTGG CCCTTCTCCT AGATTGGTCA
CAGGAGGAGG AGACAACAAT ATCCGAGTCT GGAAATTGCA CCATAAACAT GACCAAGTAA
ACCTAAATCA ACATGACCAA GGGGGCCAAT CTCAACAAAG TCAACAATAT CAAAATCAAA
ACCTACAAAA TACAAGCGTA GAGTATCTCA GCACATTGAG AAAACATACA CAGGCTGTCA
ATGTCGTCAG ATTTAATCCC TTGGGTACTA TATTGGCAAC AGCTGGTGAC GATGGAACGT
TGATTCTTTG GAAACTCGCT GACCGGGTCT TGAAGGATTT TGAAGCGGAG GATGAAGACG
ATGATGATGT TCAGGAGTCG TGGCAGGCTG TGTGTCAGTT CCGATCGTCG ACTTCTGAAA
TCAATGATAT ATGCTGGTCG TCAAATTCCC GGTATTTGGT CACTGGTTCC ATGGATAACA
TCACTAGAGT GTATCATATC GATTATGCCA ATGATAAGGT CACTGGTACT CTTGTGACAT
CGAGCAAGAA CCATAACCAC TACATCCAGG GTGTCTACTG GGACCCACTT GACCAGTACA
TAGTAACACA ATCAGCAGAC AGATCCGTCT GCGTATACAG AATCGTTAAG CATAAGAAGA
AGGACGAGAT TGAAGACATA AAACTAGCAC ACAGATTCTT AAAATTTAAT AACCAGCACC
TTTACCACTC AGAAACATTG CAATCTTTCT TCAGAAGGTT GTGCTTTTCA CCAGATGGAA
GTTTGGTAAT AACACCAGCA GGTTTGGAAA GTGATCCTAA CACTCAAAAC GACGAAACGA
CATCAACAAC TGAAAATCTC GAGTATGATT CTTCTAATAA TACCAACATT GACATCAATA
ATATCAAAAA CAATAATAAC TCCAATAACA AGACAGACGA TTCGACCGCT ATCAACACTG
TCTATGTGTA TTCCAGATAC AGTCTCTTGC ATACGCCCAT CTACAAGATA TCGAACTTGA
ACAAGCCAGC TATTGCCGTG GCATTCAACC CATTTCTATA CGAGCCTAGT GCAACCAGTC
CAGTTCTAAA GTTAGCCTAC AAGATGATAT TTGCCGTTGC AACCCACGAC TCGATCCTAA
TATATGATAC GGAGAATTTC AAGCCTTTGG GTTACGTTTC CAACTTACAC TACAGTTCCA
TAACTGATCT CAAATGGGAT TCCGACGGTA CAAAGATCAT CGTGAGTTCA ACTGATGGAT
TCTGTCTGAT AATATCGTTT GATGACAATG TGTTCGGCCA GCGATATGCA AAGAAGGAAG
AGAAATCAGA GGGTGTGCCT TTGACTGTTC CTGTCACTGA TCCTCCGACA CCTGTGGCAA
CAAATTCAAG AAGCTTGACT CCTATCAACA ACCTAAAAGC TCTTCATTTG TCCAGTGATG
TGGGGGAAAT AGAGGACTAC AAGTCGGATT TCGACTCATC GGAGGCAAAG GACGTAGAAA
TGATACTGGG AGACACCAGT CCTGAAGTTG AAATAGTAGA GATAATATCC GAAGAAGAAA
CTACGGATGT AGCTGCTCCT TCCATGGGAA CTATAGATAA GTTTTTCATG AGGCTGAAAG
AGCTCTCGCC CAACAAGGAC AAGAACAAGC GTAGAGTTGT GCCTACATTG GTAAATAACT
AGAAAGTGCT ATTTGTTATC TGTTAACATT AGTTAGCTAG TTAACGATAG ATAATAAGAA
TACTAAGTTA ACAATACGTT AACCATGGGG TACTTAGTTA GAATTAGATA CCAGTTAGCA
ACATTGAAAT ATGACGAGCG AGAACGGTCC TTCTCGTGAG GTGAATCT
 
Protein sequence
MNASTITVHW HNDNQPIYSV DFQPSASGPS PRLVTGGGDN NIRVWKLHHK HDQYLSTLRK 
HTQAVNVVRF NPLGTILATA GDDGTLILWK LADRVLKDFE AEDEDDDDVQ ESWQAVCQFR
SSTSEINDIC WSSNSRYLVT GSMDNITRVY HIDYANDKVT GTLVTSSKNH NHYIQGVYWD
PLDQYIVTQS ADRSVCVYRI VKHKKKDEIE DIKLAHRFLK FNNQHLYHSE TLQSFFRRLC
FSPDGSLVIT PAGLENDSTA INTVYVYSRY SLLHTPIYKI SNLNKPAIAV AFNPFLYEPS
ATSPVLKLAY KMIFAVATHD SILIYDTENF KPLGYVSNLH YSSITDLKWD SDGTKIIVSS
TDGFCSIISF DDNVFGQRYA KKEEKSEGVP LTVPVTDPPT PVATNSRSLT PINNLKALHL
SSDVGEIEDY KSDFDSSEAK DVEMISGDTS PEVEIVEIIS EEETTDVAAP SMGTIDKFFM
RSKELSPNKD KNKRRVVPTL VNN