Gene PICST_54680 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_54680 
Symbol 
ID4837496 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp1201638 
End bp1205360 
Gene Length3723 bp 
Protein Length1228 aa 
Translation table12 
GC content40% 
IMG OID640388811 
Productpredicted protein 
Protein accessionXP_001382447 
Protein GI150863836 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTATCG ATGATGATTC GCTCTATCTC TACCATCTCA CGCTAAGGGC TCCTTCCAAT 
TTCACGCTGT CGGTGTTAGG GCAGTTTTTG GGCGAAAAGA AGTCACAGGA GATTCTAGTA
TCATCCGTTA GCACTCTTCA ATTGTTACGT CCGAATGCCG AAACTGGCAA GATAGAAGTG
GTGGCGAGTC AAAATACTCT CGGTGTAATA CATAAAATTG AAAAGATCCG AATCGTGGGT
ACTCAAAAGG ACCTAGCAGT TGTAGTAGGA GAGTCTGGAA AAGTGGTGTT TCTAGAGTTT
GACGTAGATT TGCATAGGTT TGTGCCTGTT TTACAAGAAC CGTATGCAAA GACAGGATTT
GGAAGAGTCA ATCCAGGAGA GTACCTTGCT GTAGATCCTC AGAGTCGCTG CATTTTTCTT
GGTGCCATAG AAAGAAACAA ATTGATTTTC AAAGTAGAGA CAGACTCTCA AGGAAAGGTA
GAGCTCTCTT CACCTTTAGA GGCACACTCG AAACATACTT TGACATTAAG TGTCGTAGCG
TTGGATACTC AGTTTTCCAA CCCTGTTTTC GCTGCCATTG AGTGCGATTA TTCTAACTAC
CACAGTGATG GAAAGGTTCA GTTCGATGCG GATTCCTCGC CTCTTCTACT CAATTATTAC
GAGCTTGACC AGGGGTTGAA TCATATAGTC AAAAAAAAGT CCACAAATAC CATTCCTTCA
TCGGCTACGC ATTTGATCCC ATTACCGAGC CATGTTGGTG GGGTTTTCGT CTGTTGCAAA
AACTATATCA TATATGATAA TCTTCATAAA AATCTCGAAA GACTCTATCT TCCTTTACCA
CTTAGAAAGG ACAGCGAATA TACCGTAGTT GTCAGTCATG TCGTCCATAA ATTGAAAAAG
AACAATTTCT TTGTTTTACT TCAATCTTCT ATGGGTGACT TATTCAAAGT GACTGTTGAA
TACAACAGCG ACAAAGAGTT GATTGAGGAT ATTCAGATCG GCTATTTTGA CACCATTCCC
GTTTCATCGT CGTTGAACAT TTTGAAAAGT GGATTTCTCT TCGCAAATGT GTTGAACAAT
GACAAGCTCT ACTACCAGTT CGAGAAATTG GGAGACGACG ACGAAAATAT TCAGTTGAAA
GCATCTCCTG ATATTTCATC TATTGACGAA GAAGATAGAA GCAACAGAAC ATTTACAGTG
AAAGCTCTCG ACAACTTAGC ATTAGTAGAG ATTTTCACTT CTCTCAGTCC GATAACTGAT
GCTGGCATTG TGGAGTCGAT TTCTAGTGGT ACAGCTGACT CATTACAACA GATGATTACC
GCATCTTCAC ATTCGCATTT GAAATCACTA GTACATGGAA TTCAAACCTC GACTCTTGTA
TCTTCACCAT TGCCTATCAT CCCAACTGGA GTGCTCACAA CGAAGTTGTT TGCAGATTCT
CGTAGCGATG AATACTTGGT CATATCCTCC ACAGTAGCTT CCCGAACTCT TGTATTGTCT
ATTGGCGAGG TAGTTGAAGA AGTCGAAAAC TCTCAATTTG TCAATGATCA ACCTACTCTT
GCAGTTCAAC AAGTAGGGAC TTCTTCAGTA GTTCAAATCT ACACAAATGG GATACGACAC
GTAAAACACA CAAGAACCGA AGATAAAGAA CAGTCTATAT CCAGAAAGAT CACAGATTGG
TATCCGCCGG CTGGCATCAC TATTGTCAAT GCCAGCACTC ACAGGGAACA GGTGATCATT
GCTCTTTCAA ATGCCGAGAT TTGCTACTTT GAAGTAGATG CCACAGACGA CCAGTTAATA
GAGTATCAAG ACAGAGTAGA AATGTCTAAT TCAATTACAT CGATCGCTAT ATGCGAGGAG
ACAGCAAACA AAAAGAACTT GTTTGCTGTA GTTGGCTGTT CTGATGAAAC CATTCAGGTT
TTATCTTTGC AACCACACAA TTGTCTTGAG ACATTATCGT TGCAGGCTCT TTCAGCTAAC
AGTACTTCAT TGCTGATGCT CCAGAACGAC AACACAACAA TGGTTCACAT TGGAATGGAC
AATGGACTTT ATGTTCGGAC TTCTATAGAA GAAATCAGTG GAAATCTATC TGACACTCGC
ATAAAGTATC TTGGCTCCAA GCCCGTGACT TTATCAGTAA CCAAATTACC AAATGGAAGC
AAAGCGATTT TGGCTATCTC CTCCAGACCT TGGATTTGTT ACTACAATAG GAGCGAGTTC
AAAGTCACTC CTCTCCTTGG TGTCAAAATC TTGAAAGGAG CATCTTTCAG TTCTGAGGAC
ATAGGAGGTG AAGGAATTGT TGCTCTATCT GACAACAATT TAATTGTCTT CACTATAGGC
AAAGAAGACG TAGAGTTCGA TATAAACCAG GACGTCAATA TTGAAAAAAT CCGTTTAAGG
TATACACCCA GGAAGTTAAT TATAGACGAT GACGGAAAGA GCTCTAAAGT AAATTACATA
TATGCCTTAC AGTCAGAATA CGGAACTAAG AGTCCATTCT CACCTTCAAA CTTGAATTCG
GAAGATGACC CTGAAAGTGA AATAGACCAA GATTATTACG ACGCTTTTGG CTATGAGACT
GAAGTTGATA AATGGGCATC ATGCATTCAA GTGGTAGATT TTGAAAATCT GAGTATCATT
CAAACAGTTG AGTTCTCTAG CAACGAGAGT GCTATATCCA TGGCAAAATT GCACTTTGTA
TCGTCAGGCA AGGGTAATAT GGAACATTTA ATTATTGGAG TTACTACAGA TAGGAAGTTT
CTCAAAAATT CAGTTGGGAA AAGTTACCTA TTCACATTCA AAATCCAAAA GAATACCAGA
AAATCCAATA AGAAAAGACT AGAGTATCTT CATAAGACAG AGATAGATTG TTCACCTACA
GTGATGATTC CTTTTAATGG AAGATTGTTG GTTGGCATGG GAAAGTATTT ACGACTTTAT
GATATTGGAC ATCGCCAATT GCTTCGAAAA TCGTCTACAA ATATTGACTA CATTTCTTCT
ATAGTAGACC TTGTACATAC TGGAGGAGAG AGAATAGCAT TTGGAGATTC TCATTCGTCC
ATTGTATTTG CCAAGTTTGA CTCTGCTGAG AACAGATTTG TACCATTTGC TGACGACATA
ATGAAGCGAC AAATTACAGC AGTCGCAGCT TTGGATTACG ATACCGTTAT AGGCGGTGAC
AAATTTGGAA ATGTATTCGT TTCTCGAGTT CCCGATTCCG TTTCGAAAAA GTCTGATGAA
GACTGGAGTC TATTGAAAGT CCAGGAATCA TATTTGAATG CTTCTCCATC TAGAACGAAG
AACCTCTGTG AGTTTTTCCT TCTGGATACA CCAACTTCCT TCACCAAAGG CAGTATGACG
ATTGGTGGAC ATGATGGCAT TATTTACACT GGTATTCAAG GAACTGTAGG ATTGCTTTTG
CCTCTTTCTA CAAAGCTGGA AGTCCAGTTC ATAAACAGTT TGGAGCAATC GTTGCGACAA
GTATTCGACT ATAACTTTGA TGACTACGAT AGTAAGCAAA TGGGTTTCAA TTTACTTGGT
ATGGATCACT TGAAATTCAG AAGTTATTAT AATCCAGTGA AGAACGTCAT TGATGGGGAT
TTGATAGAGA AGTACTATGA GCTTAGCCAA AGCTTGAAAA TAAAAATTGC CCGTGAATTG
AATAGAACAC CAAAAGAAGT CGAGAAGAAG ATCTCTGACT TACGAAATAG ATCAGCATTC
TAG
 
Protein sequence
MSIDDDSLYL YHLTLRAPSN FTSSVLGQFL GEKKSQEILV SSVSTLQLLR PNAETGKIEV 
VASQNTLGVI HKIEKIRIVG TQKDLAVVVG ESGKVVFLEF DVDLHRFVPV LQEPYAKTGF
GRVNPGEYLA VDPQSRCIFL GAIERNKLIF KVETDSQGKV ELSSPLEAHS KHTLTLSVVA
LDTQFSNPVF AAIECDYSNY HSDGKVQFDA DSSPLLLNYY ELDQGLNHIV KKKSTNTIPS
SATHLIPLPS HVGGVFVCCK NYIIYDNLHK NLERLYLPLP LRKDSEYTVV VSHVVHKLKK
NNFFVLLQSS MGDLFKVTVE YNSDKELIED IQIGYFDTIP VSSSLNILKS GFLFANVLNN
DKLYYQFEKL GDDDENIQLK ASPDISSIDE EDRSNRTFTV KALDNLALVE IFTSLSPITD
AGIVESISSG TADSLQQMIT ASSHSHLKSL VHGIQTSTLV SSPLPIIPTG VLTTKLFADS
RSDEYLVISS TVASRTLVLS IGEVVEEVEN SQFVNDQPTL AVQQVGTSSV VQIYTNGIRH
VKHTRTEDKE QSISRKITDW YPPAGITIVN ASTHREQVII ALSNAEICYF EVDATDDQLI
EYQDRVEMSN SITSIAICEE TANKKNLFAV VGCSDETIQV LSLQPHNCLE TLSLQALSAN
STSLSMLQND NTTMVHIGMD NGLYVRTSIE EISGNLSDTR IKYLGSKPVT LSVTKLPNGS
KAILAISSRP WICYYNRSEF KVTPLLGVKI LKGASFSSED IGGEGIVALS DNNLIVFTIG
KEDVEFDINQ DVNIEKIRLR YTPRKLIIDD DGKSSKVNYI YALQSEYGTK SPFSPSNLNS
EDDPESEIDQ DYYDAFGYET EVDKWASCIQ VVDFENSSII QTVEFSSNES AISMAKLHFV
SSGKGNMEHL IIGVTTDRKF LKNSVGKSYL FTFKIQKNTR KSNKKRLEYL HKTEIDCSPT
VMIPFNGRLL VGMGKYLRLY DIGHRQLLRK SSTNIDYISS IVDLVHTGGE RIAFGDSHSS
IVFAKFDSAE NRFVPFADDI MKRQITAVAA LDYDTVIGGD KFGNVFVSRV PDSVSKKSDE
DWSLLKVQES YLNASPSRTK NLCEFFLSDT PTSFTKGSMT IGGHDGIIYT GIQGTVGLLL
PLSTKSEVQF INSLEQSLRQ QMGFNLLGMD HLKFRSYYNP VKNVIDGDLI EKYYELSQSL
KIKIARELNR TPKEVEKKIS DLRNRSAF