Gene PICST_50168 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_50168 
Symbol 
ID4841069 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp390012 
End bp392030 
Gene Length2019 bp 
Protein Length672 aa 
Translation table12 
GC content41% 
IMG OID640392384 
Productpredicted protein 
Protein accessionXP_001386664 
Protein GI150866912 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CAACCGGTAG ATTCGGTGGT AAGTATACTT CCTCCGTCCA TAATCACCGA AGTATTTCAA 
CACGTAGACC AGTACGATTT GCTTAACTTA CTGACGACGT GCACAGCGTT GTACTCTTTG
GCCACGGACC GCTTATACAA GCGAGTAACT GTTCTTCTTA ACGCGGAATT CCCTCTTCGC
TTCAGCACTG CACGAGATTA CGTTCTGGAG AACGGGATCA GATACATGGA CAGCTCCATC
ATCTTGACTC TCGATTCTTT GGTCAAATTT CTTTCAACGA TGAATTCGAG ACCAGACTTG
ATCCAAAAAA TCAAGTTCTT TGTATTTGAC AAATGCCAAA ATCTTGAAAA GATCGACGTA
AATGCGGTCC AGCTGAACAT AATCGAGTTC TTTGGAGCTA ACGCTCGTGA GCTCAACTTC
TTGCACATCA CTTTTGTAGA CTTCCTGACG GGAATAGTCA AGTTGACCAA CTTCTTAAGA
AATGCAAACA TCAGAAACAA GATCTTCAAG TTATTTGTCA CGAAGACCAG TGAGCTCTAT
GAGCCGTGTA TACCCCCTAG CTTGACGAAT CTCTTCTTGA TGTTGAATGA AGTCGAGCTC
ATAGATCAAG AGTACCTTTT TGACTTGTCC AAGCATCCTT ACGATGTATT CAACTCTCTT
TTTACTTTGA CTTGTAGCAC AAACCACCAG GTGGGATTAG AGATATTAAG AAAATTCAAG
CTCTTCGCGC CTGGAATGAA GCTCAAGTTG AAAGCATTAT CATTGTTTCA TTGCCATAAA
GAAGCAGTAG AAGGTAGAAG CGAATTTACA GATGAATACG CTTCTCAGTT CAGTTTGTTG
AATCAAGACA ATGACAAAGT TCTTCTTGAG AAGTACATGC AACAGATCAG CAAGAAGTTG
GACTTCAGCG TCATTGACGA AAAAGTTGAA GTAGCTAACT TGACGCATTT GTACTTAAAA
GTAGAATGTA TTGAACAGAG GCACAGCCAA TGTAACTGTT TCGAGACATT CTTCAAAGAC
TTGACAAAGT ATTCTGAAAG TCATGGAGGT TTGCCTAACT TGGTTAATCT TGAAGTGGAG
TCATTTCCCA ACTTGGATTG GCTCAGACCT CATCAGATAC TCGAAGATAT ATTAACTCCG
TTAGGTGGTT TTGTTAAGAC TTTGAACAAC TTAACAAGAC TAGCGATCGA CTTCTCTACA
CCTGGTTTCA AGATGTTTGA CAACAACATG GGCATGTCTA CGTGGTTGTT GAATAAGTTG
AACGAAAGCT TGATGGAATC TTTTTTTCTC TGCTTTTTCA CTGCTTCTAA CAAGCTGAAC
TTGGTGGCAA ACTTGAAGAC GTTGCAGTTG CCTGATTTCT TGACTTCTTT TATATACTAT
AAGCCAGATT TCTTAGAGTC ATTGTTGCAC ACCTGCCAGT GTTGGGGTTG TGCTTTGGTT
TTGGAGAAGT TGGCAGAGTC GTTCTATCCG ATTTTCAATG ACCAGGACGA CGACGATTTG
GATGAAGAAA GAGACGAAAC CGCTACACTT GATTTAGAGT CGACTTACTA CGTGTTGATA
GGATACATTT TGGGAAAACT ACAAGCAGAT CGTGAAGTGT GTATTCCCAT CAAGGAAAAG
ACATTCAGCT ACAGAAACTA TCCTATTTTC AAGGGCCAGC CTCATACTTT ACATAACGGT
TTCCATAAGG CTACTGGAGA CCAAAATGGA GCCGATGCTT GCAGCTGTGC TGTAGATGAA
GATCCACAGG GCAGGAGTAG CATGAACATA GACAACTTGG TGTGTACTTA TATTGTTCAT
CAGTTGGAAC CAATTATCCA ATACTTGTCG AATATTTTCA CCAATTTGGA CAACTTGATG
ATCCATGGAA TTTATTATGA AATCGACAAA TACAACGATA AATTGGTTCC AATATTCGAC
AGCAGCGAAT ATCCTGCCCA GTTTCTCGAA AATAAGAAGG ACGAAATGGA TCGTGGAGTA
AAGCCCAGTG GGCCGTTTGG CTACTTCAGA AACTGCTAG
 
Protein sequence
QPVDSVVSIL PPSIITEVFQ HVDQYDLLNL STTCTALYSL ATDRLYKRVT VLLNAEFPLR 
FSTARDYVSE NGIRYMDSSI ILTLDSLVKF LSTMNSRPDL IQKIKFFVFD KCQNLEKIDV
NAVQSNIIEF FGANARELNF LHITFVDFST GIVKLTNFLR NANIRNKIFK LFVTKTSELY
EPCIPPSLTN LFLMLNEVEL IDQEYLFDLS KHPYDVFNSL FTLTCSTNHQ VGLEILRKFK
LFAPGMKLKL KALSLFHCHK EAVEGRSEFT DEYASQFSLL NQDNDKVLLE KYMQQISKKL
DFSVIDEKVE VANLTHLYLK VECIEQRHSQ CNCFETFFKD LTKYSESHGG LPNLVNLEVE
SFPNLDWLRP HQILEDILTP LGGFVKTLNN LTRLAIDFST PGFKMFDNNM GMSTWLLNKL
NESLMESFFL CFFTASNKSN LVANLKTLQL PDFLTSFIYY KPDFLESLLH TCQCWGCALV
LEKLAESFYP IFNDQDDDDL DEERDETATL DLESTYYVLI GYILGKLQAD REVCIPIKEK
TFSYRNYPIF KGQPHTLHNG FHKATGDQNG ADACSCAVDE DPQGRSSMNI DNLVCTYIVH
QLEPIIQYLS NIFTNLDNLM IHGIYYEIDK YNDKLVPIFD SSEYPAQFLE NKKDEMDRGV
KPSGPFGYFR NC