Gene PICST_76300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_76300 
Symbol 
ID4836908 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp2300422 
End bp2303560 
Gene Length3139 bp 
Protein Length904 aa 
Translation table12 
GC content44% 
IMG OID640388223 
Productpredicted protein 
Protein accessionXP_001382669 
Protein GI150864000 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000841792 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AGGAGATTAC ATTCTTCTGA TACAACTGGA CCTGTTGAAT CTTGATCTCA TCCATATGCT 
CTTGATCTGT TCTGATCTGA TTTTGAACTT CTGAGCCAGT CTTTAATCAT CTATTACGTC
GTTCTTACTA CAGCATTATT CTCGTTCTCA GTACACCATA ATTGAAATCT TCAGTCTCCA
AAACTTCACC AAACCGCACC TCCAGGCTTT CCCAAAATGC ATATCTTCAC TGCCAAACAC
CAGAAGCTCA TTCTCCAGGT GTATCCGCCT GGAAAGGCAG TCGACAAAAG AGCCAACCCG
TCGGAGCTCT CGTACTTACT CTACTACGCC TCCACCAGAA GAGTCAAGTT GGAGAAGGTA
GTCACTTTTC TCGAGGCAAA GACAGCTAGT GATGTCTCCC ACAACCGTTC AGGTAACCTC
CAAGTGACTT TGGAGATCAT CTCCTCTCTT ATCGAGAAAT GTGCCGACAA CTTGAACGTG
TTTGCTCTGT ACGTGTGTTC GATTCTCACG TCAATTCTTA ACACCAAGGA CTTGTCACTC
TGCAAAAGCG TTGTAAACAC CTATGGAGTG TTCTGTGAGA ATCTCGACGG CGGCTTGTTT
TCAGGTGACA AGGCTTTTGT AGACTTGTTC ACCAGCTTGT CTGAAGACTT GATCTTGACA
GGCTCCAAGA ACCTGCAAGC CCCGGGCCCT AACCAGTTGG AATGGAAGAT GATAGCACTT
ATGGCCATTA GACACATTTC GCCATGTTTG AGTCATTCGA CCAAAATTGC CACCAAGTTC
ATCAACATCT CAATTCCTTT CTTGCTGAAG ACGATTCACA GCACAAACTC ACAAAGTAAC
TTGCTTTCAC GTGTCCGGAG CAACATAAAT GTAGAAGGGG ACGATCGCCG TCTTCAGAGA
GTTCTTCTGT CAAAGGTTCC TCAAAATATC CACAAGCAAA TCCAAGAAGA TTTCGACAAC
GATAATTTGC TGGAAGATGA CATCACCGAA GAAGCTCTCC GTAGTTTGAA GGCTTTGTTT
AACACCACCC TAACAAGTCA GATCTCACTG GCCACTAGAG CTGTTGTCAA ATACAATTAC
GAAAGTAAAA CCGACCTGAA ATGGGGAGCC ACTTTCTTAG AGATGTGCAC CACCTGGATC
CCAGTCCAGC TTCGATTTGT AAGTTTGTCG ACCTTGATGG CCAGATTGAC TACAATATCT
AGCGAGGCCA CCAACAGTAG TCCCAGTTTT GGTTTACAAA CGCAGTACGC TAGTCATATC
TTGGCTTTGG TATCTTCTGA TGTCAACATG ATTGGTTTGT CTGTTTCTGA CATCATTCAG
CAGATATTGT CGTTGCAATC AAATTTGATC TTGGATCAGC TGCTGTTCTT GAAGCCAGAA
CAGGTCAAGA GATTGTCGTC CATATACTCT GGCTGTATTT GTAACTTGGC ATCCCACATC
TACTACTACG ACCAGGTTCC TGATGCCATC CAAGAGATCT TGGTCAAGAT CGACACCGTT
GTAGACTTTT CATTTGTCGA CAGAGACCAG GACGTTCTGC AAAACATCAA CGCCAACAAC
ATCCACAACT TGGTGATGAC ATTGTTGGGC GATGTTTCTG TGATCTTTAG TACTTTGACC
AAGAAGTCAA GCTCGATAGC CAGAAACCAT GTAAACTTGG AGCATTGGGA AATAAGCTTG
CCGTTGTTGT CACCTGAGAG TCTGTATGAC AACGATACGC TTAGAAAGAC TGTCTTCACA
CCTTCACAGA TCATTGACAT TCAAATTAAG TACTTGAAGG TGTTGCACGA TTTCTTAACT
AACGAGTTGT CTTCAGCTGG CAACACCCCT CAGCCGGTTA AGAAGAATTC GTTAGATCCA
AAGACAATTG AGTCCAAGAG ATCGTCATTT GAGTCTGTCA ACTCTGAGAA TTTGAACGGT
ACAGTCAAGG ACTTGTTGAA ACCCAACATC AATCAGTATA TATCCAACCC CAACAACGTG
ATCTCTCATT TCTTGCTCTA TGTGCACAAA TTCTTCAACT TCCACGAGTC TCCCAATACT
GAAGTTGTAC TTTCTTTGAT TTCTGTGATG AAAGACATGC TCAACATATT GGGTGTCAAC
TTTGTAACCA ACTTCTTACC TTTCCTTTAC AACTGGTTGA TTCCCTTGAA CGATATTAGC
GATGTTCCAT CCAATGCGAA GTTCAGAGAT TCCATTGCAC ATATTGTGAC CTACTATTGT
TTGAAGACAT TGGATGACAA ATACCCCGAC GACTTGGAAG GTTATGCTTG TGGCAGTAAG
TTTTTCTCGA AGTTGTTGCG AGCAGTGGAG TACAGAAAAG TCAACAAGTT GTGGATCCAG
GGACTTGATT CGTCGCCTAC CGATCTTGAG ATCATCAAGA ACACGGCCAG TATCAACGCC
GTTGTTCACA GTTCTGATGT TTACGAGAAC GATACCAATT CGAAGTTCTC CTTTACCAGA
AAGGACTACG ACGACTTTGT CTGTGGCAAT AACTTCACCA TTGTCCACAT CAATCCAGCC
AAGTCTTTGG ATTTGAACGG AATCTCTACT GTAGTGCATG CTGATGCAAG AGGTTCCTCT
GGAAGCGGAA TCTTAGGTCC CGGTGATATT AGCAATAAGT TTGGTGGTTC ATTCCAGGGC
TCGGGTTCGA CTCCAGAAAG TGAAGTTCAC TCGCTGATTG AACTGTTAAG GCACAATGGC
AACTCTGGCT TAGGCTATGG GTTAGGCACT GTAGGCGATA TCAGCTCTAT CCATTCTGAG
ATCTTGCAGA ATACCCAGCA TTTGAACGGT AGAAACTTCA GTTTGACAGC GAACGGAACC
TTCAACAGTG ACATCACCAG CTCGTCGATT CTAACCTCTG ATGGCAGATA TGTCAATTCG
CCTAGAGTCG CCGACTTGAA GGACTTGATG ACAGACCCCA GAAGAGGCTA CAGAAAGACG
TCGTTGATAA GCGATAACAT CTCTACCTCT ACTACAAGCC ACATTGGCCA GACACCAGGT
TCTGTTCTTG GAAAGCAGAT GGTGTCTACA GACTTGGAAT CGATTCTCAC CAGCTTGAAC
ACTGAAGATG ATTCACGGAT TATAGTCTAG TATTTATTGT TATAGAGATG TTCATGTAAA
ATAAAATTGA TAAGATGAA
 
Protein sequence
MHIFTAKHQK LILQVYPPGK AVDKRANPSE LSYLLYYAST RRVKLEKVVT FLEAKTASDV 
SHNRSGNLQV TLEIISSLIE KCADNLNVFA SYVCSILTSI LNTKDLSLCK SVVNTYGVFC
ENLDGGLFSG DKAFVDLFTS LSEDLILTGS KNSQAPGPNQ LEWKMIALMA IRHISPCLSH
STKIATKFIN ISIPFLSKTI HSTNSQSNLL SRVRSNINVE GDDRRLQRVL SSKVPQNIHK
QIQEDFDNDN LSEDDITEEA LRSLKALFNT TLTSQISSAT RAVVKYNYES KTDSKWGATF
LEMCTTWIPV QLRFVSLSTL MARLTTISSE ATNSSPSFGL QTQYASHILA LVSSDVNMIG
LSVSDIIQQI LSLQSNLILD QSSFLKPEQV KRLSSIYSGC ICNLASHIYY YDQVPDAIQE
ILVKIDTVVD FSFVDRDQDV SQNINANNIH NLVMTLLGDV SVIFSTLTKK SSSIARNHVN
LEHWEISLPL LSPESSYDND TLRKTVFTPS QIIDIQIKYL KVLHDFLTNE LSSAGNTPQP
RSSFESVNSE NLNGTVKDLL KPNINQYISN PNNVISHFLL YVHKFFNFHE SPNTEVVLSL
ISVMKDMLNI LGVNFVTNFL PFLYNWLIPL NDISDVPSNA KFRDSIAHIV TYYCLKTLDD
KYPDDLEGYA CGSKFFSKLL RAVEYRKVNK LWIQGLDSSP TDLEIIKNTA SINANDTNSK
FSFTRKDYDD FVCGNNFTIV HINPAKSLDL NGISTVVHAD ARGSSGSGIL GPESEVHSSI
ESLRHNGNSG LGYGLGTVGD ISSIHSEILQ NTQHLNGRNF SLTANGTFNS DITSSSILTS
DGRYVNSPRV ADLKDLMTDP RRGYRKTSHI GQTPGSVLGK QMVSTDLESI LTSLNTEDDS
RIIV