Gene PICST_31188 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31188 
Symbol 
ID4838630 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp155750 
End bp159415 
Gene Length3666 bp 
Protein Length1221 aa 
Translation table12 
GC content47% 
IMG OID640389945 
Productpredicted protein 
Protein accessionXP_001383992 
Protein GI150864962 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0032345 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0791309 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATCAT TCTGGGACAA CAACAAAGAC ACGTTTAAGT CTGCTGGCAA AGCTACGATG 
CGTGGAGTGG CCAGCGGAAC GAAGGCTGTG AGCAAAGCAG GCTATAGAAC CTATAAAAAC
CATTCAGGAG GTGGTGGAGG CAGTAGCAAC AATGATACTA CTGGTGAAGT AGACCAGCCT
GAGTATATTG GACCACCGAG ACCGCTTCCC TCCAAAGACC AATTGCTGGC ACTTCCTCCT
CCACCAAAGA GAAATATTCC CACCTATGAA GTTCCAGAAA AAGGAGCTCC ATCTCAGTAC
AGTGTTCCTC AGCCTCAGCA GTATCTTCAA CAACAACCGC CACAAAATCA ACCGCAACAA
ATTCAACCGC AACAAATTCA ACCACCACCA CAAAATCAAC AACTAGTACA ACAACCCCAA
TTGTATCAAC AACCAAATCA ATATCAAAAT CAGTATCAAA ATCAAAATCA GCCAAATCAA
ATTCCGCAAC CTCAGACGAA TTTGTACACT CCCGCAAATT CGCAGCAAAC ATTTTCGCAG
CCAAATGCAC CAGCTCAACA ACTTCCACAG CAACTCCCTC CACCAGTTAA TGGATACTCA
GATGCTAATG GTCAACAAGG GTATTATCAA CAGGGCTACC AAACTACACC TCCGGCTGGT
AATGGCTATG GGGTTCAACA TCCACAGCAG CCTCAACAGA CACAACAACT TCCTCAAGGG
CAACAGTCGC AGAATGAGAT GTATGCTCAG GCAGCCAAAC TGGCACTTCC TGTTCTTCTG
AATTTGTATC AACAACATCA ACAGGGCCAG ACCCAGAACT CTGATGCTTC GCAACAGTTT
CAACAGACTC AGCAGACTCC GCAACAGCAG CAACAAGCAA TGTACTCTCT GGCCGCAAAA
CATGTTCTTC CTGTTCTCCT GAACATGTAC CAGAACCAAC AGCAGACTAG TGAGCTTCAG
CAAGGTCAGC AAGGTCAACA GCCTCAAGGT CAGCAGCCTC AAGGTCAGCA GCCTCAAGGT
CAGGTGCCTC AAGGTCAAGC TGATTTGTAT GCTCTGGCAG CGAAGCACGC ATTGCCTGTA
CTTCTGAATA TTTACCAGCA ACAATCTCAA CAACAGCAAC AACCTCAACA ACCTCAACAA
CAGCAACAAT ATCAGCAACC TCAGCAATTT CAGCAACCTC CGCAACCTCA ACAACCTCAA
CAGCCTCAAC AACCTCAACA ACCTCAACAA CCTCAACAAC CTCAACAACA GCAACAATAT
CAGCAAGTCC AATTGCCATC AAACCAATTT CAAAATGAAC TTGGTCAGCT TCCTCAACCT
GGCCAACTTC CTCAACCTCC AGCTAGACAA TTGCCACCAC CTCCTCCAGA ACGCACTGTT
CCTGCAACCC CAGGTATTCC AATTGCTTCA GGACAGTTCC AAGCTCAAGC TCCATTATCT
TACGATTCAC CTTATTACCA ACCTCCATCT ACCCAACCCG CTGCAGCTGC TCAGCCAGCA
CCGGCAAAGA CTTACGGGTT TGGTGTTGCA CAAGTTCAGG ATCCAGATGA AGCTCCTAAA
CCCAAGAAAG AGTTGCCTGA TCCATCTCTG TTCGCTCCTC CTCCTATTCG TGCAGATAGA
GTAGGACCAC CATCTACAAA ATCGGCTACT TCTGCAACTT CTCCTTCTCC TACGCAGCTT
CCAGCCACTA CTTCTGGCAG CAAACTGTCT GTTGGAACTG CTCCTCCTTC AGCTCCCCCT
AGAGCATCTT CTGCTGAACA AACCACTGCT CCTTCTGAAG TCAAAGAGGA ACCGAAGCTT
CCTTCCAAGA CAAACCTCAT GGACTTTGAT ATATCCAAAT TCGGTGCTCC TCCTCCTAAA
ATATATAGAG GACCACAAGA CGCTCTTCCC CTAAAGAAAT CTTCTGCTAA TGCTTCGGCT
TCAACGGTTT CTTCTTCTTT TACACCTCCA CCTCCACCGC CTGCTCGTGT TTCCCCTTCT
CCTCTGCCCC AAGCGTACTC TGAACAACCT GTAAAACCAC CAAAACCACC AAAACCAACC
AAACCAACTA AGCCAAAGAC TTTGGAAATG GAGGATATAC CTCCTCCAAG GCCCGCTAGA
GCAGACTCTA CAGAGGCTAC TCCACCACCG ATGCCAGCTA GAAAGCATGA TATTGTTGAA
ATGGAAACCC CACCTCCTAA ACCTTCTAGA CCAGTAATAA CAGTAGATCT GAAGAAGGCT
CCTCCTCCAA CTCCTTCAAG AAAGGCAGGG GTAGCAACAG ATCGCCCAAC TCCACCTCCT
CCATATTTGG AAGTTTCTCC CCATCCCGAA ACATCGAATT CGCCTGTTCC TCCTCGTACA
CCTAATTTTG CTGCAGAAAT CGCAAAAAGA AATGGTCACA GTGCTTCTCC GCAGCCAGAA
CCAATTCAGA AGAAGGCAGC ACCTCCACCT GTATCCAAGA AGCCACTGAG TCTTCTGACC
CACGATGCTA AAGAAGAGGA ACATGTATCT CTGAGGAATC CTTCTGCAAA AGGAGCTTTT
ATTGAACAGC TTCAATCGCA ACTTCAAGCT ACCCATGTTG GAGAAATTCC TCATAAGGTT
CCTCCTCCAG TACACTCAAA ACTAGGACCT AAACCCTTCG AAAAAAATGC ACCAGTAAAA
GAACTGGAAC CCATAGAAAA GGCAAAACCA GCAGTGAAAC CAAAGCCAGC CGTTCAACCC
AAATCTGTTG TGCTGCCTGT TGTTCCTTCA GTACACTCTG TTGTTCCTGT TATTCCTGCT
CCAGCCGTTA TACCTTCTGT TCCTGCTCCT AGATCAACAG CTCCGGAAGT TTCTGCTGTA
CCACCACCTC CTCCCACAAG AAACTACGTC AGATCCAAAG CTCCAATACC AGTAGCCCAT
CCATCAAATG AACCACCGCA ACTTGATTTG GAGATGTATT CGGGTTGGTA CGCTGATGTC
AATGGACCAA TTAATTTTCC TGAGGCATTA GCAGGCTTGA ACAACCAAAG TTCAATGCTG
TACTCTACAT CGGGTGGCAT CACTAACTAC GAGAGACGTA TAAGTTTAAG ATTGAAAGAT
TTGTCGTCTA TCAGATATGT AATAAAGTGG TCTAGTAATA ATGTAGCAGG AGCTACGGTG
AAAATCGACA AGTTTATTCC TTCTCCAATC TCAAGCAATA TTCCATCCAA GGAGGAATTG
GTAGGGTATC TGCAACAATA TGGAGAGCAT GTTGCTTCGT GGTGTGAACA TAGATATGGC
CAGCAAGTTG GCCGAGGTGA GTGCTGGGAT CTTGCCAAAG AAGCTTTGGA GAAAGGGTGT
GGAAAGCACG CTTTTGTCAG TGAATACTAT CATCACGGTT ATCCTATACT TCTGGTTAGA
GGTGTTAATG GTATTATGCA ATTGATAGAT GACAAACAAC CGTTGGACGA AGTGAGGCGT
GGAGATATCC TCCAGTTCAA GAGCTGTACA TTCTACAATG CTGCTAGTGG AAGAACCCAA
ACCGTCGGAG CTCCAGACCA TACTTCCGTA GTTTTGGGTA ATGTAGGCGG CAAGATTCTT
GTGGCAGAGC AGAACGTTAA CAACGTTAGA ACCGTTCAGA ATGGAGAGTA TATTTTGAGA
GATTTGACTC TGGGAGACGT TTGTGCATAT AGACCAGTTC CTGCCAGTTG GGCAGGGTCA
TTGTAG
 
Protein sequence
MSSFWDNNKD TFKSAGKATM RGVASGTKAV SKAGYRTYKN HSGGGGGSSN NDTTGEVDQP 
EYIGPPRPLP SKDQLSALPP PPKRNIPTYE VPEKGAPSQY SVPQPQQYLQ QQPPQNQPQQ
IQPQQIQPPP QNQQLVQQPQ LYQQPNQYQN QYQNQNQPNQ IPQPQTNLYT PANSQQTFSQ
PNAPAQQLPQ QLPPPVNGYS DANGQQGYYQ QGYQTTPPAG NGYGVQHPQQ PQQTQQLPQG
QQSQNEMYAQ AAKSALPVLS NLYQQHQQGQ TQNSDASQQF QQTQQTPQQQ QQAMYSSAAK
HVLPVLSNMY QNQQQTSELQ QGQQGQQPQG QQPQGQQPQG QVPQGQADLY ASAAKHALPV
LSNIYQQQSQ QQQQPQQPQQ QQQYQQPQQF QQPPQPQQPQ QPQQPQQPQQ PQQPQQQQQY
QQVQLPSNQF QNELGQLPQP GQLPQPPARQ LPPPPPERTV PATPGIPIAS GQFQAQAPLS
YDSPYYQPPS TQPAAAAQPA PAKTYGFGVA QVQDPDEAPK PKKELPDPSS FAPPPIRADR
VGPPSTKSAT SATSPSPTQL PATTSGSKSS VGTAPPSAPP RASSAEQTTA PSEVKEEPKL
PSKTNLMDFD ISKFGAPPPK IYRGPQDALP LKKSSANASA STVSSSFTPP PPPPARVSPS
PSPQAYSEQP VKPPKPPKPT KPTKPKTLEM EDIPPPRPAR ADSTEATPPP MPARKHDIVE
METPPPKPSR PVITVDSKKA PPPTPSRKAG VATDRPTPPP PYLEVSPHPE TSNSPVPPRT
PNFAAEIAKR NGHSASPQPE PIQKKAAPPP VSKKPSSLST HDAKEEEHVS SRNPSAKGAF
IEQLQSQLQA THVGEIPHKV PPPVHSKLGP KPFEKNAPVK ESEPIEKAKP AVKPKPAVQP
KSVVSPVVPS VHSVVPVIPA PAVIPSVPAP RSTAPEVSAV PPPPPTRNYV RSKAPIPVAH
PSNEPPQLDL EMYSGWYADV NGPINFPEAL AGLNNQSSMS YSTSGGITNY ERRISLRLKD
LSSIRYVIKW SSNNVAGATV KIDKFIPSPI SSNIPSKEEL VGYSQQYGEH VASWCEHRYG
QQVGRGECWD LAKEALEKGC GKHAFVSEYY HHGYPILSVR GVNGIMQLID DKQPLDEVRR
GDILQFKSCT FYNAASGRTQ TVGAPDHTSV VLGNVGGKIL VAEQNVNNVR TVQNGEYILR
DLTSGDVCAY RPVPASWAGS L