Gene PICST_68531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_68531 
Symbol 
ID4841150 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp755469 
End bp758850 
Gene Length3382 bp 
Protein Length1114 aa 
Translation table12 
GC content37% 
IMG OID640392465 
Productpredicted protein 
Protein accessionXP_001386553 
Protein GI150866828 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.134601 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0417383 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGGA ACGGTTTAGA TGACCGTGTG AATAGATTGT ACTCGTTGAG CAGACGATGG 
TATCCAGAAT TGAAGAAAGA CTTCTCCAAT ACACTCATCA ACAATGTACA CTCTGAGCTC
TACGATCTGG AGAATCTGTA CTTGCCCCAG ACTTCATGTC TTGAAGTATT GTTAGAAACA
AAATACTTCG AGAAAGTGCT ATGGGCGAAC TTTAATGAAG ATGTCACAAC GACCCACATT
GAACAAATTT TGCATTTGGA CTGGCTCTCG ACCTATTTTG AATATGAGAA GTTTCGTCAC
AATTCAATCT CTAGTATGTT GAGGAGCGAA GACAACATCA ACTTTATCAT TAGAATCTTG
GAAATTACCT TCAAACTCAC AGCATCTACC AATTATAGAC TCAATTCCCT CGTATTGCGG
TTTTTCAGTA TCTCCAAACC CCTTCCTCTG TGTTTGGATG AGATAACAGA TATAATCATA
TGGAATAATG TACCTAGTCT AACGTCGAAT CTCGCCAGCC CCTACAAAGA ACGCTTAAAA
GAAGCAGTTG TTAAATTTGA AAAGATAGAA GATGCTTCTC AAAGAAGAAT ATCAACGCTC
ACAAACAAAT GGCTCTTTAA TTTATTGCAT GATACAGCTA GACAACACAT ACTAATATCC
ACCAAATCGT CCATTCCTCC ATTTTACTTC GAGTATCTAA ATGAACTTTT AAATTTCTTA
GCTTTTCTCG TTTTTAACTA TCCGGAGAAA GCAGATGAAT TTATTTTACA GTCCAACATA
GTCTCTATTG TTTCATTCAA TTCGTCCCTC AAGAATCTGC TAGAACAAGT GAAGCAGAAA
TTTCTTTCCC ATTTTGCTAC AAAGCAGAGT AGCTTCGAGA TACTACAACA TTTAATTTAC
AATAGAACTA AGGTTGTTGT AGATGTTGAT ATTGCTGTTC CACATAACAT AGCTATAGAT
GAGATTTCAA AGCATTTACA GAATTTTAGT ATTCTAGATT TAGCATCATT GGCTAGTGAT
TTCAGATCAA TTAAGGATCC ATTGGTTCTC TTTGGGTCTA CAGAGTTGGA TGATAAGATG
AAACTTAGTA TTTTAGTTCA AACAGTATGT GATGGAATTG CTCAACCTAA TGAATCTATC
TCAAAATTTA TTTCTAAGTT GGATGAATAC GACTTGATGG ACAATCCAAA GTTTTCGTAC
CCTCGAGGAT GTATTTCTTC CATTCTCAGT CCATACAGAT ATCCATTTGC GAATAAAAAT
AATTCGTCTT TCACAATTAA GAATCTTCAG GAAGAGTTTC AAATTTATCT TCATCAACAT
ATTTCAGGAG TATTGGAAAG GCTACGGATT GATCCAAAAA GTGGCATTCA AGGCAAGAGC
AAATATTTCC ACAAAGTTGA ATCACTTGAG AATGTTAATG GAAGTACATT TTCTATGAAA
ACCAAATCAA TTGTCCCTGC ATCAATACAG TTTATCGTCT TGGTAGAGAT GTTAAAACCA
GTACAGTATT CTAAACAGAA GCGTATGAAG GAATTTGGAG TTAATGTGAT TAGAATCGTA
CAAATCAGTT CGAATTCTGT GGATGGAAAA GATGCCACTT TCAAGTTCAC TTTTAATGAA
GAAATTCAGT CATTTTCAAG TAGAATAAAT CATTTTATTT CTGTTCCGTT TTCGGTGCCG
GGAAGTTCAC TCCTTGCACT TTCAGACAGT AAAGCTGTTC CAATACAAAA AAAGAATGAA
TCAATATCTC CAGTAGACTA CCAAACTCCA TTGGGAATCT CCATTTTGTC TGAAACAAAG
AGTGAAAATG GTAAAGACCG ATCGCATGAG AACGAATTAT GGGAAGTTGA AAGAAATGGA
GACATATTGA GCAGTTATCA AAGAGATATT ATGGCAAATA TTATTAAAGG ACATGGCAAA
GCGTTCAGGT TTGAGAAAGG GTGTGGCATG AAAAAGTTGA TTTCACTAAT CTTGACTCTG
AATCTTAGTA ATTCCCGATG TCTTGTCATA GTACCCAGCA GAAATTACTC AAGACAAATT
CCTGCTTCTA TTTTAGAAAA TCGAGTCTCA TACTTGAGAT ACGGTAGCGA ACAGGACATT
CGGTCTCTCA GCAATTTTGT CAAGGAAACA TATAAAGCGA CGATCGGTTT AATAGGCGAG
AAATGCGAAG TGGAAGAGAA TGAGCTGGTA TCCATTGCTG GTATTTACAA GTTCGAACAA
CGTCTTAAAT TGGAGTGGTC TAAGATTGCG AACTCTTTAT CTCTAGGAGT AGAGACAAAA
AATGAATTGA AGAGCGCCTT CGCTTTCTTT AGTGGCGCTG ATTCACCAGT GCCACAATCG
GTGAACTTCA AGATGGTATT CAAGGCATAT AAGAAACGCC GATATGCCTT AGCACTTGCT
GCTACATTAA TTCCGATTAT TGAATTGCAT AGCAAGGGGA GAAACGATGA CGTATGGAAC
CTTTTGTTTC GCAAGTTCAG TTCTATCATT GCTTATGAAG ACTATTTGAA CTTATTACAA
AACAATTCTC ATCATAATAT GAACAATTTC GATAATATAA TTGTCGTCAA CGGATGGCCG
GGAGCAGCAT TAGTGGCGGG TTTGAAAAAT ACCCATACAA GGAAAGTAGT TGAAATTGGT
GCTAGACATG TGCTCAGTTC GACTAAATTA GGCGAGCAGG AACCAGTATT GTTTCAGTGG
AGGCCAGAAT TTATACCAAT CTCTAAACAG AATCTCAAAC TCGTTAACAA GAAGATTAGA
AATTTCAACC CTGGTTTGAA GCATGTCTTT CAAGTTATAT ATACCGAGGA TCAAGTCGAT
AGCATGGAGT ATAGTGTATT GTTGTATCAA TACTTAAGAC TTCTCGGATA TCCGTCATCA
AAGATATGCA TTTCTGTCGG TTCTCTTTTG CATAGAGCAC TCTTAGAGGA AGTCTTGAGT
AAGCATTGTA CAAAGATTAG CAAAGATAAG TCTATCAAGT CAGCCAATGA AAGTAGTGAT
GATCCAAAAG ATTTCCAATT CGGATGGCCC GACATCTTAA TTTATGATGA TCCAGATTAC
TACTTTGATA CTTATGAATA TGGTATCATT TCAGCAGTCA GTCAGACTGC AAACAGGTTG
GAAGTCATTA GTTTACCAGG AAGATTTGGC AATTACATTA TTGGACTGCA AATATATTCA
CATGAGTTCG AGCTTCCTGA GGTGGCTGAT CTTGAAGTTG TTGTAGGAGA GAACTACAAT
ACTGAAGTGC GGAAGCAGTC GCAGCTGTAT CCTATCGAAA GCAAAGACCA TTTTGAACAG
TACATTCACA ACATGACAAA GGTAAGGCTC GGGCACAAGA AGTAGCATAC ATAATAGAAC
TAATAGAATA AAATACCATT TT
 
Protein sequence
MSRNGLDDRV NRLYSLSRRW YPELKKDFSN TLINNVHSEL YDSENSYLPQ TSCLEVLLET 
KYFEKVLWAN FNEDVTTTHI EQILHLDWLS TYFEYEKFRH NSISSMLRSE DNINFIIRIL
EITFKLTAST NYRLNSLVLR FFSISKPLPS CLDEITDIII WNNVPSLTSN LASPYKERLK
EAVVKFEKIE DASQRRISTL TNKWLFNLLH DTARQHILIS TKSSIPPFYF EYLNELLNFL
AFLVFNYPEK ADEFILQSNI VSIVSFNSSL KNSLEQVKQK FLSHFATKQS SFEILQHLIY
NRTKVVVDVD IAVPHNIAID EISKHLQNFS ILDLASLASD FRSIKDPLVL FGSTELDDKM
KLSILVQTVC DGIAQPNESI SKFISKLDEY DLMDNPKFSY PRGCISSILS PYRYPFANKN
NSSFTIKNLQ EEFQIYLHQH ISGVLERLRI DPKSGIQGKS KYFHKVESLE NVNGSTFSMK
TKSIVPASIQ FIVLVEMLKP VQYSKQKRMK EFGVNVIRIV QISSNSVDGK DATFKFTFNE
EIQSFSSRIN HFISVPFSVP GSSLLALSDS KAVPIQKKNE SISPVDYQTP LGISILSETK
SENGKDRSHE NELWEVERNG DILSSYQRDI MANIIKGHGK AFRFEKGCGM KKLISLILTS
NLSNSRCLVI VPSRNYSRQI PASILENRVS YLRYGSEQDI RSLSNFVKET YKATIGLIGE
KCEVEENESV SIAGIYKFEQ RLKLEWSKIA NSLSLGVETK NELKSAFAFF SGADSPVPQS
VNFKMVFKAY KKRRYALALA ATLIPIIELH SKGRNDDVWN LLFRKFSSII AYEDYLNLLQ
NNSHHNMNNF DNIIVVNGWP GAALVAGLKN THTRKVVEIG ARHVLSSTKL GEQEPVLFQW
RPEFIPISKQ NLKLVNKKIR NFNPGLKHVF QVIYTEDQVD SMEYSVLLYQ YLRLLGYPSS
KICISVGSLL HRALLEEVLS KHCTKISKDK SIKSANESSD DPKDFQFGWP DILIYDDPDY
YFDTYEYGII SAVSQTANRL EVISLPGRFG NYIIGSQIYS HEFELPEVAD LEVVVGENYN
TEVRKQSQSY PIESKDHFEQ YIHNMTKVRL GHKK