Gene PICST_36494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_36494 
Symbol 
ID4840114 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp6080 
End bp8092 
Gene Length2013 bp 
Protein Length629 aa 
Translation table12 
GC content40% 
IMG OID640391429 
Productpredicted protein 
Protein accessionXP_001385687 
Protein GI150866182 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.215422 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGCTA CTCCATTCAA AATACGGCAT ACCCGTATCT TGAAAGCGTG TGACAACTGC 
CGACAGAGAA AGATAAAATG CTCTGGCGAG TCCGTATGTT CTTATTGCAA GAAATATGGC
GATCCGTGCA TTTATAGAGA GAAAACTAGA CAATCCAAAA AAGTGGTAGA TAGAGGGAAT
ATTGACGTCG ATGCCACATC AACAGATTTG CATCAAAAGC CCGTCTCTCC AGGAAAGTGT
GGACAGGAGG TTTTGGATAT GCCTACGTTC TTATCAGTTA GAAAATCCAG CGTCAATTCT
TCCACAATGG AGTACTTTGG GCCCGCAAGT AACTTCTCTT TCGTCAACCA GTTGAACGAT
TACTTGAGGT TGTTGGGTAA GAACACCACT CCTATTGACG AGGATCAAGG GTTGCGTAGA
TTCGGAATGA ATTTGATGGT ATTATCCAAT CCTGCCGAGG ATTTTGACTG TACTTTAAGT
ACGATATCAG TAGAAACGGT TAATCAGTTG ATCACAGCAT TCTTGGAAAC ATGGCATATT
CCTTGCCCAA TATTTACCGG CGAAGACTTG TTTGATCTCT CTGTCACAAC ATGGAAACAG
GGCTCTGCTC CCAAGCACAG AAAAGCTTTA TTATACTTAG TGCTTTCTAT CGGAGCAGCA
GCATCTTACT TTGAGTCCAC CCATTGCAGT GCTTCATCAA CTTTGCCTCT TGCGAGAGGT
TTCTTTGAGC TCTCGATTCG GACTGTTCCT GAAATATTCA CGGAGGTTTC TTTTGATGCA
GTTAGAATAA TATTCCTTAT GAGCTTAAGT GCGTGTAATT TAGGTGATAC TGCCCAGTCC
TATCTATACT CAGGCTATTC TGTCAGAATA GCAATAGCTC TTGGTTTACA TAAATTGACG
AAATTCGAAT CTCAGCACCA ATGTCGAGTG TGGACTTGCG CTTGGCAATG GGAAAATTAT
TGGAGTTTCT GCGTGGGACG TCCAAGTTGC TCAAGAGAGG ATATGCTGAT TCCTATGGTA
CCAGAGGATG CTTTTACAGC ACTGGGATAT GGAAACAAGG ACAGATTTGC AATACATCAT
CAGCATATGG AGCTCAGAGT CTATTTTGGA GCCAACTGCT CGAGGATTCA TTCGCAGCTA
TATGACTCCG AAAGTGATTT GCTTGCAGTA TTAAAATCAG TGGAGAAGCT CTCCACTGAC
ATTGACAACA AGTATTTGGG TTGCTCAGAT CCTTTATTGA AGGAATCCCA GGTCAGCGAT
TTGCTTTTGC AAAATACCGA CGCTAATGCT TGTAGAGAAT GGTTTTGGAT TCGAATCTAC
TATTTGTATT TGAAAATGGT CATATACCGA CCCTTTATGA TCTTTTATGC TTACTTGAAT
AACTCTAAAA CAGAGGCTTC AGAGAAAATT ACTTTATTAC TAAAATCAAA GTCTAATCTT
TGTGTACAAG TTGCTATTGA CATCTCAAGG TTCATTATAG ACTTGAATAG AAAGATTAAA
ATGCGACAAC CCATTTTCTT TATCTGCACA TATTTGGAAA GTGCGTCTAC AATTCTACTC
TTCTTTATTA TCAGTAATCG TGATAACATT CCGGACACTT TGGCAGAGAG CATTTGGGAA
GTTCTTCAAG ATACATGCGC ATTTTTAAGT GGGTCGTCGG GACCCTATGT TGGTAGTATA
AAGATAATTG CAAATGATGC GTTGAAATCC CTCCATGATA TTTTACTCTC AAATAACTCA
GAGATTGCCG AGCGGACTTA TTTTGGAAAG GTTCTTCAAG GAGTGGTAAA ATGTGATGTA
TTGGGAAGTA CCAATGATGT GGTAAATGAA GGAGTTTTGA AGGAAAGAAA CCTAGAGCCA
AATCATTCAC CTGATTCAAA TGTACTCACA GAGCCTTCTT CAGATGCTAC AAGTCGATTT
AATGCTGAAA AAGACGCACA GTTAGGCGAC ATGTCCACAT ATGGATTGGA AGACTTCTGG
CAGCAAACTT TAGATTGGAT TAGCATTACA TAG
 
Protein sequence
MSATPFKIRH TRILKACDNC RQRKIKCSGE SVCSYCKKYG DPCIYREKTR QSKKVVDRGN 
IDCGQEVLDM PTFLSVRKSS VNSSTMEYFG PASNFSFVNQ LNDYLRLLGK NTTPIDEDQG
LRRFGMNLMV LSNPAEDFDC TLSTISVETV NQLITAFLET WHIPCPIFTG EDLFDLSVTT
WKQGSAPKHR KALLYLVLSI GAAASYFEST HCSASSTLPL ARGFFELSIR TVPEIFTEVS
FDAVRIIFLM SLSACNLGDT AQSYLYSGYS VRIAIALGLH KLTKFESQHQ CRVWTCAWQW
ENYWSFCVGR PSCSREDMSI PMVPEDAFTA SGYGNKDRFA IHHQHMELRV YFGANCSRIH
SQLYDSESDL LAVLKSVEKL STDIDNKYLG CSDPLLKESQ VSDLLLQNTD ANACREWFWI
RIYYLYLKMV IYRPFMIFYA YLNNSKTEAS EKITLLLKSK SNLCVQVAID ISRFIIDLNR
KIKMRQPIFF ICTYLESAST ILLFFIISNR DNIPDTLAES IWEVLQDTCA FLSGSSGPYV
GSIKIIANDA LKSLHDILLS NNSEIAERTY FGKVLQGVPN HSPDSNVLTE PSSDATSRFN
AEKDAQLGDM STYGLEDFWQ QTLDWISIT