Gene PICST_33357 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33357 
Symbol 
ID4840676 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp405241 
End bp407370 
Gene Length2130 bp 
Protein Length709 aa 
Translation table12 
GC content39% 
IMG OID640391991 
Productpredicted protein 
Protein accessionXP_001386279 
Protein GI150866620 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.344504 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.306168 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGGATG TGCAATTCAG GCATCTAAGA AGTGCAGTTA CTTACCTTGG GTTAGGTTTA 
CGAGGTTTTC ACTCTTCTGA AGGCCTTTTC AATTCAAATT TGAGAGGATA TTTGCGAAGT
GGAACTACAA AATACAGAAA ACAGAAAGCA GACAGAACAA CTAAAGATGG AGTAGCAAAA
CAACCCTCTA GGTTGTTAAA GTACGATCCG AATGCCCAAA TTGCGGATTT CACAACTTTT
GGAGTGTTTC CATTTTTGCA GAAGAAATTG AACGATTTTC TTTTGCCAGA AACTCAGAAG
ACAAATGCAA TCCCCGATTT CAACGCTTCT CCTACTCCAG ACCAGAAGAG GATTCTTTCC
GTTCTTAAAA GTGGCCACAA CTTGCTAATA AACGGAGGCT TTCAAACCGG CAAATCTATT
GCAATGTTGA CATATTGTAT CGAACAGACT CTTTCAACTA CCCCAGCATT TGATCATCAT
AATAGACCGG ACAGAGTTCA AAGTTTGATT ATCGTCCCCA CAGATGAACT AGTCAACAGG
TACGCTACAT ATACAAGGTA TCTTTTGAAG GATATCCCTC CCAATTGCTG TCCCAAGCGT
TTAGTAGCTT CTTCGAGTAG CCAAAGAAAA ACTGTCTACA CTCTTGAAAG AGGGCCCATG
AATCTTGCGT TTCTCTCCCA CGACAGCAAA CCACAATTTT ATAGTACTAA TGTAGACAGT
CACAATGTCG GAGTTCCAAA CATAGTAGTT ACTACTGCTG CCCTGTTGGA AAAAGCCATC
CAGGGTCAGT TAAATTTGGA ACCAAGCGCT CTTCAAGATC TCAAATTTAT TGGTGTTGAC
GAACTTGATA TCTTTCTTCT GACAACTAGT ATCGGCGAGT GGGATGTCGT AGTTCAGGAA
GGAAACAAGA ATAAGTATGT TAACAAGATC GAGAAGATGA TTCAGAAGTT GAAGTCGGAT
CTGATATCGT GTTATTCAGA TAGCCTCACG AACGATCTTG AACGAGTTGA AAAGAAGCAT
TACTTTAAGA TGTCATCTGA ATCTTCTTTT AACTATTCAG AAATAGTCAT AAATAACAGA
ATTGACGATG TGAAAAGTTT CATTGCTAAT GAATTTAAAG GTACCTCGAA ACCAGATCTG
TCGTTGCTCA AAAAGCTTAT CAAATTGAAG AGAAAGAACT TGTACAAACC AATCCAGTAC
TGTTTCCTCA ATAGTGCCAA ATCAAATCAA GCATACAATT TCGGAAATCT TCCATCGCTT
TCAAAGCCTG AAGATAATGA CCTTTTGATC TCATTCGTAG ATAAAGCTTT AAGGCTCACG
AATTCTCAGA AAAAGTGGAA GGAAAATGAA AGGCGTCTAT TTCAAGTAGA AAATTTCAGT
ATCCCTAACA ATGCGATATT GGACAACTCT GGCAAGACAG TCTTTGGAGG AATTATAAAT
GCTTACATAT TAAATTCTTC GAAGTCTAAA GTTAAAGATG TGCAACTTGA AAATGAAGTT
CGCGATGATC TCTCATTCGG AGATTGGAAA AACCTTGAGA AAACGTATCT AAGACTGAAG
TTGAAACAAA TATCAGCATT GAATGAAGAC AATTACGTTT CAGTCGTCTA TGAGACACTT
TGTATAGCCA AAAAGTTGGG TATAACGAAA CCCTTCCTTA TTATGATACC CGAAGGAGTG
GATAATTCAA TGGTTGCTGC TAGCTTGCAA CAGTTCGGTA ATATTGGAAA GATCTCTTGT
TTGTCAAGTG GTACAGTACA CGAGGAGTCT CATGTCATAG CCCATCCTTC TCAACTTTTG
GGAAATACAA TACCTTGTGT TTCTAATATC CTAGTTGTTG GAATCGAGAG TTTACTCCCA
GAGTTTGCAT TGGGAAAGAA GGCCACATGT ACCACTAAGG ATAACATTCC TGGACTTGTT
GATCCCGTCT CTGATCTCTC CATCTTTTAT CTTTCGAGAT TGCTTAGCTC TCCGTCCTCA
GTTCCTAAGA ATTTGATTTT TGCAATTAGC AACTGGAACT TGGATCCACA AGACATTCAA
GTAACAAACG ATCTCAACAA ACTCTCTCGC TCAATGGCAT TCAATGGTCT TCTAAATCAT
GTCCGCCTCA AGAAAACCTC AAGTAATTAG
 
Protein sequence
MLDVQFRHLR SAVTYLGLGL RGFHSSEGLF NSNLRGYLRS GTTKYRKQKA DRTTKDGVAK 
QPSRLLKYDP NAQIADFTTF GVFPFLQKKL NDFLLPETQK TNAIPDFNAS PTPDQKRILS
VLKSGHNLLI NGGFQTGKSI AMLTYCIEQT LSTTPAFDHH NRPDRVQSLI IVPTDELVNR
YATYTRYLLK DIPPNCCPKR LVASSSSQRK TVYTLERGPM NLAFLSHDSK PQFYSTNVDS
HNVGVPNIVV TTAASLEKAI QGQLNLEPSA LQDLKFIGVD ELDIFLSTTS IGEWDVVVQE
GNKNKYVNKI EKMIQKLKSD SISCYSDSLT NDLERVEKKH YFKMSSESSF NYSEIVINNR
IDDVKSFIAN EFKGTSKPDS SLLKKLIKLK RKNLYKPIQY CFLNSAKSNQ AYNFGNLPSL
SKPEDNDLLI SFVDKALRLT NSQKKWKENE RRLFQVENFS IPNNAILDNS GKTVFGGIIN
AYILNSSKSK VKDVQLENEV RDDLSFGDWK NLEKTYLRSK LKQISALNED NYVSVVYETL
CIAKKLGITK PFLIMIPEGV DNSMVAASLQ QFGNIGKISC LSSGTVHEES HVIAHPSQLL
GNTIPCVSNI LVVGIESLLP EFALGKKATC TTKDNIPGLV DPVSDLSIFY LSRLLSSPSS
VPKNLIFAIS NWNLDPQDIQ VTNDLNKLSR SMAFNGLLNH VRLKKTSSN