Gene PICST_33482 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33482 
Symbol 
ID4840792 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp748776 
End bp750005 
Gene Length1230 bp 
Protein Length409 aa 
Translation table12 
GC content39% 
IMG OID640392107 
Productpredicted protein 
Protein accessionXP_001386340 
Protein GI150866672 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGAAA GCAAACGAAA ATATGGCAGT CTCAAGCGAA ATAGTAGTCC TAACCAAGGT 
AGCAGGATTT CTCCACCTGA AAACGAGTCT CCCACTAGAA TAGCCACACA TTCACAACAA
TTGTTTGAAT CTAGACCAGA CCATATCAAT GTAAAAACCT ACAAAAGGCG AAACACAGCC
AAGAAACCAT ATATTCATAT TTCTAAAAAG TCTGACCCGA TTCTTCGAAC ACGAGACAGT
TCCAGTTTTT TGGATTTAGA AGGTGAAACG AATTCTAGCA TTTACTCAAA ATACGATAAT
GAATTGAGGT CAGATCTGGA CATAGACGAC TTTGATGCTG GACTCATAGA TTTGGAAGAC
ATTGGAAAAA CTCAAAACAT ACAAAAATCT CATGATTATT CTGACGAAGT AGAAGGACTT
ATACAAGTTG TGGAAGATGA GAATACAGAC AGAGTCCCAT CTGTCTCTGA TTCGTTTGCT
TTGAAAATAG CCAATGGAAA CGTGTTGCAA GTCATGAAAG AGCATCAGGA GTCAAAGACA
GATGCTGTTA TAGACAAATT CAAGAAGTAC GCATTTCCGT CGCCTATAAG GTCGCGAAAA
GAGTTGATGA GGAGAGCAGA TAAATATTTT GATGTGCTAC CTTTGATCCT AAAAGGTAAA
CAAGCACCGT CAGCATATTA TTTATTGGCT AAGAATCAGG CAAACAGTTC TGTTCATGAA
ACACTTTCAG CTACAGAAAA ATGGCAGATA AATTGGGACA AGTTCTGTGG AGGCTATTAT
GGCTTCAAGA GACAGCTGTT GATAGGAAAC AGTATTAGTG TAAAATTGGC CAAGGAACTA
AGAGCGGCGC ATAGAAACAA AACCGTTTCC TATTGGACGA CGTCAGGCTT TGCAACACAT
GTTCTAGCAA ACGAAGTCAT CATAAGAATG GCTATGGAAG ACTTGCTGTG CGATTTTGAC
AGTGCTGAGA GGATAGTGAT GGAAAGTGTT GAGTATGGCA AAGTTATCGC AGATGCTACT
GAAATAGAAG ACGATCTACA GGCAGACGAA TTGGTGCTGA AACAGTCGAA AAAGTTTATG
AAACAAATTG ATATTGTATC TAAGGTAAAC GAACATATGG AAGAGAAAGA ACAGGAGGAG
GAAATACCCA GTGCCAAAAG TCAAGGCACA CGAGACTTCC TTGATCAATT GGTAGATAGC
GATTCTGACT CTGATCCTGA ATCTGAGTAA
 
Protein sequence
MFESKRKYGS LKRNSSPNQG SRISPPENES PTRIATHSQQ LFESRPDHIN VKTYKRRNTA 
KKPYIHISKK SDPILRTRDS SSFLDLEGET NSSIYSKYDN ELRSDSDIDD FDAGLIDLED
IGKTQNIQKS HDYSDEVEGL IQVVEDENTD RVPSVSDSFA LKIANGNVLQ VMKEHQESKT
DAVIDKFKKY AFPSPIRSRK ELMRRADKYF DVLPLILKGK QAPSAYYLLA KNQANSSVHE
TLSATEKWQI NWDKFCGGYY GFKRQSLIGN SISVKLAKEL RAAHRNKTVS YWTTSGFATH
VLANEVIIRM AMEDLSCDFD SAERIVMESV EYGKVIADAT EIEDDLQADE LVSKQSKKFM
KQIDIVSKVN EHMEEKEQEE EIPSAKSQGT RDFLDQLVDS DSDSDPESE