Gene PICST_28349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_28349 
Symbol 
ID4851126 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp993696 
End bp994910 
Gene Length1215 bp 
Protein Length404 aa 
Translation table 
GC content44% 
IMG OID640392834 
Producthypothetical protein 
Protein accessionXP_001387831 
Protein GI126274112 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGTAG CCGTGCCCAT GGCTATACCT GTAGAAGATG CTATTTTCGA CATATCCGAA 
TATTGGGAAG AGTTCATGGA CAACAGTCTC AGCTACGACC AAATAGACAC ATATTACGAA
CTTCTCGACT CCTCCAACGA GCTCTCCAAG AGAGACGTTC AGACTATTGC TGGACTCTTG
GAATTGCTCA ACAGATCAGA GATCATCTGG GACGTTTTGG ACCAAGTAGC TGGCCATCCT
GACAGAATTC AGACCCTTGC CAATCTCACC AGTGGTTTGA TCGATTTCAT TGGTGCTTCC
AACATCAGTC TTGCTGATTT AACGAGTGCT GCTTCGTCGC TCAGCGGTTC TGTTAATGTT
ACCGCCATTG TCGGTGCTGT TCTTGATTCT GGAGTCGTCA CCAACTTGTT GGACGGAATT
CTCTTAGATG ACAGTTTCAG ACCCGTTTTG GTGAACTTGA TTAACGACGT TGTTCTATCG
TTTAAGACTG AGATTTGGTT CCTTCTCAAG GGTGTATTTG CCAAGAGAGA GTACTTGGAA
CCAATGTCTC AAGAAGAAAT CATCGAAGTG TTCAGAAGAG CTTCCAACGA GGGATCTCTT
GAAACCTTTG CTAGCAACGT TGTGGGCACT ATTCTTGAAT CACAATTGGT TAAAAACATC
TCGTTGGACT TGCTTGCTGC CTTGAATGAA ACAAGCTTCT TGACTTACAC CGTTAAGAGA
TTCCTTGCCA CCCCAGCCTA TATCAACATG ACAACTGATT TGATATCTGA CATTTACAAC
ACTGCACACA TCAATATTGA TCTTAGCTCC ATTAACATCA GTGCAATTGT AGGTTCTGCC
TTGGCTAACC CTAAGTTAAT TAGCAATGCT GTTGGTCTGC TCTTGTCAGG AAATTTGGAT
CTCAGCTTCT TAGGAAAGTA TGCCTCGGCT GTGAGTAAGA TCATTACTGG CTTAGAGGAT
AAGGGTCTCT TCCAGGAATT GAACGACTTC ATCTTCCCTT CTACGAGTAA ATCTGCTACT
GCTACAACTT CGGAGAAAAA CAAAGATCAA GTTGTTACTG GAAGCACTTC TGCAACTACT
ACTCAATCTA AGTCCAGTTC TACGGCAACG AATGGAGCTC CATCCGTGGC AGGAAGCCGC
TTGAGCAATC CCATGATCAA AATGATTTTC TTTTTGCAAT CACTTGCTTT CGGTGGAGTC
TTGTTGATTC TCTAA
 
Protein sequence
MAVAVPMAIP VEDAIFDISE YWEEFMDNSL SYDQIDTYYE LLDSSNELSK RDVQTIAGLL 
ELLNRSEIIW DVLDQVAGHP DRIQTLANLT SGLIDFIGAS NISLADLTSA ASSLSGSVNV
TAIVGAVLDS GVVTNLLDGI LLDDSFRPVL VNLINDVVLS FKTEIWFLLK GVFAKREYLE
PMSQEEIIEV FRRASNEGSL ETFASNVVGT ILESQLVKNI SLDLLAALNE TSFLTYTVKR
FLATPAYINM TTDLISDIYN TAHINIDLSS INISAIVGSA LANPKLISNA VGLLLSGNLD
LSFLGKYASA VSKIITGLED KGLFQELNDF IFPSTSKSAT ATTSEKNKDQ VVTGSTSATT
TQSKSSSTAT NGAPSVAGSR LSNPMIKMIF FLQSLAFGGV LLIL