Gene PICST_85034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_85034 
Symbol 
ID4840737 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp535407 
End bp536514 
Gene Length1108 bp 
Protein Length361 aa 
Translation table12 
GC content49% 
IMG OID640392052 
Productpredicted protein 
Protein accessionXP_001386122 
Protein GI150866496 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.737973 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.53722 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AATGTCTGAT ACTAACATCG ATATCACTAT CAAGCTGTCT GGAGACACGA AGTACGAGTT 
GCTGGTGTCG CCTCTGCTCA CTGTATATGA TTTGAAGGAG CTTATCGCAG ATAAAGCCGA
CATCCCTGCG GACAGACAGA GACTCATTTA TTCTGGAAAG GTGTTGAAGG ATACTGAAAC
TATTGCTTCG TACAAGGTTC AGACTGGACA TACTATTCAT ATGGTGAGAT CTGCTGCACG
AGCCACCGGA GCTCCAAGTG CTTCTAATGC TACTGGTACC TCCGGAAATA CAACTTCTGC
AAGTGCTACT CCTTCTGGAA GTACTAATAT TACCGGCAAT TCTGCAGCTG GAACCGGAGT
CCCTTCCAAT ATCGCTGCTG GACAAGGACT GTTTAACCCT CTTGCGGACT TAACGGGAGC
GCGTTATGCT GGCTATGCTC AACTTCCCCT GGCTTCTATG TTTGGCCCAG ATGGAGGTAT
GAATGCCATG CCGGATCCGG ATCAGTTGGC TCTGATGATG AACGATCCTA TGGTCCAACA
GCAGCTCAAT GCCATGTTGC TGAATCCACA AATGATCGAC TACATGATCA ACCAGAACCC
GCAATTGCGT GCTATGGGCC CTCAAGCTAG ACAGATGTTA CAGTCCCCTA TGTTCAGACA
AATGATGACG AATCCGGAAA TGATGCGCAT GATGATGAGT ATGGGTCCAA TGATGGGAGG
GGCAGGTCCC GGTGCTGGAC AAGGAGCTTC GGCATTTCCA GCTCCTGGGG CTAATCCCAA
CGTAGCTGAT ACTTCTACAG ATTCTACTGC TAGTGCTGCT GATACTCCTA CTACTAACGC
TACTGCTAAT AACGCTAACG CTGCCGCTGC TGCTAATCCG TTTACATCAT TGTTTCCTGG
TGGAGTACCT CCTGTCGATC CGTTTGCGTT GTTTGGAGGT GGTGCACCTG CTCCCGTCGA
TAACCGTCCT CCAGAGGAAA GATATGAGAG CCAGTTGAGG CAACTCAACG ATATGGGCTT
CTTCGACTTC GACCGCAACG TTGAAGCTCT CAGACGAACA GGTGGCAGTG TCCAGGGTGC
TATCGAGTAC TTGTTGTCGA ACAACTAG
 
Protein sequence
MSDTNIDITI KSSGDTKYEL SVSPSLTVYD LKELIADKAD IPADRQRLIY SGKVLKDTET 
IASYKVQTGH TIHMVRSAAR ATGAPSASNA TGTSGNTTSA SATPSGSTNI TGNSAAGTGV
PSNIAAGQGS FNPLADLTGA RYAGYAQLPS ASMFGPDGGM NAMPDPDQLA SMMNDPMVQQ
QLNAMLSNPQ MIDYMINQNP QLRAMGPQAR QMLQSPMFRQ MMTNPEMMRM MMSMGPGAGQ
GASAFPAPGA NPNVADTSTD STASAADTPT TNATANNANA AAAANPFTSL FPGGVPPVDP
FALFGGGAPA PVDNRPPEER YESQLRQLND MGFFDFDRNV EALRRTGGSV QGAIEYLLSN
N