Gene PICST_31470 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31470 
Symbol 
ID4839020 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp887354 
End bp888397 
Gene Length1044 bp 
Protein Length347 aa 
Translation table12 
GC content44% 
IMG OID640390335 
Productpredicted protein 
Protein accessionXP_001384122 
Protein GI150865065 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1097] RNA-binding protein Rrp4 and related proteins (contain S1 domain and KH domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGTCA GTGAGGTCAT CTCCATCACC AAGCCTGTTG GACTTGACAA TGACATAGAT 
TCAGATGTTG AAATGTCAGA TTCGGAAAAC CAGGCTCAAG ACTCGTTCAA GCAGTCTATA
GTAACACCTG GGGAGTTGGT GACGGACGAT CCCATCTGGA TGAAAGGCCA CGGGACGTAT
TTCCTCGAGG ATAGGACATT TTCGTCTGTG GCTGGGAATA TTCTGAGAGT GAATCGTTTG
TTGAGTGTAA TACCGTTAAA AGGCAGGTAT CAGCCTGAGA CCGGTGACCA TATTGTAGGC
AGAATCACAG AGGTAGGCAA CAAAAGATGG AAGGTCGACA TTGGAACTAA GCAGGACGCT
GTTTTGATGT TGGGATCTGT CAATTTACCT GGAGGTGTAT TAAGAAGAAA ATCCGAGAGT
GATGAATTGC AAATGAGAAA CTTCTTGAAG GAGGGAGACT TGTTGAACGC AGAGGTACAG
ACAATTTTCA ACAACGGCAT TGCGTCGTTA CATACGCGTT CATTAAAGTA CGGAAAATTG
AGAAACGGGA TGTTCTTGAA GGTACCAAGC AGTTTGGTAA TCAAGTCAAA GAATCACTCG
TATGATTTGC CAGGAAATGT CAGTATAGTA TTGGGAGTTA ATGGCTATAT CTGGCTCTAC
AAGACATCTA CAGGCATCAA CAGCGCTACT AACACTAGTG TTACATCTAA TACCAACATG
TTCCGGGCCT CTGTCGACAC TACTGGTTCG TATGCTATCG GGCAGGGTTC GGTTTCTATT
ACTAGATTGG AAGAAGAAAG TTCGTGGGAA ATTTACTCAG ACAAAAACGA TCCAAATATC
TCCAACTCTG TACGTTCTAA CATTACTCGA TACAACAACG TCCTCCGGGC AATGAGCTTC
TGCGAGTTGG GGATAACAGA ACAGCGGATT ATCATGGGCT ATGAGGCTAG TTTGTCGTAT
TCGAATATAG GCAGTTTGAT AGACAAGGAG TCGATGGAGA GCATTTGCCA GGATATCATA
AATAACGAGA AGATGAGAGG TTAG
 
Protein sequence
MNVSEVISIT KPVGLDNDID SDVEMSDSEN QAQDSFKQSI VTPGELVTDD PIWMKGHGTY 
FLEDRTFSSV AGNISRVNRL LSVIPLKGRY QPETGDHIVG RITEVGNKRW KVDIGTKQDA
VLMLGSVNLP GGVLRRKSES DELQMRNFLK EGDLLNAEVQ TIFNNGIASL HTRSLKYGKL
RNGMFLKVPS SLVIKSKNHS YDLPGNVSIV LGVNGYIWLY KTSTGINSAT NTSVTSNTNM
FRASVDTTGS YAIGQGSVSI TRLEEESSWE IYSDKNDPNI SNSVRSNITR YNNVLRAMSF
CELGITEQRI IMGYEASLSY SNIGSLIDKE SMESICQDII NNEKMRG