Gene PICST_39422 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_39422 
Symbol 
ID4851871 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp3057285 
End bp3058703 
Gene Length1419 bp 
Protein Length472 aa 
Translation table 
GC content44% 
IMG OID640393579 
Productpredicted protein 
Protein accessionXP_001387152 
Protein GI126275867 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.423389 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGCTC TCGTTTCGCT CCCGATGATG GGTGCCTCTT CCTTGGCTTC GTGCTTCGGA 
GCTGCTGCCT GTTCTGCACT TTGTTCCACG ATAGGAGGCA CATTCCAATC GTCTATTATG
ACCAGAATAA CATATGCCAT GTTGCTTTTA GTGAACTCAT TGATATCGTG GATAGCGCTA
TCGCCGTTTA TTGTTCACAA AATCGAGAAA GCCACCTTCG GCTTTATCAA TAGCAAGTGT
GGCCAAGATG GCTCTCAGTG TATTAGCTTT TCATCTGTCC ACAGAGTCAA CTTTGCTTTG
GGGGTCTTAC ATTTAGTCTT GGCTGTGTTG TTGATAGATG TCAAGTCTAC AGCCAACCCT
CGTGCAGTAA TCCAGAACGG GTGCTGGAGA ATCAAGATAT TCAGCTGGTT GACGTTTATT
GTCATCAACT TCTTGCTTAT CCCCGATCAT TTCTTTGTTT TCTACGGTAA CAACATCGCC
ATCATATTTT CCACCATTTT CTTAGGAATC GGACTTATCT TGCTTGTAGA CTTTGCACAT
GCCTGGGCTG AGAAATGCTT GGAAAAGATC GAGTTAGAGG AATTGACTGG TGAAGGAGAT
TCCTCTTTCT GGAAGAAGTT GTTAGTGGGA GGTACTTTGA CTATGTATAT TTCGAGCATA
ATCTTAACTG TGCTCATGTA CTGGTTCTTT GCTGGAAACG GCTGTAGTAT GAACAAGACC
GCTATCTCGT TGAACATGAT CTTTGGCTTA ATAATCTCAG CCATGTCTAT TAACCAGACT
ATCCAAGAAT ACAATCCTCA CGCTGGACTT GCCCAATCTT CCATGGTAGT CTTCTATTGT
ACGTATCTTG TCATGAGTGC TGTCGCATCA GAGCCAGACG ACAAGTTCTG CAATCCATTG
GTAAGATCTA GAGGTACTAG AACTGCCAGT GTCATCTTAG GTGCCTTTTT CACGTTTATT
GCAGTAGCCT ATACCACCAC TAGAGCAGCA GCAAACTCCG CTTTCAGCTC AGAACCAACT
GCAGATCCTT ACATCAATGC CCAGCCAGCG GTTAGAAACG AAATGAGATA CCAGGCTATA
AAGCAGGCTG TAGACGAAGG CTCTTTGCCT GAAAGTGCCC TTAACCAAAT GGACTTGTAT
GACGAAGACA TGGAAGGCAA CAGCAACGAT GAAGAAAGAC AGAAAGTCAA GTATAACTAC
TCATTGTTCC ACATTATCTT CTTTTTGGCT ACCCAGTATG TCGCTACGTT GTTGACTATC
AACGTGAAGC AAGACGAAGT CGGTGACTTT GTACCTGTTG GCAGAACATA CTTTGCCAGT
TGGGTCAAGA TTATTAGTTC GTGGGTATGT TTTGTTTTAT ACGGATGGAG TTTGGCTGCC
CCTGTAGTTT GGCCAGACAG ATTTGGTGTT CAATTGTAA
 
Protein sequence
MGALVSLPMM GASSLASCFG AAACSALCST IGGTFQSSIM TRITYAMLLL VNSLISWIAL 
SPFIVHKIEK ATFGFINSKC GQDGSQCISF SSVHRVNFAL GVLHLVLAVL LIDVKSTANP
RAVIQNGCWR IKIFSWLTFI VINFLLIPDH FFVFYGNNIA IIFSTIFLGI GLILLVDFAH
AWAEKCLEKI ELEELTGEGD SSFWKKLLVG GTLTMYISSI ILTVLMYWFF AGNGCSMNKT
AISLNMIFGL IISAMSINQT IQEYNPHAGL AQSSMVVFYC TYLVMSAVAS EPDDKFCNPL
VRSRGTRTAS VILGAFFTFI AVAYTTTRAA ANSAFSSEPT ADPYINAQPA VRNEMRYQAI
KQAVDEGSLP ESALNQMDLY DEDMEGNSND EERQKVKYNY SLFHIIFFLA TQYVATLLTI
NVKQDEVGDF VPVGRTYFAS WVKIISSWVC FVLYGWSLAA PVVWPDRFGV QL