Gene PICST_47989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_47989 
Symbol 
ID4840138 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp205831 
End bp207153 
Gene Length1323 bp 
Protein Length440 aa 
Translation table12 
GC content40% 
IMG OID640391453 
Productpredicted protein 
Protein accessionXP_001385374 
Protein GI150865951 
COG category[T] Signal transduction mechanisms 
COG ID[COG5409] EXS domain-containing protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.129932 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCAAA ACCAAAAAAA ATCCGACGAG ATTCTCTTCG ATGACTTGGT GCCGCTTCCA 
TTCCGAATCC TTTTTCTCGT CCAACTCGGA GTTTTTTTCT GGTACTACCT AGTCTATTCG
TGTTACAACT TGAGGAAGTT AAACATTTTG CACCTAATCA AGTTGTCATA TTCAGCACAT
GACTACTCAC AACTCGATGA CCACTACATA CCCAATGGAG AGTTTGCTAC GACTCTCGTT
CCGGATTTCA ATTCCAACCT CATTCTAGCC AATGGAATTT GGGCTAACCT TCGACCTGTG
ACTATCGTCA ATGTTATAGG TTGGGCTGTG TTTAAGATTA TTCAACGTAA AGTGAGTCTG
AACGACGATG TTTCGCCAGC CATTTTTATT CCGTTGTCCT ATGTGATCCC GTTAGCATTA
TTTTTTCATT TGTTCTATAG ATTGTTCTAC AAATCCAAAG TGCAAAATTC TATGGGACAG
TACAGAGCAT TTACCACCAT GAAAAGAATC TTGTTGGGTA AGATAAACTC TAGCACAATG
AGAACTAACG ATATTTTGAT ATCAGATAGT TTGGTCTCCT ACAGCAAGGT ATTGAATGAC
TTCGGCTTGT ACCTTTGGAA CTACTACTAC GCCAGAGACA TACCATACAG TGTCGAGTTA
GAATTCATTT TACTATGTAT ACCGACATTT ATTCGCATGA AGCAATGTTA TTCTGAATAC
AGAAGCACCG CAAACAGACA GCACTTATTC AATTTCATCA AGTATTCTAC AACCTTGGGT
CCATTATTCG TAAACCTGTT GATCAAATCT ATTATCACTT CGCCAGGAAA GGATCTTAAT
GAACCTGCAT TCTTGGACAA ATTGCAGTCC TTGAACAGGT GGTGGTACTT GCTTTCGTTT
GTAAACTCAA CGTATTCGTT TATTTGGGAT GTGAAGATGG ACTGGGGGCT TAAGATGTTT
GATTTTCTCT TCGAATCCAA AACTTACTAC TTCAAAATGG TTCTCTTGAG ACCTAAATTA
GCATTTGAGC CCGTTGTCTA TTTCGCTGTC ATCTTGTTTG ACTTCATAGT GAGGTTTGTC
TGGATTCTCA AAGTTTTCAT TGTTAAAGAA GGACAGGACC AAGTCAAATG GACGACGTTG
CATATGTTGT CAACCTTTTT ATTCGGTTAC GATGCATTTT CGTTTGGGTA CACCGTGATT
GAATTCCTTG AGATCCTCCG TAGATGGGCC TGGTGTTTCA TCAAACTCGA CTCAGACTGG
GCCACGCTTG AACAAGCTAC CGGTAACGAT ATTGAGTTGG TCAATACCTC AAAATTGGGC
TAA
 
Protein sequence
MDQNQKKSDE ILFDDLVPLP FRILFLVQLG VFFWYYLVYS CYNLRKLNIL HLIKLSYSAH 
DYSQLDDHYI PNGEFATTLV PDFNSNLILA NGIWANLRPV TIVNVIGWAV FKIIQRKVSS
NDDVSPAIFI PLSYVIPLAL FFHLFYRLFY KSKVQNSMGQ YRAFTTMKRI LLGKINSSTM
RTNDILISDS LVSYSKVLND FGLYLWNYYY ARDIPYSVEL EFILLCIPTF IRMKQCYSEY
RSTANRQHLF NFIKYSTTLG PLFVNSLIKS IITSPGKDLN EPAFLDKLQS LNRWWYLLSF
VNSTYSFIWD VKMDWGLKMF DFLFESKTYY FKMVLLRPKL AFEPVVYFAV ILFDFIVRFV
WILKVFIVKE GQDQVKWTTL HMLSTFLFGY DAFSFGYTVI EFLEILRRWA WCFIKLDSDW
ATLEQATGND IELVNTSKLG