Gene PICST_47702 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_47702 
SymbolPDZ1 
ID4839935 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp546870 
End bp549821 
Gene Length2952 bp 
Protein Length983 aa 
Translation table12 
GC content44% 
IMG OID640391250 
Producthypothetical signalling-associated PDZ domain containing protein 
Protein accessionXP_001385455 
Protein GI126137864 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.849223 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGTGC CCACAAAAAG AAGACTCTCG TTTGACGAGT CGACTAATAA ACGGTTCCTC 
AACGGAACCC ATTCTACTGA GAACAATACC AGTAATATCG AAGTCGACGA AGATTATGGT
TCTGACGGTT CAGAACAGAT CTTTGACTTG CCAGCTACCA CCAACAATAA CCAGTGGCAA
GAAACCATCA CCAAGGTCGT CAACGCTGTT GTTTCTATTC AGTTCACGCA TGTGTCCAAT
TTTGACACCG AAACTTCGCT TGTTTCAGAA GCCACGGGTT TTGTTGTAGA TGCCACCAGA
GGGTTGATCT TGACTAACAG ACATGTGGTG GGTCCTGGAC CTTTCTGTGG CTATGTAGTG
TTTGACAACC ACGAAGAAGC TGTCGTCAAG CCTATCTATA GAGACCCTGT GCATGATTTC
GGGTTCTTAC AGTTTGACCC CAAAGAAGTG AAATATTTAC AGTTGACCCA GTTGGAGTTG
AAGCCCGACT TAGCTAAAGT CGGTACTGAG ATCAGAGTTG TAGGTAACGA TGCTGGTGAA
AAGTTGTCTA TCTTGGCTGG GTTCATATCT CGTCTCGATA GAAATGCTCC TGAATACGGA
AGTTTGACCT ACAATGATTT CAATACTGAG TATATCCAAG CTGCAGCTTC TGCTTCTGGA
GGTTCTTCAG GCTCTCCAGT AGTGAACGAA GACGGTGACG TTGTAGCACT ACAGGCTGGT
GGTTCTACTG AAGCATCTAC CGATTTCTTT CTCCCTATCT ACAGACCTTT GAGAGCATTA
CAGTGTATCC AGAAGAAGCA ACCTATCACC AGAGGCGATA TCCAAGTGGA GTGGCAGTTG
AAGCCATATG ACGAATGTAG AAGATTGGGT TTAACTCCGG AAGCCGAAGC CAGAGCAAGA
AAGTTGTTTC CTAATAAAAT TGGTCTTCTT GTAGCTGAAC TCGTCTTGCC TCAAGGTCAA
GCTGATGGGC TCATCAAGGA AGGTGACACC TTGATTTCAA TTGACGACAT AGACATTTCT
ACCTTTATTA AGGTTGATGA AATCTTGGAT GAGAACGTTG GAAATGAGTT AAAGTTTGTC
ATCCAAAGAG GAGGAGAGGT GATTACCCAA ATGATCAAGA TTGGCGATTT GCATTCCATC
ACTCCAGACA GATACGTCGA TGTCGGTGGT GCCTCCTTTA ATAATCTTTC CTACCAGGTC
GCCAGATGCT ACTGTATACC TGTCAAGGGA GTTTTCATCA ACGACGCTTC TGGCTCCTTT
GAATTTGCTT CTTATGAAAA GTCAGGCTGG TTGTTGGAAA CAGTTGACGA CATGCCCACA
CCAGATTTGG ATACTTTGAT CGAAGTGATG AAGATGATTC CAGACTGTCG TAGAGTTCCC
ATCACCTACA GACATGTTTC TGATTTGCAC ACAGAAAACA TTCAGATCAT TTACATTGAA
AGACACTGGC AGTCCAGTTT CAGATTGGCT GTTAGAAACG ACACTACTGG ATTATGGGAT
TTCACAGATC TCCAGGAAAA ACCTCTTCCT CCATTATCCC ACGAGCCACA AAACGCCAAG
TTCATCGATA TCCCCTTCAG CGATGAAACC AGGTCTGGCT GCTCTTCTTT GGTTCGTTCA
TTTGTTCAAG TCAGACTTAT CGCTCCTGTT CCTATGGACT CTTACCCATA CCGTAAAGAG
ATCTGTTATG GTGTGGTTGT CGATTCAGTC AACGGTTACG TCTTGGTTTC CCGAAGATTC
GTACCCCACG ATATGTGCGA TATCTTTCTT ATTTTCGCTG AGTCTATTGA TGTTCCAGCC
AAGGTTGTAT TCCTTCATCC TAACCAGAAT TATGCCATCT TGAAATACGA CCCTAGTTTG
GTTTTGGCTG ATGTCAAGAC TCCCAAGTTT GGCGACAAAC CATTGAAGAG AGGTGAAAAA
TCCTACTTTA TTGGCTACAA CTACAACTTG AGATTAGTCA CTGATGATGT GAAGATCAGT
GGTGTTTCTT CGTTGAACAT CCCCCCTGCC TCGTTGTCGC CCCGTTACAG AGGAACCAAC
TTGGAGTGTA TTCTCTTGGA TAGTAAGATC AGCGTAGAAT GTGATAGTGG TGTTTTGGCT
GATGATGATG GTACAGTACG TGCATTCTGG ATCACCTACT TGGGTGAAGC TACTTGTGAT
CAAGGCAGTG ATAGAATGTA CAGAATGGGC TTGGATGTCA CTGATGTCTT GAGTGTGATC
GAAAAATTGA AGGTAAACGA AATCCCTAAG CAATTGAGAC TCTTGGAAGC CGAGTTTACT
TCCGTAACCA TTCTCCAGGG TAGAACCAGA GGTGTGTCAC AGGAATGGAT CAATAAGTTC
GAAGAAGTTT GTGAAGACGA AATCAAATTC TTGGCTGTAG AAAGAGTGTC TGCCCCTACT
TTACACCAGG AAAAGAACCC ATTGAAAGCA GGTGATATCA TCTTGTCTGT CAACGACATC
ATTGTGAAGA ACATGAGAGA CTTGAAGCCA ATGTTCACTG AACAAGAGTT GAAGTTTCGG
ATCATCAGAC AAAAGAAGGA AACTGAAATT GTGGTTCCAA CCATCGATAC CACGACCATC
AACACTTCAC ACGTTGTCTT CTGGTCTGGA GCTATCATTC AGGCTCCGCA CTACGCTGTT
CGTCAACTTA TGGAAAGGGT TCCTTCAGAA GTCTATGTCA CTCGTAAGAG TGCTGGAGGT
CCAGCTCATC AATACGGTAT AGCCACTAAC AGTTTTATTA CCCATGTGAA CGACGTCGAA
ACCAAGGATC TCGTCAGCTT GATGAAGGTT GTCAAAGACA TTCCTGACAA CACCTACATT
AAGTTGAGGT TAATGTCTTT TGACAATGTT CCTATTGCCA TCTCGTTGAA GACTAACTAC
CACTACTTTC CAACGTCTGA GTTGAAGAAA AAAGAAGGCT CCGACGAATG GATCGAGATC
GAACACAAGT AG
 
Protein sequence
MSVPTKRRLS FDESTNKRFL NGTHSTENNT SNIEVDEDYG SDGSEQIFDL PATTNNNQWQ 
ETITKVVNAV VSIQFTHVSN FDTETSLVSE ATGFVVDATR GLILTNRHVV GPGPFCGYVV
FDNHEEAVVK PIYRDPVHDF GFLQFDPKEV KYLQLTQLEL KPDLAKVGTE IRVVGNDAGE
KLSILAGFIS RLDRNAPEYG SLTYNDFNTE YIQAAASASG GSSGSPVVNE DGDVVALQAG
GSTEASTDFF LPIYRPLRAL QCIQKKQPIT RGDIQVEWQL KPYDECRRLG LTPEAEARAR
KLFPNKIGLL VAELVLPQGQ ADGLIKEGDT LISIDDIDIS TFIKVDEILD ENVGNELKFV
IQRGGEVITQ MIKIGDLHSI TPDRYVDVGG ASFNNLSYQV ARCYCIPVKG VFINDASGSF
EFASYEKSGW LLETVDDMPT PDLDTLIEVM KMIPDCRRVP ITYRHVSDLH TENIQIIYIE
RHWQSSFRLA VRNDTTGLWD FTDLQEKPLP PLSHEPQNAK FIDIPFSDET RSGCSSLVRS
FVQVRLIAPV PMDSYPYRKE ICYGVVVDSV NGYVLVSRRF VPHDMCDIFL IFAESIDVPA
KVVFLHPNQN YAILKYDPSL VLADVKTPKF GDKPLKRGEK SYFIGYNYNL RLVTDDVKIS
GVSSLNIPPA SLSPRYRGTN LECILLDSKI SVECDSGVLA DDDGTVRAFW ITYLGEATCD
QGSDRMYRMG LDVTDVLSVI EKLKVNEIPK QLRLLEAEFT SVTILQGRTR GVSQEWINKF
EEVCEDEIKF LAVERVSAPT LHQEKNPLKA GDIILSVNDI IVKNMRDLKP MFTEQELKFR
IIRQKKETEI VVPTIDTTTI NTSHVVFWSG AIIQAPHYAV RQLMERVPSE VYVTRKSAGG
PAHQYGIATN SFITHVNDVE TKDLVSLMKV VKDIPDNTYI KLRLMSFDNV PIAISLKTNY
HYFPTSELKK KEGSDEWIEI EHK