Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_47702 |
Symbol | PDZ1 |
ID | 4839935 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | + |
Start bp | 546870 |
End bp | 549821 |
Gene Length | 2952 bp |
Protein Length | 983 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640391250 |
Product | hypothetical signalling-associated PDZ domain containing protein |
Protein accession | XP_001385455 |
Protein GI | 126137864 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.849223 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGTGC CCACAAAAAG AAGACTCTCG TTTGACGAGT CGACTAATAA ACGGTTCCTC AACGGAACCC ATTCTACTGA GAACAATACC AGTAATATCG AAGTCGACGA AGATTATGGT TCTGACGGTT CAGAACAGAT CTTTGACTTG CCAGCTACCA CCAACAATAA CCAGTGGCAA GAAACCATCA CCAAGGTCGT CAACGCTGTT GTTTCTATTC AGTTCACGCA TGTGTCCAAT TTTGACACCG AAACTTCGCT TGTTTCAGAA GCCACGGGTT TTGTTGTAGA TGCCACCAGA GGGTTGATCT TGACTAACAG ACATGTGGTG GGTCCTGGAC CTTTCTGTGG CTATGTAGTG TTTGACAACC ACGAAGAAGC TGTCGTCAAG CCTATCTATA GAGACCCTGT GCATGATTTC GGGTTCTTAC AGTTTGACCC CAAAGAAGTG AAATATTTAC AGTTGACCCA GTTGGAGTTG AAGCCCGACT TAGCTAAAGT CGGTACTGAG ATCAGAGTTG TAGGTAACGA TGCTGGTGAA AAGTTGTCTA TCTTGGCTGG GTTCATATCT CGTCTCGATA GAAATGCTCC TGAATACGGA AGTTTGACCT ACAATGATTT CAATACTGAG TATATCCAAG CTGCAGCTTC TGCTTCTGGA GGTTCTTCAG GCTCTCCAGT AGTGAACGAA GACGGTGACG TTGTAGCACT ACAGGCTGGT GGTTCTACTG AAGCATCTAC CGATTTCTTT CTCCCTATCT ACAGACCTTT GAGAGCATTA CAGTGTATCC AGAAGAAGCA ACCTATCACC AGAGGCGATA TCCAAGTGGA GTGGCAGTTG AAGCCATATG ACGAATGTAG AAGATTGGGT TTAACTCCGG AAGCCGAAGC CAGAGCAAGA AAGTTGTTTC CTAATAAAAT TGGTCTTCTT GTAGCTGAAC TCGTCTTGCC TCAAGGTCAA GCTGATGGGC TCATCAAGGA AGGTGACACC TTGATTTCAA TTGACGACAT AGACATTTCT ACCTTTATTA AGGTTGATGA AATCTTGGAT GAGAACGTTG GAAATGAGTT AAAGTTTGTC ATCCAAAGAG GAGGAGAGGT GATTACCCAA ATGATCAAGA TTGGCGATTT GCATTCCATC ACTCCAGACA GATACGTCGA TGTCGGTGGT GCCTCCTTTA ATAATCTTTC CTACCAGGTC GCCAGATGCT ACTGTATACC TGTCAAGGGA GTTTTCATCA ACGACGCTTC TGGCTCCTTT GAATTTGCTT CTTATGAAAA GTCAGGCTGG TTGTTGGAAA CAGTTGACGA CATGCCCACA CCAGATTTGG ATACTTTGAT CGAAGTGATG AAGATGATTC CAGACTGTCG TAGAGTTCCC ATCACCTACA GACATGTTTC TGATTTGCAC ACAGAAAACA TTCAGATCAT TTACATTGAA AGACACTGGC AGTCCAGTTT CAGATTGGCT GTTAGAAACG ACACTACTGG ATTATGGGAT TTCACAGATC TCCAGGAAAA ACCTCTTCCT CCATTATCCC ACGAGCCACA AAACGCCAAG TTCATCGATA TCCCCTTCAG CGATGAAACC AGGTCTGGCT GCTCTTCTTT GGTTCGTTCA TTTGTTCAAG TCAGACTTAT CGCTCCTGTT CCTATGGACT CTTACCCATA CCGTAAAGAG ATCTGTTATG GTGTGGTTGT CGATTCAGTC AACGGTTACG TCTTGGTTTC CCGAAGATTC GTACCCCACG ATATGTGCGA TATCTTTCTT ATTTTCGCTG AGTCTATTGA TGTTCCAGCC AAGGTTGTAT TCCTTCATCC TAACCAGAAT TATGCCATCT TGAAATACGA CCCTAGTTTG GTTTTGGCTG ATGTCAAGAC TCCCAAGTTT GGCGACAAAC CATTGAAGAG AGGTGAAAAA TCCTACTTTA TTGGCTACAA CTACAACTTG AGATTAGTCA CTGATGATGT GAAGATCAGT GGTGTTTCTT CGTTGAACAT CCCCCCTGCC TCGTTGTCGC CCCGTTACAG AGGAACCAAC TTGGAGTGTA TTCTCTTGGA TAGTAAGATC AGCGTAGAAT GTGATAGTGG TGTTTTGGCT GATGATGATG GTACAGTACG TGCATTCTGG ATCACCTACT TGGGTGAAGC TACTTGTGAT CAAGGCAGTG ATAGAATGTA CAGAATGGGC TTGGATGTCA CTGATGTCTT GAGTGTGATC GAAAAATTGA AGGTAAACGA AATCCCTAAG CAATTGAGAC TCTTGGAAGC CGAGTTTACT TCCGTAACCA TTCTCCAGGG TAGAACCAGA GGTGTGTCAC AGGAATGGAT CAATAAGTTC GAAGAAGTTT GTGAAGACGA AATCAAATTC TTGGCTGTAG AAAGAGTGTC TGCCCCTACT TTACACCAGG AAAAGAACCC ATTGAAAGCA GGTGATATCA TCTTGTCTGT CAACGACATC ATTGTGAAGA ACATGAGAGA CTTGAAGCCA ATGTTCACTG AACAAGAGTT GAAGTTTCGG ATCATCAGAC AAAAGAAGGA AACTGAAATT GTGGTTCCAA CCATCGATAC CACGACCATC AACACTTCAC ACGTTGTCTT CTGGTCTGGA GCTATCATTC AGGCTCCGCA CTACGCTGTT CGTCAACTTA TGGAAAGGGT TCCTTCAGAA GTCTATGTCA CTCGTAAGAG TGCTGGAGGT CCAGCTCATC AATACGGTAT AGCCACTAAC AGTTTTATTA CCCATGTGAA CGACGTCGAA ACCAAGGATC TCGTCAGCTT GATGAAGGTT GTCAAAGACA TTCCTGACAA CACCTACATT AAGTTGAGGT TAATGTCTTT TGACAATGTT CCTATTGCCA TCTCGTTGAA GACTAACTAC CACTACTTTC CAACGTCTGA GTTGAAGAAA AAAGAAGGCT CCGACGAATG GATCGAGATC GAACACAAGT AG
|
Protein sequence | MSVPTKRRLS FDESTNKRFL NGTHSTENNT SNIEVDEDYG SDGSEQIFDL PATTNNNQWQ ETITKVVNAV VSIQFTHVSN FDTETSLVSE ATGFVVDATR GLILTNRHVV GPGPFCGYVV FDNHEEAVVK PIYRDPVHDF GFLQFDPKEV KYLQLTQLEL KPDLAKVGTE IRVVGNDAGE KLSILAGFIS RLDRNAPEYG SLTYNDFNTE YIQAAASASG GSSGSPVVNE DGDVVALQAG GSTEASTDFF LPIYRPLRAL QCIQKKQPIT RGDIQVEWQL KPYDECRRLG LTPEAEARAR KLFPNKIGLL VAELVLPQGQ ADGLIKEGDT LISIDDIDIS TFIKVDEILD ENVGNELKFV IQRGGEVITQ MIKIGDLHSI TPDRYVDVGG ASFNNLSYQV ARCYCIPVKG VFINDASGSF EFASYEKSGW LLETVDDMPT PDLDTLIEVM KMIPDCRRVP ITYRHVSDLH TENIQIIYIE RHWQSSFRLA VRNDTTGLWD FTDLQEKPLP PLSHEPQNAK FIDIPFSDET RSGCSSLVRS FVQVRLIAPV PMDSYPYRKE ICYGVVVDSV NGYVLVSRRF VPHDMCDIFL IFAESIDVPA KVVFLHPNQN YAILKYDPSL VLADVKTPKF GDKPLKRGEK SYFIGYNYNL RLVTDDVKIS GVSSLNIPPA SLSPRYRGTN LECILLDSKI SVECDSGVLA DDDGTVRAFW ITYLGEATCD QGSDRMYRMG LDVTDVLSVI EKLKVNEIPK QLRLLEAEFT SVTILQGRTR GVSQEWINKF EEVCEDEIKF LAVERVSAPT LHQEKNPLKA GDIILSVNDI IVKNMRDLKP MFTEQELKFR IIRQKKETEI VVPTIDTTTI NTSHVVFWSG AIIQAPHYAV RQLMERVPSE VYVTRKSAGG PAHQYGIATN SFITHVNDVE TKDLVSLMKV VKDIPDNTYI KLRLMSFDNV PIAISLKTNY HYFPTSELKK KEGSDEWIEI EHK
|
| |