Gene PICST_31676 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31676 
Symbol 
ID4838863 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp1410166 
End bp1413203 
Gene Length3038 bp 
Protein Length449 aa 
Translation table12 
GC content43% 
IMG OID640390178 
Productpredicted protein 
Protein accessionXP_001384568 
Protein GI126136088 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2939] Carboxypeptidase C (cathepsin A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.499026 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAAG TAGGCTATGG AAGCCCACCA GATCCTCGGT ATCCCTATCC TAGCCCTTGG 
TGTATTATTG TTACCGACAA TGTATTTTCT GCGTATTTTC TCATCGTGGT ATTTTCAATC
GGATAACAGA ATGCTACGAG GGTTGGATCT AGTGGTGTTC AACCACAGCT TGTAGTCGTA
GGCAGCTAGT GGCAACTAGG TGGGTCAAGA TCCTCGTCGA CAATTACAGG TTCTAGAAGG
ACTAAAATGG GATTTCTGCT ACGGTAATTC ATGGCAACCA GAGATCTGTC TCAGGTCTTG
CATTGTCTTA CATACGAAAA CTTCCTTAAT TAGTGGAAAT TTTGCACTAT TTTCTGGAAC
ATACAGTATA TGCCTGGCAT TGTCGGCGGG GCTCTTTGTG CAGTTTGGAT CCTTAGGCGG
CTAGCGTAGT GACTTTGACG TGGTAGTCTA CGGGTCTTTG TCAAATCTCA AAATGTGATA
ATTTGGTAGC AGCCGATATC ACAGTCTTCT GGAACTGAAA AATACAGCCG AGTGGGGAAG
ATACCATTGG CCTTGTTGAA GGTATAGTCG GTGTGGGCTG TTCTGGACGC GCAGTTACGC
AGCCTTTTTC GTCAACTCCC CAATCTTAGC TCCCGCTTGC ATGGACATAC TTCCACTCCT
AAGGGATAGT TGACCTTTTT CTAAATGTAG CGCGGATGTC GATCATTCCT GAATGTAATT
TTCAACTCAC CCCTTGAACA ATTGCAATTT TAATTTTTTA AAGATGGGTT TAAAGGCTAT
TGTCGAGATA TCGGACCCAG CTATCACTTT GGCCATTGAC GAATTATAAT TAATTGCCGT
CAAATTTATT TGCTTGCAAT GGATACGTTT TCACGGTGTT CTCGCATTGG GTATCTAAAC
AGCTTCTTGT TGCTGGCCCA GGGCTACAGA TATTTACGCC TCGTTTAGAC ACCAGCTATT
GTTCTCGAGC TTCTACGATA AACCCCCTTT CCACGTCATC TAAAAGCCAA AAATCCACCG
TAATATCCTG GATCTATGGT TATCAAACCT TGGCTCACTT TCTTGACGTG CAGATAGCAG
CACTGGACCA TCGCCAGAAA CCAAAGTCAG AATAAGCCTA CTTTGTATAA CATCCTCTGA
CTTGAACGAA ACCGAGGAAA CTCCTGTTCG GAAAATGGAC ATGAAGAAGT TCGGACCATA
GTGGCACCTG TCGATTTACT TCATTGGATC CGTGAACTTT ATTTTATCTG AATAGTGATT
TCATACGCAT TCTTGTCTTG CGTGCCTATG AAACAGGGAC CCCTCGTGTT TGGAGAGCCA
ACGCAACCCA ATCGGCATTA ATTGGGACGT TTCTATCCCA TTTTTGGCTT CACATTGTAA
GTTGAGTTCT GTGTCTAAGT ACTCTCCACA TAAATAAAAT CAGCTCCATA TCGCTTCATG
AATTCAATTA CCTTTGAATA TCTTCACGTG ATCGTACGGC TCAAGTATGG CTCTCCCAGT
TTAACCAGCT AGTTCTTATC TAGTCTATCT CTTTAGTTTT CTCTGCCTCA GGTCAATTGA
GTAGTGCGAT GAAACTTCTG ATCGGCACTA TTCTACTTTC GGGCTTGTGT GGCTCTGTTG
CCACAGCGTA TTCTTTATCT TCCTTGCAAC AGTTTTTGGG CTTCAAGGAA GTTTCGGATG
CACACGATAT TGCAAATCTT GGTAATTCTG GGCCTTTGGA CAATTTATTT GATGTAGTTT
CAAACGAGAA CTATGCCAGT CACAGGTTGC GAGTTAAGCA CATAGATCCG CTTGTTCTCG
GTCTAGATAA AGTAAAGCAA GTCACGGGAT ATTTGGATAT CGAAGATGAC AAGCACTTGT
TCTATTGGTT CTTTGAATCG AGAAACGATC CCCAGAACGA CCCAGTAGTA TTATGGTTGA
ATGGAGGGCC AGGTTGCTCT AGCTCAACGG GACTCTTCTT TGAGTTGGGG CCCTCTTTTA
TCAATTCAAC CCTTCAGCCA GAATATAACC CCTATTCGTG GAACTCGAAT GCGTCTGTTA
TCTTCTTGGA TCAACCTGTA GACGTAGGAC TCTCGTACTC GGATGACAAC GAAGTTTCAA
CTACGGCTGC TGCTGCAAAA GATGTATACA TATTCTTGGA ATTGTTCTTC CAAAAGTTTC
CACAATTCCA AAGCAGAGAC TTTCATATGG CTGGAGAATC GTATGCTGGC CATTACATTC
CTAAGTTCGC GTCGGAGATC CTCAGTCATC CGGAAAGGTC GTTCAACGTG ACTTCAGTTC
TCATTGGAAA TGGGTTCACT GATGCTATTC CACAATATAA AGCTCTTATT GGAATGGGAT
GTGGACAAGG AGGTTATGAT TCAATCTTGT CAGAACAAGA TTGCAAGGAA TTGGAAGAGA
ATTACTATCC CAAATGCAAG CAATTCCTTG AACTATGCAA CAGGGAACAG GATGCATTGA
CATGTGTACC AGCTTATCAT TACTGTGAAA CAAGAATGTT TATTCCTTTC TCCAAGACGA
ACTTGAACCC ATATGACATA CGTGAAGAAT GTGAAAGGGG TGGAACTTGC TACGAGGAAC
TAGACGATGT GGACGCTTAT CTCAACCTTG ACTTTGTCAG GAGTGCCATT GGGGTTTCTC
CTGAAGTCAA GAAGTATGAA GGTTGTTCTG ATGTTGTATC AAAGAACTTT GCCTTGGAAG
GCGATAAAGC ATTGCCCCAT CAGCAGTATG TTGCCGAACT TCTTGAAAAG GAGGTAGCAG
TATTGATATT TGCTGGAGAT AAAGACTATA GATGTAATTG GTTAGGTAAC TACGAGTGGA
CAGACCAATT AGACTATGAT GGTCATGATG AATTTTCAAG TAAACCTTTG GTGCCATGGC
AAACTTCTGA CGGCAGTATT GGTGGAGAGT ACAGGAACTA CGAAAAGTTC ACTTATTTGA
GATTCTACGA TGCTGGCCAT TTGGTCCCTC ACGATCAACC CCAGAGGGCA TTGGAAATGG
TTAACAGTTG GTTACAAGGA CAGTATTCAT TGAACTAA
 
Protein sequence
MSEVGYGSPP DPRYPYPSPW FTQPFSLRVK HIDPLVLGLD KVKQVTGYLD IEDDKHLFYW 
FFESRNDPQN DPVVLWLNGG PGCSSSTGLF FELGPSFINS TLQPEYNPYS WNSNASVIFL
DQPVDVGLSY SDDNEVSTTA AAAKDVYIFL ELFFQKFPQF QSRDFHMAGE SYAGHYIPKF
ASEILSHPER SFNVTSVLIG NGFTDAIPQY KALIGMGCGQ GGYDSILSEQ DCKELEENYY
PKCKQFLELC NREQDALTCV PAYHYCETRM FIPFSKTNLN PYDIREECER GGTCYEELDD
VDAYLNLDFV RSAIGVSPEV KKYEGCSDVV SKNFALEGDK ALPHQQYVAE LLEKEVAVLI
FAGDKDYRCN WLGNYEWTDQ LDYDGHDEFS SKPLVPWQTS DGSIGGEYRN YEKFTYLRFY
DAGHLVPHDQ PQRALEMVNS WLQGQYSLN