Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_32238 |
Symbol | |
ID | 4839375 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | + |
Start bp | 1021573 |
End bp | 1023081 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640390690 |
Product | predicted protein |
Protein accession | XP_001384866 |
Protein GI | 150865588 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2866] Predicted carboxypeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.237691 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGTCGA AAAACTGGCT CTTGTCCATT TCGCTAGCTA CCCTCGCTAT GGGTTTCCAG ATGCAGATGC CCTTTGACAC CTCCAGATTC CAAGACTGGC TCCCGCTGAA TGGCAACTCG TACCGTCACT ATCTGACTGT GCCTCTTGAC CAGTTACCTA TAGACGTCCA TCAGTACAAA AACGATATGG TGATAAGGAT CAACTATGCT GAGAATAAAG AGTTGAAAAA ATTATTGTTT TCTCTGCAGC CTGAGAACAA ACCTGAAGAT TCAAACAACC ATAAGGACAC CAAAATCCAC TACAGCAGAT GGGCCCACAA CAATGTCAAG TCGACAGTGG ACTTACAGAT AGACGAAGAC AACTTGGTTG GGCTCATTGG CCGTTTTCCT CTGTTGGACT ATACTGTGAT CATCCAAGAC TTGGCTCAGA AAGTATACGA GACATATCCG CAAGACTTTG AAAGACCAGG CAAAAGCAAG ATATATGCGC AGAAAGACGA TTACGTTTAC AAATCAACGG CTGAAGTCCT TGATTCAATG AAAGTCGATG TCATGTCCGA GCTATTTTTC CGCGAATACA GACCGCTCGA AACTATCGAT GCCTGGTTGG ATATCATTCA GCAGACATAT CCTGACATCA TTACACTTGA AGAAATTGGC CATAGTTTTG AAAACAGAGC CTACAAAGTT GTCCATTTCT CTGTACCCGA CGGCAATGTG GACCACAGCC AAAAGAAGAC AATTGTGGTC AACGGAGGAG TGCACTCACG TGAATGGATT TCTGTCTCTT CTGTGTTGTA TACGGTCTAC CAGCTTATAC AGCTTTATAA CGAGAATCCT ACTTCTAAGA TCTTCTCTCA CTTAGATTTC TTGTTTATTC CTATTTCCAA CCCTGATGGT TACGAATACA CCTGGAGATC TGATCGGTTG TGGCGTAAGA ACAGACAGCT GACCCTTTAT CCTGGCTGCT TTGGCATAGA TATTGACCAT TCCTACGATT ACCACTGGAT CAAGTCTTCT GACTGGGCCT GTGGAGAAGA GTACAGTGGA GAGCAGCCTT TTGAAGCCTA CGAATCTCAG ATCTGGGAAG ATTACTTGAA CGCTACTAAC AACGACCACA AGATCTGGGG CTATATCGAC TTGCATTCGT ATGCCCAAGA GATCTTGTAC CCATATGCAT ATTCTTGTAG TGAGCAGCCT AGAGACGAAG AGAACTTGAT TGAGTTGGCC TATGGTATCT CCAAGGCTAT TAGAGTGCAA TCGGGAAAGA CCTATGATGT CTTGCCAGCT TGTATTGATA AGGATGCTGA TCTTCTTCCT GATTTAGGTT CCGGTAGTGC TTTGGACTTT ATGTATCACA ACAGAGCATA TTGGGCTTAC CAGTTGAAGT TGCGAGACAG TGGTAGTCAT GGGTTCTTGC TTCCTAGTAA GTACATCGAG CCTGTGGGTG AAGAGATATT TGCCGGAATC AAGTATTTCT GTTTGTTCAT CTTGAGCGAC GATCGTTAG
|
Protein sequence | MLSKNWLLSI SLATLAMGFQ MQMPFDTSRF QDWLPSNGNS YRHYSTVPLD QLPIDVHQYK NDMVIRINYA ENKELKKLLF SSQPENKPED SNNHKDTKIH YSRWAHNNVK STVDLQIDED NLVGLIGRFP SLDYTVIIQD LAQKVYETYP QDFERPGKSK IYAQKDDYVY KSTAEVLDSM KVDVMSELFF REYRPLETID AWLDIIQQTY PDIITLEEIG HSFENRAYKV VHFSVPDGNV DHSQKKTIVV NGGVHSREWI SVSSVLYTVY QLIQLYNENP TSKIFSHLDF LFIPISNPDG YEYTWRSDRL WRKNRQSTLY PGCFGIDIDH SYDYHWIKSS DWACGEEYSG EQPFEAYESQ IWEDYLNATN NDHKIWGYID LHSYAQEILY PYAYSCSEQP RDEENLIELA YGISKAIRVQ SGKTYDVLPA CIDKDADLLP DLGSGSALDF MYHNRAYWAY QLKLRDSGSH GFLLPSKYIE PVGEEIFAGI KYFCLFILSD DR
|
| |