Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_36810 |
Symbol | |
ID | 4840258 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | - |
Start bp | 1402330 |
End bp | 1403703 |
Gene Length | 1374 bp |
Protein Length | 457 aa |
Translation table | 12 |
GC content | 48% |
IMG OID | 640391573 |
Product | predicted protein |
Protein accession | XP_001385968 |
Protein GI | 126138890 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2939] Carboxypeptidase C (cathepsin A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.820131 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGCTCG AGTTCCCTGA GACCGTTTCC AAGTTGAACT TCAAGGCTCC AGCCAAAAAG TCCTCTAGCA AGTCGCAGTT CGACTACCAT GTATCTGACG CCAAGCTTCC TAACCACAAG CTTAGAGTCA AAAACACTCC TAAGGACTTG GGCATTGACT CCGTCAAGCA GTACAGTGGT TACTTGGATG TTGAGGACGA AGACAAGCAC TTCTTCTACT GGTTCTTCGA ATCGAGAAAC GACCCCAAGA ACGACCCCGT TATCTTGTGG TTGAACGGTG GTCCAGGATG TTCGTCTTTG ACCGGTTTGT TCTTTGAATT GGGCCCAGCA TCCATCGGCG CCGACTTGAA GCCTGTTCAC AACCCATACT CATGGAACAG TAATGCCTCG GTAATCTTCT TGGACCAGCC AGTAAATGTT GGATACTCCT ACTCTTCTCA GTCTGTTTCC AACACCATTG CTGCTGGCCA GGACGTGTAT GCCTTCTTGG AATTATTCTT CAAGCAGTTC CCAGAATACA ACACTCTTCC TTTCCACATT GCTGGTGAAT CCTACGCCGG CCATTACATC CCAGTGTTCG CCAGTGAGAT CTTGAGCCAT GAGGACCGTT CTTTCAACTT GACCTCGGTG TTGATCGGAA ACGGTTTGAC CGACCCTTTG ACCCAATACG AATACTACGA GCCTATGGCC TGTGGTGAAG GAGGAGAACC TTCCGTCTTG GAACCAGAAG AATGCCAAGC CATGTCCAAC GCCATTCCTA GATGTTTGTC TTTAATCAAG TCCTGTTATG AGTCCGGCTC TTTGTGGTCG TGTGTTCCTG CCACGATCTA CTGTAACAAC GGTCAGATGG GTCCTTACCA AAAGACTGGT AGAAATGTCT ACGACATCAG AACCATGTGT GAAGGCTCCA ACTTGTGCTA CAAAGATTTG GAATACATCG ACCAATACTT GAACCAGCCG GAAGTCAAGG CTAAGCTTGG TGCCGAGGTG GACGAGTATG AATCCTGTAA CTTCGACATT AACAGAAACT TCTTGTTGGC CGGTGACTGG ATGAAGCCTT ACTACAAGAA TGTCATTGAA TTATTGGAAG CTAAGCTCCC AGTGTTGATT TATGCCGGTG ACAAGGATTT CATCTGTAAC TGGTTGGGAA ACCAAGCCTG GACCAACAGT TTGCCATGGT CTGGAGCTGC CAAGTTTGCC ACAGAAAAAA TCAGAACCTG GACAGTAGGA AAGAAGGCTG CCGGTGAAGT CAAGAACTTT GCCAACTTCA CCTTCTTGAG AGTGTTTGGT GGTGGTCACA TGGTGCCATA CGACCAACCA GAGAATGCTT TGGACATGGT CAACAGATGG GTTTCTGGCG ACCGCAAGTT CTGA
|
Protein sequence | MMLEFPETVS KLNFKAPAKK SSSKSQFDYH VSDAKLPNHK LRVKNTPKDL GIDSVKQYSG YLDVEDEDKH FFYWFFESRN DPKNDPVILW LNGGPGCSSL TGLFFELGPA SIGADLKPVH NPYSWNSNAS VIFLDQPVNV GYSYSSQSVS NTIAAGQDVY AFLELFFKQF PEYNTLPFHI AGESYAGHYI PVFASEILSH EDRSFNLTSV LIGNGLTDPL TQYEYYEPMA CGEGGEPSVL EPEECQAMSN AIPRCLSLIK SCYESGSLWS CVPATIYCNN GQMGPYQKTG RNVYDIRTMC EGSNLCYKDL EYIDQYLNQP EVKAKLGAEV DEYESCNFDI NRNFLLAGDW MKPYYKNVIE LLEAKLPVLI YAGDKDFICN WLGNQAWTNS LPWSGAAKFA TEKIRTWTVG KKAAGEVKNF ANFTFLRVFG GGHMVPYDQP ENALDMVNRW VSGDRKF
|
| |