Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_57141 |
Symbol | |
ID | 4837873 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | - |
Start bp | 1327779 |
End bp | 1328897 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 12 |
GC content | 45% |
IMG OID | 640389188 |
Product | predicted protein |
Protein accession | XP_001383880 |
Protein GI | 150864880 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.371029 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGTAG ATCTCGATAA ATACGTGAAG CCTGGTAGAG ACTATTATAT TGCACTTGGA CTAGAAGGTT CAGCCAACAA GTTAGGAGTG GGAATTATCA GACAACCAGT GGGCCAATTG TCGCAGACTA ACCGAGCAGA AGTCCTCCTG AATGTTAGAG ATACATATGT CACTCCTCCA GGAGAAGGCT TCCTTCCTCG GGATACAGCC AGACATCACC GAAACTGGGT AGTGAGAATT ATTAAAAGAG CATTGAGCGA AGCCAAAGTC ACGGGAGCAG ACCTTGACTG TATTTGTTTC ACCCAGGGCC CTGGAATGGG TGCTCCATTA CAGAGTGTAG TTGTTGCTGC TCGTACATTG GCCCAGTTAT GGGAGTTGCC TCTAGTTGGA GTAAATCACT GCGTGGGCCA CATTGAGATG GGAAGAGAGA TCACCGGAGC CGACAACCCC GTAGTTTTAT ATGTCAGTGG TGGAAATACG CAGGTGATTG CATATTCTAA ACAGAGATAC AGAATCTTTG GAGAAACGCT TGATATCGCC ATAGGGAATT GTTTGGACAG ATTTGCTAGA ACACTCAAGA TACCGAACGA GCCTGCGCCA GGATACAACA TCGAACAAAT GGCCAAAAAG GGTAAACATT TGGTCAACTT ACCTTATACT GTAAAAGGAA TGGACTTGTC AATGTCTGGG ATATTGGCCC ATGTAGACGG CTTGGCCAAG GACATGTTTG GCAAACAGGG TAAAAAGCTC GTAGACGAAG AAACAGGAGA ACTTATTACT GCGGAGGATC TTTGCTTCTC TCTTCAGGAG ATCTTGTACT CTATGTTGGT AGAAATCACA GAACGTGCTT TAGCCCATGT GAATAGTAAC CAGGTGTTGA TTGTAGGTGG TGTAGGGTCC AACGAGCGGT TACAAGAGAT GATGAAGTTG ATGATCCAAG ATAGGAAAAA CGGCCAGATC TATGCCACCG ACGAAAGATT CTGTATAGAT AACGGCATAA TGATAGCTCA TGCCGGGTTG TTGAGTTACA GAACCGGTCA GACAAACGAC CTCTGGAACA CTGTCTGTAC ACAGAGATTC AGAACTGACG AAGTATTTGT AAAATGGAGA GATGACTAG
|
Protein sequence | MTVDLDKYVK PGRDYYIALG LEGSANKLGV GIIRQPVGQL SQTNRAEVLS NVRDTYVTPP GEGFLPRDTA RHHRNWVVRI IKRALSEAKV TGADLDCICF TQGPGMGAPL QSVVVAARTL AQLWELPLVG VNHCVGHIEM GREITGADNP VVLYVSGGNT QVIAYSKQRY RIFGETLDIA IGNCLDRFAR TLKIPNEPAP GYNIEQMAKK GKHLVNLPYT VKGMDLSMSG ILAHVDGLAK DMFGKQGKKL VDEETGELIT AEDLCFSLQE ILYSMLVEIT ERALAHVNSN QVLIVGGVGS NERLQEMMKL MIQDRKNGQI YATDERFCID NGIMIAHAGL LSYRTGQTND LWNTVCTQRF RTDEVFVKWR DD
|
| |