Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_4493 |
Symbol | |
ID | 4838661 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | - |
Start bp | 934969 |
End bp | 936315 |
Gene Length | 1347 bp |
Protein Length | 449 aa |
Translation table | 12 |
GC content | 48% |
IMG OID | 640389976 |
Product | predicted protein |
Protein accession | XP_001384483 |
Protein GI | 150865320 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0320757 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GCCAACGCAT TGGTTATCCC TGATGTCAAC TCATTCTTCA ACTTCAACCA GTTTGTGCCT CTGACTGTTG CTGTCAATGC TGCTCAACAA CAACCACAAC AGCTCTGGAC AGCTAACAAG AACGAGGCTC CTGTTCTTGC AGCCGGTTCC AAAAGCGTAG CCATCCCAAA CAGATACATT GTTATCTACA AGGAAGACGT TACCGAGGCC CAGAGAAACC ATCACAAGAA GTGGTTGATT GCCGAACATA CGGAAATGGT TGCCACAGCC GGAATTCGTC CTTCGGTAGG TGTTCTCGAC TTCTTCGATG TCGACACGCT ACTCCTGGGC TACTTCGGCT ACTTCACTCC CGAGATGCTC CGCAAGATCC AGAAGGACCC TCGCATCAAG TTCATTGAGC AGGATACCGT AATGAAGGTC AATGAGTTCG ACGTCGAGAA AGATGCCGAA TGGGGTTTGA GCAGGATTTC ACATCGTGAA AGCAGCCCTC AACTCGAATA CCTTTACGAT AATGAAGGTG GCAAGGGTGT CACTGCTTAC GTCATTGACA CCGGTATCAA AGTTGAACAC GAAGAATTCG AAGGCAGAGC CCTGTGGGGT GAAGCCGTGG CTTTCCCCAA GTTGAAGATT GATGGACACG GCCATGGAAC CCACTGTGCT GGTATCATTG GATCCAAGAC GTATGGTGTA GCTAAGAATG TTGAATTAGT AGCTGTAGGT GTTATGAACT TGTTGGGTAG TGGTACGACC TCAGACATCA TCAAGGGTGT CGAATTTGTT GTCGGCGACC ATAAATCAAA CTTCCTGGCA AAGAAGAAGG GCTTCAAGGG CTCCACAGTC AACATGTCTA TTGGTGGAGG AGAATCTGAA GCTTTGGACT TGGCTGTTAA TGCTGCTACC AAGGCTGGCT TGCATGTAGC TGTAGCTGCT GGTAACGACA ATGCTGACAC TTGTACTTTT TCTCCAGCAA GAGCCAGCGG ACCAATAACC GTAGGAGCTT CTGATATCAA CGACAACAAG GCTGAATTCT CCAACTGGGG TTCTTGTGTA GACATCTTCG CACCTGGGGT TGACATTGTT TCTACATACA TCTGGAGCAA CACTGCTTCT ATGTCTGGTA CTTCAATGGC TTCTCCTCAC ATTGCTGGAT TGCTTTCGTA CTACTTGTCG CTCTACCCTG AGCCTGAATC CGAGTACAGC GTAGCTGTAT TGGACCCAGC AACCTTGAAG GACAAGGTGA TCAAGTATGC CACCAAGGGC GTTATAAAGG GCTTGAAGAA TGACGGTTCG CCTAACTTAT TGGCCTTCAA TGGTGCTGGC GCCAATATCA CCGACTTCTG GAGCTTA
|
Protein sequence | ANALVIPDVN SFFNFNQFVP STVAVNAAQQ QPQQLWTANK NEAPVLAAGS KSVAIPNRYI VIYKEDVTEA QRNHHKKWLI AEHTEMVATA GIRPSVGVLD FFDVDTLLSG YFGYFTPEML RKIQKDPRIK FIEQDTVMKV NEFDVEKDAE WGLSRISHRE SSPQLEYLYD NEGGKGVTAY VIDTGIKVEH EEFEGRASWG EAVAFPKLKI DGHGHGTHCA GIIGSKTYGV AKNVELVAVG VMNLLGSGTT SDIIKGVEFV VGDHKSNFSA KKKGFKGSTV NMSIGGGESE ALDLAVNAAT KAGLHVAVAA GNDNADTCTF SPARASGPIT VGASDINDNK AEFSNWGSCV DIFAPGVDIV STYIWSNTAS MSGTSMASPH IAGLLSYYLS LYPEPESEYS VAVLDPATLK DKVIKYATKG VIKGLKNDGS PNLLAFNGAG ANITDFWSL
|
| |