Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_76282 |
Symbol | |
ID | 4837585 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | + |
Start bp | 2076311 |
End bp | 2079516 |
Gene Length | 3206 bp |
Protein Length | 1046 aa |
Translation table | 12 |
GC content | 41% |
IMG OID | 640388900 |
Product | predicted protein |
Protein accession | XP_001382625 |
Protein GI | 150863965 |
COG category | [R] General function prediction only |
COG ID | [COG1026] Predicted Zn-dependent peptidases, insulinase-like |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.967871 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.769643 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CCATATTTAG CAGCCGCAAA CCATTATGTT AGCCGCTCGT TCCCGAAGCA TAGTCAACGC AGCAAGACGG AGCTTTGCTA CGGCAGCTTC TCCCTCTGCA GCAACTTCCG CCATTCTCTC CAAATACCCC ATTGGCCTCA ACCTCTATGG GTTTGTCATA GACAATGTTC AGCCCATTCC TGAGTTTTCG CTTGTGGCTG TCCATCTAAA GCATGAAAGA AGCGGAGCAG CACACCTTCA CTTGGACTCA CCTACAGATA ATAATAACGT GTTTCTGATT GCGTTCAAAA CCAATCCTCC AGATGCTACT GGTGTACCTC ATATCTTGGA ACATACCACC TTGTGTGGTT CGTATAAGTA CCCTGTACGA GACCCATTCT TCAAGATGAC AAACAGGTCG CTTTCCAACT TCATGAATGC CATGACTGGC CACGACTTCA CCTTCTATCC GTTTGCCACG ACTAACGCAA AGGACTTTGA CAACTTGATG GATGTCTATC TTTCATCGGT ATTCGAGCCT CTTTTGTCAT ACAATGATTT CATCCAAGAA GGATGGAGGT TGGAAAATGA GGATATAAAT GACCCTGAGA GCAAGCTCGA ATTGAAGGGG GTAGTATACA ACGAGATGAA GGGCCAGAAT TCCAACACGT CGTACTACTT CTACATCAAG TTCTTGGAAT CCATATATCC TTCTTTAAAC AATGCTGGTG GGGATCCAGC AAAAATCCCT GACTTGCAGT ACGAAGACTT AGTCGATTTC CACCACAGAA ATTACCATCC TTCCAATGCC AGAACTTTCA CGTATGGTAA TTTGCCGTTG ATGAACCATT TGAAACATTT GAGCGACTAC TTCCAGACGT TTGGCGTCCG TCCTCAATCC AAGGATTTGA AATTGCCAAT CTTCTCCAAC ACGACTTCGC CATCGTCAAC CACCGTTGTC AAGGTTCCTG GTCCTGTAGA TACCATGTCG TCGAAACCAG CTGAACAACA GTACAAGGCT TCGATCACAT GGAACTTAGG AAATGCTCTC GAAGAAGCCA ACCAGTACGA GTTGTTCAAA TGGAAAATCT TGAGTTCTCT TCTCTGTGAC GGCCATAATG CTCCATTTTA TCAAGAGCTT ATTGAAACTG AGTTTGGTGA TGACTTTTCA GCAAACTCAG GAATAGATGC CACGACTGCT TTGATATCGT TCACTATTGG GTTGAACAAT TTGACTGTAG AAAAAGTTGG CCAGTTGGAA TCCAAAGTGT TGGATATTAT TAGAAACAAG GTGCTACCCG AGTTCGAAAA TCCGGAAAGC TCCTACAAAA CTAGAATTGA AGCTATTTTG CATCAGATAG AATTGAACTT GAAAAAACAT AAGCCAGATT TCGGTTTGAG CTTGTTGAAT GTGATCGTTC CTACATGGGT GAATGGCTTG GACCCAATCA AATCGTTAAG GGTTGAACCG ATTTTGAACC AGTTCAAGTC TGACTTTGAG TGCAAGGGAT TACTTGTTTT CAAGGAACTC CTTGATAGTT CAATTTTGAA TCCACAATGT GAAAAGTTCT CGTTTGTCAT GGAACCCCAG AATGAATTCA ACAAGAATTT AACCACAGTT GAAGCTGAGA GAGTAAAGAC GATGGTACAA AGCTTATCTG AAGAAGATAA AAAGATTATA AACGAAAGAG GCCAAGAGCT AGCTAGAAAA CAGACAGAAG AACAGGACGG AGAAGTATTG CCTACTTTGA CCATTAAGGA TATACCAGAG AAGGGAGATT TCCATCCACT TTTGTACTCG CAAATTGGCT CCAATACATT GCAAAAGAGA ATTGTAGATA CAAATGGTTT GGTGTACACT GCTGCTTTAA AGAATATCAA ATATTTGCCA AAGGAGTACT ATAAATACCT TTCTTTGTTC AGTTCCTGTC TCACCAATTT AGCAGGGACT TCTCATACAT CCGTTACCGA TCTTGAGACT AAGATAAGCA GATTGACAGG TGGTATCTCC TTTAGCCACA GAATATCAAC CAATCCCTAT GATATCAGAG AAGTTGACTT GTACTTTCAG ATGTCGGGAA TGGCTCTAAA AGAGAACTCT GAACATATCT ACAACTTGTG GCATGAAGTT CTTACTGATA CGCAATTTGA TTCCAATGAC GAATTGGTCG TTGACAAGTT ATTCACTTTA ATCAAGAATA TGAGCCAGAA CCAGCTAAAT GTAATAGCAG ACAGGGGTCA TTCATATGCG AGTGGATATT CCAATGCCAA GTTGACTCCT GCTAGGTATA TTTCTGATAT TACCTCAGGA ATTAGCCAAG TGGAATTTAT AATGGAGTTA AATTCAAATA TCGAAAAGAA GGGTAAGCAG TATCTTGCTG ACGAAATTTT GCCAATTTTA AAGGAAATCC AGTTATTAAT CATTGACAAT GCCAAGGGCG AATTCAAGTA TAGATTGGTA GGTGACAAGG ACATTATTGG AGAGAATGAG AAGTTGGTAC AAGAATTTAA CGACAAAATT TCACATTTCA GCAACACTTC TAGTCAAGGA AATGCGTCTG AATTGAAGAA TTTGGTCAAT GCTTTCAATA ACAACAATTT GGGAATAAAT GCAAACCAAA ATACCTTGCT TAACTTACCT TTTCAAGTCA GCTATGCTTC TCTCGGGAAA CTTGGTGCAG AGTATGCTTG CAAAGATGGA GCTAGTTTAC AGATATTGGC CCAGTTATAC ACTTTCAAGC ATTTGCACTC TGTAATTAGA GAATCTAATG GAGCTTATGG TGGAGGCTTG CTTTATGATG GATTGGGTGG TACTTTGAAC TTCTATTCCT ACAGAGATCC TAATCCGTTG AAGTCAGTAG AATCTTTTGA GAAATCGTTT GATTTTGGGC TTCTGGTAAA CTGGGAGCCT AAGGACTTGC AAGAAGCCAA ATTGAGAATA TTCCAGAGTG TTGACGCTCC TACCAATATT GCTAGTCAGG GTTCTACTGA ATTTTACGAA GGAATCACAG ACGTGATGAG ACAGGAAAGA AGAGAGAATT TCTTGAGCGT GAACAACCAG GACTTGACGA ACGTCATCCA GAAGTATTTG GTTGACCAGA AGGATAACCT GGTTACAGTT ATCGGTGATA ACACAACTTT AAATGTTGGA AAGGACTGGA CTGTCCAAGA GTTGAATGTT AACTAAACCT GTAAATAGAT AATAGACGTT ATGTATATAT TAATAT
|
Protein sequence | MLAARSRSIV NAARRSFATA ASPSAATSAI LSKYPIGLNL YGFVIDNVQP IPEFSLVAVH LKHERSGAAH LHLDSPTDNN NVFSIAFKTN PPDATGVPHI LEHTTLCGSY KYPVRDPFFK MTNRSLSNFM NAMTGHDFTF YPFATTNAKD FDNLMDVYLS SVFEPLLSYN DFIQEGWRLE NEDINDPESK LELKGVVYNE MKGQNSNTSY YFYIKFLESI YPSLNNAGGD PAKIPDLQYE DLVDFHHRNY HPSNARTFTY GNLPLMNHLK HLSDYFQTFG VRPQSKDLKL PIFSNTTSPS STTVVKVPGP VDTMSSKPAE QQYKASITWN LGNALEEANQ YELFKWKILS SLLCDGHNAP FYQELIETEF GDDFSANSGI DATTALISFT IGLNNLTVEK VGQLESKVLD IIRNKVLPEF ENPESSYKTR IEAILHQIEL NLKKHKPDFG LSLLNVIVPT WVNGLDPIKS LRVEPILNQF KSDFECKGLL VFKELLDSSI LNPQCEKFSF VMEPQNEFNK NLTTVEAERV KTMVQSLSEE DKKIINERGQ ELARKQTEEQ DGEVLPTLTI KDIPEKGDFH PLLYSQIGSN TLQKRIVDTN GLVYTAALKN IKYLPKEYYK YLSLFSSCLT NLAGTSHTSV TDLETKISRL TGGISFSHRI STNPYDIREV DLYFQMSGMA LKENSEHIYN LWHEVLTDTQ FDSNDELVVD KLFTLIKNMS QNQLNVIADR GHSYASGYSN AKLTPARYIS DITSGISQVE FIMELNSNIE KKGKQYLADE ILPILKEIQL LIIDNAKGEF KYRLVGDKDI IGENEKLVQE FNDKISHFSN TSSQGNASEL KNLVNAFNNN NLGINANQNT LLNLPFQVSY ASLGKLGAEY ACKDGASLQI LAQLYTFKHL HSVIRESNGA YGGGLLYDGL GGTLNFYSYR DPNPLKSVES FEKSFDFGLS VNWEPKDLQE AKLRIFQSVD APTNIASQGS TEFYEGITDV MRQERRENFL SVNNQDLTNV IQKYLVDQKD NSVTVIGDNT TLNVGKDWTV QELNVN
|
| |