Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_67906 |
Symbol | ECM17.1 |
ID | 4839565 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | + |
Start bp | 1228967 |
End bp | 1231866 |
Gene Length | 2900 bp |
Protein Length | 934 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640390880 |
Product | Sulfite reductase [NADPH] beta subunit (Extracellular matrix protein 17) |
Protein accession | XP_001384919 |
Protein GI | 126136791 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0155] Sulfite reductase, beta subunit (hemoprotein) [COG0369] Sulfite reductase, alpha subunit (flavoprotein) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0700093 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGATCA TTGATTCTGA ACCATACCAG GACAAAGCTG CCGGTAGGTC CTCCAACGGT TACCGTAAGA AGGATGTTGG TCTCTACGCC ATAAACTACG GTAATGTCTA CGTTGCCTCT GTAGCCATCT ACTCTTCTTA CACCCAGGTA TTGCAAGCTT TCATTGAAGC CGAAAAGTTC AACGGTCCAT CGATTGTGTT AGCCTACTTG CCATACAACA AGGAAACCGA CAACACCTTG TCCATCTTGC AAGAAACGAA GAAGGCTGTG GACAGCGGTT ACTGGCCATT GTACAGATAT AATCCTAGCA TAGAGAGCGA GGCTGACTCC TTCAAGTTGG ACTCTTCCAA TTTGAGAAAG CAGTTGAAAG AATTCTTGGA CCGTGAAAAT AAGTTGACTT TGTTGGCCGC CAAGAACCCT GTTTTGTCTA GAAACTTAAC TGCTACTGCT ACTTCTGAAG TCAAAAAGCA ACAGAAGAAG GTAGCTGCCG ATTCCTATGC CAAGTTACTT CAAGGTCTTT CTGGTCCACC ATTGACTGTC GCCTTTGCTT CAGATGGCGG CAATGCTGAA GCTGTTGCCA AAAAAGTAAA CAGACAAGCC TTGGGTAGAG GGTTGAAATC GGTAGTGTTG GCTATGGATG ACTTGTCAAT TGAAGATTTG CCTACTGAAA CCAACATTGT CTTCATTACC TCTACTTCTG GTCAAGGTGA GTTCCCAACA AACGGTAAGC AGTTCTGGGA CGGTTTGAAG AACACCACCG ACTTGGAGTT GTCTACTGTT AAATATTCTG TTTTCGGCTT GGGTGACTCT CAGTACTGGC CCAGAAAGGA AGACAAGCAT TACTACAATA AGCCAGCTAA AGACTTAGAA TCCAAGTTGA AACTCTACGG TGGTATTGAA TTGGCTGAGA TTGGTTTGGG TGACGACCAG GATGCTGATG GATTCAACAC TGGTTTCAAC GAGTGGATTC CAAAGATTTG GAAGGCGCTT GGAGTTGACA ATGTTGAGGG AGTCGAAGAA CCAAAACCAA TCACCAACGA AGACATGAAA ATCAACTCTG ATTTCTTGCG AGGCACCATT GTAGAAGGAT TGAACGATCA GTCTACTGGT TCTATTAGTG CTGTTGATCA ACAGTTGACC AAGTTCCACG GTATCTACAT GCAAGATGAT CGTGATATTA GAGACGAACG TAAGAGTCAA GGGTTGGAGC CAGCTTACGC TTTTATGGTT CGTGTTAGAT TGCCTGGAGG ACAAGCCTCT CCAGCCCAGT ACTTGAAGAT GAACGAATTG TCCGACTTCA GAGGTAACGG GACCTTGAAG ATCACCACCA GAGCCACCTT CCAGTTGCAC GGTGTTGTCA AGCACAATTT GAAACCAGCT ATCAGAGGTA TGAATGCTTC CTTGATGGAT ACTTTAGCTG CCTGTGGTGA TGTTAACAGA AACGTCATGG TTTCTGCCTT ACCCCACAAT GCTAAGGTAC ATGCTCAAAT TTCCAGTATT GGTGCTAAGA TCTCAGAACA TTTGTTACCA AATACTACTG CTTATCACGA AATCTGGTTA CAGGGCGAAG ATGAAAGTGA CAAGCCAGGT GACAGAGACA ACTGGGAAAC TCGTAAGAAT GGTCCAACCA AGACCAAGAC GTTGGTTGCT GGTAACGCTT TAGCTGATCT TGAGCCTCTT TATGGTAGTA CTTATTTACC AAGAAAATTC AAGATCGTTA TCACCGTACC TCCATTCAAT GATGTCGATG TTTATGCTCA TGACATTGGT TTGATTGCCA TTGTGATCAA CAACGAAGTC ATTGGTTTCA ACATGTTAGT GGGTGGTGGT ATGGGTTCGA CCCATAACAA CAAGAAGACA TACCCTAGAA CCGGTTCCAT GTTTGGTTTC GTTCCTGTTG ACAAGATCCA CTTGGCTTGT GAGAAGGTGA TGCTTGTACA GAGGGACTTT GGTGATAGAA CCAACAGAAA GCACGCCAGA TTGAAGTACA CAATTGACGA CATGGGAGTT GATATTTTCA AGGGTAAGGT CGAAGACTTG TTGGGCTTCA AATTCGAGGA ACCAAAAGAA TTCAAGATCG AGTCCAACAT CGACTACTTC GGTTGGTGCA AGGATGAATT GGGTTACAAC CACTTCACAT GCTTCATCGA AAACGGTAGA ATAGAAGACA CTGCTGAACT TCCTCAAAAA ACTGGTTTGA AGAATATTGC CGAATACTTA AACGGTGGCA GATCTGGTGA GTTCCGTTTA ACTGGTAATC AGCACTTAGT CATTTCCAAC ATCGAGGACG CAGACTTGAC CCATGTCAAG TCTTTATTGG CTGAGTATAA GTTGGACAAC ACTGATTTTT CCGCCTTGAG ATTGTCTTCA TCGTCTTGTG TTGCTTTCCC TACCTGTGGT TTGGCTATGG CTGAATCTGA ACGTTACTTG CCAGAGTTGA TCACCAAGTT GGAAACCTCT TTGGAAGAGT TCGGCTTGAG ACATGACTCT GTGGTGATGA GAATGACAGG TTGTCCAAAC GGTTGTGCTA GACCATGGCT TGCTGAAGTT GCCTTGGTTG GTAAAGCTTA TGGAGCCTAC AATTTGATGT TAGGTGGTGG ACACCATGGT GAGAGATTAA ACAAGATCTA TAGATATTCT ATCAAGGAAG ATGAAATATT AGACATCTTG TTGCCATTGT TCAAGAGATG GAGTTTGGAA AGAGAAGAAG GCGAACCATT TGGTGACTTC GTTATTAGAA TCGGAATCAT CAAGCCTACT ACCGAAGGTA AATACTTCCA CGATGATATT CCCGAAGATG CTTAATGTTT CATTTCGACA CATATACAAT AGATTCATGA ACATATCCCT ATAATTCTGC ATTATCTTTT ATACTATTTA ATAAAAATGT TACTATATTA
|
Protein sequence | MLIIDSEPYQ DKAAGRSSNG YRKKDVGLYA INYGNVYVAS VAIYSSYTQV LQAFIEAEKF NGPSIVLAYL PYNKETDNTL SILQETKKAV DSGYWPLYRY NPSIESEADS FKLDSSNLRK QLKEFLDREN KLTLLAAKNP VLSRNLTATA TSEVKKQQKK VAADSYAKLL QGLSGPPLTV AFASDGGNAE AVAKKVNRQA LGRGLKSVVL AMDDLSIEDL PTETNIVFIT STSGQGEFPT NGKQFWDGLK NTTDLELSTV KYSVFGLGDS QYWPRKEDKH YYNKPAKDLE SKLKLYGGIE LAEIGLGDDQ DADGFNTGFN EWIPKIWKAL GVDNVEGVEE PKPITNEDMK INSDFLRGTI VEGLNDQSTG SISAVDQQLT KFHGIYMQDD RDIRDERKSQ GLEPAYAFMV RVRLPGGQAS PAQYLKMNEL SDFRGNGTLK ITTRATFQLH GVVKHNLKPA IRGMNASLMD TLAACGDVNR NVMVSALPHN AKVHAQISSI GAKISEHLLP NTTAYHEIWL QGEDESDKPG DRDNWETRKN GPTKTKTLVA GNALADLEPL YGSTYLPRKF KIVITVPPFN DVDVYAHDIG LIAIVINNEV IGFNMLVGGG MGSTHNNKKT YPRTGSMFGF VPVDKIHLAC EKVMLVQRDF GDRTNRKHAR LKYTIDDMGV DIFKGKVEDL LGFKFEEPKE FKIESNIDYF GWCKDELGYN HFTCFIENGR IEDTAELPQK TGLKNIAEYL NGGRSGEFRL TGNQHLVISN IEDADLTHVK SLLAEYKLDN TDFSALRLSS SSCVAFPTCG LAMAESERYL PELITKLETS LEEFGLRHDS VVMRMTGCPN GCARPWLAEV ALVGKAYGAY NLMLGGGHHG ERLNKIYRYS IKEDEILDIL LPLFKRWSLE REEGEPFGDF VIRIGIIKPT TEGKYFHDDI PEDA
|
| |