Gene PICST_67906 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_67906 
SymbolECM17.1 
ID4839565 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp1228967 
End bp1231866 
Gene Length2900 bp 
Protein Length934 aa 
Translation table12 
GC content43% 
IMG OID640390880 
ProductSulfite reductase [NADPH] beta subunit (Extracellular matrix protein 17) 
Protein accessionXP_001384919 
Protein GI126136791 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0155] Sulfite reductase, beta subunit (hemoprotein)
[COG0369] Sulfite reductase, alpha subunit (flavoprotein) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0700093 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGATCA TTGATTCTGA ACCATACCAG GACAAAGCTG CCGGTAGGTC CTCCAACGGT 
TACCGTAAGA AGGATGTTGG TCTCTACGCC ATAAACTACG GTAATGTCTA CGTTGCCTCT
GTAGCCATCT ACTCTTCTTA CACCCAGGTA TTGCAAGCTT TCATTGAAGC CGAAAAGTTC
AACGGTCCAT CGATTGTGTT AGCCTACTTG CCATACAACA AGGAAACCGA CAACACCTTG
TCCATCTTGC AAGAAACGAA GAAGGCTGTG GACAGCGGTT ACTGGCCATT GTACAGATAT
AATCCTAGCA TAGAGAGCGA GGCTGACTCC TTCAAGTTGG ACTCTTCCAA TTTGAGAAAG
CAGTTGAAAG AATTCTTGGA CCGTGAAAAT AAGTTGACTT TGTTGGCCGC CAAGAACCCT
GTTTTGTCTA GAAACTTAAC TGCTACTGCT ACTTCTGAAG TCAAAAAGCA ACAGAAGAAG
GTAGCTGCCG ATTCCTATGC CAAGTTACTT CAAGGTCTTT CTGGTCCACC ATTGACTGTC
GCCTTTGCTT CAGATGGCGG CAATGCTGAA GCTGTTGCCA AAAAAGTAAA CAGACAAGCC
TTGGGTAGAG GGTTGAAATC GGTAGTGTTG GCTATGGATG ACTTGTCAAT TGAAGATTTG
CCTACTGAAA CCAACATTGT CTTCATTACC TCTACTTCTG GTCAAGGTGA GTTCCCAACA
AACGGTAAGC AGTTCTGGGA CGGTTTGAAG AACACCACCG ACTTGGAGTT GTCTACTGTT
AAATATTCTG TTTTCGGCTT GGGTGACTCT CAGTACTGGC CCAGAAAGGA AGACAAGCAT
TACTACAATA AGCCAGCTAA AGACTTAGAA TCCAAGTTGA AACTCTACGG TGGTATTGAA
TTGGCTGAGA TTGGTTTGGG TGACGACCAG GATGCTGATG GATTCAACAC TGGTTTCAAC
GAGTGGATTC CAAAGATTTG GAAGGCGCTT GGAGTTGACA ATGTTGAGGG AGTCGAAGAA
CCAAAACCAA TCACCAACGA AGACATGAAA ATCAACTCTG ATTTCTTGCG AGGCACCATT
GTAGAAGGAT TGAACGATCA GTCTACTGGT TCTATTAGTG CTGTTGATCA ACAGTTGACC
AAGTTCCACG GTATCTACAT GCAAGATGAT CGTGATATTA GAGACGAACG TAAGAGTCAA
GGGTTGGAGC CAGCTTACGC TTTTATGGTT CGTGTTAGAT TGCCTGGAGG ACAAGCCTCT
CCAGCCCAGT ACTTGAAGAT GAACGAATTG TCCGACTTCA GAGGTAACGG GACCTTGAAG
ATCACCACCA GAGCCACCTT CCAGTTGCAC GGTGTTGTCA AGCACAATTT GAAACCAGCT
ATCAGAGGTA TGAATGCTTC CTTGATGGAT ACTTTAGCTG CCTGTGGTGA TGTTAACAGA
AACGTCATGG TTTCTGCCTT ACCCCACAAT GCTAAGGTAC ATGCTCAAAT TTCCAGTATT
GGTGCTAAGA TCTCAGAACA TTTGTTACCA AATACTACTG CTTATCACGA AATCTGGTTA
CAGGGCGAAG ATGAAAGTGA CAAGCCAGGT GACAGAGACA ACTGGGAAAC TCGTAAGAAT
GGTCCAACCA AGACCAAGAC GTTGGTTGCT GGTAACGCTT TAGCTGATCT TGAGCCTCTT
TATGGTAGTA CTTATTTACC AAGAAAATTC AAGATCGTTA TCACCGTACC TCCATTCAAT
GATGTCGATG TTTATGCTCA TGACATTGGT TTGATTGCCA TTGTGATCAA CAACGAAGTC
ATTGGTTTCA ACATGTTAGT GGGTGGTGGT ATGGGTTCGA CCCATAACAA CAAGAAGACA
TACCCTAGAA CCGGTTCCAT GTTTGGTTTC GTTCCTGTTG ACAAGATCCA CTTGGCTTGT
GAGAAGGTGA TGCTTGTACA GAGGGACTTT GGTGATAGAA CCAACAGAAA GCACGCCAGA
TTGAAGTACA CAATTGACGA CATGGGAGTT GATATTTTCA AGGGTAAGGT CGAAGACTTG
TTGGGCTTCA AATTCGAGGA ACCAAAAGAA TTCAAGATCG AGTCCAACAT CGACTACTTC
GGTTGGTGCA AGGATGAATT GGGTTACAAC CACTTCACAT GCTTCATCGA AAACGGTAGA
ATAGAAGACA CTGCTGAACT TCCTCAAAAA ACTGGTTTGA AGAATATTGC CGAATACTTA
AACGGTGGCA GATCTGGTGA GTTCCGTTTA ACTGGTAATC AGCACTTAGT CATTTCCAAC
ATCGAGGACG CAGACTTGAC CCATGTCAAG TCTTTATTGG CTGAGTATAA GTTGGACAAC
ACTGATTTTT CCGCCTTGAG ATTGTCTTCA TCGTCTTGTG TTGCTTTCCC TACCTGTGGT
TTGGCTATGG CTGAATCTGA ACGTTACTTG CCAGAGTTGA TCACCAAGTT GGAAACCTCT
TTGGAAGAGT TCGGCTTGAG ACATGACTCT GTGGTGATGA GAATGACAGG TTGTCCAAAC
GGTTGTGCTA GACCATGGCT TGCTGAAGTT GCCTTGGTTG GTAAAGCTTA TGGAGCCTAC
AATTTGATGT TAGGTGGTGG ACACCATGGT GAGAGATTAA ACAAGATCTA TAGATATTCT
ATCAAGGAAG ATGAAATATT AGACATCTTG TTGCCATTGT TCAAGAGATG GAGTTTGGAA
AGAGAAGAAG GCGAACCATT TGGTGACTTC GTTATTAGAA TCGGAATCAT CAAGCCTACT
ACCGAAGGTA AATACTTCCA CGATGATATT CCCGAAGATG CTTAATGTTT CATTTCGACA
CATATACAAT AGATTCATGA ACATATCCCT ATAATTCTGC ATTATCTTTT ATACTATTTA
ATAAAAATGT TACTATATTA
 
Protein sequence
MLIIDSEPYQ DKAAGRSSNG YRKKDVGLYA INYGNVYVAS VAIYSSYTQV LQAFIEAEKF 
NGPSIVLAYL PYNKETDNTL SILQETKKAV DSGYWPLYRY NPSIESEADS FKLDSSNLRK
QLKEFLDREN KLTLLAAKNP VLSRNLTATA TSEVKKQQKK VAADSYAKLL QGLSGPPLTV
AFASDGGNAE AVAKKVNRQA LGRGLKSVVL AMDDLSIEDL PTETNIVFIT STSGQGEFPT
NGKQFWDGLK NTTDLELSTV KYSVFGLGDS QYWPRKEDKH YYNKPAKDLE SKLKLYGGIE
LAEIGLGDDQ DADGFNTGFN EWIPKIWKAL GVDNVEGVEE PKPITNEDMK INSDFLRGTI
VEGLNDQSTG SISAVDQQLT KFHGIYMQDD RDIRDERKSQ GLEPAYAFMV RVRLPGGQAS
PAQYLKMNEL SDFRGNGTLK ITTRATFQLH GVVKHNLKPA IRGMNASLMD TLAACGDVNR
NVMVSALPHN AKVHAQISSI GAKISEHLLP NTTAYHEIWL QGEDESDKPG DRDNWETRKN
GPTKTKTLVA GNALADLEPL YGSTYLPRKF KIVITVPPFN DVDVYAHDIG LIAIVINNEV
IGFNMLVGGG MGSTHNNKKT YPRTGSMFGF VPVDKIHLAC EKVMLVQRDF GDRTNRKHAR
LKYTIDDMGV DIFKGKVEDL LGFKFEEPKE FKIESNIDYF GWCKDELGYN HFTCFIENGR
IEDTAELPQK TGLKNIAEYL NGGRSGEFRL TGNQHLVISN IEDADLTHVK SLLAEYKLDN
TDFSALRLSS SSCVAFPTCG LAMAESERYL PELITKLETS LEEFGLRHDS VVMRMTGCPN
GCARPWLAEV ALVGKAYGAY NLMLGGGHHG ERLNKIYRYS IKEDEILDIL LPLFKRWSLE
REEGEPFGDF VIRIGIIKPT TEGKYFHDDI PEDA