Gene PICST_30164 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_30164 
SymbolSPI1.1 
ID4837136 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp2112353 
End bp2115085 
Gene Length2733 bp 
Protein Length910 aa 
Translation table12 
GC content44% 
IMG OID640388451 
ProductPutative serine proteinase inhibitor (KU family) 
Protein accessionXP_001382633 
Protein GI150863972 
COG category 
COG ID 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATATAG ACGCCATCAA TCAACTGGAA GGGCCTCCTG GAGACTCTTC CCACTTCAAC 
GCTTCAAATA GTGCTTTAAA CGAACCGAAA CCGCCAGATA TTGGCCGAAA ACGAAAAACC
GACGTCGCAC CCGATGACCT GATGGACCTT GATGGCGAGT CCTCAGATGC CGTAGAAAAC
GGCGAATTGG AGCCATCTTT CGTGACAGCA GGATCAGTTA TTGCTGATGA CTCTTTTGAC
GATACCCAAG AAGCCTCAAA CGCTTTAAAC ACCTTGAACG AAGGGCCAGA AATGTCGATT
ATGGACAGCT TTGAAACCTC AGTTTCGAAA AATTCTTCAC ATCACGTGAT CGAGGAAGAA
CACGACCCAG TTTTGGCTGC AAAAGGAGAT ACTGTTATGA TATCTCCAAG TCCTTCTTCC
AGTAGCGATT TGTCAAAGCC TACATCCACC AGTGACTCAG TTGAAATTAC AAATGTTTAT
ATTAAAGAAA AAAACCCAAA ATCCCAAAAA AATAACCTTT CAGAAAATGA AGAAATCAAA
AAAAATCAGG AAAACAGAAA ATCAAAAAAA CAAACCTCAG GAGCTCCTCT TACTGATCTG
ATCTGGAACG GTCAGAACGC CTCGCTCCAA CATCACAAAA ACAATTCATC AATTAAACTT
CTCTCCAAAG ACAGGTTAAT AAAGGATCTT GAAAAACTCG GATCTGGCTT AAACAAGAAT
TGGGAAGAAG ATAAATTCAC AGACGATTTG ACCTTTAAGA AGATCAGGAG GTTAGGGTTT
GCCAAAAACT ATGTCACTTC CCACTCACAA CAAGAATGGG CAGGCGAACA GGGCAGAATT
GACAGATGGA ACATCCCGCA AGAATTTCTC ATTGATTCAT TGTCACTCCT TGGAGAACAC
GAATATCAAA AAAGGGCAAT CGAATTGCGG ATTGATCGGT TTGAGTACAA GGATACCCAA
TTAGACCCGT CCAAAGAGTT GTCTTTAAAA GAAAAGCTAG AGGTTATTGA AGACCACTTT
TCCCATCTCC CCGAGAAATT TTATGAAACA GCTCTTCAGC TCGAAACCAA ATTGAAACAA
GTAAGAGCAG CTAAAACAAA TTTCCTAGAC TTACTTGCCA AGGCGACAGG TGAGGATAAG
GAGGCGAGAC CACTGTACCT CAGAGCCATA CATGAAAACG AGATACAAGA AAGAGATCTC
ATGGCCCAAT TGACAAAGGA AAACACCCTT CGTGCTATTG TTCCCTTGAT GATGATCAGA
CCACTCAGCA ACTCCACTGC CTATATGTCC CTCAGTCCCA TTGAGGGACC TAAAACCTCA
ACCCAGGGAC CCAGCCTTCT AATGTCTCTT AAAAAATTGT TAAGACAAGG TTATGAAGGA
TCAACTTCCG ATTACTATTT CAAATTGAGC ATGATTCCCT CTAAACCTCT CGCAAACATG
TTTCTTGTCG GACTTGAACA CCCATCAGAT CACAAAAACC CAGAGATGGC GATCAGTGAA
TTATTCAATA GAGGTTCCAT GCAAATTACA TCGTCCCATA ACCTCAATTT TTCAAAATTC
CCATCAGACG TCTACTCAGT GGAAGAAAAA GTCAAGTATG AGATTCTCGT TGCCCATTCA
GAGCGCATTC GCAATACATA CTATTTAGAC AAAGACAAAG AAAAGAGTAA ACTGTTCTAC
CTCATTGCAA CGGACTCTGG CAAAGTCCCT ACTCGAAAGA ATGCAAATCT TCCGAACTAT
AGTCTCGAGA TCATTTCGAC TTTTAAATTC TGCAGTTATT GTCACGAAAG TGATCACCCA
ACCTTTAAAT GCACCAAAAC GAACAAAAAC CGCACCACGT ACTATCCACC TAGTACACCA
ATGGTAGCTA CTTACACTCT GACCAGAGAA AGTACTACAG ACAATGGCAG TAAATTTCGT
AAAGTACGGA CAGGGGCCTT CACCGAAGTC ACCAAGGGCG TAAAACCCCA GAAAATCCAA
CAAACAATTG TAAACCATGG ATCAAATTCA TTCATGAACT TGCCTATGGA AGAACCGCCC
AGCTCAACCA CGGACCCGAC CAGCACCACG GCCTTTTCGC GTCAGCCAAC AAGTACACCA
ACAACCCCTC GCCAACGCCA GACACAAAGC CAAACACCGG CTACAAAGAC GACGACAACT
CAAGATCTTG ATTCCGAGGA CATCGTGATC CAGACCACAA CGTCGATCCA GACAGAAGCA
AACACCATAA CTCCCCCCAG CACGCCTTTT ACCGTCCAAA CACATAAAAC TACCGACACT
CCCTCTTCTA TGGGAGCTTC CTCAACAAGA AGAACTCCAT ACTCAAGACC CATCAACAAG
CGTCAAGATG CGACCTCCCC AACCCAGCGA TTGGAGCCGT TACGGGCAAG AAAAACACCC
TCTCGCCAAA ACACGGCACA ACTTGCAGAC GCAAGACTGT GGCTGGAAAG ATACCCCCAG
CGACTACGAG ACAACCCTCC CAAAACGGGA TCGGTAGATT TCTCTTTCTC CCAGATAGGA
TCATCCATAG ACCCTCTCAA ATCCACAAAT ACTACACCTG CCATGATCAG CTCCCCAATC
GGATCGTCAT TCAGTATCAC GGAGGCATCA CAACAACAGC TAGCAAACCC CATTCAGTTG
AGGTCCGAAG TCCTTCCATC ACAGGATACG ATTCCTGACT GGTCGACGTT ACAGGACACT
AGCACCAATG ACACGCCTTC TCAATTAAAT TAA
 
Protein sequence
MDIDAINQSE GPPGDSSHFN ASNSALNEPK PPDIGRKRKT DVAPDDSMDL DGESSDAVEN 
GELEPSFVTA GSVIADDSFD DTQEASNALN TLNEGPEMSI MDSFETSVSK NSSHHVIEEE
HDPVLAAKGD TVMISPSPSS SSDLSKPTST SDSVEITNVY IKEKNPKSQK NNLSENEEIK
KNQENRKSKK QTSGAPLTDS IWNGQNASLQ HHKNNSSIKL LSKDRLIKDL EKLGSGLNKN
WEEDKFTDDL TFKKIRRLGF AKNYVTSHSQ QEWAGEQGRI DRWNIPQEFL IDSLSLLGEH
EYQKRAIELR IDRFEYKDTQ LDPSKELSLK EKLEVIEDHF SHLPEKFYET ALQLETKLKQ
VRAAKTNFLD LLAKATGEDK EARPSYLRAI HENEIQERDL MAQLTKENTL RAIVPLMMIR
PLSNSTAYMS LSPIEGPKTS TQGPSLLMSL KKLLRQGYEG STSDYYFKLS MIPSKPLANM
FLVGLEHPSD HKNPEMAISE LFNRGSMQIT SSHNLNFSKF PSDVYSVEEK VKYEILVAHS
ERIRNTYYLD KDKEKSKSFY LIATDSGKVP TRKNANLPNY SLEIISTFKF CSYCHESDHP
TFKCTKTNKN RTTYYPPSTP MVATYTSTRE STTDNGSKFR KVRTGAFTEV TKGVKPQKIQ
QTIVNHGSNS FMNLPMEEPP SSTTDPTSTT AFSRQPTSTP TTPRQRQTQS QTPATKTTTT
QDLDSEDIVI QTTTSIQTEA NTITPPSTPF TVQTHKTTDT PSSMGASSTR RTPYSRPINK
RQDATSPTQR LEPLRARKTP SRQNTAQLAD ARSWSERYPQ RLRDNPPKTG SVDFSFSQIG
SSIDPLKSTN TTPAMISSPI GSSFSITEAS QQQLANPIQL RSEVLPSQDT IPDWSTLQDT
STNDTPSQLN