Gene PICST_29712 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_29712 
SymbolSPI1.2 
ID4836861 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp937285 
End bp940041 
Gene Length2757 bp 
Protein Length918 aa 
Translation table12 
GC content44% 
IMG OID640388176 
ProductPutative serine proteinase inhibitor (KU family) with thrombospondin repeats 
Protein accessionXP_001382950 
Protein GI150864216 
COG category 
COG ID 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.355589 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGACA TCTTGCCAAA AACGATGGAT ATAGACGCCA TCAATCAACT GGAAGGGCCT 
CCTGGAGACT CTTCCCACTT CAACGCTTCA AATAGTGCTT TAAACGAACC GAAACCGCCA
GATATTGGCC GAAAACGAAA AACCGACGTC GCACCCGATG ACCTGATGGA CCTTGATGGC
GAGTCCTCAG ATGCCGTAGA AAACGGCGAA TTGGAGCCAT CTTTCGTGAC AGCAGGATCA
GTTATGGCTG ATGACTCTTT TGACGATACC CAAGAAGCCT CAAACGCTTT AAACACCTTG
AACGAAGGGC CAGAAATGTC GATTATGGAC AGCTTTGAAA CCTCAGTTTC GAAAAATTCT
TCACATCACG TGATCGAGGA AGAACACGAC CCACTTTTGG CTGCAAAAGG AGATACTGTT
ATGATATCTC CAAGTCCTAC TTCCAGTAGC GATTTGTCAA AGCCTACATC CACCAGTGAC
TCAGTTGAAA TTACAAATGT TTATATAAAA GAAAAAAACC CAAAATCCCA AAAAAATAAC
CTTTCAGAAA ATGAAGAAAT CAAAAAAAAT CAGGAAAACA GAAAATCAAA AAAACAAACC
TCAGGAGCTC CTCTTACTGA TCTGATCTGG AACGGTCAGA ACGCCTCGCT CCAACATCAC
AAAAACAATT CATCAATTAA ACTTCTCTCC AAAGACAGGT TAATAAAGGA TCTTGAAAAA
CTCGGATCTG GCTTAAACAA GAATTGGGAA GAAGATAAAT TCACAGACGA TTTGACCTTT
AAGAAGATCA GGAGGTTAGG GTTTGCCAAA AACTATGTCA CTTCCCACTC ACAACAAGAA
TGGGCAGGAG AACAAGGCAG AATTGACAGA TGGAACATCC CGCAAGAATT TCTCATTGAT
TCATTGTCAC TCCTTGGAGA ACACGAATAT CAAAAAAGGG CAATCGAATT GAGGATTGAT
CGGTTTGAGT ACAAGGATAC CCAATTAGAC CCGTCCAAAG AGTTGTCTTT AAAAGAAAAG
CTAGCGGTTA TTGAAGACCA CTTTTCCCAT CTCCCCGAGA AATTTTATGA AACAGCTCTT
CAGCTCGAAA CCAAATTGAA ACAAGTAAGA GCAGCTAAAA CAAATTTCCT AGACTTACTT
GCCAAGGCGA CAGGTGAGGA TAAGGAGGCG AGACCACTGT ACCTCAGAGC CATACATGAA
AACGAGATAC AAGAAAGAGA TCTCATGGCC CAATTGACAA AGGAAAACAC CCTTCGTGCT
ATTGTTCCCT TGATGATGAT CAGACCACTC AGCAACTCCA CTGCCTATAT GTCCCTCAGT
CCCATTGAGG GACCTAAAAC CTCAACCCAG GGACCCAGCC TTCTAATGTC TCTTAAAAAA
TTGTTAAGAC AAGGTTATGA AGGATCAACT TCCGATTACT ATTTCAAATT GAGCATGATT
CCCTCTAAAC CTCTCGCAAA CATGTTTCTT GTCGGACTTG AACACCCATC AGATCACAAA
AACCCAGAGA TGGCGATCAG TGAATTATTC AATAGAGGTT CCATGCAAAT TACATCGTCC
CATAACCTCA ATTTTTCAAA ATTCCCATCA GACGTCTACT CAGTGGAAGA AAAAGTCAAG
TATGAGATTC TCGTTGCCCA TTCAGAGCGC ATTCGCAATA CATACTATTT AGACAAAGAC
AAAGAAAAGA GTAAACTGTT CTACCTCATT GCAACGGACT CTGGCAAAGT CCCTACTCGA
AAGAATGCAA ATCTTCCGAA CTATAGTCTC GAGATCATTT CGACTTTTAA ATTCTGCAGT
TATTGTCACG AAAGTGATCA CCCAACCTTT AAATGCACCA AAACGAACAA AAACCGCACC
ACGTACTATC CACCTAGTAC ACCAATGGTA GCTACTTACA CTCTGACCAG AGAAAGTACT
ACAGACAATG GCAGTAAATT TCGTAAAGTA CGGACAGGGG CCTTCACCGA AGTCACCAAG
GGCGTAAAAC CCCAGAAAAT CCAACAAACA ATTGTAAACC ATGGATCAAA TTCATTCATG
AACTTGCCTA TGGAAGAACC GCCCAGCTCA ACTACGGACC CGACCAGCAC CACGGCCTTT
TCGCGTCAGC CAACAAGTAC ACCAACAACC CCTCGCCAAC GCCAGACACA AAGCCAAACA
CCGGCTACAA AGACGAAGAC AACTCAAGAT CTTGATTCCG AGGACATCGT GATCCAGACC
ACAACGTCGA TCCAGACAGA AGCAAACACC ATAACTCCCC CCAGCACGCC TTTTACCGTC
CAAACACATA AAACTACCGA CACTCCCTCT TCTATGGGAG CTTCCTCAAC AAGAAGAACT
CCATACTCAA GACCCATCAA CAAGCGTCAA GATGCGACCT CCCCAACCCA GCGATTGGAG
CCGTTACGGG CAAGAAAAAC ACCCTCTCGC CAAAACACGG CACAACTTGC AGACGCAAGA
CTGTGGCTGG AAAGATACCC CCAGCGACTA CGAGACAACC CTCCCAAAAC GGGATCGGTA
GATTTCTCTT TCTCCCAGAT AGGATCATCC ATAGACCCTC TCAAATCCAC AAATACTACA
CCTGCCATGA TCAGCTCCCC AATCGGATCG TCATTCAGTA TCACGGAGGC ATCACAACAA
CAGCTAGCAA ACCCCATTCA GTTGAGGTCC GAAGTCCTTC CATCACAGGA TACGATTCCT
GACTGGTCGA CGTTACAGGA CACTAGCACC AATGACACGC CTTCTCAATT AAATTAA
 
Protein sequence
MADILPKTMD IDAINQSEGP PGDSSHFNAS NSALNEPKPP DIGRKRKTDV APDDSMDLDG 
ESSDAVENGE LEPSFVTAGS VMADDSFDDT QEASNALNTL NEGPEMSIMD SFETSVSKNS
SHHVIEEEHD PLLAAKGDTV MISPSPTSSS DLSKPTSTSD SVEITNVYIK EKNPKSQKNN
LSENEEIKKN QENRKSKKQT SGAPLTDSIW NGQNASLQHH KNNSSIKLLS KDRLIKDLEK
LGSGLNKNWE EDKFTDDLTF KKIRRLGFAK NYVTSHSQQE WAGEQGRIDR WNIPQEFLID
SLSLLGEHEY QKRAIELRID RFEYKDTQLD PSKELSLKEK LAVIEDHFSH LPEKFYETAL
QLETKLKQVR AAKTNFLDLL AKATGEDKEA RPSYLRAIHE NEIQERDLMA QLTKENTLRA
IVPLMMIRPL SNSTAYMSLS PIEGPKTSTQ GPSLLMSLKK LLRQGYEGST SDYYFKLSMI
PSKPLANMFL VGLEHPSDHK NPEMAISELF NRGSMQITSS HNLNFSKFPS DVYSVEEKVK
YEILVAHSER IRNTYYLDKD KEKSKSFYLI ATDSGKVPTR KNANLPNYSL EIISTFKFCS
YCHESDHPTF KCTKTNKNRT TYYPPSTPMV ATYTSTREST TDNGSKFRKV RTGAFTEVTK
GVKPQKIQQT IVNHGSNSFM NLPMEEPPSS TTDPTSTTAF SRQPTSTPTT PRQRQTQSQT
PATKTKTTQD LDSEDIVIQT TTSIQTEANT ITPPSTPFTV QTHKTTDTPS SMGASSTRRT
PYSRPINKRQ DATSPTQRLE PLRARKTPSR QNTAQLADAR SWSERYPQRL RDNPPKTGSV
DFSFSQIGSS IDPLKSTNTT PAMISSPIGS SFSITEASQQ QLANPIQLRS EVLPSQDTIP
DWSTLQDTST NDTPSQLN