Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_30164 |
Symbol | SPI1.1 |
ID | 4837136 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | + |
Start bp | 2112353 |
End bp | 2115085 |
Gene Length | 2733 bp |
Protein Length | 910 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640388451 |
Product | Putative serine proteinase inhibitor (KU family) |
Protein accession | XP_001382633 |
Protein GI | 150863972 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATATAG ACGCCATCAA TCAACTGGAA GGGCCTCCTG GAGACTCTTC CCACTTCAAC GCTTCAAATA GTGCTTTAAA CGAACCGAAA CCGCCAGATA TTGGCCGAAA ACGAAAAACC GACGTCGCAC CCGATGACCT GATGGACCTT GATGGCGAGT CCTCAGATGC CGTAGAAAAC GGCGAATTGG AGCCATCTTT CGTGACAGCA GGATCAGTTA TTGCTGATGA CTCTTTTGAC GATACCCAAG AAGCCTCAAA CGCTTTAAAC ACCTTGAACG AAGGGCCAGA AATGTCGATT ATGGACAGCT TTGAAACCTC AGTTTCGAAA AATTCTTCAC ATCACGTGAT CGAGGAAGAA CACGACCCAG TTTTGGCTGC AAAAGGAGAT ACTGTTATGA TATCTCCAAG TCCTTCTTCC AGTAGCGATT TGTCAAAGCC TACATCCACC AGTGACTCAG TTGAAATTAC AAATGTTTAT ATTAAAGAAA AAAACCCAAA ATCCCAAAAA AATAACCTTT CAGAAAATGA AGAAATCAAA AAAAATCAGG AAAACAGAAA ATCAAAAAAA CAAACCTCAG GAGCTCCTCT TACTGATCTG ATCTGGAACG GTCAGAACGC CTCGCTCCAA CATCACAAAA ACAATTCATC AATTAAACTT CTCTCCAAAG ACAGGTTAAT AAAGGATCTT GAAAAACTCG GATCTGGCTT AAACAAGAAT TGGGAAGAAG ATAAATTCAC AGACGATTTG ACCTTTAAGA AGATCAGGAG GTTAGGGTTT GCCAAAAACT ATGTCACTTC CCACTCACAA CAAGAATGGG CAGGCGAACA GGGCAGAATT GACAGATGGA ACATCCCGCA AGAATTTCTC ATTGATTCAT TGTCACTCCT TGGAGAACAC GAATATCAAA AAAGGGCAAT CGAATTGCGG ATTGATCGGT TTGAGTACAA GGATACCCAA TTAGACCCGT CCAAAGAGTT GTCTTTAAAA GAAAAGCTAG AGGTTATTGA AGACCACTTT TCCCATCTCC CCGAGAAATT TTATGAAACA GCTCTTCAGC TCGAAACCAA ATTGAAACAA GTAAGAGCAG CTAAAACAAA TTTCCTAGAC TTACTTGCCA AGGCGACAGG TGAGGATAAG GAGGCGAGAC CACTGTACCT CAGAGCCATA CATGAAAACG AGATACAAGA AAGAGATCTC ATGGCCCAAT TGACAAAGGA AAACACCCTT CGTGCTATTG TTCCCTTGAT GATGATCAGA CCACTCAGCA ACTCCACTGC CTATATGTCC CTCAGTCCCA TTGAGGGACC TAAAACCTCA ACCCAGGGAC CCAGCCTTCT AATGTCTCTT AAAAAATTGT TAAGACAAGG TTATGAAGGA TCAACTTCCG ATTACTATTT CAAATTGAGC ATGATTCCCT CTAAACCTCT CGCAAACATG TTTCTTGTCG GACTTGAACA CCCATCAGAT CACAAAAACC CAGAGATGGC GATCAGTGAA TTATTCAATA GAGGTTCCAT GCAAATTACA TCGTCCCATA ACCTCAATTT TTCAAAATTC CCATCAGACG TCTACTCAGT GGAAGAAAAA GTCAAGTATG AGATTCTCGT TGCCCATTCA GAGCGCATTC GCAATACATA CTATTTAGAC AAAGACAAAG AAAAGAGTAA ACTGTTCTAC CTCATTGCAA CGGACTCTGG CAAAGTCCCT ACTCGAAAGA ATGCAAATCT TCCGAACTAT AGTCTCGAGA TCATTTCGAC TTTTAAATTC TGCAGTTATT GTCACGAAAG TGATCACCCA ACCTTTAAAT GCACCAAAAC GAACAAAAAC CGCACCACGT ACTATCCACC TAGTACACCA ATGGTAGCTA CTTACACTCT GACCAGAGAA AGTACTACAG ACAATGGCAG TAAATTTCGT AAAGTACGGA CAGGGGCCTT CACCGAAGTC ACCAAGGGCG TAAAACCCCA GAAAATCCAA CAAACAATTG TAAACCATGG ATCAAATTCA TTCATGAACT TGCCTATGGA AGAACCGCCC AGCTCAACCA CGGACCCGAC CAGCACCACG GCCTTTTCGC GTCAGCCAAC AAGTACACCA ACAACCCCTC GCCAACGCCA GACACAAAGC CAAACACCGG CTACAAAGAC GACGACAACT CAAGATCTTG ATTCCGAGGA CATCGTGATC CAGACCACAA CGTCGATCCA GACAGAAGCA AACACCATAA CTCCCCCCAG CACGCCTTTT ACCGTCCAAA CACATAAAAC TACCGACACT CCCTCTTCTA TGGGAGCTTC CTCAACAAGA AGAACTCCAT ACTCAAGACC CATCAACAAG CGTCAAGATG CGACCTCCCC AACCCAGCGA TTGGAGCCGT TACGGGCAAG AAAAACACCC TCTCGCCAAA ACACGGCACA ACTTGCAGAC GCAAGACTGT GGCTGGAAAG ATACCCCCAG CGACTACGAG ACAACCCTCC CAAAACGGGA TCGGTAGATT TCTCTTTCTC CCAGATAGGA TCATCCATAG ACCCTCTCAA ATCCACAAAT ACTACACCTG CCATGATCAG CTCCCCAATC GGATCGTCAT TCAGTATCAC GGAGGCATCA CAACAACAGC TAGCAAACCC CATTCAGTTG AGGTCCGAAG TCCTTCCATC ACAGGATACG ATTCCTGACT GGTCGACGTT ACAGGACACT AGCACCAATG ACACGCCTTC TCAATTAAAT TAA
|
Protein sequence | MDIDAINQSE GPPGDSSHFN ASNSALNEPK PPDIGRKRKT DVAPDDSMDL DGESSDAVEN GELEPSFVTA GSVIADDSFD DTQEASNALN TLNEGPEMSI MDSFETSVSK NSSHHVIEEE HDPVLAAKGD TVMISPSPSS SSDLSKPTST SDSVEITNVY IKEKNPKSQK NNLSENEEIK KNQENRKSKK QTSGAPLTDS IWNGQNASLQ HHKNNSSIKL LSKDRLIKDL EKLGSGLNKN WEEDKFTDDL TFKKIRRLGF AKNYVTSHSQ QEWAGEQGRI DRWNIPQEFL IDSLSLLGEH EYQKRAIELR IDRFEYKDTQ LDPSKELSLK EKLEVIEDHF SHLPEKFYET ALQLETKLKQ VRAAKTNFLD LLAKATGEDK EARPSYLRAI HENEIQERDL MAQLTKENTL RAIVPLMMIR PLSNSTAYMS LSPIEGPKTS TQGPSLLMSL KKLLRQGYEG STSDYYFKLS MIPSKPLANM FLVGLEHPSD HKNPEMAISE LFNRGSMQIT SSHNLNFSKF PSDVYSVEEK VKYEILVAHS ERIRNTYYLD KDKEKSKSFY LIATDSGKVP TRKNANLPNY SLEIISTFKF CSYCHESDHP TFKCTKTNKN RTTYYPPSTP MVATYTSTRE STTDNGSKFR KVRTGAFTEV TKGVKPQKIQ QTIVNHGSNS FMNLPMEEPP SSTTDPTSTT AFSRQPTSTP TTPRQRQTQS QTPATKTTTT QDLDSEDIVI QTTTSIQTEA NTITPPSTPF TVQTHKTTDT PSSMGASSTR RTPYSRPINK RQDATSPTQR LEPLRARKTP SRQNTAQLAD ARSWSERYPQ RLRDNPPKTG SVDFSFSQIG SSIDPLKSTN TTPAMISSPI GSSFSITEAS QQQLANPIQL RSEVLPSQDT IPDWSTLQDT STNDTPSQLN
|
| |