Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_28139 |
Symbol | SPI1.3 |
ID | 4850918 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 437339 |
End bp | 440071 |
Gene Length | 2733 bp |
Protein Length | 910 aa |
Translation table | |
GC content | 44% |
IMG OID | 640392626 |
Product | Putative serine proteinase inhibitor (KU family) |
Protein accession | XP_001387724 |
Protein GI | 126273879 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATATAG ACGCCATCAA TCAACTGGAA GGGCCTCCTG GAGACTCTTC CCACTTCAAC GCTTCAAATA GTGCTTTAAA CGAACCGAAA CCGCCAGATA TTGGCCGAAA ACGAAAAACC GACGTCGCAC CCGATGACCT GATGGACCTT GATGGCGAGT CCTCAGATGC CGTAGAAAAC GGCGAATTGG AGCCATCTTT CGTGACAGCA GGATCAGTTA TTGCTGATGA CTCTTTTGAC GATACCCAAG AAGCCTCAAA CGCTTTAAAC ACCTTGAACG AAGGGCCAGA AATGTCGATT ATGGACAGCT TTGAAACCTC AGTTTCGAAA AATTCTTCAC ATCACGTGAT CGAGGAAGAA CACGACCCAG TTTTGGCTGC AAAAGGAGAT ACTGTTATGA TATCTCCAAG TCCTACTTCC AGTAGCGATT TGTCAAAGCC TACATCCACC AGTGACTCAG TTGAAATTAC AAATGTTTAT ATAAAAGAAA AAAACCCAAA ATCCCAAAAA AATAACCTTT CAGAAAATGA AGAAATCAAA AAAAATCAGG AAAACAGAAA ATCAAAAAAA CAAACCTCAG GAGCTCCTCT TACTGATCTG ATCTGGAACG GTCAGAACGC CTCGCTCCAA CATCACAAAA ACAATTCATC AATTAAACTT CTCTCCAAAG ACAGGTTAAT AAAGGATCTT GAAAAACTCG GATCTGGCTT AAACAAGAAT TGGGAAGAAG ATAAATTCAC AGACGATTTG ACCTTTAAGA AGATCAGGAG GTTAGGGTTT GCCAAAAACT ATGTCACTTC CCACTCACAA CAAGAATGGG CAGGAGAACA AGGCAGAATT GACAGATGGA ACATCCCGCA AGAATTTCTC ATTGATTCAT TGTCACTCCT TGGAGAACAC GAATATCAAA AAAGGGCAAT CGAATTGAGG ATTGATCGGT TTGAGTACAA GGATACCCAA TTAGACCCGT CCAAAGAGTT GTCTTTAAAA GAAAAGCTAG CGGTTATTGA AGACCACTTT TCCCATCTCC CCGAGAAATT TTATGAAACA GCTCTTCAGC TCGAAACCAA ATTGAAACAA GTAAGAGCAG CTAAAACAAA TTTCCTAGAC TTACTTGCCA AGGCGACAGG TGAGGATAAG GAGGCGAGAC CACTGTACCT CAGAGCCATA CATGAAAACG AGATACAAGA AAGAGATCTC ATGGCCCAAT TGACAAAGGA AAACACCCTT CGTGCTATTG TTCCCTTGAT GATGATCAGA CCACTCAGCA ACTCCACTGC CTATATGTCC CTCAGTCCCA TTGAGGGACC TAAAACCTCA ACCCAGGGAC CCAGCCTTCT AATGTCTCTT AAAAAATTGT TAAGACAAGG TTATGAAGGA TCAACTTCCG ATTACTATTT CAAATTGAGC ATGATTCCCT CTAAACCTCT CGCAAACATG TTTCTTGTCG GACTTGAACA CCCATCAGAT CACAAAAACC CAGAGATGGC GATCAGTGAA TTATTCAATA GAGGTTCCAT GCAAATTACA TCGTCCCATA ACCTCAATTT TTCAAAATTC CCATCAGACG TCTACTCAGT GGAAGAAAAA GTCAAGTATG AGATTCTCGT TGCCCATTCA GAGCGCATTC GCAATACATA CTATTTAGAC AAAGACAAAG AAAAGAGTAA ACTGTTCTAC CTCATTGCAA CGGACTCTGG CAAAGTCCCT ACTCGAAAGA ATGCAAATCT TCCGAACTAT AGTCTCGAGA TCATTTCGAC TTTTAAATTC TGCAGTTATT GTCACGAAAG TGATCACCCA ACCTTTAAAT GCACCAAAAC GAACAAAAAC CGCACCACGT ACTATCCACC TAGTACACCA ATGGTAGCTA CTTACACTCT GACCAGAGAA AGTACTACAG ACAATGGCAG TAAATTTCGT AAAGTACGGA CAGGGGCCTT CACCGAAGTC ACCAAGGGCG TAAAACCCCA GAAAATCCAA CAAACAATTG TAAACCATGG ATCAAATTCA TTCATGAACT TGCCTATGGA AGAACCGCCC AGCTCAACTA CGGACCCGAC CAGCACCACG GCCTTTTCGC GTCAGCCAAC AAGTACACCA ACAACCCCTC GCCAACGCCA GACACAAAGC CAAACACCGG CTACAAAGAC GAAGACAACT CAAGATCTTG ATTCCGAGGA CATCGTGATC CAGACCACAA CGTCGATCCA GACAGAAGCA AACACCATAA CTCCCCCCAG CACGCCTTTT ACCGTCCAAA CACATAAAAC TACCGACACT CCCTCTTCTA TGGGAGCTTC CTCAACAAGA AGAACTCCAT ACTCAAGACC CATCAACAAG CGTCAAGATG CGACCTCCCC AACCCAGCGA TTGGAGCCGT TACGGGCAAG AAAAACACCC TCTCGCCAAA ACACGGCACA ACTTGCAGAC GCAAGACTGT GGCTGGAAAG ATACCCCCAG CGACTACGAG ACAACCCTCC CAAAACGGGA TCGGTAGATT TCTCTTTCTC CCAGATAGGA TCATCCATAG ACCCTCTCAA ATCCACAAAT ACTACACCTG CCATGATCAG CTCCCCAATC GGATCGTCAT TCAGTATCAC GGAGGCATCA CAACAACAGC TAGCAAACCC CATTCAGTTG AGGTCCGAAG TCCTTCCATC ACAGGATACG ATTCCTGACT GGTCGACGTT ACAGGACACT AGCACCAATG ACACGCCTTC TCAATTAAAT TAA
|
Protein sequence | MDIDAINQLE GPPGDSSHFN ASNSALNEPK PPDIGRKRKT DVAPDDLMDL DGESSDAVEN GELEPSFVTA GSVIADDSFD DTQEASNALN TLNEGPEMSI MDSFETSVSK NSSHHVIEEE HDPVLAAKGD TVMISPSPTS SSDLSKPTST SDSVEITNVY IKEKNPKSQK NNLSENEEIK KNQENRKSKK QTSGAPLTDL IWNGQNASLQ HHKNNSSIKL LSKDRLIKDL EKLGSGLNKN WEEDKFTDDL TFKKIRRLGF AKNYVTSHSQ QEWAGEQGRI DRWNIPQEFL IDSLSLLGEH EYQKRAIELR IDRFEYKDTQ LDPSKELSLK EKLAVIEDHF SHLPEKFYET ALQLETKLKQ VRAAKTNFLD LLAKATGEDK EARPLYLRAI HENEIQERDL MAQLTKENTL RAIVPLMMIR PLSNSTAYMS LSPIEGPKTS TQGPSLLMSL KKLLRQGYEG STSDYYFKLS MIPSKPLANM FLVGLEHPSD HKNPEMAISE LFNRGSMQIT SSHNLNFSKF PSDVYSVEEK VKYEILVAHS ERIRNTYYLD KDKEKSKLFY LIATDSGKVP TRKNANLPNY SLEIISTFKF CSYCHESDHP TFKCTKTNKN RTTYYPPSTP MVATYTLTRE STTDNGSKFR KVRTGAFTEV TKGVKPQKIQ QTIVNHGSNS FMNLPMEEPP SSTTDPTSTT AFSRQPTSTP TTPRQRQTQS QTPATKTKTT QDLDSEDIVI QTTTSIQTEA NTITPPSTPF TVQTHKTTDT PSSMGASSTR RTPYSRPINK RQDATSPTQR LEPLRARKTP SRQNTAQLAD ARLWLERYPQ RLRDNPPKTG SVDFSFSQIG SSIDPLKSTN TTPAMISSPI GSSFSITEAS QQQLANPIQL RSEVLPSQDT IPDWSTLQDT STNDTPSQLN
|
| |