Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_29712 |
Symbol | SPI1.2 |
ID | 4836861 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | - |
Start bp | 937285 |
End bp | 940041 |
Gene Length | 2757 bp |
Protein Length | 918 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640388176 |
Product | Putative serine proteinase inhibitor (KU family) with thrombospondin repeats |
Protein accession | XP_001382950 |
Protein GI | 150864216 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.355589 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGACA TCTTGCCAAA AACGATGGAT ATAGACGCCA TCAATCAACT GGAAGGGCCT CCTGGAGACT CTTCCCACTT CAACGCTTCA AATAGTGCTT TAAACGAACC GAAACCGCCA GATATTGGCC GAAAACGAAA AACCGACGTC GCACCCGATG ACCTGATGGA CCTTGATGGC GAGTCCTCAG ATGCCGTAGA AAACGGCGAA TTGGAGCCAT CTTTCGTGAC AGCAGGATCA GTTATGGCTG ATGACTCTTT TGACGATACC CAAGAAGCCT CAAACGCTTT AAACACCTTG AACGAAGGGC CAGAAATGTC GATTATGGAC AGCTTTGAAA CCTCAGTTTC GAAAAATTCT TCACATCACG TGATCGAGGA AGAACACGAC CCACTTTTGG CTGCAAAAGG AGATACTGTT ATGATATCTC CAAGTCCTAC TTCCAGTAGC GATTTGTCAA AGCCTACATC CACCAGTGAC TCAGTTGAAA TTACAAATGT TTATATAAAA GAAAAAAACC CAAAATCCCA AAAAAATAAC CTTTCAGAAA ATGAAGAAAT CAAAAAAAAT CAGGAAAACA GAAAATCAAA AAAACAAACC TCAGGAGCTC CTCTTACTGA TCTGATCTGG AACGGTCAGA ACGCCTCGCT CCAACATCAC AAAAACAATT CATCAATTAA ACTTCTCTCC AAAGACAGGT TAATAAAGGA TCTTGAAAAA CTCGGATCTG GCTTAAACAA GAATTGGGAA GAAGATAAAT TCACAGACGA TTTGACCTTT AAGAAGATCA GGAGGTTAGG GTTTGCCAAA AACTATGTCA CTTCCCACTC ACAACAAGAA TGGGCAGGAG AACAAGGCAG AATTGACAGA TGGAACATCC CGCAAGAATT TCTCATTGAT TCATTGTCAC TCCTTGGAGA ACACGAATAT CAAAAAAGGG CAATCGAATT GAGGATTGAT CGGTTTGAGT ACAAGGATAC CCAATTAGAC CCGTCCAAAG AGTTGTCTTT AAAAGAAAAG CTAGCGGTTA TTGAAGACCA CTTTTCCCAT CTCCCCGAGA AATTTTATGA AACAGCTCTT CAGCTCGAAA CCAAATTGAA ACAAGTAAGA GCAGCTAAAA CAAATTTCCT AGACTTACTT GCCAAGGCGA CAGGTGAGGA TAAGGAGGCG AGACCACTGT ACCTCAGAGC CATACATGAA AACGAGATAC AAGAAAGAGA TCTCATGGCC CAATTGACAA AGGAAAACAC CCTTCGTGCT ATTGTTCCCT TGATGATGAT CAGACCACTC AGCAACTCCA CTGCCTATAT GTCCCTCAGT CCCATTGAGG GACCTAAAAC CTCAACCCAG GGACCCAGCC TTCTAATGTC TCTTAAAAAA TTGTTAAGAC AAGGTTATGA AGGATCAACT TCCGATTACT ATTTCAAATT GAGCATGATT CCCTCTAAAC CTCTCGCAAA CATGTTTCTT GTCGGACTTG AACACCCATC AGATCACAAA AACCCAGAGA TGGCGATCAG TGAATTATTC AATAGAGGTT CCATGCAAAT TACATCGTCC CATAACCTCA ATTTTTCAAA ATTCCCATCA GACGTCTACT CAGTGGAAGA AAAAGTCAAG TATGAGATTC TCGTTGCCCA TTCAGAGCGC ATTCGCAATA CATACTATTT AGACAAAGAC AAAGAAAAGA GTAAACTGTT CTACCTCATT GCAACGGACT CTGGCAAAGT CCCTACTCGA AAGAATGCAA ATCTTCCGAA CTATAGTCTC GAGATCATTT CGACTTTTAA ATTCTGCAGT TATTGTCACG AAAGTGATCA CCCAACCTTT AAATGCACCA AAACGAACAA AAACCGCACC ACGTACTATC CACCTAGTAC ACCAATGGTA GCTACTTACA CTCTGACCAG AGAAAGTACT ACAGACAATG GCAGTAAATT TCGTAAAGTA CGGACAGGGG CCTTCACCGA AGTCACCAAG GGCGTAAAAC CCCAGAAAAT CCAACAAACA ATTGTAAACC ATGGATCAAA TTCATTCATG AACTTGCCTA TGGAAGAACC GCCCAGCTCA ACTACGGACC CGACCAGCAC CACGGCCTTT TCGCGTCAGC CAACAAGTAC ACCAACAACC CCTCGCCAAC GCCAGACACA AAGCCAAACA CCGGCTACAA AGACGAAGAC AACTCAAGAT CTTGATTCCG AGGACATCGT GATCCAGACC ACAACGTCGA TCCAGACAGA AGCAAACACC ATAACTCCCC CCAGCACGCC TTTTACCGTC CAAACACATA AAACTACCGA CACTCCCTCT TCTATGGGAG CTTCCTCAAC AAGAAGAACT CCATACTCAA GACCCATCAA CAAGCGTCAA GATGCGACCT CCCCAACCCA GCGATTGGAG CCGTTACGGG CAAGAAAAAC ACCCTCTCGC CAAAACACGG CACAACTTGC AGACGCAAGA CTGTGGCTGG AAAGATACCC CCAGCGACTA CGAGACAACC CTCCCAAAAC GGGATCGGTA GATTTCTCTT TCTCCCAGAT AGGATCATCC ATAGACCCTC TCAAATCCAC AAATACTACA CCTGCCATGA TCAGCTCCCC AATCGGATCG TCATTCAGTA TCACGGAGGC ATCACAACAA CAGCTAGCAA ACCCCATTCA GTTGAGGTCC GAAGTCCTTC CATCACAGGA TACGATTCCT GACTGGTCGA CGTTACAGGA CACTAGCACC AATGACACGC CTTCTCAATT AAATTAA
|
Protein sequence | MADILPKTMD IDAINQSEGP PGDSSHFNAS NSALNEPKPP DIGRKRKTDV APDDSMDLDG ESSDAVENGE LEPSFVTAGS VMADDSFDDT QEASNALNTL NEGPEMSIMD SFETSVSKNS SHHVIEEEHD PLLAAKGDTV MISPSPTSSS DLSKPTSTSD SVEITNVYIK EKNPKSQKNN LSENEEIKKN QENRKSKKQT SGAPLTDSIW NGQNASLQHH KNNSSIKLLS KDRLIKDLEK LGSGLNKNWE EDKFTDDLTF KKIRRLGFAK NYVTSHSQQE WAGEQGRIDR WNIPQEFLID SLSLLGEHEY QKRAIELRID RFEYKDTQLD PSKELSLKEK LAVIEDHFSH LPEKFYETAL QLETKLKQVR AAKTNFLDLL AKATGEDKEA RPSYLRAIHE NEIQERDLMA QLTKENTLRA IVPLMMIRPL SNSTAYMSLS PIEGPKTSTQ GPSLLMSLKK LLRQGYEGST SDYYFKLSMI PSKPLANMFL VGLEHPSDHK NPEMAISELF NRGSMQITSS HNLNFSKFPS DVYSVEEKVK YEILVAHSER IRNTYYLDKD KEKSKSFYLI ATDSGKVPTR KNANLPNYSL EIISTFKFCS YCHESDHPTF KCTKTNKNRT TYYPPSTPMV ATYTSTREST TDNGSKFRKV RTGAFTEVTK GVKPQKIQQT IVNHGSNSFM NLPMEEPPSS TTDPTSTTAF SRQPTSTPTT PRQRQTQSQT PATKTKTTQD LDSEDIVIQT TTSIQTEANT ITPPSTPFTV QTHKTTDTPS SMGASSTRRT PYSRPINKRQ DATSPTQRLE PLRARKTPSR QNTAQLADAR SWSERYPQRL RDNPPKTGSV DFSFSQIGSS IDPLKSTNTT PAMISSPIGS SFSITEASQQ QLANPIQLRS EVLPSQDTIP DWSTLQDTST NDTPSQLN
|
| |