Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_80311 |
Symbol | SEC31 |
ID | 4851057 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 837981 |
End bp | 842128 |
Gene Length | 4148 bp |
Protein Length | 1244 aa |
Translation table | |
GC content | 47% |
IMG OID | 640392765 |
Product | component of the COPII coat of ER-Golgi vesicles |
Protein accession | XP_001387795 |
Protein GI | 126274043 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.812288 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ACACAATCCA GTGCACGACA CAGGCGAGAC TCCAAACATA CCAGCACAAG CCTGTGACGC GAACCCTGCT CGCGACTTTT GTCTGAACAT TCTCAATTGA CTCTTAGTCT TCTCTTCTAG TGCTCAAAGA TATAATACAA AAGACATAGT ACTATTAGCA CGCCAGGATA TATATAGGTA AGAAAAATGG TCAAGATCTC TGAAATCGCT CGAACTTCGA CGTTTGCTTG GAGTCTGGAT ACTTTACCTA TTCTTGCCAC TGGGACCGTT GCCGGAGCCG TAGATGCTAG CTTCAACTCT CTGCTGCTGT TGGAACTCTG GGACATTTTT TCAGCTACCA ACACCAACGA GCCCATTTTT AGTGCAGCTG TAGAACACCG TTTCTACGCT TTAGCATGGT CGAAACCGTT TGAAGGACGT CCCAGGGGTT TGATTGCTGC TGCCTTTGAG AACGGAGTGA TCGAGTTCTG GGATGCCGAA GTTCTCATCA TACTGAAAGA CTTGGCCAAA GCTTCCGTCC ACAAGCTGAG CAAACACTCT GGACCCGTCA GAAGTTTGCA GTTCAACCCT CTTCAGAGTC ATGTGTTGGT TTCCGGAGGA TCTCACGGCC AGATCTTCAT ATGGGATACA AAGAAGTTCA CCGAACCCTT CTCTCCGGGA CTGGCTATGA CTCCTATGGA CGAAATCAGC TCCGTAGCCT GGAACAACTC GGTCAGCCAC ATTCTCGCCA GTACAGGAAA TAGTGGCTAC ACGTCGATCT GGGACTTGAA ATCAAAGCGA GAAGTGTTGC ACTTGTCGTA CACTGGTGCA TCAGGAAGAG CCAACTTCTC CCATGTTGCT TGGCATCCTA CTAAGCTGAC CGAATTAATC ACCGCCAGTG ACAATGATGC CTGCCCATTG ATATTGACGT GGGATTTACG TAACTCCAAC GCTCCAGAGA AGATCTTAGA AGGGCACAAG AAGGGAGTTT TGTCGCTTGA CTGGTGCCAA CAGGATCCGG AGCTCTTGAT CTCCAGCGGA AAAGACAACA CAACTTTCTT GTGGAACCCT ACTACTGGAC AGAAACTTGG TGAATACCCT ACTACAGCCA ACTGGGCTTT CCAAACTGCT TTTGCACCAA AAGTTCCTGA TATCTTTGCC ACAGCTTCGT TTGACGGCAA GATCGTCGTG CAGTCGCTCC AAGATACCTC TCCTCCAGTG TCGGAAAAAG TCACATCCAA CGATGACAAC GTCTTCTGGA ATCAGTTGTC TACAACTGAC ACCCAGCAGC CGGTATTTGA TATCAAACAG GCTCCTCAGT GGTTGAAGAC GCCTTCTGCT GTGTCATTTG GCTTTGGTTC CAAGTTGGTG CAGGTGCTGA AAGACAGCAA CGGCAAGTCC ATTATCAATA TCCAAAAGTT TGTCGCCAAA GGACAAAGTT CATCATCTGA ATTATATACT GCTCTCAAGA ACAACAACTT CAAATCAATC ATAGACGAGA AGATCTCGTC GAACGTTGCC AGCGACCTTG ACAAGAGCGA CTGGAAGCTC TTACAGAAGT TGGCTGAATC CGGAAAGGAT GAAATCTTGA CTGAGGTTAC TACAGAAGAA GAAGAAAAGA AGCCAGAAAC TGAAATTGAA CTGGAAGATA AGAAGAACGG TGATTCTGAA GACGTTCCAG CTTCTGCTGA TGATTCCTTC TTTGACAATC TTGGAAACGG CAAGGTAGTT CTTGAAAATG AAGCACCTTT TGTTCCATCG GGTTCGTTCA AGATCTTTTC CGCCAAAGTC AGCGAAGAGG ACAAGTCTTT GATCAAATTG GTTTTGGGCA ACAAAATCGA AGATGCCGTC CATGACTGTA TCGAAAGAGG CAAATTGTTG GAAGCATTGG TGTTGGCCTT GGATGCTTCT GATGACATCA AGGAAAAAGT TAAAAACGCC TACTTTAAGA AGAACGTCAA AAAGGAAGTG TCGAGAGTTC TCTATTCGGT CTCATCGCAG AATATCACCG ACATTGTATC CAACGCCAAT GTAGCCAACT GGAAGGAAAT CGCTGCTGGC ATCACCTCCT TCACCAACGA TCCGGACGAT TTTAACAGCA AGATCACCGA GTTGGGTGAT CGTATCTTGG AATCGAAGAC TGTAGCTGAT AGCAGAGACA CTGCCATCAG ATGCTACTTG GCTGGTAATG CTTTGGACAA AATTGCAAGT ATCTGGTTGA AGGAGTTGCC AGCCTTGGAG GCTCATTTGT TGGAGTCCGA CAACGCTGAA AACGTCTCAT CTCCTTCCGA AGCTCGCCTC ATCGCTTTGA CTAACTTTGT TCTGAAGATT GCAGCCTACA GATCGATTTC TAACATCAGC GGCGAGATCT CTGGCCCTTC AGCTGAACCT ATCTCCAAGG CTATTGTAGA GTACACAAAT TTGGTAGCTG GTAACGGAGA GTTTGAATTG GCCAACATAT TCTTGCAATT GTTGCCCAGT GACTTGGCAG GAACCGAAAA GGACAGAATC AACAAGGCTA CTGGTGCCGT TGCTGCTGTT ACTGCTTCCA AAACTGTGAA ATCTGGCACA AGCGCTGTTG CTAACTCTGT CACCGCTAAG ACTAGCAAGG TTTCGAGAGG TCCTTATGGT AGGACGACCC CAGTGATTCC AGAAGTCCTG TCAACTCCAA AACCTTCGTA CCAGGCAACG ATGCCTCCTA TTGGAGCTCC ACTTGCTCCC TCAGCTATTC CATCTGCCAG TGTTCCAAGC AGCAATCCAT ATGTCAGAGC TTCTAACCCA TATGCTCCTC ATGTGTCCTC AACCAACATC TACAAACCTG CTGCTCCAGT CGTACAAGCA CCACCTCCAG CCCAAGCTAC AGCAGTAAGT CCGCCTCCTA CTGGCCCACC TAAGCCGGTA TACAAACAGG AAACTGACGG TTGGAACGAT TTGCCAGATA CGTTCAAATC GAAGGCTCCA GCACGCCGAG CTGCTGCTGT TGTTACAGCT ACACCATCTC CTACACCATT ACCTCAAACT ACAGTTCCAC CAATGTCCAT TCCTCCAGGT CCGAAGAGAT CGATGTCAAG CGGAAGTGCT GCTCCTCCTC CACCAAAAGG TAGCCGTGCT AATTCTAAAG TAGCTGTACC CACTATCCAG TCGTCACCAC GTCCTGCTCC TGTTCATGTT AACAATCGTT ACGCTCCACC TCCTTCAGCA GATGTGAATG CTCCGTCCAA CACTCACTCT TCTCCAGTAG GAGTATCTCC TTCTACCAAG AAGAACCCGT ACGCTGTAGC ACCAGAGGTA GCTCCTAGAG TAGCCTACGC TCCTCCTCCA GCTTCTCTTT CAGGATTAGG ATTCTCCGGA GGCGCTGCTG CACCACCAGC ACCCCCAAAG AATCCATATG CTCCTCTGGC CAGTTCTGTA ATTTCTCCTA GAGTTAGCAA CGCTGGTATA GTACCACCTC CTATGGGTAG AGGCATTGTA TCTCCTCCAA CTTCGTTTGG CTCAATGCAC GCAGCCCCTA TTCAGCCAGC TTTTAGCGGA GTCCCACCAC CACCTCCAGC TATCGGGCAC CAGCCAGCTG CTTCTGCCCC TCCTCCTCCT CCGGCAGCTA AAACTCCAGT TCCAACCAAG AGTAAATATC CCAAAGGAGA CAGGTCTCAC ATCCCCGAAA AGTCGGTTTT GATCTACCAA TATTTGACCA AGGTGCTTGA GGCTGTCAAG CCTAACATCC CAGAAAAGTA CACGGCTCAC GGCGAGGACT TGGAAAAGAG ACTCAACATA TTGTTTGACC ACTTGAACAA TGAAGACTTG TTAACCGACG ATGCCATTGA AGACTTGAAG GAAGTTTGCA CAGCATTGGA AAGCAAGGAT ATTGAATCTG CCAGCTCATT GAACACTCTG TTTGCTGCCA ACCATATCGA TCAACTCGGT AATTGGCATA GAGGTATCAC CCGTCTCATT ACCATGGCTG AAGCTATGTA TTAGCGCAAA ATAACTTTTT TTAATGTATA GTCATCTCCA CTCGGTGCAT TATTAGATAT GCTAACTATG CCTACCATAG AAATAACTAA AATGATTTTT GTCAAGTTTC TAGACATGTA CAAATATAGT AGAGTCACCC CTAAAAGATT ACAACATTGT CTAAAAATGT TGTAGAATAG ACATTTTGTT AACGTTAG
|
Protein sequence | MVKISEIART STFAWSLDTL PILATGTVAG AVDASFNSLL LLELWDIFSA TNTNEPIFSA AVEHRFYALA WSKPFEGRPR GLIAAAFENG VIEFWDAEVL IILKDLAKAS VHKLSKHSGP VRSLQFNPLQ SHVLVSGGSH GQIFIWDTKK FTEPFSPGLA MTPMDEISSV AWNNSVSHIL ASTGNSGYTS IWDLKSKREV LHLSYTGASG RANFSHVAWH PTKLTELITA SDNDACPLIL TWDLRNSNAP EKILEGHKKG VLSLDWCQQD PELLISSGKD NTTFLWNPTT GQKLGEYPTT ANWAFQTAFA PKVPDIFATA SFDGKIVVQS LQDTSPPVSE KVTSNDDNVF WNQLSTTDTQ QPVFDIKQAP QWLKTPSAVS FGFGSKLVQV LKDSNGKSII NIQKFVAKGQ SSSSELYTAL KNNNFKSIID EKISSNVASD LDKSDWKLLQ KLAESGKDEI LTEVTTEEEE KKPETEIELE DKKNGDSEDV PASADDSFFD NLGNGKVVLE NEAPFVPSGS FKIFSAKVSE EDKSLIKLVL GNKIEDAVHD CIERGKLLEA LVLALDASDD IKEKVKNAYF KKNVKKEVSR VLYSVSSQNI TDIVSNANVA NWKEIAAGIT SFTNDPDDFN SKITELGDRI LESKTVADSR DTAIRCYLAG NALDKIASIW LKELPALEAH LLESDNAENV SSPSEARLIA LTNFVLKIAA YRSISNISGE ISGPSAEPIS KAIVEYTNLV AGNGEFELAN IFLQLLPSDL AGTEKDRINK ATGAVAAVTA SKTVKSGTSA VANSVTAKTS KVSREVLSTP KPSYQATMPP IGAPLAPSAI PSASVPSSNP YVRASNPYAP HVSSTNIYKP AAPVVQAPPP AQATAVSPPP TGPPKPVYKQ ETDGWNDLPD TFKSKAPARR AAAVVTATPS PTPLPQTTVP PMSIPPGPKR SMSSGSAAPP PPKGSRANSK VAVPTIQSSP RPAPVHVNNR YAPPPSADVN APSNTHSSPV GVSPSTKKNP YAVAPEVAPR VAYAPPPASL SGLGFSGGAA APPAPPKNPY APLASSVISP RVSNAGIVPP PMGRGIVSPP TSFGSMHAAP IQPAFSGVPP PPPAIGHQPA ASAPPPPPAA KTPVPTKSKY PKGDRSHIPE KSVLIYQYLT KVLEAVKPNI PEKYTAHGED LEKRLNILFD HLNNEDLLTD DAIEDLKEVC TALESKDIES ASSLNTLFAA NHIDQLGNWH RGITRLITMA EAMY
|
| |