Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_91200 |
Symbol | |
ID | 4840948 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009048 |
Strand | + |
Start bp | 518342 |
End bp | 521272 |
Gene Length | 2931 bp |
Protein Length | 947 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640392263 |
Product | predicted protein |
Protein accession | XP_001386493 |
Protein GI | 150866782 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG5096] Vesicle coat complex, various subunits |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGAGT CTGGATACAC ATTGATCTAC GACCCCAATG CGGCGTCCAA GACCTCAGTC AATGAGTTCA AAACTCTCCT CGAAAAAGGA AAAGATGAAG CCAAGGTTGA GGCCATGAAG AAGATCTTGA TTCTGATTCT CAACGGTGAC CCCATGCCCG ATTTGTTGAT GCACATCATC AGATTTGTGA TGCCCTCCAA AAACAAGGAG TTGAAGAAGT TGCTCTACCA CTATTGGGAA GTTTGTCCCA AGATGGACGA TCAGGGCAAA ATGAGACACG AGATGATTCT TGTTTGTAAC GCTATACAGA GAGATTTACA ACATCCTAAC GAGTACATCC GTGGAAATAC GTTGAGGTAC TTGACGAAGT TGAAGGAGGC AGAATTGTTG GAAACTTTGG TACCAAATGT CCGCCAGTGT CTTGAACACA GACACGCCTA TGTGAGAAAG AATGCTGTGT TTGCGCTTCA TTCCATCCAC AAGGTCAACG ACCATTTGGT TCCCGATGCG GACGAGCTTA TCTACAAGTT TTTGTATGAG GAAAGTGACT CGGTTTGTAA GAGAAATGCT TTTGTCTGTC TTGGAGACTT GAACAGAGAC GCATCCTTAC AGTATATCCA AGATAACATT TCCATTATCG AGACGTTGGA TCCGTTGTTA CAGCTTGCCT TCATCGAGTT CATCCGTAAG GATTCGGTCC AGAACCCAGT GTTGAAGTCT CAGTACACGA ACTTGGTGAC AGATATCATT GAAAGTTCTT CCAACGTAGT TGTATACGAA GCAGCCAATG CCTTGACGGT CTTGTCCAAC AATCCACAGT CTATCTTGTT GGCTGGTAGC AAGTACGTCG AGTTGGCCAC CAAAGAAGCT GACAATAATG TCAAGATCAT CACTCTTGAG AGAATTAACG ACCTCCACAA AAAGAACCCG GGTGTTCTTC AAGAGTTATC TTTGGAAATC TTGAGAGTAT TATCATCGCA AGACTTAGAC GTACGTAAAA AGGCTCTTGA CGTTACTTTG CAGTTCGTTA CCAGCAGAAA TGTAGAAGAT GTGGTGAAGT TGTTGAAGAC TGAGTTGCAG AGAACTTCGT CGGCTAATGA AGATAAGAGC GCTGAATACA GACAATTATT GATCAACGCC ATCCACCAAT TGGCCATCAA ATTTGTCGAA GTTGCAGCCA ACGTCATAGA TTTGTTGTTG GAGTCCATGA GTGACTTAAA CACCACTGCT GCTTACGAAG TTATCACTTT TGTTAAAGAA GTCGTGGAAA AGTTCCCAGA CTTGAGAAAG ACCATAATAA CCCGGTTGAT CAGCGTGTTG CCTTCTATCA AAAGTGGAAA GGTCTTCCGT GGAGCATTCT GGATTATTGG TGAATATGCT CTTGAAGAAA GCTTAGTTCA AGAAGCATGG AAATACATTA GATCTAGCAT TGGAGAGGTT CCTATTGTAG CCAGTGAAAA GAGAGCTGCC GAAGGAAATG CCCCAGATGT AGAAGAATAC TCCAACGGAT CTACTGAACA TACAAAGAAG GGACCTGTAG TTTTGCCCGA TGGTACCTAT GCTACTGAAA GTGCTTTGAC TGCTGAAGTA AAGGATACCA GCAACGATGA GAAGCCTCCA GTTCGTAAAC ATATCTTGGA CGGAGACTTT TATCTTGCTG CCGTGTTGTC CTCTACTTTG GTCAAATTGG TTTTGAGATT ACATCGTTTG AAGGCTCTGC AATCGGTGTT GAATGCTTCT AAGGCTGAGG CTTTATTGAT TATGGTTTCG ATCTTGAGAG CAGGTGAGAG TTCATATGTA GCCAAGAAGA TCGATGAAGA CTCAGCTGAT CGTATTCTCT CATACATCAA GGTATTGAAT GAAGAAGATG ATTTGGAATT AATCCTGGCC GGGTTCTTGG ACGAAACCAA AGATGCCTTC ACTGCCCAGA TTCAGAGCGC CGAGCTCAAG AAGGCAGAAG AGCAAGCCAG AGATTTCCAC GAAAACGCTG AGCAGGTGGA CGGTTCTATT GTTTTCAGAC AATTCGACAA GGATAATGCT GCCAAGAGTG CTGCTGTTGA TGATGTATCA TTGGCAAGTG GTAGTGCATT AAAGAAGGAA GACTTGTCGT CCAGATTAAA CAAGATCTTA CAATTGACAG GATTCTCTGA TCCAATCTAT GCCGAAGCCT TTGTCAAGGT TCACCAATAC GACGTCACCT TGGATGTCTT GTTGGTTAAC CAAACCACTG CTACGTTGCG TAACTTGTCA ATTGAGTTTG CTACTCTTGG TGATTTGAAG GTTGTGGACA AGCCTGCTAC AGCCAACGTA GGTCCACATG GATTCTACAA AATTCAAACC ACAGTGAAGG TCACCTCAGC AGACACTGGT GTCATCTTTG GTAACATTGT CTACGATGGC CAACACTCTG ATGAGTCTAC CATTGTGATC TTGAACGACG TTCATGTGGA TATCATGGAT TATATTAAGC CTGCCACTTG TTCTGAAAGC CAGTTCCGTA AGATGTGGAA CGAATTCGAA TGGGAAAACA AGATCACCAT CAAGTCACAG ATTCCTACTT TGAAGGAATA CTTGGACGAA TTGATGAAGG GCACCAACAT GAACTGTTTA ACTCCCGGTG CTGTCATAGG AGAAGAATGC CAGTTCTTGT CTGCTAACTT GTACTCTCGC TCATCGTTTG GTGAAGATGC CTTGGCCAAC TTGTGTATTG AAAAGCAGAG TGACGGTCCA ATAATTGGCC ATGTTAGAAT AAGATCCAAG GGTCAGGGTT TGGCATTGTC GTTGGGTGAC AGAGTTGCAT CAATTTCCCG TAAAAACAAG CCAGCAAGTG TGATCAAGGT CTAAAACATT TATTTCGTTT TATTGTATTC ACTAAATGTA TAGTGTATAA TCAAAACTTC ATATACAAAT CATTTACATA TATTATATTA A
|
Protein sequence | MSESGYTLIY DPNAASKTSV NEFKTLLEKG KDEAKVEAMK KILISILNGD PMPDLLMHII RFVMPSKNKE LKKLLYHYWE VCPKMDDQGK MRHEMILVCN AIQRDLQHPN EYIRGNTLRY LTKLKEAELL ETLVPNVRQC LEHRHAYVRK NAVFALHSIH KVNDHLVPDA DELIYKFLYE ESDSVCKRNA FVCLGDLNRD ASLQYIQDNI SIIETLDPLL QLAFIEFIRK DSVQNPVLKS QYTNLVTDII ESSSNVVVYE AANALTVLSN NPQSILLAGS KYVELATKEA DNNVKIITLE RINDLHKKNP GVLQELSLEI LRVLSSQDLD VRKKALDVTL QFVTSRNVED VVKLLKTELQ RTSSANEDKS AEYRQLLINA IHQLAIKFVE VAANVIDLLL ESMSDLNTTA AYEVITFVKE VVEKFPDLRK TIITRLISVL PSIKSGKVFR GAFWIIGEYA LEESLVQEAW KYIRSSIGEV PIVASEKRAA EGNAPDVEEY SNGSTEHTKK GPVVLPDGTY ATESALTAEV KDTSNDEKPP VRKHILDGDF YLAAVLSSTL VKLVLRLHRL KASQSVLNAS KAEALLIMVS ILRAGESSYV AKKIDEDSAD RILSYIKVLN EEDDLELISA GFLDETKDAF TAQIQSAELK KAEEQARDFH ENAEQVDGSI VFRQFDKDNA AKSAAVDDVS LASGSALKKE DLSSRLNKIL QLTGFSDPIY AEAFVKVHQY DVTLDVLLVN QTTATLRNLS IEFATLGDLK VVDKPATANV GPHGFYKIQT TVKVTSADTG VIFGNIVYDG QHSDESTIVI LNDVHVDIMD YIKPATCSES QFRKMWNEFE WENKITIKSQ IPTLKEYLDE LMKGTNMNCL TPGAVIGEEC QFLSANLYSR SSFGEDALAN LCIEKQSDGP IIGHVRIRSK GQGLALSLGD RVASISRKNK PASVIKV
|
| |