Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_88514 |
Symbol | |
ID | 4837936 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | + |
Start bp | 1735311 |
End bp | 1737246 |
Gene Length | 1936 bp |
Protein Length | 555 aa |
Translation table | 12 |
GC content | 46% |
IMG OID | 640389251 |
Product | predicted protein |
Protein accession | XP_001383618 |
Protein GI | 150864683 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG3118] Thioredoxin domain-containing protein |
TIGRFAM ID | [TIGR01126] protein disulfide-isomerase domain [TIGR01130] protein disulfide isomerases, eukaryotic |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | AATTGATTTG TTCCATACTC TTTCTACTTC ATACGCTGTA TACGTAGTAT ATTAGTTCCC ACTATACGAG AATATCAGTA TTCAAACTCA GTGTATTAGT TTTCAGTATT TTTCTCTACC AGAATTACTA CTATTACTAA TAAATACTCC ATCTACCGCT ACTACCTAGT ACAATATATC ATAGAATGAA GTTCTGGAAA TTCTCATCTT CTGTGCTTGC CACCCTTCTC GCTGTCGTCT CTGTCCAAGC TTCGGGACCA GCCGAAGGCG ACGCTGTCGC CGATCCAAAC TCCGCTGTCG TAAAGCTTAC GGCCGAGACA TACAAACAGT TCCTTGACGA GAACCCTCTT GTTCTCGCCG AATACTTTGC CCCATGGTGT GGCTACTGTA AGATGTTGGG ACCTGAATAC GCCAAGGCTG CCAACTCGTT GAACGAAACC AACCCAAACA TCAAGTTGGC CCAGATCGAC TGTACCGAGG AAGAAGAACT CTGTCGTGAC CAAGGTATCA GAGGCTACCC TACCTTGAAG GTTGTCTCCA ACGGCGCCTA TGCCGACTAC GATGGCCCCA GAGATGCCGC CGGTATCGCC AACTATATGG TCAAACAGTC TTTGCCTGCC GTCCAAGTGC CAGCTGACGC TGACGCTTTG ACTGCTGCTA TTGAAGAACA GACCAAGCCA TATGTCATCC AAGTAGGTGC TTCTACTGAC TCTGACGCCG CTTCCGCCTA CGAGCAAGTC GCTAAGGCCA ATAGAAACGA CTACTCTTTC TTCTCAGTGG AAGAGCCAGC TTTGGTCAAG GAATTGAACA CGAAGTTTAC CAATGTTAAA GTAACTGGCA AGTCCCCTTC ATACTACGTA GTCCATCCTG GTCAATTGGA TGACGTAAGA GAATTTGAAG GCAAGGACAT CAATGCTGAC ACTTTGACCC TGTTTGTTAC CACCGAAGTT GTTCCATACT TTGGCGACAT CAACAGAGAC ACCTACTTGA CATACATGGG TTCTCCATTG CCTCTCGGCT ACTACTTCTA CAACACTGCT GAACAGAGAG CTGCTTTTGC TGACGAATTC TCGAAGTTGG GTAAGCAATA CCGTGGAAAG ATCAACTTTG TCGGTTTAGA CGCTACCCAA TTCGGAAAGC ACGCCGAGTC CATCAACATG GACCCAGCAA TCGTGCCTTT GTTCGCCATC CAAGACACAC CAAACAACAA GAAGTATGGT GTTAACCAAA AGGAAAACCC AGAAGGTCCA TCTTTGAAGA CGATCAAACA GTTCGTTGCT GACTACCTCG ACGACAAGTT GACTCCTATC GTCAAGTCTG AAGATTTGCC AACCGAAGAA GAAAAGAAAG CCAACCCAGT TGTCAAGTTG GTAGGCCACA ACCACAACGA AATCATCGAA GATGTCTCCA AGGACATCTT TGTCAAGTAC TATGCTCCAT GGTGTGGCCA CTGTAAGAAG ATGGCTCCTA TCTGGGAAGA ATTGGCTTCC GTTTTTGGCT CCAACAAGGA CGACGCCAAG GTGGTCGTTG CCGACATTGA CCATACCAAC AATGACGTCG TTCTTCCCTT CGAAATCGAA GGCTACCCAA CCTTGGTTTT ATATCCTGCC AACGGTGAAG TTGACGAAAA GACCGGCTTG AGAAAGCCAG TTGTTTTCTC TGGCGCAAGA GAATTAGATG CCTTCATTGA CTTTGTAAAG GAAAATGGTG CCCTTGGTGT TGACGGCCAT GTATTGAAGG CTGCTCAAGA CAAGGCAGCT GCTGAAGCTG CTCCTGAAGA AGAAGAAGAA GCCGCTGAAG AAGTTAAAGA AGAAGCTGCT GAAGATGAGG ATGTTGAACA CGACGAGTTG TAAGTTTCCT GAAATCAGGC TATAGCTTTC TGTATAATGT CATGTTTAAA AGTGAGTGCA GTATTGTAAT ATAAACTTGA ATTTGT
|
Protein sequence | MKFWKFSSSV LATLLAVVSV QASGPAEGDA VADPNSAVVK LTAETYKQFL DENPLVLAEY FAPWCGYCKM LGPEYAKAAN SLNETNPNIK LAQIDCTEEE ELCRDQGIRG YPTLKVVSNG AYADYDGPRD AAGIANYMVK QSLPAVQVPA DADALTAAIE EQTKPYVIQV GASTDSDAAS AYEQVAKANR NDYSFFSVEE PALVKELNTK FTNVKVTGKS PSYYVVHPGQ LDDVREFEGK DINADTLTSF VTTEVVPYFG DINRDTYLTY MGSPLPLGYY FYNTAEQRAA FADEFSKLGK QYRGKINFVG LDATQFGKHA ESINMDPAIV PLFAIQDTPN NKKYGVNQKE NPEGPSLKTI KQFVADYLDD KLTPIVKSED LPTEEEKKAN PVVKLVGHNH NEIIEDVSKD IFVKYYAPWC GHCKKMAPIW EELASVFGSN KDDAKVVVAD IDHTNNDVVL PFEIEGYPTL VLYPANGEVD EKTGLRKPVV FSGARELDAF IDFVKENGAL GVDGHVLKAA QDKAAAEAAP EEEEEAAEEV KEEAAEDEDV EHDEL
|
| |