Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_67783 |
Symbol | |
ID | 4839635 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | + |
Start bp | 358243 |
End bp | 360684 |
Gene Length | 2442 bp |
Protein Length | 551 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640390950 |
Product | predicted protein |
Protein accession | XP_001384728 |
Protein GI | 150865490 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG5243] HRD ubiquitin ligase complex, ER membrane component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | AATTTGGTTT TCTTGTACTT TTTCTGCTTA GCGAACTTCG GTGGCCACAA TAAAGCGAAC CAGAAGCTGA GCTATCTTAT TCTACATCTA ATCCTGATAC TCTGCTACTT CTCAGGATCA TCCCACAATT TCCGCGAAAC CCTTGAAGCT GCCAGAAGCT ACAGGTACCA CATGAAAAGC AGGTCTGGAC TCATAAGTTT CCAGTCTGAA AGTCTCAGTC CACTTAAGCA TTACTAATCC GTGTTTGACA GGTGCTCGTA TTTGTTCGAC CTTAACACGA GACACTATAG TCTCTGTTTC CACATCTATT GATCTGTTAC AACACATAGA CTCTAACAGA AACTGTTACC ATCCAGATTC TTATTATACT GAATCCTATT GGATATATTG ATTTACAAAG TCCTGAATCA ATATTCATCA TATTATTCAC TTGTTCCTAA TTTGCTAATA TTGAAGCATT CGATGCAATA GTCTTCACCA CCATGTTAGC ACCAGACAAA ACCATCTATT TAGCCATAAG GGAAACCCTG GCGACAGTAC TTCAAGTTAC TAATCTGCCT ACAAGCACCA CTACTACTGT CACCCAAACC CGGGGTACGC CGCCTGGGCT GACAGAAACA GACTCCTCAA CCACCGTTCG TACCTCGCCC ACATTTGTGG GAAATACGCC GTCCACGGTT CTTTTCTTCA TTGCTTTAGC TGTTGGTGTC TTCATTGCTT TGCTCTTCAT CTTTTTCACC ATCAGATACT TTGTTCGTTC CAAGTATGGA CTTCATGTAT ATCCACTTCT GCGGAGGCAT TTGATTGCCG GAGCAGCCAT CACCTCCGAT CGTTTCCATG ACACCATGAC TAACGAGGAG TTGCAGGAGC ATCTCAACTA TATCAGAGAT CACCACTTCA TCCAGACACT GTTCTTAGAA AGACGGTTCA CTGGCCGTAG AAGACGTAGA AGAAGAGGCC GTTATTCGCG AATGAAAAAG TTGTCGGAAG CCGAGGTCGA GATTCTATTC CCCAAAAAGA CCTACACAGA TTGGTTGAAT GGAGGTCAAG AACGGGATCA CGAGAAGCGA GACGGTGTTC TTCAAGAAGA AGGTAATACT GACTCGGGCA ACTTAAACAT CATCAACGAA GAGGACTCTT CTGATTTGCA TAGCCAAACC ACTGTCAGTG ATCGAGTTGC ATCTAGTTCA AGAGACATGG ACAACGACGA CTCCATAGAG TTGCACGAAT TGAAAAACGA AGCCACGAAC TCTGTTTCTG CAGATGCCGA AGATGACTTG CACTTCACTT CTGGATCTTG TGCCATTTGT TTGGAAACCA TCGGTGACGA AGACATTGTC AGAGGCTTGA TCTGTGGGCA TGTGTTCCAT GCTGAGTGTC TAGATCCATG GTTGACTAAG AGAAGAGCTT GTTGTCCTAT GTGCAAGCGA GATTATTTTT TCAAGAAAGA GGCAGCTGAA AGCACAGAAA CGCAAACCAG TACCACAAAT AACGAAACTA ACAACAATGT AAATGAAAAT AGCGACATCC AGAACCTGAA TGAAAATACA AACAACAATA ATGACAATAC GAACAATGTT GATACCAGTA TTATCGATGA TGATGACGAC ACGAGCATTG ACTATGATGC CATTCGCAGC AATCCCGCAT TCCGAGCTCT TCTTCAGGAA TTAGTTCCCA TTTCAGAAAG AGTGCGGCAC ATCATGAGTG ACCCGTCGAA CGACCACTTA GACCTTGAAG TCAGAGCACG TGCAGTGGCC AAGAAGACTT ATGGTCGTTA TTTCAAGGTT CTCTGGTGGA AGTTGATGGG CATCAGCAAG GAAGACCTTT TCAACTGGGC AGCTTTGACG ATCTTTCAAG ACTGGAGACG TGCGAACAAC CAAACAGGAA ATGAAGCAGA AAATGAAGCA GAAAATGAAG CACAGACTGG CTCTACTGCA GAAGGAACAG CTGACTCCAA CGATGGCACC AATACAAACG ACAGCAACTC CAACAATAAT GGTAATAATA ATGACACCCA TAGCGATTCT TTGGAGTCAC CAGTTGAAAA CAGAGACATG GAAGAAGTGG ATCTCGGTGA AAGAGATTCG CCAGAGCTTT CTGCTGCACG CAGAGACGTG GTGGATAATC GAGTATAGAT TTGTAAAGTT ATAGCAGTCT CTTGCTGTAT CTCCCGACAT ATTTAACTAT TTATTTATTT CTTATTCAAT TCTTGAGCTT CTGTTTCAAA GAACCACGCG GTGCCACACA GAAACTTCTA TTTCTTAATT TGATTAATTT ACACCTCAAT CCAGTAACCT AGAATTGATA AAATATCTGG CTGCGAAAAC TAATTTCGCA GCCACACGGT AATTGTGGTT TGCACCATGA ACAAATGGAA ACCATCGTAC TGTGACCCTA GCTTATACTA TGACCCTAGT TAATAATATG ACCTTTTGTG TT
|
Protein sequence | MLAPDKTIYL AIRETSATVL QVTNSPTSTT TTVTQTRGTP PGSTETDSST TVRTSPTFVG NTPSTVLFFI ALAVGVFIAL LFIFFTIRYF VRSKYGLHVY PLSRRHLIAG AAITSDRFHD TMTNEELQEH LNYIRDHHFI QTSFLERRFT GRRRRRRRGR YSRMKKLSEA EVEILFPKKT YTDWLNGGQE RDHEKRDGVL QEEGNTDSGN LNIINEEDSS DLHSQTTVSD RVASSSRDMD NDDSIELHEL KNEATNSVSA DAEDDLHFTS GSCAICLETI GDEDIVRGLI CGHVFHAECL DPWLTKRRAC CPMCKRDYFF KKEAAESTET QTSTTNNETN NNVNENSDIQ NSNENTNNNN DNTNNVDTSI IDDDDDTSID YDAIRSNPAF RALLQELVPI SERVRHIMSD PSNDHLDLEV RARAVAKKTY GRYFKVLWWK LMGISKEDLF NWAALTIFQD WRRANNQTGN EAENEAENEA QTGSTAEGTA DSNDGTNTND SNSNNNGNNN DTHSDSLESP VENRDMEEVD LGERDSPELS AARRDVVDNR V
|
| |