Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_47790 |
Symbol | |
ID | 4840410 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | - |
Start bp | 1292416 |
End bp | 1293486 |
Gene Length | 1071 bp |
Protein Length | 169 aa |
Translation table | 12 |
GC content | 40% |
IMG OID | 640391725 |
Product | predicted protein |
Protein accession | XP_001385944 |
Protein GI | 126138842 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG5078] Ubiquitin-protein ligase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00101457 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTACAC CAGCCAAAAG AAGACTCATG CGTGATTTCA AGGTAAGTTC TACTTTTGTA TCCCGAAAAA GGCCCGAGTC CAAAGGTAAC TTTTGCCATA ATTAATGGTC TTCTGACCAG CGGCAACAAG GGTTTGTTGC CAGAAAGTTA CAGTGAAAAT GAAAAATAAG GCCATAATAC TAACACCAAC TAGCGCATGC AACAGGATGC TCCCTCCGGT GTCAGTGCTT ATCCGTTACC GGACAATGTC ATGACATGGT ATGTGATTAC AAATGAGAAA GGATAAAAAG AGAAAGGACA TTAAAAGATT CATGAGGAGT GGGAGATATT GAGTTATGAA TAAATAAGAA ACATGACGAG ATGAAAGTCA TATGAAGAGC AGGAGAATGA ACTTAATATG GAAAATGAAT GGATGACTAA TGAATGAAAT TTATCCGGAA GAACAAGACA CGAATGAAAA ACTAAAGAAA TGATAGTCAT TATCTACATG TTTATAAGGT ACGCAACATC CAAAATGATA TGATCCTAGA TTTTCGTATC CCTGAGACGC CAATCCACAT TCACTTTCAA TACTATAAGC ATCACAGTTG AAGAGAAAAA TATTCATCCC CCCCCCATAT CTTTTTTTGC TTTCTTCTAC TTATTCCATT ACTTTACTAA CCTATTAGGA ATGCTGTTAT CATTGGTCCT TCAGATACAC CCTTTGAGGA TGGAACATTC CGTCTTGTGC TCCAGTTTGA CGAGCAATAT CCCAACAAGC CTCCTACCGT CAAGTTTATA AGTGAGATGT TCCATCCCAA CGTATATGGA TCAGGAGAGT TGTGTTTGGA TATTCTTCAA AACAGGTGGT CTCCAACTTA CGATGTATCA TCAATCTTGA CGTCTATCCA GTCGTTGCTT AACGACCCCA ATATCAGCTC ACCGGCTAAC GTTGAAGCTG CCAATTTGTA TAGGGACCAT AGGTCGCAGT ACGTGAAGCG AGTGAGAGAA ACTGTAGAGA ATAGCTGGAA TGAAGATGAC TTTGATGACG ATGACGATGA CGATGACGAT GAAGACGACG AGATCGAGTA G
|
Protein sequence | MSTPAKRRLM RDFKRMQQDA PSGVSAYPLP DNVMTWNAVI IGPSDTPFED GTFRLVLQFD EQYPNKPPTV KFISEMFHPN VYGSGELCLD ILQNRWSPTY DVSSILTSIQ SLLNDPNISS PANVEAANLY RDHRSQYVKR VRETVENSWN EDDFDDDDDD DDDEDDEIE
|
| |