Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_32968 |
Symbol | |
ID | 4840178 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | + |
Start bp | 1101545 |
End bp | 1103935 |
Gene Length | 2391 bp |
Protein Length | 796 aa |
Translation table | 12 |
GC content | 38% |
IMG OID | 640391493 |
Product | predicted protein |
Protein accession | XP_001385565 |
Protein GI | 150866087 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAACAT ATAATCCTAG GAAGAAAGCA GACAGTGCTT CCAACGACCT ACCCACGACA GGGCCACCAT CTCATCGTGC TATATACAAT GCAATTCATC TACCGACAAC ATTCAATCTA TTGCGATACA AACATTTCAG TTCCAAGTCA TTTTCTAGTC GGCCGGAAAA CCCCACCAGA TCAGGACTTC CGTCAGATCC AGCCTCAAAT TCATCGAATA CTTCACTGAA CTCAAATTCG AGTCTGCAAT TTGGGTCCAG CAAGTATGCC CGATCGATGG ATCTAATGGA GAAGCAATTA CTTAATCGGT TGAATTTCAG AACTTTTAAT TCAGACTCAA TTCTTGTAAT TAATGCTGTA CTAGCCCTCA CAGCTTACTA CATGCGCTCA AACTGCTTAA AAGCATTCAG TAATTTTGGA TCTAATGATC CACAACAAAT TCAAAATTAC CAGCAATTTA GAAACTTTCT CATCAAGATA TCTGCAAAGT ACTACGGCAT TGCCATCCAT AAGCTAAGAA CAATGCTTTC CAGAAAGGAT TATAATGTAA CCATCGCGAT CATTGTGCTG TCTTTTATGA ACAAAATATC CATCTATGAA GATGCTGACT TGAACCAATC AGTTACATTT TCAAAAGGTG TCATAGGCAT ATTCAATGAC ATCTTGTCTA ACTTCAAACA GTCAAACGCT CGATTTCTCA GATTGTTTAA TATATCAAAT GAATCGTCAT ATACCAGTGC GAACGACGTA TCTTGGATTA TCAATTTCCT AGTGTTTGCT TCGAAGTCGA TATTTTTCCC TACATATGAC CCAACCATCC TATTGGAATA TAACCAAACA TTAATTGAAT ACAGAGACAT ACTTCATGAA CTAAATAGAC AGGGAATACA ACCGTCCCAA CATGTCATAT TTAATTTCAA CCACCTCTTC AGTTACACCC AACAATTACT CTTGTTTATT CAATCGCATA ACACTAATAA CATCAACCAA TATCCAATTT TACTCTACAA GTTGTTGCGG CATTGGTTGA TTATTCTCCC ATCAAAAGTT CATGTGCTCG AGTATCTATC CGATCCGTTG GAAAAGACCT TGATGTACTT ATTCCGTACT TTAACAAAAA TCCTTGATAA CCTTTTCCCG TCTGTGATAT TCTACTTTTT ACATACATTC AGTGGAGGCT TGTCATTATG GTATGAACCT AATTACGAAT TAGACCCAAA AATAGTAACC CCACAGGAAA TGTCATCATT TCCTACTCCC ATTCAACATC ACTTGATTAG GATTACTATT TACAGTGTTC GAGTCTGTAC ATTTTTTCAG AAGCGAAGTA ACATATTACT GATCTTTTTT GGGGACCAAA GATTGAAACA AGAATTGTCA CCACAAATTA TCCAGGCTAA CATCAATGAA GTCATGATAA AAGCTTTCAA AAAAACAAAA ATAAGGCTCT ATCATTACAT ACATTTGCCT AATGTAGTGC AGTTCACAAA GACGAATGTT ATCTTTGATA ATATACAACA AAGATTCGGT GCCAAGATGC TCTACTACTA CAACCAAGGA CATTACGACC ATTTCATGTC TAAAAATTCG GACAAATCGT ATAATAATGT GTTCAAGAAA GACTCGATAG ATCCTATCAT AAACTTTCAT ATGAATAATA TATCTCAAAG CGGGTTTGAA ACTGCCAACG ATAACAAGAG ATTTGGTAGT ATGGACGGAT CTAGCTCGTT CTCATCGTCC ACTACTGCTT CACCGCCTGA TACTGGAAGA AGCGTGCCTG TTTCTCTTCC ATTTACAACC ACTATGGGAA GTTCTGCACC GATAAACTAC TACACTATAT TCGAAGATAA TTTTTCTCTG ATATTGACGA AGAAACTTAG TCAAGAATAT GAAGATTTAT TTCGTTCTGG CGATAATTCG CTATTTGTGA ATCCAGACAG TGCACCATTC CTCGATTTGC AGCTCAATCC TCAAAACAGG TTATTTTTCG GTGACAACGA TCCATTACGA ATTGCCGAGA TCAGCACTGA CATCATTTTC AAAAGACGAC TTCAGAATTT GTTCACTAAT ACAGAAATTT ACAATCTCAT AGTCAACTTG CACAACACGA GGAATAGAAG ACTAAGTGCA ATAATGAGCG AACCGGCTTC CACCATAAGT ATCTCTCGAC AACCTTCTGT GACGTCCACC AATGTCCAAT TAATGGAATC CATTCATCAT GAGATTGAGG TGGAACATAG AGAAAACATC AAAAATGTCA GGAAACGCAG TGGCTCCAAA TCAAAAAAGA AGATATCTCC TGAAGAGAGT CCCGTCGAAA AAGTCCAAGA GAAGTTGAGC GACGTGGACA ATTTCTACGA CGACTTGAAT GAAAAGTACT TTGCACTTTA G
|
Protein sequence | MSTYNPRKKA DSASNDLPTT GPPSHRAIYN AIHLPTTFNL LRYKHFSSKS FSSRPENPTR SGLPSDPASN SSNTSSNSNS SSQFGSSKYA RSMDLMEKQL LNRLNFRTFN SDSILVINAV LALTAYYMRS NCLKAFSNFG SNDPQQIQNY QQFRNFLIKI SAKYYGIAIH KLRTMLSRKD YNVTIAIIVS SFMNKISIYE DADLNQSVTF SKGVIGIFND ILSNFKQSNA RFLRLFNISN ESSYTSANDV SWIINFLVFA SKSIFFPTYD PTILLEYNQT LIEYRDILHE LNRQGIQPSQ HVIFNFNHLF SYTQQLLLFI QSHNTNNINQ YPILLYKLLR HWLIILPSKV HVLEYLSDPL EKTLMYLFRT LTKILDNLFP SVIFYFLHTF SGGLSLWYEP NYELDPKIVT PQEMSSFPTP IQHHLIRITI YSVRVCTFFQ KRSNILSIFF GDQRLKQELS PQIIQANINE VMIKAFKKTK IRLYHYIHLP NVVQFTKTNV IFDNIQQRFG AKMLYYYNQG HYDHFMSKNS DKSYNNVFKK DSIDPIINFH MNNISQSGFE TANDNKRFGS MDGSSSFSSS TTASPPDTGR SVPVSLPFTT TMGSSAPINY YTIFEDNFSS ILTKKLSQEY EDLFRSGDNS LFVNPDSAPF LDLQLNPQNR LFFGDNDPLR IAEISTDIIF KRRLQNLFTN TEIYNLIVNL HNTRNRRLSA IMSEPASTIS ISRQPSVTST NVQLMESIHH EIEVEHRENI KNVRKRSGSK SKKKISPEES PVEKVQEKLS DVDNFYDDLN EKYFAL
|
| |