Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_33654 |
Symbol | |
ID | 4840834 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009048 |
Strand | + |
Start bp | 121960 |
End bp | 123399 |
Gene Length | 1440 bp |
Protein Length | 479 aa |
Translation table | 12 |
GC content | 40% |
IMG OID | 640392149 |
Product | predicted protein |
Protein accession | XP_001386420 |
Protein GI | 150866732 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTTCAG AGTCCAAAAT TGGATATCCC ACGCGAGAGT ATAGAATCCC GTATGACAAT TCTGCATCTC TGGACCTTCA TACTGAATGG TACTTGATTC TCACTTCCAA TAATTACCAT TTCTATTTCA ACAGGCTATT GAAACAGTCA TACTGGCAAT TGGCAGACAT AGCTGCCGAA TTCAAAGATG TCGACATCGA AGAATTTGTG CTGGCTATCA ACTTCGATGT TATTTCTTTG ATGTTCGCCA GGAATGTGGG GCTTAAGGGT TTAGATGGCT ACTATTTTGA AAAGCAGGAT GCTGATCAAG ACACGATAGA AGTAGAAGAG TTTGAAGAAG AGGAGGAAGA AATCAGGGAA AGCGATACTG GAGATGCAGA AGAAGTAGAA ATCGACGTCG AAGCCAGAGA CGGCATGATC AGAGAGTTCT TACTAGAAGA AGGGTACGAA GTAAAGGAAG AGAAAGCGGA AGATGAAGTC AAAGAAGAAC CAAAAGCCCC AACGGGAATT TCTTTGGTTT CAGGATACTC TTCTAGTGAA GAGGAAGACG ACGAATCTGG AGAAGAAAAA CCTGCTGAGA AAGGAAGTAA TTCTGTTGAA AAAAACGATC ATCATGATAT ATCACAAGAT AAAGAAGAAC AAGTTGAAGA AGTTGACGAT CTTCAATCTG ATGAGTCTGA TTCCGAAAAT AGTGGCCTAG ACCTCAATAT TTCGGAAGAT GAAGATGGCT CAGAAAGATT GCAAACGTCA GCAGTTACAG AATTCATAGA GCTTTTGGAT ATGTTTGCCG ATCGAATAGA CAAATACCAA CCTTGGGATC TAATTGAAGA AGAATTGCTT CCTGACTTCG TCAAACATCC ACAGTACTAT GCCCTTGAAC ATGCTTCGCA AAGAGAGGAA GCATTTGACG AATGGTTGAA GAGTAGATCC CAGAAAGAAG ATAATTCTTC AGAGAAAGAA GTCACACAAG AACCTCCATT ATATCCTACA CCAACTTTGG ACTTTTACCA TTTCCTCCAA GATCATAAAA AGGAACTAAA GTCGGCTACA TACCAGGAAT TTTACAATAA GAACCACGAG CACATTAATG ATGTAGATCT CGTGTCTAAG GAGAAAGAAG CGCTTTTCCG AAAATTCAAG ATCATGCTCC AGGACCAGAC GGAATTCGAG AAGAGCGCTA AAAAATCGAA AGCATTATCC CCGGGAATCA ACCTAAAGAG GTATAAGTTA GACGAGTTTT TGTCTACCCA AGAATCCGTC GAAATCCATC CCGGCCAATT ACAGGAAATT ACAAATAGCG GAAGTACAGA CTACGGAAAA TGGCTCGCAT TAGCTAACAA ATTAAATCTC CGCAAGGAGT TAATTGAAAG CACTAAAAAC TTCATAGTAG GTGACGAGAA GAGACTAGCA GCTTACCTAG ATAAGTTTTC AAGTAATTGA
|
Protein sequence | MASESKIGYP TREYRIPYDN SASSDLHTEW YLILTSNNYH FYFNRLLKQS YWQLADIAAE FKDVDIEEFV SAINFDVISL MFARNVGLKG LDGYYFEKQD ADQDTIEVEE FEEEEEEIRE SDTGDAEEVE IDVEARDGMI REFLLEEGYE VKEEKAEDEV KEEPKAPTGI SLVSGYSSSE EEDDESGEEK PAEKGSNSVE KNDHHDISQD KEEQVEEVDD LQSDESDSEN SGLDLNISED EDGSERLQTS AVTEFIELLD MFADRIDKYQ PWDLIEEELL PDFVKHPQYY ALEHASQREE AFDEWLKSRS QKEDNSSEKE VTQEPPLYPT PTLDFYHFLQ DHKKELKSAT YQEFYNKNHE HINDVDLVSK EKEALFRKFK IMLQDQTEFE KSAKKSKALS PGINLKRYKL DEFLSTQESV EIHPGQLQEI TNSGSTDYGK WLALANKLNL RKELIESTKN FIVGDEKRLA AYLDKFSSN
|
| |