Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_31672 |
Symbol | |
ID | 4838861 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | + |
Start bp | 1399208 |
End bp | 1401406 |
Gene Length | 2199 bp |
Protein Length | 732 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640390176 |
Product | predicted protein |
Protein accession | XP_001384227 |
Protein GI | 150865134 |
COG category | [R] General function prediction only |
COG ID | [COG2940] Proteins containing SET domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.418717 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGATA TAGACAACTC TGAGGTGGAT TCTCCATTCA CGGCCGATAC AACACCAGCT CCCAAACAAA GAACTCCGCA GCTCTTCCTT GATGTAGAGG ATAAGACGGA AGAGGCCCGG TCCAAGTTTT CGGAGTTGGA AGCATGTACC TATTCTAGCA AGTTGATTGG TTCCTCGGGC CAACAGGAGT ATATGACGTG TGACTGTGTG GAAGATTGGG ATTCAGAGTC TCAGCGCAAC TTGGCCTGTA GCGATGATTC CAACTGTATA AACCGTGTAA CCTCTGTGGA ATGTATCAAT GGCCATTGTG GTTGTGGAAA GAACTGTCAG AACCAGCGTT TCCAAAAAAG GCAATATGCC TCAGTTTCTG TTTTTCAGAC AGAACTCAAG GGTTATGGTT TACGAGCAGA TGATGTTATT CCCGAAGGTG GATTCATCTA TGAATATATT GGCGAAGTCA TAGACGAAGC TAGCTTTAGA GCCAGAATGA TCGATTATGA TTCCAAGAAT TTCAAGCATT TCTACTTCAT GATGCTCAAG AACGACTCGT TCATCGATGC CACCATCAAA GGTTCATTGG CCAGATTCTG CAACCATTCA TGTAGTCCCA ATGCTTTCGT TGATAAATGG GTCGTTGGCG ACAAGTTGAG AATGGGGATC TTTGCCAAAA GATCCATTCT GAAGGGTGAG GAAATTACTT TCGACTACAA CGTGGACAGG TACGGAGCAC AATCACAGCC ATGCTACTGT GGAGAAGCCA ACTGTATCAA ATTCATGGGT GGAAAGACTC AGACTGACGC GGCGTTGTTC TTGCCAGATG GTATAGCTGA AGCCTTGGGA GTTACTCCCA AGCAGGAAAA GCAGTGGTTG AAAGAAAACA AGCATCTTAG AGCTAAACAG CAAAGTGATG ATGCTGCAAT CAATGAAAAG TTCGTCAGGG AATTGATTGT AGAAGAATTA AAGGAAAATG ATGTTTCGAA GGTGATGGGA GCTTTGATGA AATCGCAGGA TCTCAATATC ATCAAGAGAT TAGTAGAAAG AATTCACAAA ACCAACGACG ATTTCATCAA TTCCTTGATT ATCCGATTCC ATGGCTACAA GACATTATCG ACTATAATCA AGGAATTCAA ATCGGAGGAA GACTTGATTG TACCGATTCT TGAGATCTTG GGCAAGTGGC CAAAAGTAAC AAGAAACAAG ATTTCATCCT CTCAAATCGA AAGTGCAATC CAAGACATCA AGTCTTCGAC CAGTAACGAA GAAATCCGCA CATTGTCCAC TGAGTTATTG AATGAGTGGA GTAAATTACA GATGGCCTAC AGAATCCCCA AGAGTAAGAA CTCCTACTCT TTGCTGTATG GAAGAAACAC CAGATCGCCT GAACCGGAAG AACCCAATGC AAGCAGTGAA CTCAAAGAAT CTAGTGAAGA AGCATTGCCT GCTGGTTGGG AAGTGGCATA CGATCCAAAT ACTGGTAACA ACTATTACTT TCATAGAGAG TTGAGTCTCA CCCGTTGGGA TAGACCCACA GCATCTGTGC CTACTGGACC AAAGACTCCA CAAGGAAGAG GAGCTGGATC AAGAGGAACA TTGCCTAAAG GACCGAGAGA TAATGGATTC CAGCAGAAGG ATATGAACAG AAGAGAAGAA GAAAGATTGA AGCAGGAAAA GGAAGAGCAG TTCAACAAAC TCCAAGAAAA GGAGAAGCAG TTGTTATTAT TGATTCAACA GGCTGGAGAA CAGGAAAGAC AGAAACAGTT AGAGGAAAAG ACACAAAAGG TAACAAAGAA CAAGAAGTCG TCTAATGGTC AACACCGCCA TCACAAATCG CACGATGACT CAAAAAATAG TAGCTCCAAG TCCATTTCAG TAGAGGATAA ATGGAGGCAT GTTTTGGCCA AACATGTTCC CAATTTGATC AAAAAGCATG AGAAGGAAGT GGGTAGAGAA AATATAAAAG GTTGTGCTCG TGATTTGGTG AAGATTCTTG TGGCCAAGGA GTTGAAGAAA GATGCCGAAA TCTCTCCTCC CAAAGAGTTG GACAGTAACA AACTTAAGAA AATTAAAGAG TACTCGAAAG TGTTTATGGA CAAGTTCTTG ATTAAGTATA GAGCAAAACA TGAAAAACAC AGAGACGATA AAAAGAGGGC CCATGAAGAC GATGGTGAAG AAAACGATGT CAAGAGGGTA AAGGAATGA
|
Protein sequence | MSDIDNSEVD SPFTADTTPA PKQRTPQLFL DVEDKTEEAR SKFSELEACT YSSKLIGSSG QQEYMTCDCV EDWDSESQRN LACSDDSNCI NRVTSVECIN GHCGCGKNCQ NQRFQKRQYA SVSVFQTELK GYGLRADDVI PEGGFIYEYI GEVIDEASFR ARMIDYDSKN FKHFYFMMLK NDSFIDATIK GSLARFCNHS CSPNAFVDKW VVGDKLRMGI FAKRSISKGE EITFDYNVDR YGAQSQPCYC GEANCIKFMG GKTQTDAALF LPDGIAEALG VTPKQEKQWL KENKHLRAKQ QSDDAAINEK FVRELIVEEL KENDVSKVMG ALMKSQDLNI IKRLVERIHK TNDDFINSLI IRFHGYKTLS TIIKEFKSEE DLIVPILEIL GKWPKVTRNK ISSSQIESAI QDIKSSTSNE EIRTLSTELL NEWSKLQMAY RIPKSKNSYS LSYGRNTRSP EPEEPNASSE LKESSEEALP AGWEVAYDPN TGNNYYFHRE LSLTRWDRPT ASVPTGPKTP QGRGAGSRGT LPKGPRDNGF QQKDMNRREE ERLKQEKEEQ FNKLQEKEKQ LLLLIQQAGE QERQKQLEEK TQKVTKNKKS SNGQHRHHKS HDDSKNSSSK SISVEDKWRH VLAKHVPNLI KKHEKEVGRE NIKGCARDLV KILVAKELKK DAEISPPKEL DSNKLKKIKE YSKVFMDKFL IKYRAKHEKH RDDKKRAHED DGEENDVKRV KE
|
| |