Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_44458 |
Symbol | |
ID | 4839007 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | + |
Start bp | 976398 |
End bp | 977663 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640390322 |
Product | predicted protein |
Protein accession | XP_001384141 |
Protein GI | 150865076 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0219256 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.240847 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTTTGT TCCAGGTTAG TGGCTGGGAC TTAAAGAACG AAACCGTGGC TGTCGGTGGC ACAGGCGCAA AGAAGAAGTC CAACAGAGAA AAAAAGAGAG CCAGGCAACA AATTAAGGAA CTTGAAAAGT CTCAGCAATC TGAAGATGTA GCCGAACAAG AAGATGAAAT CATAAAAGAA ATAGACGAAC CAGAGAAAGA AGAGAAGAAG ATCAAAAAAG AAAAGAAAAT AAAGAAAAGA AAACACGAAG AATCTGAAAA AAGCTCTTCA ACAACTTCTC CGGCTGCTGC TATAGTAAAT CCTACTGTAG ATGCACCTAT ACCTATTACT ACACAGAAAC TCACTCCATT GCAACAGAAG ATGATGGCTA AATTGTCTGG ATCCAGATTC AGATGGATAA ACGAACAATT ATATACAATC TCGTCGGAAG AGGCTCTCAG TTTGTTAAAG CTGCAACCTT CCTTGTTCGA CGAGTACCAT CAAGGATTCA GATCGCAAGT CCAAGCGTGG CCAGAAAACC CTGTAGATGT GTTTGTCGAC CAGATCAAGA CTCGTGCCTC TCAGAGACCT ATTAATGCTC CCGGTGGTTT GCCTGGTTTT CCCGACAAGA AAGTTGTTGT TGCCGATATG GGTTGTGGGG AAGCCCAGCT AGCCTTAGAT GTGAACAACT TTGTTAAACA ATACAACGCT CAAGGGGCTA AAAAGAAATT CTCGAAAGGT AACAATAACA AGAGATTACA AACTGGACCC AAAACATTGG AAATCGAAGT ACATAGTTTT GACTTGAAGA AGCACAACGA CAGAATAACC GTGGCCGATA TTAAGAATGT GCCGTTGCCA GATGGGTCAT GTACGGTGGT GATTTTCTGT TTGGCATTGA TGGGAACCAA CTTTTTAGAT TTCATAAAAG AAGCCTACAG ATTGTTGGCT CCTCGAGGCG AGTTGTGGAT TGCCGAAATC AAATCGAGAT TCACTGAGTC GTCCGAAAAG AAAACAGTCA AACCAGAGGA CGTCGGACAG GAATTCGTGG ACGCCTTGAA GTTGTGTGGT TTCTTCCACA AGAAGACAGA CAACGACAAT AAGATGTTCA CTCGTTTTGA GTTTTTCAAG CCACCTCAAG ACATTATCGC TGAGAGAAAC GCGAAGTTGG AAAGAAGAAA GAAATTCATT GAACAGGAGT CGGAAAAGGA AGACTTGGAG ACTAAAAGAG CACAAACTCC AGAAGGTAAA TGGCTCTTGA AGCCATGTAT TTACAAGAGA AGATAG
|
Protein sequence | MALFQVSGWD LKNETVAVGG TGAKKKSNRE KKRARQQIKE LEKSQQSEDV AEQEDEIIKE IDEPEKEEKK IKKEKKIKKR KHEESEKSSS TTSPAAAIVN PTVDAPIPIT TQKLTPLQQK MMAKLSGSRF RWINEQLYTI SSEEALSLLK SQPSLFDEYH QGFRSQVQAW PENPVDVFVD QIKTRASQRP INAPGGLPGF PDKKVVVADM GCGEAQLALD VNNFVKQYNA QGAKKKFSKG NNNKRLQTGP KTLEIEVHSF DLKKHNDRIT VADIKNVPLP DGSCTVVIFC LALMGTNFLD FIKEAYRLLA PRGELWIAEI KSRFTESSEK KTVKPEDVGQ EFVDALKLCG FFHKKTDNDN KMFTRFEFFK PPQDIIAERN AKLERRKKFI EQESEKEDLE TKRAQTPEGK WLLKPCIYKR R
|
| |