Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_56033 |
Symbol | |
ID | 4837401 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | - |
Start bp | 2294614 |
End bp | 2297004 |
Gene Length | 2391 bp |
Protein Length | 767 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640388716 |
Product | predicted protein |
Protein accession | XP_001383195 |
Protein GI | 150864404 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.517516 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACACGA ACTTGCTATA TGTGCCATTA CGGCAGTCTC GTCCCATAGA TATGGGCTCT GAATTACGCG AGGTTATCCG AAAAGACTAC TTCCAGACTC CATCTTCATT TGAACCTGAT CTCATGAGAA TTTCCAATGC TCGAAACAAG ATCACTCTAC TAACGAACGA AACGATTAGC CAAAAGAGCG AAATTTTACT CAAAGAGTAC TACGTCTACC TTCTTGCAGT TATGAAGAAA TTTCTGGATG GCTGTGTCGA GTTTGGCTGG TATGGCACTT TGACTTATGG CCCTAGTGGT CCTACCAAGT CTCGTTCACT TAAGGTAGAA TTGTGGAATA TCGTATTTCA ATTGGGAAGT TTCTACTCGC AGATGGCTCT ACAAGAATCC AGATTTACTG ATGATGGCTT GAAGAATGCG TGTGCGCTTT TTCAGCAGGC TGCTGGCTGT TTTGAGTATA TTTGTCAGTT AGTGAAAAGG GAAACAGACC AGAGTTCCAA TTCTTTGGCA ATACCGCGAG ATTTTTATGG CGACACTGTG CTCTGTTTGA AGTTCTTGAT GTTGGCACAG GCTCAGGAAA CAATCTGGCA GAAAGCTCTT GGTAACACTA CTTTGAAAGA TACGGTTATC GCTCGTTTGT CTATGCTGAC ATCTGATCTC TACGGCCAGG CTTTGGAATA TGGAAACCGT TCTGATTACA TCAAGCTTGA GTGGATTAAC CATATAGGTG TTAAGAAGTT CCACTTCAAA GCAGCAGCAT ATTACAGAAT GTCAATTGTA AGTCAGGATA GCTTTGAGTA TGGGGAACAG GTGGCACTTT TGCGAGTGGC GTCTTCATCG TGTGACTCGG CTCTCAAGTA CAAAAAGTAC GTAACTCAGC TTGTAGTAGA AGACTTGCGG GGTTTGAACC AGACGATCAA GGATGTTTTG CGTGGAGCAG AGAAGGACAA CGACTTGGTG TACATCAAGC CTGTTCCCGT CGAAAAGGAT CTCAAGCCCA TAGCAGCCGT TTCTATGGTT AAAGCTACCG TTCCTTCGGA TCTTGAGACT CCAGTAGAGA CTAGGAAACT GCTCTTCAAT GATCTTTTGC CCTATATTGT TATTCAGGTC GCACAAGCCT TTCGGGAAAG ACAGGATAAG TATATATATG AACGTTTTGT TGAGCCTATT CAGGCACTAA ACAACATGTT AGTCAAATTT ATTACCGAAA GAGGTCTTCC TGCTTCGATC GATGCACTTC AGCAGTCGGA AAATTTGCCC GATTCTATCA TCCAGCATTC CCAGGAGATC TTGGCCTTTG GTGGAACTGA CATTATTGAA GATTCCATCA CGGAAATCAA CAAGCTTTCT ATGGAATGTC AACAATTAAT AGACCATTGC AATGGAAGAC TCACCCTTGA TGCTAAAGAG GAAGATATGA TGCGGCAGAG GCATGGCCGT GAACATTGGA ACCATCAGAC GACAGAAGTT GCCGCTCGTG CACTTATAGA GAGAATAGAA AAGATGATTC AATATCTAGA TCAAGCTAGA GACGGAGATA GTTGTGTTCT CACTAAGTAC TACGAAATCA AGCCATATTT GGAGATCTAT TGTGGAGGAT ATAAGCCATT AAGCGAGTTC ATTCCCAACT CGGACTACTC CAAGGTCGAC AAGAACATGA GCAATATCAT TACGGATTTG AGAAACGCCG TAAATCAGGT ATCTGTACTA GAAGAACAAC GCAAAAGGTT TCTTCTGCAA GTAGAGTTGA AGGCTCGAGA ACACAATATC TTGCCTAGCG TGATTGAAGA GTTCAAGCTG AAACAAAATG AGATGTACGA TGAAAATGGA AATGTAAATG AGAGATCGTT TGAAGTAGTC TACGACAAAC ATATCAAGCT CTTCAGCAAA GAGATGAAAT TCATGGAGAG CACTAAAAGT ACCCAGATCT CGTTGGAAAA CGATATAGAT ACCTTGAACA GCCGCTTTAT TTCTGACTAC AACACCAGAA GTAGTGACTC GCAGGTTAAA CGAAAAGAAG CACTACAGTT GCTTGAGGCT GTTTACTCCA AGTATCTAGA GGTAATCTCT AACTTGAGCG AGGGATCGAA ATTCTACAAT GACTTTTTGG TCAAAGGCAA CGGAGTACTA AGTGAGTGTG AAGATTATCT TAATCAACGA CGCTTAGAGA GCAGAGAACT AGAACTTACC ATCAGCAAAC TGTTCAGGTC TGGTCCATCT CAACATTCAC ATGGATATGA CGAAGAACTC AGTCCAACTT CTGTTCATGA AAGTCGAGAA ATGGAAAGAT TGCGCAAAGA GGTAGAAGAA GAGACTCGTG CAACAAGTGT TGGGGCTCCC ATAACTAAGC CTGGTATATG GAGTCCAGAC CAGGGCATCA AGTTTGACTG A
|
Protein sequence | MNTNLLYVPL RQSRPIDMGS ELREVIRKDY FQTPSSFEPD LMRISNARNK ITLLTNETIS QKSEILLKEY YVYLLAVMKK FSDGCVEFGW YGTLTYGPSG PTKSRSLKVE LWNIVFQLGS FYSQMALQES RFTDDGLKNA CALFQQAAGC FEYICQLVKR ETDQSSNSLA IPRDFYGDTV LCLKFLMLAQ AQETIWQKAL GNTTLKDTVI ARLSMSTSDL YGQALEYGNR SDYIKLEWIN HIGVKKFHFK AAAYYRMSIV SQDSFEYGEQ VALLRVASSS CDSALKYKKY VTQLVVEDLR GLNQTIKDVL RGAEKDNDLV YIKPVPVEKD LKPIAAVSMV KATVPSDLET PVETRKSLFN DLLPYIVIQV AQAFRERQDK YIYERFVEPI QALNNMLVKF ITERGLPASI DALQQSENLP DSIIQHSQEI LAFGGTDIIE DSITEINKLS MECQQLIDHC NGRLTLDAKE EDMMRQRHGR EHWNHQTTEV AARALIERIE KMIQYLDQAR DGDSCVLTKY YEIKPYLEIY CGGYKPLSEF IPNSDYSKVD KNMSNIITDL RNAVNQVSVL EEQRKRFLSQ VELKAREHNI LPSVIEEFKS KQNEMYDENG NVNERSFEVV YDKHIKLFSK EMKFMESTKS TQISLENDID TLNSRFISDY NTRSSDSQVK RKEALQLLEA VYSKYLEVIS NLSEGSKFYN DFLVKGNGVL SECEDYLNQR RLESRELELT ISKLLRKEVE EETRATSVGA PITKPGIWSP DQGIKFD
|
| |