Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_45215 |
Symbol | |
ID | 4838922 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | - |
Start bp | 303008 |
End bp | 304432 |
Gene Length | 1425 bp |
Protein Length | 456 aa |
Translation table | 12 |
GC content | 41% |
IMG OID | 640390237 |
Product | predicted protein |
Protein accession | XP_001384356 |
Protein GI | 150865225 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.803161 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCAAAT ATCATCTCGT TGTCTTGGTC CATGGGTAAG TATGCTTTTC ATTCACGTGC ACCCTAGTTC ATATATACTA ACATACTAGA CTTTGGGGCA ATCCCACCCA TATGGACTAT ATCGAGTCGC AGGTTTTGGA TAAAATACAG CCAGCTGATG AGGAGCTTGT GGTTTACAAG ACTGGGTCTC ACAGTGGCTA TTTGACATAT GACGGTGTAG ACATCAACGG TAAAAGAACC TCAGATGAGA TCTTGGAGCA AACGAATGCT CTTTCTCAGG AGGGAAATAA GGTTACAAAG TTGTCAATAA TAGGCTATTC TTTAGGAGGT TTGATCTCCA GATATGCTGT TGGAATTTTG TATTCGCAAG GTTATTTCGA TGACATCGAT CCAGTGAACT TTATCACTTT CTGTACACCG CATGTTGGGG TTCTTCATCC AATGAACCAC AGTATATCTG TTAGATTATT CAATAACTTT GCTCCCTACT TTTTGGCCCA TTCTGGTAGT CAGATGTTCT TGAAAGATAT GGTGTCCAAA ACTCAGAAGC CGTTGTTGGT AGTGATGGCA GATGTCAACT CCTATTTCTA CAAGGTATTG AAGTTGTTCA AACACAAATC TCTCTATGCC AACGTTGTCA ATGACAAGAG AGCTGCATTT TTTACAAGTG CAATTACTGC AATAGATCCG GTTAACTCTA TGATCAACCA GCTGGCTGAT AATCTCCAGA TGACTTACAT TAAGGGCTAT GAACCCATTG TTGTAGATAT CGAGAAGCCG TTGAAGTACG AAAAAATTGC CGATTCATTC GTTCCCGCTA ACCGGAAGTC TCAATCTCGA TTAACCAGAG CCGTCGGTTG GGTCAAGGTG TTTGGCAGCA TTGTCTTGTA TACACCTTTA TGGGTGGCTT TCTTTATTTC CAACTCCATT GTGCAACGTA TTAGATTGAA CAATCGAGTC AGCAATTTTC TTCAGGATGC TTCCAACAAC TTGCACCATT TGTATGACCA ATTGAGCGAT GATCCAATCC AATTGAAGGA AGAAGAAACC CACAATACTG CAGAAGAAGA AGAAGAACTG GAAAGAGCCA CTCTTCTTGA GAAGTTTGAA GACTTGGGTG ACCGTATTCA GGAACAGCAA TATTCCGTTG TCGAGTCAGT ATATTCAGTG ATGAATAACA ACGATGCAGC CATTGATCAA GAGAAAAATC CAGAGTCTGG ACAGGTTTCA GTTATGAATT TGAATGCCGA CCAGAAATTC ATTATCAGCC TGTTGAATAC CTTGGGTTGG AACAAGTATC CCGTTGTTAT CAGAAACAGC AAACATAGCC ATGCTGCAGC TATTGTTCGA TTTCACGATC CTAACTTCGA TGAAGGCAAA GTTGTTATAG ACCATCTCGT GAACGAGGTT TTCCAGTTGG AATGA
|
Protein sequence | MVKYHLVVLV HGLWGNPTHM DYIESQVLDK IQPADEELVV YKTGSHSGYL TYDGVDINGK RTSDEILEQT NALSQEGNKV TKLSIIGYSL GGLISRYAVG ILYSQGYFDD IDPVNFITFC TPHVGVLHPM NHSISVRLFN NFAPYFLAHS GSQMFLKDMV SKTQKPLLVV MADVNSYFYK VLKLFKHKSL YANVVNDKRA AFFTSAITAI DPVNSMINQS ADNLQMTYIK GYEPIVVDIE KPLKYEKIAD SFVPANRKSQ SRLTRAVGWV KVFGSIVLYT PLWVAFFISN SIVQRIRLNN RVSNFLQDAS NNLHHLYDQL SDDPIQLKEE ETHNTAEEEE ESERATLLEK FEDLGDRIQE QQYSVVESVY SVMNNNDAAI DQEKNPESGQ VSVMNLNADQ KFIISSLNTL GWNKYPVVIR NSKHSHAAAI VRFHDPNFDE GKVVIDHLVN EVFQLE
|
| |