Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_87156 |
Symbol | |
ID | 4837019 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | + |
Start bp | 1910967 |
End bp | 1912606 |
Gene Length | 1640 bp |
Protein Length | 456 aa |
Translation table | 12 |
GC content | 39% |
IMG OID | 640388334 |
Product | predicted protein |
Protein accession | XP_001382593 |
Protein GI | 150863939 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGACG AAGTGGATGA GGCTGCGATA GTAAAAGACG AGGCCAACTT TGATGTTGAG GACGAAGTTC AGGACTGGAA GTTTCTCAAC AAATCGTTGG CATCCTCAGG TATTCCCAAA AGAGGTGAGA AAGAATTTGC ACCTGATGGA ACTCAAGTCC AGAACAGCTC TTTGCAAGAG TCTAGAAACG CCATGTACGA CGCTCTTGAA GGTGTAAGAG GTCATCACTT GAAGCTGAAG CTTATAGCAG TATGGATAGC ACTGGAATCT AAATGCATAA TGCCTCATGC CAGAGGAAAC TACTTCCGAG ATATGGGAAT TCCCATTCAT ATAGGCTCTA AAGTTAAGGA TGGTGTGGAA CTAAACTCAC TTGAAGCAGT CTATTTAGTT GAAAGAGGTT CTGTAGTAAT ATATCTCGGA AACGAGAGCT ATGACGAATT TCTCAAGTCC AGTCAAGAAA GCGAATTTGA CTACGATTCG CTTATAGCTA TGGATTTGGA GTTTCTATAC AGCGTAGCAT TCAACAATAC GGGTTATCTG CTTGATCAGT ACCAGATATA CTCGTACTTG AAACGTCTTG GTTATTTAGT GCAAGACTTT AAGCAAATCA CTAAAACAGA CCAAAGGACT AGACTACCAA TGGTCCCTTC AACTTTGTCT GTTATCCCAT CCTTTCTAAA TAAAATAATG GCAACTACAG TTGTGAAATA TTTACAACTG AGATTACGTC AATGGGGCTT CTTGTCATAT CCCTTGTTCC ATAGCTTACA TTTCATGACT AAGCACTACT TTAGATACAC AGACATTTAC AAGTCGTTGA GACTTATTCC AGCATATTCT ACACATGACA GTGTAAAAGA GAATCTAGAA GCTAGTGGGT ACTCTATAAC CTTTAATGTG TGGAAACCTA CACCTTCATT CTCCAAAAAG AACCCGCCAT TCCCTGATTT CCAAGTAGCA GTAGCAAACA CGGATTCAGC CAAGTTTCCA ACCCTACAAA CCATACAAGC ATTACACAAC TCGTTGAACT ACACTTTTCC CAATGAACGG AAAACACAGG TAGTCCCCGC TGTTAGACAG CAGAAGAAAA ATACAATGCC TCCGTCCAAA AAAGTGATAC GACAAAAGAA ACAGCAAGAA AGACAATCCA AGTTGGATGA ATCTGTGCGG AAGAAGAATG ACTATAATAG AAAAAGAGAC AATCTATTGA AATACGGATC CAATGGGCGT ACGGTGGTGA TTGCAGTTAT AGATAGCGGA ATCTTGAACT TTGTCAATTT GAGTGAAGGA GACTTCAGTT TGCAATCTAG CACTGATTTG GAAGATATTT TCCCCAGGGA AGACCACGGG ATTATCTATA ATGAAAAATG AATATACAAG GGGCTTCAAT TGAGGGGATT TCCTTGCAAA GACCGATCTC ATAATTAGCC TGAAAAGCCA AGAGTTGGGT AGAGGAGAAA GTAATTAAAT AGATATATAC GGGATATTAC GGGCTCATAT TACCGACCTG AGAAAACAGA GATGGTAATA CTCTAAATAC GAAACCATAA GTCATTCATA TTACTTAATT ACTTAATTAG AAACATTTCA GTTATTTGCA ATCCTTGAAA TACGAAACGA GGGTTTCAGG TACAAAGAAG
|
Protein sequence | MADEVDEAAI VKDEANFDVE DEVQDWKFLN KSLASSGIPK RGEKEFAPDG TQVQNSSLQE SRNAMYDALE GVRGHHLKSK LIAVWIASES KCIMPHARGN YFRDMGIPIH IGSKVKDGVE LNSLEAVYLV ERGSVVIYLG NESYDEFLKS SQESEFDYDS LIAMDLEFLY SVAFNNTGYS LDQYQIYSYL KRLGYLVQDF KQITKTDQRT RLPMVPSTLS VIPSFLNKIM ATTVVKYLQS RLRQWGFLSY PLFHSLHFMT KHYFRYTDIY KSLRLIPAYS THDSVKENLE ASGYSITFNV WKPTPSFSKK NPPFPDFQVA VANTDSAKFP TLQTIQALHN SLNYTFPNER KTQVVPAVRQ QKKNTMPPSK KVIRQKKQQE RQSKLDESVR KKNDYNRKRD NLLKYGSNGR TVVIAVIDSG ILNFVNLSEG DFSLQSSTDL EDIFPREDHG IIYNEK
|
| |