Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_33555 |
Symbol | |
ID | 4840602 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009047 |
Strand | - |
Start bp | 946233 |
End bp | 948277 |
Gene Length | 2045 bp |
Protein Length | 572 aa |
Translation table | 12 |
GC content | 40% |
IMG OID | 640391917 |
Product | predicted protein |
Protein accession | XP_001386370 |
Protein GI | 150866694 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGTCG ATGATAGAGT TTCTCGTAAG AAGTCCCGTG TCCGAGGATC GGGGTAAGTG TTTTGATTGG AAAATTGAGG AAAATCGAAA GAATAAGCTC ATATATCTCA TATACCTAGA GTTTCTGGAA TGATTTCATA TGCGAGTTAA GACAACAAGA ATTATATCTT TATTGAAATC AAATGGTCGA TTGGCTCCAT TTGAAATAGG ATCTACTTTA TAAGAGATCA TGAGTACGAA AGCATACCTT TTCTTTATGT GCTATGCTAT CAACATAGTA TTTGATCAAC TTCTATTCCG ATACTAATTA ATTGAAACAA CCAATGATAA TTACGATTTC CGAGATGATT TCAGGATCGA AGTACTAACA CCTTTTAAGA AAACGAAATG TTCTAGTGGA TGATTTCTTC GTGCTAAACA AACAATCCAC TTCAGACAAA CCAAGTAAGA AACGACGTCT CACGGACAGT CTTAAGTCTG CATTGAACCA TTGGGATAAC ATCGAAAGTC TTCCCAGACT TCGAAATCGA AAAGTTTCGG CAATTGGAAA AGCAGATTTG GACAATGCTA ATTCTAAGAC TACTACACCA GTACCTCAAG ATAGGGGAGA GCAAGAAGAG TCTCAACCTC AGAACAAACT GGTTCATTCT GTGGCCCTTC ACTCTGAGGA GGCTGGGGAG TTGTTAACTC CGTCTCATTT GATCAACCAG TCTATCTATG ACGAAATTGA GTACACAGGA AATCATAATC AGGATGAAAG TCTCCAGGTA ACGAACAGCG ATGTTTCGGT AAGAAAAAGT GCAACTGATA AAACAAACAG TGGACGTGGA AGACCAAAAA AGAGAGCAAG AGTCAATGTA GGTCGTCCAC GAAAAGTGGA CAAGCCTAAG AACAATTTGA ATAGCACTAC TGAGGCTGCT CAGAAAGATA CAATAGAGAA CAACGAGCAC GATTCTTCAG CTGAAACGTC GTTTCGAAGA GAGAGTTTAA GAAAGACAAG GAGAATAAGC TACAAAGAAA TGGTCAGCGA CAACGAGAAA GAAGAGTCCA GTGAAGACGA GAAGATAGAG TATGCATTCC GATCGCTAGC TGCACGAACG TTGCGACAGA AGTCGAGACT CAAGCAGTTT CTTGGAGGTG TAGATTCGCC AGAAGAAGCA GAAATTGTAG AAGAACCACA TACAAGGATC AGAAAAATTC GAGATAATGT GAAGAAGCGA CAGAATGAGT TGAGAGAATC GCAAGAAAGG AATAAAGTAT CGAATGGAAA GACATTAAGA AAATCAAAAT CAAGTGACAG GACAACAACA AGCGACAATA CAAAATCAAA TGGAAATAAG TCAAGAAAAG CAAAATCCAA GTCTAGGGTC GAGAGCACTG AAACTGGCCC TGAGAGTAGA ATTGAAAGAG TACCATCACG AGTGAGGGAA AGAGTAGCGC CTATTCCGTT ATCGAGAAAG AGGCGGAACG AACGTCAGAA ACCAATGAAT ATTGATGTTG AGAGATTGAG AGATGAAGAA AACAAAGACA AACGCGTCAA GATTCACACA ATCGATGTAC TTAGACATTT AGTCAAAGAG TACGAACCGG AAGAAACTGC CTCAGAAGTA ATTAGAGAAC AAGTTGTTCA GGAAGACTTC AAGGCACATC TTGTCCATCA GCTAGACTAT CTTATGGATG TTCATTCCGC TATAAACGAT ATCACTACAA GAATCAACGA GGTTCAGAAG TTGAAAAACG AATACCGACA AAGGATATAC ACGTTGAAAC AGAATCATGT TGATGTGGGT ACCAAATTAA ACACATTAAG AAGTCAATAT AATCGAGACA AGGATAGACA TGCCGAAGTT CAAATGGTCG AAACCGAAAT GAAGTCGTTG CAACAGATAG GCAATACTAC AGAGGATGCA AAGCTGTCGT TGAGCCAACA GGTTACAGTT GCGTTGAGCC GTGCATCGTC GATTGTGAAT CCTTCTGCTG GAGTCCTACG TAAACTTCAG ATTGTAAACC AGAAGCTTGT TGACCTCGAC AAGGAACTAT TATAG
|
Protein sequence | MAVDDRVSRK KSRVRGSGKR NVLVDDFFVL NKQSTSDKPS KKRRLTDSLK SALNHWDNIE SLPRLRNRKV SAIGKADLDN ANSKTTTPVP QDRGEQEESQ PQNKSVHSVA LHSEEAGELL TPSHLINQSI YDEIEYTGNH NQDESLQVTN SDVSVRKSAT DKTNSGRGRP KKRARVNVGR PRKVDKPKNN LNSTTEAAQK DTIENNEHDS SAETSFRRES LRKTRRISYK EMVSDNEKEE SSEDEKIEYA FRSLAARTLR QKSRLKQFLG GVDSPEEAEI VEEPHTRIRK IRDNVKKRQN ELRESQERNK VSNGKTLRKS KSSDRTTTSD NTKSNGNKSR KAKSKSRVES TETGPESRIE RVPSRVRERV APIPLSRKRR NERQKPMNID VERLRDEENK DKRVKIHTID VLRHLVKEYE PEETASEVIR EQVVQEDFKA HLVHQLDYLM DVHSAINDIT TRINEVQKLK NEYRQRIYTL KQNHVDVGTK LNTLRSQYNR DKDRHAEVQM VETEMKSLQQ IGNTTEDAKS SLSQQVTVAL SRASSIVNPS AGVLRKLQIV NQKLVDLDKE LL
|
| |