Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_30636 |
Symbol | |
ID | 4838250 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | + |
Start bp | 565684 |
End bp | 567384 |
Gene Length | 1701 bp |
Protein Length | 512 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640389565 |
Product | predicted protein |
Protein accession | XP_001383387 |
Protein GI | 150864535 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.931484 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATACGA TTCCCGTCTT TGGATCCTCG GTTCAAGAGG CCGTACAGGC CACAGTTCGT CTCGGTAAAC CACTCTTTGT CTTCTTATCG GTGAACTCGG AAGAGAATCT GGCTACGTTC CTCGAGCAGT TCTTCCACAG CCAGGAGGCA ATAGATAGCG AAATAGGACA ACTTGTCACG GAGTCATTTG TGACATTGAA GCTTGTAGAA GACACGGTAG AGTTTGGCTA TTTCCAGCAG ATATTTTCGA ACTTGATTGT TCCTAGCTTC TATATTATCC AGAATGGAAA ATTGCTAGAC GTGATTTCTG GAGAGACCAC TGAAGCACAA TTTGTCGAAA AAGTAACCAA TGTAATTATA GCTGAAACAA ACCAAATCTC AACAGGTCCA GAAACATCAC AAACTGATAC AAATGGTGCA AATGCCGCTA ATATTTCTAA TTCTTCCATT TCATCGGTTC CTGCTCCTAA CACAGCCACA CAAATACATT CAGATCCAGT ATCAGATACA TCTGCAACAA ATTCGACAGT CGCAAATGTA GATCTGCCAA CAGCTTCTTC ATCTTCGGAT AACCAGAGAC ATTCTCAAAG CCCGGTTGGA CAAGTGGAAA CTGTCAAATC TACCAAGCCA CAATCAGCCC ATGACAGAAC AGCTTCGGAG TACCATAAAC AATATCTAGC TTCTAGAAAG AAGCAGGAAG AAGAGAGACT CAGGCTTCGA GTACTCCTTC AGGCGGATCA AAAAGAAAGA CTCTCGAGAC AAAGAGAGAT GGACGAAATT CTTCATGGTT CAGAATCAAC TTCTCCACAG CCCAAATCAC AATCTCCAGC ACACCCAGCA CAACATGATG TGTGTTTTCT CTCAATAAAG CTTTTCGATG GAAGTTCATT GAAGCACGAG TTTCTGTCAT CAGATACATT GAATACTGTA AGGGAGTGGT TGGATAAAGA AACAGAAATA ATACCTCCCA CAGACTCCCT ACCATCTTTT GCAAGCTCTT CGTATCCGCA GCCTACAAAT TACGCCTTTC ACCGCCCGAT ATTACCGAGA GAGACTTATA CAGATGAGCA AGAGTTCCAG AAGCTTGTTG ACCTTGGATT GTGCCCTAGA TCTGCATTGA TCTTGAAGCC TATTTATGAC GATAAGTACC TGAGTTCGTA TCCTACCAAT AAGACTTCAG GAGGTATATT GAGGGGTGTA GGCGGAACTT TAGCCAGAGT AGGAAGTGCT TTATATTCGT TCTTTGATTA TGGGGTAGAT GACACTCAGG AACATCAGCA TCAAGACTAT GATGAACCGG ATGGTTCCAG AAGTCCACGT GATCCCACCA GTCCTTCCAG ACCTTCAGCT ACAGCATCTG GTTCGTCTCG TGTAGATTTC CCTGTGAGAC CACCATTGTT TTCAATCGAC AATAACGTGC CTTCATCGTC TTCTCTCATC AACATCTCGG AACCTGCGAA CAACAATTCG TCTTCTTCTT TACAACAGGA AGGGCCTACG TCGTTCTTCA TTGACGAGTC TAATAATCCT TCAGTATACA ACAGTAGAGC CTCAACACCC AAACCGTTAG GATTGTCGCT GATTAGTAGA GTCCAGACTA TTCATGATGA GCAAGATGAT AAGGACAAGA AAGACGTGGA TACATATAAT GGTAACTCAG TGAACCTTCG TGGAAAGGAT GATGAAGATA AGAGAGGTTA A
|
Protein sequence | MDTIPVFGSS VQEAVQATVR LGKPLFVFLS VNSEENSATF LEQFFHSQEA IDSEIGQLVT ESFVTLKLVE DTVEFGYFQQ IFSNLIVPSF YIIQNGKLLD VISGETTEAQ FVEKVTNVII AETNQISTGP ETSQTDTNGA NAANISNSSI SSVPAPNTAT QIHSDPVSDT SATNSTVANP QSAHDRTASE YHKQYLASRK KQEEERLRLR VLLQADQKER LSRQREMDEI LHGSESTSPQ PKSQSPAHPA QHDVCFLSIK LFDGSSLKHE FSSSDTLNTV REWLDKETEI IPPTDSLPSF ASSSYPQPTN YAFHRPILPR ETYTDEQEFQ KLVDLGLCPR SALILKPIYD DKYSSSYPTN KTSGGILRGV GGTLARVGSA LYSFFDYGVD DTQEHQHQDY DEPDGSRSPR DPTSPSRPSA TASGSSRVDF PVRPPLFSID NNEGPTSFFI DESNNPSVYN SRASTPKPLG LSSISRVQTI HDEQDDKDKK DVDTYNGNSV NLRGKDDEDK RG
|
| |