Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_50606 |
Symbol | SGE1.2 |
ID | 4840829 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009048 |
Strand | - |
Start bp | 17184 |
End bp | 18602 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 12 |
GC content | 39% |
IMG OID | 640392144 |
Product | hypothetical protein |
Protein accession | XP_001386606 |
Protein GI | 150866868 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.862046 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | AACTTGGCTT CGTGGATTAC TACTTCTTAT TTAATCACTT CAACTGCATT TCAGCCTTTG TATGGCTCCT TTTCGGATGT TATTGGTAGA AGGAAATGCT GCTTCTTTGC TCTGGCGTCA TTCGCACTTG GTTGTCTAGG TTGCTCTATG GCAACTGATA TCATCACTTT CAATTTGATG AGAGCCTTAA CAGGTATCGG AGGTGGAGGG TTAATCACAT TATCTACAAT CGTCAACTCT GATGTCATCA CTAAGAGGAA GAGAGGATTA TTTCAAGCCG TGCAGAATTT GTTGCTAGGA TTTGGTGCAG TGTGTGGTGC ATCTTTTGGT GGTTCCATTG CTTCATATTT TGGTTGGAGA TGGTGTTTCA TCTTTCAAGT GTTTCCTTCC ATTTGGAGTC TTATAATTGG ATACAAGTAT ATCACAAATC AACCTGGATT TGATGAATCC CAGCAGCATT TATCCAGCTC TGTGTTAGAA AGAATAGACT ACAAGGGTTC AATTATCCTA GTGAGTGCAC TTACATGTCA ACTATTTGTT CTCACATTAG GAGGAAATGA ATTACTGTGG CTGGACTCGA GATTAATAAC ACTTGCAATT TTAGGAGCAA CATTATTATT CTACTTCATC TATATCGAAC TTCATACCAA AGCTAATCCA ATCATTCCCG TAAGGAGATT CAATAGCTTA TTCACAGTCT TGCTACTTGC CCAGAACTTC TTATTGGGAT TATGTGCTTA TGCTTATCTT TTTGCATTGC CATTATTATT TCAGATCGTC TTGGGCGACA CTCCGTCAAA AGCAGGGTTG AGGTTGGCAG TTCCTTCTTT ATCTACTCCT ATTGGTAGTG TGATAACTGG AGTAATGATG AATAAATATG GAGTGTTAAA AGGATTGTTG TACGTTGGAA CTATGACAAT GGCAATTGGA AACTTTTTGA CCTTGTTAGT CAGTCCCAGT ACTCCTAGTT GGCTCTTGAA TATTTTGCTA ATGCCAGCAA ATATCGGTCA AGGAATGGCA TACCCGAGCT CGCTATTTAC ATTCATATTT GCTTACGGAA CAACTCACCA GGCAACTTCT ACTTCGACAA TATATTTACT GAGAAGCATA GGGGGAGTAT TTGGTGTATC CAGTGTTTCA GCGATTATCC AAGCATATTT GAAATTCAAA GTGAGAAAGG ATTTGAGTGC CCTACCAGAA TTATCCCATA AAGAAATCCA TAAGATTGTT ATTGCAATTT CAAAATCTTC TGATGCCATA TACAAGTACC CTGACACCAT CAAGTCTATT ATTCTCCTTG ATTACGAAAG AGCCATAAGA CTTGCACAAT TGTTTTCCAG TATTTGTTGT GCTACAGCCT TTATTCTCTG TTTGATGAGA GATATAACAA GATCAAAGCC AGACACATCT GTTGCATGA
|
Protein sequence | NLASWITTSY LITSTAFQPL YGSFSDVIGR RKCCFFASAS FALGCLGCSM ATDIITFNLM RALTGIGGGG LITLSTIVNS DVITKRKRGL FQAVQNLLLG FGAVCGASFG GSIASYFGWR WCFIFQVFPS IWSLIIGYKY ITNQPGFDES QQHLSSSVLE RIDYKGSIIL VSALTCQLFV LTLGGNELSW SDSRLITLAI LGATLLFYFI YIELHTKANP IIPVRRFNSL FTVLLLAQNF LLGLCAYAYL FALPLLFQIV LGDTPSKAGL RLAVPSLSTP IGSVITGVMM NKYGVLKGLL YVGTMTMAIG NFLTLLVSPS TPSWLLNILL MPANIGQGMA YPSSLFTFIF AYGTTHQATS TSTIYLSRSI GGVFGVSSVS AIIQAYLKFK VRKDLSALPE LSHKEIHKIV IAISKSSDAI YKYPDTIKSI ILLDYERAIR LAQLFSSICC ATAFILCLMR DITRSKPDTS VA
|
| |