Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_78175 |
Symbol | |
ID | 4839399 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | + |
Start bp | 332594 |
End bp | 334581 |
Gene Length | 1988 bp |
Protein Length | 584 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640390714 |
Product | predicted protein |
Protein accession | XP_001384723 |
Protein GI | 150865487 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.586636 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCCAT ATCAATATGG CTGGTAAACA CAAGAAGCTC ACCTTCAAGG AGCAGATGCA GGGGTTCCCT GTGAGACAAA TGTTCATTAT CAGTATCATC CGTTTCTCGG AACCCTTGGC TTTCACGTCT CTTTTCCCAT ACTTGTATTT CATGATTCGT GATTTCCACA TCGCAGAACG TGAGGAGGAT ATCTCCAAGT ACAGTGGGTA CCTTGCATCA TCGTTCGCCT TCTGCCAGTT CCTCTTTGCT GTCAGATGGG GGAAACTCAG TGATCGTATA GGTAGAAAGA TCATCTTGTT GATAGGCTTG TTTGGTACTT CACTCTCGTT GATTTGTTTT GGATTCAGTC AGACTTATTG GACAGCATTA ATATCGCGTA GTTTGGCTGG TGTTTTGAAC GGAAACGTCG CGGTTTTGAG AACGATGATT GGGGAAATCG CTACGGAAAG AAGACACCAA GCCTTGGCTT TCTCGACACT TCCACTCTTG TTCAATTTCG GTAGTATCAT AGGTCCAGCC ATTGGAGGTT CCAAGTACTT GACCCATCCT CGCAAGGAAA ACCCGTACCA CCCAGGGGAA ACTATGGCAT TGTTAGAAGA AGGACTTTCG TTTTACGAGC GTTTCATCAG GAAGTATCCT TATGCCTTGT CCAACATTGT AGTGAGTTGC TTCTTATGGT TCTCTTTAAT TTGTGGATTT TTGTTTCTTG AAGAGACCCA TGAAGTTTGC AAGTACAGAA GGGATTACGG AGTAGATTTG GGGGACTGGT TGTTGTTGAA ATTGGGCTTC CATCCACCGG TCAGACCATG GCACCATCAT AAACACACTA GTGCTGAAGC ACAAATAGCT TCTGAAAGCT CTCCTTTATT AGGAAATTCC AGTCTGGATA ACAACCAAGA TAATGCCTCT ATTTCTTCAG AAACTGTTGA TGAAGACGAT GATGATGATA TCCAAAGTGT CCGTCCTTAT ATCTCTAGAA GAATGTCACA GGCTATTGTA AAGACTTACT CCATGAGTGA ACACGAAATT GAGGTACATA GACCTTCTTA CGCTAATGCC TTCACATCCC GAGTAATCAC GGTCATCACA GGAAATTTCA TCATCTCTTT ACATAACGTC ACCTACAACG AATTCTTGCC AGTTTTTTTG GCATCTAGAT TCCGTAAAGA CGGGCTCAAA TTTCCGTTCA GAATAGAAGG TGGATTTGGT CTTGATGTCA GTTACATTGG TACTTTGTTT TCTTCTACAG GTATCATGGG AATGTTAATT GTCCTTCTAA TCTTCCCCAT GATCGACCGT AACCTTGGAA CCATTAACGG GTATAGATTG TCAGTATCAA TCTTCCCATT CGTCTACTTC ATGGTACCCT TAGCCATCTT TACTTTGCAC GACTACAACC CAGCCTTCCC CAAGTGGGTT ACCCCAGTTA TACTATACAC GTTCACGTCA TTGAAGACGT TGGCTAGTGC AACAGGGATG CCTCAGGTTA TGCTCTTGAA CCACCGTGCT GCTGCCAAGG AACACCGTGC CTATGTGAAC AGTGCTACTA TGAGTATTAT TGCTCTTGCC AGGTGTACTG GTCCGATTGT GTTTGGTTAC TTGATGTCTC TAGGTGACAA GCTTTCTACT GGGGAATTGA TCTGGTGGGT TATGTCTTTG CTTGCCGGAT TTGGAATGAT CCAGTCTTAC TGGATGGAAG ATTACGACGA CGATGAAGAT GAAAAAGTTC AACCTGAGCC GGCTGCCGAT CTCTTCGAAG CAGAAGCCTT GGGTGAAGAA GACATGCAAC AAGACTTCAC TACAATAGAG GTTTCTGATA ACGAGCAGTC TAGCTTGTCT ACCAATTAGA GAACATATCT AAAATGATAA TTCAAAAAAC TTAATTGACA TATTGAATAT GTGCATCGAA AGGAACTCTT CAAATATCAT AGAATTGTCT ATATCCATAT ATATTAGAAA CATATAGAAA CGTACGGAGT TACCCTTC
|
Protein sequence | MAGKHKKLTF KEQMQGFPVR QMFIISIIRF SEPLAFTSLF PYLYFMIRDF HIAEREEDIS KYSGYLASSF AFCQFLFAVR WGKLSDRIGR KIILLIGLFG TSLSLICFGF SQTYWTALIS RSLAGVLNGN VAVLRTMIGE IATERRHQAL AFSTLPLLFN FGSIIGPAIG GSKYLTHPRK ENPYHPGETM ALLEEGLSFY ERFIRKYPYA LSNIVVSCFL WFSLICGFLF LEETHEVCKY RRDYGVDLGD WLLLKLGFHP PVRPWHHHKH TRNSSSDNNQ DNASISSETV DEDDDDDIQS VRPYISRRMS QAIVHRPSYA NAFTSRVITV ITGNFIISLH NVTYNEFLPV FLASRFRKDG LKFPFRIEGG FGLDVSYIGT LFSSTGIMGM LIVLLIFPMI DRNLGTINGY RLSVSIFPFV YFMVPLAIFT LHDYNPAFPK WVTPVILYTF TSLKTLASAT GMPQVMLLNH RAAAKEHRAY VNSATMSIIA LARCTGPIVF GYLMSLGDKL STGELIWWVM SLLAGFGMIQ SYWMEDYDDD EDEKVQPEPA ADLFEAEALG EEDMQQDFTT IEVSDNEQSS LSTN
|
| |