Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_31966 |
Symbol | |
ID | 4839631 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | + |
Start bp | 335340 |
End bp | 336950 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640390946 |
Product | predicted protein |
Protein accession | XP_001384724 |
Protein GI | 150865488 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAGGGT TGACAGTACG AGAACAGTTG GAAGATTTCC CCGTCTTCCA GATGTCCATC ATCTCCATCA TCCGGATAGC CGAACCAGTT GCTTTTACCT CCATGTTCCC CTATATATAC TTTATGATTA AGCAATTCGG AGTAGCAGCC AAGGAAGCCG ATATTTCGCG GTACAGTGGG TACTTGGCAT CGGCTTTTTC TTTCAGTCAG TTTCTCTGTT CGGTACTGTG GGGTAGGGCG TCGGAACGTT TTGGGCGCAA ACCAGTTCTC CTAGTGGGTC TAATGGGTAC AGCAATATCG ATGATTGTGT TTGGATTCAG CACCAATTTC TATATTGCGT TTTTAGCCCG GTTGATGATG GGATCCTTGA ATGGAAATGT CTCAGTTATC AGAACAACCA TTGGTGAAAT TGCTGTAGAA AGAAGACACC AGTCTATAGC CTTTAGCTCT CTTACTTTAT TATGGAGTAC AGGAGCAATT GTTGGATCTT GGTTGGGTGG AGTTTTGACA GACACAGAAA ACTTACCAGA GCAAATTGGA CAGGGTCCCA AGGGCTCTAG TCTATTAGAA AGATACCCGT TTGCACTTTC CAACATCGTA GTAGCAGGAG TTTTGTGCAC TAGCATTGTA ATTGGGTGGC TTTTCTTCGA AGAAACCCAC GAACATAAAC GATTTGATAG AGACAGAGGT CTTGAAGTGG GGGATTATAT TAGATCGAAA CTTGGTCTAG AACAGCCTTT AAGACCATGG AGGAAATATT CCAATGTCTA CGACCAACGT CGCCGTCCAG AAAGACTAAT GAGTGATTAT GATGGAAATG AAATGGACAG AAGTAGCAAT TCTTCGGAAA GTATAGAGCT ACAATTGTAT TCGCTGATAG ATCCAGACGC AGAAAGAGCA GGAGCAATTC CTGGCTCAAA GCCTACTCAT GTAGATTATG TCGGCGCCTT TACCTGGCCA GTCATAAATA CCATCCTCAG TCATTTCATA CTATCTTTCC ATAACTTGGT GTATTCCGAA TTCCTTCCTG TACTTCTTGC AGGCAAGATT CAGCTTAAGG ATTTGCAATT TCCATTCAAG ATCAAGGGTG GCTTTGGGTT CTCCTCTGAT ACAATTGGGA TGATCCTTTC GCTCACTGGG ATAGTTGGGA TCTTGGTGGT AATATTTGTC TTCCCCATTA TAAACACCTA CTTCAGTACT ATAAATGGGT ACCGAGTGGC ACTTATATCT TTCCCCATTT CACTTGTAAT TCTACCACTA TTGGTGTTTA CACTTCCAGA ATACAACTCT CATATTCCGA ACAAGTTCTT TACAGGAGTT TGTTTGTATA TGATTACAGG TTTGAATACG TTTTCTGGTG CTACAGCATT CTCCCAGATC ATCATCTTAA TCCATAGAGC TCTGCCCAAG AAGTACCGTG CACTCATCAA CGGCTACACG TTGAGCATCA CAGCACTAGC TAGATGTCTT GCACCCATTA TCTGGGGCTG GATCATCTCC AAGTTTGACC AGATGGGCTA CAGCGGAGTA TCGTGGTGGC TCTTGTCGTG TATAGCCATA GGTGGCTTCT TCCATTCGTT CGTACTAGAG GACTACCAGG AGGAGATTTA G
|
Protein sequence | MQGLTVREQL EDFPVFQMSI ISIIRIAEPV AFTSMFPYIY FMIKQFGVAA KEADISRYSG YLASAFSFSQ FLCSVSWGRA SERFGRKPVL LVGLMGTAIS MIVFGFSTNF YIAFLARLMM GSLNGNVSVI RTTIGEIAVE RRHQSIAFSS LTLLWSTGAI VGSWLGGVLT DTENLPEQIG QGPKGSSLLE RYPFALSNIV VAGVLCTSIV IGWLFFEETH EHKRFDRDRG LEVGDYIRSK LGLEQPLRPW RKYSNVYDQR RRPERLMSDY DGNEMDRSSN SSESIELQLY SSIDPDAERA GAIPGSKPTH VDYVGAFTWP VINTILSHFI LSFHNLVYSE FLPVLLAGKI QLKDLQFPFK IKGGFGFSSD TIGMILSLTG IVGILVVIFV FPIINTYFST INGYRVALIS FPISLVILPL LVFTLPEYNS HIPNKFFTGV CLYMITGLNT FSGATAFSQI IILIHRASPK KYRALINGYT LSITALARCL APIIWGWIIS KFDQMGYSGV SWWLLSCIAI GGFFHSFVLE DYQEEI
|
| |