Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_33002 |
Symbol | |
ID | 4839949 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | + |
Start bp | 1187668 |
End bp | 1190769 |
Gene Length | 3102 bp |
Protein Length | 1033 aa |
Translation table | 12 |
GC content | 40% |
IMG OID | 640391264 |
Product | predicted protein |
Protein accession | XP_001385580 |
Protein GI | 150866097 |
COG category | [V] Defense mechanisms |
COG ID | [COG1131] ABC-type multidrug transport system, ATPase component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACCCT GGGGAACACT CGCTCTAGCT ACTACGCTAG TTTTGCCATG GTTTCCTGTG GTTCTGGGAG CAGTACGAGT TGACTCTGAA GCTGAACTAT ATAACTCCTT TGATAGGATA AAGCGGGGAG CTGTTATAAA CGATGTTTTT GAAATCAAGC AGGCTCTAAT TGACCAACAA AAAGAGAAGA ACGGTGATAA GGACGAATGT CCACCCTGTT TCAACTGTAA TTTGCCTAAT TTCGAGTGTG TTCAGTTTTC TGAATGTAAC ATATTCACTG GTAATTGCGG CTGTAGAGAT GGTTTTGGTG GGGTTGACTG CAGTGAGCCA TTGTGTGGGG CTCTTTCAGA CGGGAACAAC AATAGACCCG TAAGAAAGAA GGCTACTTGC GAATGCAAAG AGGGCTGGAA GGGTATAAAT TGTAACATGT GTACTGATGA TTCTGTTTGT GATGCTTTCA TGCCCGATGG TTTGAAGGGG ACGTGCTATA AGACTGGTAT TATCGTTAAC AAGTTCCACC AGATGTGTGA TGTAACGAAT CCTAAGATCA TTCTGATTTT GGGCGGAAAG AAGCCTCAAG CCACTTTCAG CTGTAACAAG ACAGCAGAAA ACTGTAACTT CCAGTTCTGG ATTGACGAGC GGGAGTCCTT CTACTGTGAC TTGAACAAAT GTGTTTTAGA CTATGACCTA GTAGCAAACA CTACGAAGTA CAACTGTGAA GAAGTCGCCT GTAAATGTTT GCCTGACAGA ATGTTGTGTG GAGAAGCTGG ATCTATTGAT ATTTCTGACT TCTTGACAGA AACCATTCGT GGACCAGGTG ACTTCACCTG TGATGTTGCT GACAGGAAGT GCCGTTTCTC AGAGCCCAGT ATGAACGACT TGATCCAAAG TGTCTTTGGT GATCCGTACA TCACGTTGAA GTGCAAATCT GGTGAGTGTG TTCATAAAAG TGAGATTCCG GGCTATGAAG TTCCTGATAG AAATAAGCTT ACTTTGAACA ACATCTTGCT ATTAGCAGGA GTTGTTTTGG TTACAGCATT ATTGGTAGCA ACTACTATAC ATAACATCCG TCAATCTCCT TTGTTTAAAG CTGGAATTGG TTCCTTCGAG CCATTAGATG GAGACTCGAG TGCTTTGAAC AATAACTTTA CTCCTACCAA TATTGCGTTT GAAGGTATCA GCTACAGAGT TAGGAGTGGA CAGCAGGTTT TGAATAATGT CTCTGGTAAA ATTGAGCCAG GTGAATGTTT GGCCATTATG GGAGGCTCTG GAGCAGGTAA AACGACTCTC TTAGATATCT TGGCTGGTAA AAACAAGGGC GGTGAAGTTT ATGGAAGCAT CTATGTCAAC GGCAACATAT TGAACCCAGA CGACTACAAG AAAATTGTTG GTTTTGTAGA CCAAGAAGAC CATTTGATTC CTACCTTGAC AGTTTATGAA ACTGTTTTGA ACAGTGCCTT GCTTAGACTT CCAAGGAACA TGACCTTGAG ACAAAAGGAA TCTAGAGTTA TTGAAGTGTT GAACGAGTTA AGAATTTTAA GTATCAAAGA TAGAGTCATT GGCTCCAACT TCAAAAGAGG TATTTCGGGA GGTGAAAAGA GAAGAGTTTC TATTGCTTGT GAAATGGTTA CCTCTCCTTC TATCTTGTTC TTGGACGAGC CCACTTCTGG TCTTGATTCC TATAATGCCA GAAATGTTGT AGAGTGTTTG GTGAAGTTAT CTAGAGACTT CAACCGCACT ATTGTATTCA CTATTCATCA ACCAAGAAGT AATATCGTTT CTTTGTTTGA CAAGTTAATT TTGTTGTCTG AAGGTGATTT GATTTACTCG GGGGATATGA TCAAGTGCAA CGACTTCTTT GCTAAATATG GTTACCAGTG CCCTTTGGGT TACAATATTG CTGACTACTT GATTGACATC ACCATCGACC ATAAAAAGAT TGTTAGAGTA CCATCCGAAG ATGAGATTGC AGAAGAAGGA AGCTCAGAAG GACACGAAGA TATTCACCAA GCTTTTGTTG AAGATACTGC GGGAGAGGTC GACACTACCA GAGAATGGGA ACACTTTGCT GTTCACAGAG ATGAGTACAA CTACGCACCC TTAACTAAAA AAGGATCGAA GGATCAAAGC AAGTATATTC AGATCAAGAA TAAGCTTCCT CAAATCTTTG CTGATTCAGT ATTGGCAATC GAATTACAGA CTGAAATCGA TGAGGCAAAG AATAACCCTG TTCCTCTTGA CTTAAAGAAT CATATGATGA AGAAGGCTAG CTTCCTTAAT CAGATCCTTA TCTTGTCTTC CAGAACATTC AAAAACTTGT ACAGAAATCC TAGATTGTTG TTGACTAACT ATGTCTTGTC TTTGGTGGTT GGTGCATTCT GTGGATATTT GTACTACAGC GTAGCCAATG ATATTAGTGG ATTCCAGAAT AGATTGGGTC TCTTCTTCTT CGTATTGGCT TTCTTTGGTT TTTCGGCATT GACTGGTTTA CATTCATTTT CTTCAGAAAG AATCATTTTC ATCCGTGAAA GAGCAAATAA TTATTACCAT CCATTTGCGT ATTACATCAG TAAGATTGTA TGTGATATTC TTCCTTTGAG AGTTCTTCCT CCCATCTTGT TGATTAGTAT TGCTTATCCA TTAGTTGGTT TGACAATGGA ACATAACGGA TTCTTGAAGG CTATGGTCGT GCTAATCTTG TTCAACGTAG CTGTTGCTGT AGAGATGTTA ATTGTTGGTA TCTTGATCAA AGAGCCAGGT ACTTCGACTA TGATTGGTGT GTTGATCTTA TTGTTGTCGT TGTTGTTCGC TGGTTTGTTC ATCAACAGCG AGGATTTGAA GGTTCAAATC AAATGGCTTG AATGGATTTC GCTTTTCCAT TATGCCTACG AAGCTTTGTC GATCAACGAA GTAAAGGACT TGATATTAAA AGAAAAGAAG TACGGTTTAT CTATTGAAGT TCCAGGTGCT GTAATCTTGA GTACTTTTGG ATTTGATGTT GGTGCCTTCT GGAAGGATGT TGCGTTCTTG GGTGGGTTAT CTGGAGCCTT CTTAGTATTG GGATATCTTT TCTTACACAA TTTTACCATA GAGAAAAGGT AA
|
Protein sequence | MKPWGTLALA TTLVLPWFPV VSGAVRVDSE AELYNSFDRI KRGAVINDVF EIKQALIDQQ KEKNGDKDEC PPCFNCNLPN FECVQFSECN IFTGNCGCRD GFGGVDCSEP LCGALSDGNN NRPVRKKATC ECKEGWKGIN CNMCTDDSVC DAFMPDGLKG TCYKTGIIVN KFHQMCDVTN PKIISILGGK KPQATFSCNK TAENCNFQFW IDERESFYCD LNKCVLDYDL VANTTKYNCE EVACKCLPDR MLCGEAGSID ISDFLTETIR GPGDFTCDVA DRKCRFSEPS MNDLIQSVFG DPYITLKCKS GECVHKSEIP GYEVPDRNKL TLNNILLLAG VVLVTALLVA TTIHNIRQSP LFKAGIGSFE PLDGDSSALN NNFTPTNIAF EGISYRVRSG QQVLNNVSGK IEPGECLAIM GGSGAGKTTL LDILAGKNKG GEVYGSIYVN GNILNPDDYK KIVGFVDQED HLIPTLTVYE TVLNSALLRL PRNMTLRQKE SRVIEVLNEL RILSIKDRVI GSNFKRGISG GEKRRVSIAC EMVTSPSILF LDEPTSGLDS YNARNVVECL VKLSRDFNRT IVFTIHQPRS NIVSLFDKLI LLSEGDLIYS GDMIKCNDFF AKYGYQCPLG YNIADYLIDI TIDHKKIVRV PSEDEIAEEG SSEGHEDIHQ AFVEDTAGEV DTTREWEHFA VHRDEYNYAP LTKKGSKDQS KYIQIKNKLP QIFADSVLAI ELQTEIDEAK NNPVPLDLKN HMMKKASFLN QILILSSRTF KNLYRNPRLL LTNYVLSLVV GAFCGYLYYS VANDISGFQN RLGLFFFVLA FFGFSALTGL HSFSSERIIF IRERANNYYH PFAYYISKIV CDILPLRVLP PILLISIAYP LVGLTMEHNG FLKAMVVLIL FNVAVAVEML IVGILIKEPG TSTMIGVLIL LLSLLFAGLF INSEDLKVQI KWLEWISLFH YAYEALSINE VKDLILKEKK YGLSIEVPGA VILSTFGFDV GAFWKDVAFL GGLSGAFLVL GYLFLHNFTI EKR
|
| |