Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_36940 |
Symbol | |
ID | 4840472 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009047 |
Strand | - |
Start bp | 333467 |
End bp | 337027 |
Gene Length | 3561 bp |
Protein Length | 1157 aa |
Translation table | 12 |
GC content | 39% |
IMG OID | 640391787 |
Product | predicted protein |
Protein accession | XP_001386266 |
Protein GI | 150866611 |
COG category | [V] Defense mechanisms |
COG ID | [COG1132] ABC-type multidrug transport system, ATPase and permease components [COG2274] ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.86553 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTTCAG TTGCTTTCAT ACAATCAATC TTTTTCAATG AATACTTGTT GAAGAATCTT GAATTGGGTT TGGGTGTTCG AGCATCCCTC ACGTCTTTGA TTTACCAGAA ATCTTTGAAA CTATCTTCTG AAGCTCGCCT CAAGGTCTCT TCGGGTGATA TCATAAACTT GATGTCTGTA GATGTGAACA GAGTTCAGAG TGTCAGTCAA AATATCAGTA CTTTGGTCCT TGCTCCAGCT GATATTGTCA TGTGTATTAT CTCCTTGTGG CCATTGTTAG GAAAGGCCAC AATGGCAGGT GTATTTACTA TGATCTTATT GATTCCTCTT AACAGTGTTA TTATCAAATA TCTGAGAAGA TTGAATAAAA CCCAGATGAA ATTGAAAGAC AACAGAAGCA GAATCATAAA TGAAATACTT GTTTCAATCA AAAGTATCAA GTTGTATGCT TGGGAAAAGC CGATGTTAGC TAAATTGAGA GAAGCTAGAA ATGAGAAAGA ATTGAAAAAC TTACGCAAGA TTAGAATTGT AAACCAATGT GCACTGTTGG TTTGGAATTT GATTCCATTT CTTGTTTCGT TTACCTCTTT TGCTACTTTT GCATTGACTC AAAACATTCC CTTGACATCA GAAATAGTCT TTCCTGCTTT GGCAATTTTG AATTTGTTGT CTTCGCCTTT GCTTCAGCTT CCTGCCACCA TAACAAACAT TATTGAAGGA TCTGTGGCTA TTGATAGAAT AAAAACATTC TTAACAAGCT CAGAAGTTGA CGAGTCCTTG TTGAACCACA TGCCGCATCC AGCTAAAGAA AATGAAGTTG CTATCAGTAT CGAAAATACT TCTTTCTTGT GGTCGCAGGG AACGTATTCC GATGACACAA CTGATACCCG TAGGTTTGCC TTGAAGGACA TTAACTTTTC TGTTCGCAGA GGAGAGTTAT CTTGTATTGT AGGCAAAGTT GGAAGTGGGA AATCGTCACT TCTTTACTCT CTACTTGGGC AGCTAATTAT GGTTAATGGA GAAGGCAACG GCGTTCCTGC TGTTAATATT AAAGGAACTA TTGCCTATTG TGCACAACTG CCTTGGATAA TGAATGCCTC TGTTAAGGAG AATATTCTAT TTGGATGTAG GTATGAGAAA GACTTTTATG AGAGAACCTT AGATGCTTGT CAGCTATTGC CAGACTTAGA AGTGTTACCT GATGGTGACG ATACGCAAGT TGGAGAGAAG GGTGTTTCTT TATCAGGGGG TCAGAAAGCA AGATTGGCTC TAGCACGAGC AGTATATGCT CGTGCAGACA TCTATTTATT TGATGATATT CTTAGTGCCG TTGACTCTCA TGTTGGGAAG AAGATAATCC AGAAAGTTTT GAGCAAATCA GAAGGATTGT TGGCTCATAG TACAATAATT CTTTGCACGA ACTCTATTTC TGTATTGAGT TATTCTGACA ACGTTACTTT AATTGAAAAG GGCCATATCA TCGAAACCAC AAGCTACGAA GATATTAAGT TAGGAAACCA CCCTAAATTG TTCGACTTGA TTTCCGAATT TGGAAATAGT GATATTTCCA AAACTCCCTC AGTTTCCGAA AGCAATTTCA ATGTGTTACG GGAGGGAGAT CTGCCTTTAT TAGATATTGA CAAAGAGCCA GAAAATGATT ACTCACGGGA AAATCTATCA TTGCTTACAA GAGCTGCTAG TATTGAGACC TTAAGGTGGG ACCCACTTAA GAAGTTATTA CCCAACTTGC GTAGTGGTCA AATCACTGAA GAATCACAAA AAGGTAAGGT TAAATGGTCC GTCTACCATG CCTATGCGAG AGCCTGTTCT ATTCCAGGAG TTGCTGCTTG GTTTGGCTTA TTGATCCTTG CATCATTTGT TTCTGTTGGC GGAAACTATT GGCTCAAGTA CTGGACCGAA AAGAATTCTC AATCAGGAAA GAACGTCAGT GTTTGGAAGT TTATCACCGT CTACGCTATA TTTGGCTTTG GTGCCAGTAC AATGTCGGTT TTGAGAAGCT CTGTCATGAT GTTATGGTTG GCTATAAATG CATCAAGGGA GATTCACGAT ATGATGGCGA CGAGAATTCT TCGTGCTCCA ATGGACTTCT TCGAGAGAAC ACCCGTAGGA AGAATAATGA ATCGATTCAC TAATGATATG AATAGAGTAG ATGATTCTAT TCCAGGTGTT TTCCAAGGTT TTGTTGTTCA ATCAATAAGC GCGTTGATTA CTTTCGGTGT AATAGGCTTT GTAATGCCAT TCTATATCAT TGTCATCGCA GTTCTTTCTT TGGGATACGT TTACTATGAT GTCTATTACA TTGCCTTATC AAGAGAATTA AAGAGGTTAG TCTCCATATC AAGATCTCCA ATCTATGGTC ATTTGGGTGA AAGTTTGAAT GGACTTGACA CAATTAGGGC ATACAATCAA GGTGTGAGAT TTGATTTCAT TAATAATGCC AATGTGGATT GTAATCTTCA AACGCAGTAT ATGTTGAGAT CGATTAACAG ATGGTTGATG TTCAGATTGC AGTTGATCGG GTCTCTAGGA GTGCTCGGTG CTGGATTGTT GGCGTTAATG ACAATCTTTA CAGCTTCTCC GTTAACATCT TCTATGGCAG GGTTCATAAT GACCTACGCC TTGGAGGTGA CTGTCTCATT GAAGATGATG GTGAGACAGT CGGCAGAAGT GGAAACAAGT ATTGTTGCTG TGGAGAGATG TTTGGAATAT AGTACTCTCC CTGTTGAAGA AGATATAGAA AATAAGACCT TGATTGTCCC ACCAATTCAG TGGCCAAATC GAGGTTCCAT AGAATTTGTC AATTATTCTA CCAGATATAG AGCAAATCTT GATCTTGTCT TGCGAAACAT ATCCATGATT ATAAATTCCG GAGAAAAGGT TGGTATTGTC GGTAGAACCG GAGCTGGTAA GAGTTCGCTT GCCTTGTCTA TTTTCAGAAT TATCGAGGCT GTAGAAGGGA ACATCAATAT TGATGATATA GACACTGGCT CAATTTCATT GTACGACCTA AGACATAGAC TAAGCATTAT TCCCCAAGAT TCGCAGTTGC TAGAAGGTAC TGTCAGACAG AACCTTGATC CCTTCAACTA CTATACTGAC GAAGAAGTGT GGAAGGCATT GAAATTGGCG CACCTTAAGG ATCATATCGT TAACTTGAAA GAGACTGAAG GTGAAACACC AGAATCAAAG TTGGATTGTA AAGTGTATGA AGGAGGATCC AATTTTTCAC TGGGCCAAAG ACAATTGATG TCTCTTGCCA GAGTTTTATT GAAGATGACC AACTCGAAGG TGTTGGTGTT AGACGAAGCT ACAGCAGCCG TCGATGTCCA AACAGATAAG ATTATTCAGG AGACGATCAG AGCTGAATTC AAGGACAAAA CGATCATCAC TATCGCGCAC AGATTAGAAA CAGTTATGGA CTGTGACAGA ATTGTAAGTT TGGACAAAGG AGAGCTCAAA GAATATGATA GTCCACAGAA TCTCTTGAAG AACGAAAAAA GTATATTCCA TAGTCTCTGT AAGCAGGGTG GATATATATA A
|
Protein sequence | MFSVAFIQSI FFNEYLLKNL ELGLGVRASL TSLIYQKSLK LSSEARLKVS SGDIINLMSV DVNRVQSVSQ NISTLVLAPA DIVMCIISLW PLLGKATMAG VFTMILLIPL NSVIIKYSRR LNKTQMKLKD NRSRIINEIL VSIKSIKLYA WEKPMLAKLR EARNEKELKN LRKIRIVNQC ASLVWNLIPF LVSFTSFATF ALTQNIPLTS EIVFPALAIL NLLSSPLLQL PATITNIIEG SVAIDRIKTF LTSSEVDESL LNHMPHPAKE NEVAISIENT SFLWSQGTYS DDTTDTRRFA LKDINFSVRR GELSCIVGKV GSGKSSLLYS LLGQLIMVNG EGNGVPAVNI KGTIAYCAQS PWIMNASVKE NILFGCRYEK DFYERTLDAC QLLPDLEVLP DGDDTQVGEK GVSLSGGQKA RLALARAVYA RADIYLFDDI LSAVDSHVGK KIIQKVLSKS EGLLAHSTII LCTNSISVLS YSDNVTLIEK GHIIETTSYE DIKLGNHPKL FDLISEFGNS DISKTPSVSE SNFNVAASIE TLRWDPLKKL LPNLRSGQIT EESQKGKVKW SVYHAYARAC SIPGVAAWFG LLILASFVSV GGNYWLKYWT EKNSQSGKNV SVWKFITVYA IFGFGASTMS VLRSSVMMLW LAINASREIH DMMATRILRA PMDFFERTPV GRIMNRFTND MNRVDDSIPG VFQGFVVQSI SALITFGVIG FVMPFYIIVI AVLSLGYVYY DVYYIALSRE LKRLVSISRS PIYGHLGESL NGLDTIRAYN QGVRFDFINN ANVDCNLQTQ YMLRSINRWL MFRLQLIGSL GVLGAGLLAL MTIFTASPLT SSMAGFIMTY ALEVTVSLKM MVRQSAEVET SIVAVERCLE YSTLPVEEDI ENKTLIVPPI QWPNRGSIEF VNYSTRYRAN LDLVLRNISM IINSGEKVGI VGRTGAGKSS LALSIFRIIE AVEGNINIDD IDTGSISLYD LRHRLSIIPQ DSQLLEGTVR QNLDPFNYYT DEEVWKALKL AHLKDHIVNL KETEGETPES KLDCKVYEGG SNFSSGQRQL MSLARVLLKM TNSKVLVLDE ATAAVDVQTD KIIQETIRAE FKDKTIITIA HRLETVMDCD RIVSLDKGEL KEYDSPQNLL KNEKSIFHSL CKQGGYI
|
| |