Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_88728 |
Symbol | |
ID | 4837938 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | + |
Start bp | 1267661 |
End bp | 1271344 |
Gene Length | 3684 bp |
Protein Length | 1141 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640389253 |
Product | predicted protein |
Protein accession | XP_001383527 |
Protein GI | 150864628 |
COG category | [R] General function prediction only |
COG ID | [COG0488] ATPase components of ABC transporters with duplicated ATPase domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.690875 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.12515 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCAAT CGTTTGATGA CTTCTTGAAA CAGCAGCAGG GCCAAAAAGG CTCTGCTGGC CGTGGTAATT CTAGATTCAC CAATAACCAG TATAATAGCC AGCACCAAAA TAACCAGTAC CCAAATAATC AGTACCAGAA TAACCAGTAT AATAATCAAT ACAATAACCA GTACAATGCT GGAAATCAGG GTTATGCTTC TAACTACAAC CAATTGAACC AAGGTTTTGG CAACAATCAG TACAACCGTG GAGGTTTCAG AGGCAACTAC CAGGCCGGTG GATCTTTTCC ACAGACCCCC AACGAGTCTT TGCCCAATTC TGGAACTGCC ACCCCTAAGA CTCCTAATGC AAACGCTTCA AGTACTTCGT TGAATTCTCT TTCGTCTGCC TTAAGTAAGC TCAATGTAGG AGACGTTCCC TTCCAGGAAA ACTTGGCCAA TCTTGACAAG GCAGCAAAGA TTGCCGACGT CAGACCAGAA GTGGATGCTA TCACTGAGAC ATTTGTTGTC GAATCTATAT CCTCCGTCAA TGAGTACAAG CTCAACAATA TTGTCAAGTC TTTGGCCAAG CCCAAGAACT CAGCTTTTGT CAGAGAAGCT TCGCTTTTGA TTATCCAGCA ATTGGCTGTG AAGCTTGGGG GTCAAACTCC TAAGGAGTCG TACTTGGTCC AGTTTTTCCT GACCGCATAC GACTTATTTG CTGATAAAGA CAAAAATGTT GTCAAAGCTG CAAAGTCTGC CACCGACACT TTATACGGGA TCTTTCCCGT AGAAGCTCTT GGTACAATTG TGCTTGACGA ATTTTTAAAA TACTTGTCGT CGTCGGCTAA GTGGAATTCG AAGGTCGCCG CTTTGGCCAC TTTTGATAGA TTGGTAGAGG AAGTTCCTGC TGATTTATTG GAATTGACAT TTGTTAGAGC CGTTCCTGTC TTAACAGACT TGGCTACGGA TTTCAAACCT GAATTAGCCA AGCACGGTTT GTCCTCTTTG AAGAAATTTG TCAAAGTATT AGACAACTTA GATTTGCAGA ACAAGTACGA TTTGATTGTA GAGACCTTGG CTGATCCCTC TAAGGTCACC GAGTGTATCA AGAACTTGTC TTCTGTCACG TTTGTAGCTG AAGTGACAGA ACCTGCTTTG TCATTGTTGG TTCCGATTTT GGACAAGTCT TTGAAGATGT CATCCTCTTC CAACGATCAG TTGAGGCAAA CTGTTACTGT TACAGAGAAC TTGACCAGAT TGGTTAACAA CAAGAGAGAA ATCGAAACGT TCATCCCCAT CTTGTTGCCT GGTGTAGAAA AAGTCGTCAA CAATGCATCT TTGCCTGAAG TGAGAGACTT GGGTGCCAAA GCCTTGAAGG TGTTGAAGGA TGCTGAAAGT GAGCAGACAG ACGGAAAGTT CCATGGCAGA ATCACTTTGG AACAAGCCCA AAAATTCTAT ATTGAAAACC TCGACTCTGA GTCTGCAGCC ATTGTACATA CTTTGGACTT TTCTGACGAC ATCATAGCTC AATACTTGAG CAAGGTCTTG CAAGCCGATG CTCATGTCAA TGATTGGAAG CGTTTGAAGG AATACTTAGA GTTACTTGTA AACACTTCTG AATCCATTTC TGAAGAGCAA AAGAATCAAT ATGTCGACAA GGTTATTGAA AACTTGAGAA ACTTGTACGA CGCCGACACT GGCAAGTCCA ACGAAGACGA CGATGGTGCC ATAGAGATCG TCAACGCCGA CTTCTCGTTG GCATATGGGT CTCGTATGTT GTTGAACAAG ACAACTTTGC GTCTTTTGAA GGGTCACCGT TATGGTTTAT GTGGAAGAAA CGGTGCTGGT AAGTCAACTT TGATGAGAGC TATTTCCAAG GGACAATTAG AAGGATTCCC ATCAGCTGAC GAGTTGCGAA CTTGTTTTGT TGAACACAAA TTACAGGGCT CCGAAGCCGA TATGGACTTA GTGAGCTTCA TTGCTTCGGA TCCTGAGTTA GCTGGTATTG AAAGAAGCCA AATCTCAGAA GCTTTGATCA ATGTTGGTTT CACTCAAGAA AGATTGGAGC AACAAGTCGG TTCTCTTTCT GGTGGTTGGA AGATGAAGTT GGAGTTGGCC AGGGCAATGT TGATGAAGGC AGATGTGCTC TTATTGGATG AACCAACCAA TCACTTGGAT GTTGCTAATG TCAAATGGTT GGAAGATTAC TTGGTGGAAC ACACTGAGAT TACGTCCTTA ATTGTTTCGC ATGACTCCGG TTTCTTGGAT GCTGTTTGTA CTGACATTAT CCACTACGAA AACAAGAAAT TGGCTTACTA CAAGGGTAAC TTGTCTGAGT TCGTCAAGGT CAAGCCAGAA GGAAAGTCTT ATTACACTTT ATCTGACTCT GTAGTCCAGA TGCACTTCCC ACCACCTGGT ATTTTAACCG GGGTTAAGTC CAACACTCGT TCTGTGGCCC GTATGTCTAA CGTTACTTTC TGCTATCCTG GTGCTGCTAA GCCTCAGATG AAGAACGTTT CCTGTTCGTT GTCGTTGTCG TCTAGAGTTG GTATCATTGG TCCTAATGGT GCTGGTAAGT CCACTTTGAT CAAGTTATTG ACTGGGGAGT TGGTTCCTAA CGAAGGTAAG GTTGAGAAGC ATCCTAACTT GAGAATCGGA TATATTGCCC AGCACGCTTT GCAACACGTA GAACAGCACA AGGAAAAGAC TGCCAATCAG TACTTGCAAT GGCGTTACCG TTTCGGGGAC GATCGTGAAG TCTTGTTGAA GGACTCGCGT AAGATTTCTG AAGAGGAAAA GGAGCAGATG GCTAAGGAAA TAGATGTCGA TGACGGAAGG GGACCAAGAG CCATCGAAGC TATTGTAGGT AGACAGAAGT TGAAGAAGTC GTTCCAGTAC GAAGTCAAGT GGAAGTTCTG GTTGCCCAAA TACAACTCGT GGGTACCCAG AGAAGTTTTG TTGGAACACG GATTCGACAA GTTGATCCAG AACTTTGACG ATCACGAAGC TTCTAGGGAA GGTTTGGGTT ACAGAGAGCT CACACCATCT GTGATTAGAA AGCATTTCGA AGACGTCGGT TTGGATGGTG ACATTGCTGA CCACACTCCT ATGGGCTCAT TGTCTGGAGG TCAATTGGTG AAGGTTGTCA TTGCTGGGGC CATGTGGAAT AACCCCCACT TGTTAGTCTT GGATGAACCT ACCAATTATT TGGACAGAGA CTCCTTGGGA GGTTTGGCCA TGGCTATCAG AGAATGGGCT GGAGGTGTGG TGATGATTTC ACACAACAAT GAATTCGTCG GAGCCTTGTG TCCTGAACAG TGGCACGTAG CAAACGGTGA GTTTATCCAG AAGGGCACTG TAGCTGTAGA CAATGCTCGT TTCGAAGATC AGGGAGGAGA CTCTACTGGT ACTTCTCCTG CCATCTCTAG ATCGTCAACT CCTAGACCAG ATGATGATGA TTCTCCTGCC AACATCAAGG TTAGGCAGAG AAAGAAGAAG ATGACAAGAA ACGAAAAGAA GTTGCAGGCT GAAAGAAGAA GATTGAGATA CATTGAATGG TTGTCGTCTC CAAAGGGTAC TCCAAAGCCT ATTGATACTG ATGACGAAGA TGAGTAGACA GTTTGTTTAC TATGATAGAA TAGCATAGCA GAATTTATTC CTTATTTAAT TTGACAAAGT CGGATAATAG ATTATTAATG TTCA
|
Protein sequence | MSQSFDDFLK QQQGQKGSAG RGNSRFTNNQ GNYQAGGSFP QTPNESLPNS GTATPKTPNA NASSTSLNSL SSALSKLNVG DVPFQENLAN LDKAAKIADV RPEVDAITET FVVESISSVN EYKLNNIVKS LAKPKNSAFV REASLLIIQQ LAVKLGGQTP KESYLVQFFS TAYDLFADKD KNVVKAAKSA TDTLYGIFPV EALGTIVLDE FLKYLSSSAK WNSKVAALAT FDRLVEEVPA DLLELTFVRA VPVLTDLATD FKPELAKHGL SSLKKFVKVL DNLDLQNKYD LIVETLADPS KVTECIKNLS SVTFVAEVTE PALSLLVPIL DKSLKMSSSS NDQLRQTVTV TENLTRLVNN KREIETFIPI LLPGVEKVVN NASLPEVRDL GAKALKVLKD AESEQTDGKF HGRITLEQAQ KFYIENLDSE SAAIVHTLDF SDDIIAQYLS KVLQADAHVN DWKRLKEYLE LLVNTSESIS EEQKNQYVDK VIENLRNLYD ADTGKSNEDD DGAIEIVNAD FSLAYGSRML LNKTTLRLLK GHRYGLCGRN GAGKSTLMRA ISKGQLEGFP SADELRTCFV EHKLQGSEAD MDLVSFIASD PELAGIERSQ ISEALINVGF TQERLEQQVG SLSGGWKMKL ELARAMLMKA DVLLLDEPTN HLDVANVKWL EDYLVEHTEI TSLIVSHDSG FLDAVCTDII HYENKKLAYY KGNLSEFVKV KPEGKSYYTL SDSVVQMHFP PPGILTGVKS NTRSVARMSN VTFCYPGAAK PQMKNVSCSL SLSSRVGIIG PNGAGKSTLI KLLTGELVPN EGKVEKHPNL RIGYIAQHAL QHVEQHKEKT ANQYLQWRYR FGDDREVLLK DSRKISEEEK EQMAKEIDVD DGRGPRAIEA IVGRQKLKKS FQYEVKWKFW LPKYNSWVPR EVLLEHGFDK LIQNFDDHEA SREGLGYREL TPSVIRKHFE DVGLDGDIAD HTPMGSLSGG QLVKVVIAGA MWNNPHLLVL DEPTNYLDRD SLGGLAMAIR EWAGGVVMIS HNNEFVGALC PEQWHVANGE FIQKGTVAVD NARFEDQGGD STGTSPAISR SSTPRPDDDD SPANIKVRQR KKKMTRNEKK LQAERRRLRY IEWLSSPKGT PKPIDTDDED E
|
| |