Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_30690 |
Symbol | |
ID | 4838151 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | + |
Start bp | 685816 |
End bp | 690564 |
Gene Length | 4749 bp |
Protein Length | 1582 aa |
Translation table | 12 |
GC content | 41% |
IMG OID | 640389466 |
Product | predicted protein |
Protein accession | XP_001383415 |
Protein GI | 150864553 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.781261 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.243077 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGTTAG GTTCGGGAGC TGGATCCGGA ACAAGGTCCC GGTCGGCTAC TTTAACTGGA AGATCGGGCC GAGGGAATAT CACAGCAGAC TCCACGATGA TTTCTGCGAC CAGCGTAGGC GCTGGGGCTC CATCAAATAT CGCTGGGGCT CCCGGAGGAA TTCTGGGAAG CTTTCCACAT GACATACCTC CGGATTTGGT GCTATTACTA CAGAAACTCG AAGAGGATTT CACAGTAAAT AAGACAATCG ACCAATGGAC ATATATCAAC CGTCGTGAAG AGATCATGCA ATCGGTTACA AAACTTCACC AACAACAACA GGAGTTGCAG GAACAGCAAA TAGACCCTGA TCTGCTATCA AAGCCATTGT TTCCTCGAAA ACTGGCAGAA TTCTTGAAGG AACATAAATT TCTGCTGGAG AAACTAGTAG ACGTTTTGGC CGGTAGGGCC AATACATATA CTGCAGAGCA GGCTTTTTTA GTGCTTGACT CAAAGGGAAA GGAAGTCAAT TCTATCTCCT GGGAAAAACT CTACTTGAAA TCTGTGAAAA TAGCTTCTGA AATCCGGCAC AAGTCTACAT TGAAAAATGG AGACACTGTC GTGTTGCTTT ATAAGGATGG AGAAGTAGTA GAATTCGTTG TAGCTTTTCT CGGTTGTATC ATGTCAGGTG TAACAGCTAT TCCCATTCAT CAGGACATAT CTATTCATGA AGTTTTGGAC ATTATCAACT TAACTGGCAC CAAACTTGTG CTCTATTCAG AAGTAGTAGC TAAAGAACTT GATCGTCTCA ATGCTCAAAG TCAGAGAATA AACTGGCCCC CTAAACTTTT ACGATGGAGG ACGACTGAAT TTGGGTCCGC AAAGAAGTCT GAACTTTCCC ACTGGCTTTC GCGAGAGAAC CAGAAGAAAA TCGACTTGTC CAAGACGCAG CTAGCATACG TAGAATTCTC CAGATCTCCG GTAGGAGAAT TGCGAGGAAT TGCGCTCAGT TACAGAACTA TATTGCATCA GATGAACTGC CTTAATGTGT CATTACTGAG TTTGCCTGAT TCTGGAGGCG GTCTTCAGCG AAGTTATAAA GAATACAAGA GAAACAAGAA AGTAGTTTTA GCTACTCTTG ATATCCGATT TTCCATCGGA TTAATTCTTG GAGTCTTATT TACAATCTAC TCTGGTAATG TTCTCATATG GGCACCTCAG AGAGTGATGG AAATTCAGGG TTTATATGCC AACATTATTA CCAAGTGTCG TGCTTCGTTG TTGCTAGCTG ACTATATTGG TTTGAAAAGA GTTACCTATG ATTACCAGCA ATCTCCCAAC GCCACGAGGT ATTTTTCAAA AACACAGAGA GTTGATTTTT CCAGTGTAAA GTGGGTCTTG GTCAATGCAT TAACCATTGA TGGAGAATTC ATGGAAATCT TGGCTCTGAG ATACTTGAGG CCTTTGGGAT GTCAGCATCC GCAGAATGCC ATTATACCTA TGTTGACTTT AAGTGAATAT GGTGGAATGG TGATATCATT AAGAGACTGG ATTGGAGGAA GAGATAAAAT GGGCTTGGCT AATACTTCAG AGGAAGATGA CGCAAACTTC AACGACTTAT CTGCTGTTCT AATTGACAAA GAAGCGTTAT CCAGAAATCT TGTCACTATT CTAGATACAA ATCCTCTAGC TTCAGACGAA GTTCCACACA ACGCATTGCG TGTTGATGCC TTTGGATACC CTTTGCCAGA CGCTACACTA GCAGTAGTGA ACCCAGAACT GTCTATCTTA GTACGTAAAG GAGAACTAGG AGAAATATGG ATAGATAGTC CTTGTTTATC AGGAGGGTTT TATGGATTGA GGAACGAGTC GAAACTGATT TTCCACGCCA AATGTCGTGA TGCTAACGGT ATGCTTGAGA TGGATTTCTT GAGAACAGGT TTACTTGGAT TCGCTCACAA CGGTAAGGTG TTTGTTCTTG GGTTGTATGA AGATAGAATT AGGCAAAGAG TCAGTTGGAT TGACCAAAAA TTGTATCAGA AGTTGAACAG AGACTTGGTC ATTGGGAACG GTTCGAGATA CCATTATTCT TCTCATTTAT TGGCTACGTT GGCCAGTGAA GTGAGACAAG TCTATGACTG TACGATTTTT GATGTTTTCA TTGGCAATGA ATACTTACCT GTAGCTATCG TGGAAGCTGA AGTTGTACGT AAGCTAGTGG ATGACTCAGG TAATATTGAA CCAAATGCCA ATAGTGGTTC GTCTAGTACA AACAAGCCAG GAGAAGGCGA AGTTCAATTC AATGCCCTTC CACATAACGA ACCTGTTTTG AACACGATTG CCCAGAACTG TTTCGATACA TTGTACAAGA GACACTACTT GAGGTTGTAT TGTGTTGTTG TGGTGGATAT CGATACGTTA CCGAAGATCA TGAGGAGTGG AGGCAGAGAA ATTGCCAACA TGCTTTGTAA GAAGAGGTTC TTAGAAGGAA GTTTGAAAGC TGAATTCGTG AAATTCTTTG TACGCAAATC AATAAGTATG GTTCCTCATG GTGAAGACGT CATTGGTGGC ATTTGGTCGC CATATGTTTC TGAGTTGAGA AATGTAGCAT TACAAAAGTT TCCAAACCAA TTTTCGACTA TTGACTACCG TGAGAAATCT CTCGATGACA AGACTGGAGC TCCTTTGACG GATTTTAAAA CAATTGTAGA TATCTTGAAA TTCAGAGTTG CCAGTTCTGG AGACTCTGTT GCGTTCCAGA ATACTGATAG TTCAGGAAAG GGTTCTAAAC CGTTGACTTG GAAAAAGTTC GAACATCGCG TCTACGCCGT CTGCAGTTAC CTTATTGAGA AGACCAATGT GAAGCCAGGT CAATACGTCA TATTGATGTA TTCATTGTCG GAAGAGTTCG TTGTTGCCGT ATACGCCTGT TTGATTTGTG GTATCATTCC AATTCCCATG TTGCCATTTG ATTCCAACAG GATCGGCGAA GATTTCCCTG CCTTTGTTGG TGTTATTCGT GATTTCGATG TGAGTGAGAT TTTAGTAAAT GATGAAGTGG AAAAATTCTT GAAGAATGGC CCAGTTGCCG ATTCATTGAA GAAGATAGCT CATCGCAAGT CCAAGATTGT CATCAAGAAT ACATTGAAAT TAACCAAGGT ATCCAACATA GCTTCCTTAA ATTCTAAGGT GGCCAAGTAT CAGGCAGCAG TAAACTTCAG AGATGAGAAT ACTACAGCTT TGGTTTGGTT GAATTTCACA TCTGATCACT ATAGAGTGGG TGCCACGTTG AGTCACAAGA ACATTATTGG TATCTGTAAA GTTTTTAAAG AAACGTGTAA CTTATCTTCC AAATCAGCTA TTGTTGGATG TGTTCGTCAT TCTTCGGGAA TTGGCTTTAT TCAATCTGCT TTACTTGGAG TGTTTTTGGG TACCACAACT TATCTCACGT CACCATTCAG TTATGCTGCC AATCCACTTG CTTTTTTCTT ATCCCTTGCT CGTTGCAAGG TTAAAGACGT TTTTGTGACG GAACAGATGT TGAAGTATGC CGCAGTCAAG TTTACACCCA AGGGTTTCAA TTTGACAAAC TTGAAGAACA TGATGATCAG TACTGAAAAT CGAGTTGAAA TAGACTTACT TAGAAAGATT GCTAAGGTAT TTCAACCTAC AAAGTTGAGT GCTGCATCGA TGTCAACTGT GTACAACCAT TACTTCAATC CTATCATTGC AACTAGATCA TACATGACAG TAGCTCCGGT GGACCTTTTC TTAGACCCAA TAGCTTTAAG GCAGGGGTAT GTTTCTGTGG TGAATCAAGC CGAAGTTCCA AACGCATTAC ATATACAAGA CTCTGGAATG GTTCCAGTTT GTACTGAAAT CGCCATTGTG AATCCAGAAA CAAAGAGAAT ATGCAAGGAA GGCGAATTTG GAGAAATATG GGTTTGTTCT GAAGCCAACT TGACCTCTTT CACCAATGGT CCCAAGGGCC CTGTAGACAA GTTTAGCGAT ATCCAGTTCA ACGGAATTAT AGCTGATGGA GACCCCAACG TAGTTTATTT GAGAACGGGA GATTTGGGGT TCTTACACCA TATATCAATT ACCAAGAATG GAGGTGACCC CAACGACAAT AATGATATCA CATCGTTCCA GCCACTTTTT GTATTGGGCA AGATAGCAGA TACTTTTGAA GTTATGGGTT TGCACTACTT CCCGATCGAT ATTGAAAACA CTATCGAATC TTGTCATGCA GACATGTTCA AGAATGGCTC ATGTATCTTC AAGTGTGCCG ATTACACTGT GGTTGTATGT GAATCCAAGA GACATAGATA TTTTGCCTCT CTTGTTCCCT TGATTGTCAA CACTGTTTTG AGTAAACATC ACATAGTAGT TGATATTGTG GCCTTCATCA GAAAGGGCGA ATTTCCTATT TCCAGGTTGG GTACCAAACA AAGAGCCAGA ATTGTCGATG CGTGGGTTCA AGGAGTTATT CCTATTATTG CTTCGTATGG TGTAAATTAT GGAGAGAATA GTATGATCAA GTTGGTGAAA GAAATCGACG TTGTTGCCAG AGACCACCCA ATCACGGGTT TAAAGAATCC AGCTTTATCG TACTACGATA CTGATCTTGA AGAGATAGAG GATGTATTTG ACGACAATCG GGAAGGTTTA GTTTTGAATG ATGGCAAATT TCCCGAGGGG GTTGACAGAA GAGCAGAGTT TTCTGTGGGG AACTACAGCA CCAGCACGAA ATCAGAAAGT ACTAGCTAG
|
Protein sequence | MSLGSGAGSG TRSRSATLTG RSGRGNITAD STMISATSVG AGAPSNIAGA PGGISGSFPH DIPPDLVLLL QKLEEDFTVN KTIDQWTYIN RREEIMQSVT KLHQQQQELQ EQQIDPDSLS KPLFPRKSAE FLKEHKFSSE KLVDVLAGRA NTYTAEQAFL VLDSKGKEVN SISWEKLYLK SVKIASEIRH KSTLKNGDTV VLLYKDGEVV EFVVAFLGCI MSGVTAIPIH QDISIHEVLD IINLTGTKLV LYSEVVAKEL DRLNAQSQRI NWPPKLLRWR TTEFGSAKKS ELSHWLSREN QKKIDLSKTQ LAYVEFSRSP VGELRGIALS YRTILHQMNC LNVSLSSLPD SGGGLQRSYK EYKRNKKVVL ATLDIRFSIG LILGVLFTIY SGNVLIWAPQ RVMEIQGLYA NIITKCRASL LLADYIGLKR VTYDYQQSPN ATRYFSKTQR VDFSSVKWVL VNALTIDGEF MEILASRYLR PLGCQHPQNA IIPMLTLSEY GGMVISLRDW IGGRDKMGLA NTSEEDDANF NDLSAVLIDK EALSRNLVTI LDTNPLASDE VPHNALRVDA FGYPLPDATL AVVNPESSIL VRKGELGEIW IDSPCLSGGF YGLRNESKSI FHAKCRDANG MLEMDFLRTG LLGFAHNGKV FVLGLYEDRI RQRVSWIDQK LYQKLNRDLV IGNGSRYHYS SHLLATLASE VRQVYDCTIF DVFIGNEYLP VAIVEAEVVR KLVDDSGNIE PNANSGSSST NKPGEGEVQF NALPHNEPVL NTIAQNCFDT LYKRHYLRLY CVVVVDIDTL PKIMRSGGRE IANMLCKKRF LEGSLKAEFV KFFVRKSISM VPHGEDVIGG IWSPYVSELR NVALQKFPNQ FSTIDYREKS LDDKTGAPLT DFKTIVDILK FRVASSGDSV AFQNTDSSGK GSKPLTWKKF EHRVYAVCSY LIEKTNVKPG QYVILMYSLS EEFVVAVYAC LICGIIPIPM LPFDSNRIGE DFPAFVGVIR DFDVSEILVN DEVEKFLKNG PVADSLKKIA HRKSKIVIKN TLKLTKVSNI ASLNSKVAKY QAAVNFRDEN TTALVWLNFT SDHYRVGATL SHKNIIGICK VFKETCNLSS KSAIVGCVRH SSGIGFIQSA LLGVFLGTTT YLTSPFSYAA NPLAFFLSLA RCKVKDVFVT EQMLKYAAVK FTPKGFNLTN LKNMMISTEN RVEIDLLRKI AKVFQPTKLS AASMSTVYNH YFNPIIATRS YMTVAPVDLF LDPIALRQGY VSVVNQAEVP NALHIQDSGM VPVCTEIAIV NPETKRICKE GEFGEIWVCS EANLTSFTNG PKGPVDKFSD IQFNGIIADG DPNVVYLRTG DLGFLHHISI TKNGGDPNDN NDITSFQPLF VLGKIADTFE VMGLHYFPID IENTIESCHA DMFKNGSCIF KCADYTVVVC ESKRHRYFAS LVPLIVNTVL SKHHIVVDIV AFIRKGEFPI SRLGTKQRAR IVDAWVQGVI PIIASYGVNY GENSMIKLVK EIDVVARDHP ITGLKNPALS YYDTDLEEIE DVFDDNREGL VLNDGKFPEG VDRRAEFSVG NYSTSTKSES TS
|
| |