Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_78107 |
Symbol | |
ID | 4839603 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | - |
Start bp | 248169 |
End bp | 250364 |
Gene Length | 2196 bp |
Protein Length | 653 aa |
Translation table | 12 |
GC content | 48% |
IMG OID | 640390918 |
Product | predicted protein |
Protein accession | XP_001385064 |
Protein GI | 150865729 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | AGTCTATATT GTCCAACTGT TCTCTACGGC CGGCAATTAT ACGTTTCCCA CTACATTACC GATATCGCTG TCACCTTGCA TTGACTCCGA TAAGCTCTGT GATTATTTTT TCAGATTCTT GACTTTTTCA TTTTCAAAAT CTTGACTTTC ATTTTCAGAT TTAGTTTCAA CTTTTCAGAA ATCTCGTAGA CCATGTCAGG ACTTGAATTG CTTGCTGCCG GAATCTTGGG CACGCTGTTG TTGGAGGCCA AGTACCACTT GCTGGAAGAC TTGTCGATTT TGGCCAAGAT CATCCCCAAT CTTCCCTACT TGTACCATGC TAAGCACGGA AAGGCCAGTT ATTGGTACAA GTTTGAAAAA ACGGCCTTGT CCAAGCCAAA CAACACGGCA ATTGCCTTCC CCAGGCCCAA GGCCAACCCT CCGCCAATCA AGCTTGACTC AGAAGGCTTT AAAATTTATG ACGACCAGTT CACCACAGAA ACCTACACCT ACAAGGAGTT GTACAACATC ATCTTGAGGT TTTCGCACAT CTTGAAGTAC GACTATGGGG TCACCGCTCA AGACACCATC GGTGTTGACT GTATGAACAA GCCATTGTTT ATCTTCCTCT GGTTCGCGTT GTGGAACATC GGTGCCACGC CTGCGTTCTT GAACTTCAAC ACGAAGGACA AGCCATTGGT ACACTGTTTG AAGATCGCCA ATGTCTCGCA GGTCTTCATT GACCCCGACT GTGCGGGTCC TATTAGAGAC ACCGAGGAAC TCATCAAGCA GGATGCGCCA ACTTGTAAGC TACACTACAT GAACGAACCG GAGTTGTTGA AGGTCTTGAC AGACCCTTCC ACACCCAAGT ACAGAGCTCC TGACAACACC AGAAACCCAC AACACCAGGA CTACGACTGT TGTGCCTTGA TCTACACCTC CGGTACCACC GGTTTACCCA AGGCCGGGAT CATGTCGTGG AGGAAAGCTT TTTTGGCTGG TGTGATGTTT GGGCACATTG TCAAGATCAA AGATCTGTCC AACGTCTTGA CGGCGATGCC ATTATACCAC TCGACTGCTG CTATGTTGGG AGTGTGTCCC ACACTTTTGG TTGGTGCTAC GGTTTCCATC TCCCAGAAGT TCTCAGCAAC TTCGTTCTGG ACGCAGGCTC GTTTGGTCGG AGCCACGCAT ATCCAGTACG TGGGTGAAGT GTGTCGTTAT TTGTTGCACG CTAAACCACA CCCTGATCAG GATAGACACA ATGTAAGAGT TGCATACGGT AACGGTTTGC GTCGTGACAT CTGGCAGGAG TTCAAGAAAC GGTTCCATGT GGAAGCTGTG GGTGAGTTCT ACGCCTCCAC TGAGTCGCCC ATTGCTACTA CCAACATGCA ATACGGTGAG TATGGAGTTG GTGCCTGCCG TAAATTTGGA ACTATTGCCA GTGCACTCTT GAGCACCCAG CAGACGTTGA TCAAAATGGA ACCTGATGAC GAAGAGGAGG TCTACAGAAA CCCGAAAACT GGATTCTGTG AAGTGGTTGG GATCAACCAG CCAGGTGAGT TGTTGATGAA GATCCAGAAC CCGCAGGACA CCAAGGCTTC GTTCCAGGGC TATTACGGTA ACAAAGGTGC CACATCAAAG AAGATCCTCA GAGACGTATT CAAGAAGGGC GACGCCTGGT TCAGAAGCGG AGACTTGTTG AAGTGGGACG AGGACAGCAT GTTGTACTTC GTAGACCGTT TGGGAGATAC ATTCCGTTGG AAATCGGAAA ACGTCTCTGC AACTGAGGTT GAAAACGAGT TGATGGGATC CAAGACCATC AAGCAGAGTG TCGTCGTCGG TGTCAGAGTC CCTAACCACG AAGGTAGAGC TTGTTTTGCT GTGTTGGAGC CTCTTGACGA GTTTGCTGAC GAATCCAAGC ACGCCGAAGC CTTGAAAAAG ATTTACAATC ATGTAATCCA CACGTTGCCC AAGTACGCCA TTCCACAGTT CATCAAGATC AGTGGAATAG AGGCTTCTCA CAACCACAAG GTTCCTAAGA ACCAATTTAA GAACCAGAAG TTGCCCAAGG GTGAGTCTGG CACGGAACTC ATTTACTGGT TAAATGGCGA CAACTACGAA GAGTTGACGG AAGAGGCCTG GGCTACAATT ATCAGTGGAG ACTCCAAGTT ATAGATGTTT ATATATGTTA TAGAAACTAT ATGTCTATTT TGTTAA
|
Protein sequence | MSGLELLAAG ILGTSLLEAK YHLSEDLSIL AKIIPNLPYL YHAKHGKASY WYKFEKTALS KPNNTAIAFP RPKANPPPIK LDSEGFKIYD DQFTTETYTY KELYNIILRF SHILKYDYGV TAQDTIGVDC MNKPLFIFLW FALWNIGATP AFLNFNTKDK PLVHCLKIAN VSQVFIDPDC AGPIRDTEEL IKQDAPTCKL HYMNEPELLK VLTDPSTPKY RAPDNTRNPQ HQDYDCCALI YTSGTTGLPK AGIMSWRKAF LAGVMFGHIV KIKDSSNVLT AMPLYHSTAA MLGVCPTLLV GATVSISQKF SATSFWTQAR LVGATHIQYV GEVCRYLLHA KPHPDQDRHN VRVAYGNGLR RDIWQEFKKR FHVEAVGEFY ASTESPIATT NMQYGEYGVG ACRKFGTIAS ALLSTQQTLI KMEPDDEEEV YRNPKTGFCE VVGINQPGEL LMKIQNPQDT KASFQGYYGN KGATSKKILR DVFKKGDAWF RSGDLLKWDE DSMLYFVDRL GDTFRWKSEN VSATEVENEL MGSKTIKQSV VVGVRVPNHE GRACFAVLEP LDEFADESKH AEALKKIYNH VIHTLPKYAI PQFIKISGIE ASHNHKVPKN QFKNQKLPKG ESGTELIYWL NGDNYEELTE EAWATIISGD SKL
|
| |