Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_32574 |
Symbol | |
ID | 4839882 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | + |
Start bp | 135370 |
End bp | 138315 |
Gene Length | 2946 bp |
Protein Length | 981 aa |
Translation table | 12 |
GC content | 39% |
IMG OID | 640391197 |
Product | predicted protein |
Protein accession | XP_001385355 |
Protein GI | 150865937 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGGGT TGAATCAATT CATCGTAGAT ATCCGCAATT CCAAAGATAT CGAAGAAGAA AAGAAGCGAA TCAATTTGGA GCTTAATAAC ATTCAGTCCA AATTCAACTC CAACATCAAC AGCTACCAGA AGAAAAAGTA TGTCTGCAAG TTGATCTACA TCTACTTGAG TGGCTATGCA GATTCAGTCG ATTTTGGTTT AAAGGAGTCT TTCCAGCTTG TGTGCTCCTC CAGCCATTCC GAAAAGCAGT TGGGCTATTT GGCACTTCTG GTTCTCATCA ATAACGACAA ATCCACGCAA TCCACGCGTG ACTTCTTGGA CTCTTTGTTG GACCAAGTGC ATCTGTATTT GATCAAGGAT CTTCAATCGT CCAACGAAGA CACGAACTGT CTTGCTGTTC AATTCATTGC TTCCAACTTC AACTTACCAG AATCCACGAC TGTCAGAGTT AACGAAGCTG ACGAGTCGGC TCCAAAATGG TTGGAGTTGA TAGATATCGT CTACTCGTTT GTCACGTCTC CTATCCACAA GTCTGTTATC AAAAAGAAAG CAGCTATTGC ATTGTACTCC TTGTTGAAGT TATATCCCCA GGTTTTGATC TCTAACAACA ACTGGATTCC TAGATTGCTA TCTCTTGCTG ACGACAAGGA TTACGGCGTA TCCATTGCCA GCATCCCCTT GATACAATTC GTAGTCAAGC TGAAGCCTCA GTTTGTGAAA GCGATTATTC CGGCTATCTC TTTGAAATTG TACAATATCA TCATAGAGAA TAAGTGTCCA GAAGAATACT ACTATTACAA GTCCCCAGCT CCTTGGTTGG TAGTCAAGCT CTTACAGTTG ATAGAATACT TCTTTTTCTT GAGTGACACA AACGACTACG CTGTTTTGTC AATAGCGGAT TTAGACGAAC AAACTCTTAA CAACTTGAGA CTGGTGGTAG CCCAATCGAT TCAAAATGCA TCTCAGCCTA TAAAGGGTTT GCCCAATAGA AACTCGCAAA GTTCAACCTT GTTTCAAGCA GTGTCGTTGG CAGTTTTCTT GGACGCCTCT TCAGATGCCA TCAATGGAGC TATCAATGCT CTTATGATGT TGCTTACTTC AAACGAGACA AATACAAGAT ACCTTGCTCT AGACGCTCTC ATCAAACTCA CAGCAAGACT GACTTCAAAC AATTTGTCAG CTTCACCCTC CATCGACGAA AAGTACACCA AGATATTCAA ATTATTGTAT GACAGAGACA TTTCTGTTAG AAGAAAGTCG TTGGACTTAC TCTACACCAT TACTAATGCT TCGAGTTACA GTATGGTTAT AACTAAATTG CTAGACTACT TTCCTTTATG TGACTTCACC TTAAAACCCG AATTGGCTAT TAAAATCGCC GTTTTGGCCG AAAAATTTGC TACAGACTCC ACCTGGTATG TCACCACCAT GTTGAAATTA TTATCCATTA GTGGAGGCGT CAATTCCAAT GGAACGAATT ACATTGGGAA TGAAGTATGG GAGAGGATTG TTCAGATTAT CGTCAACAAC GAAGACTTAC AAAAGAAAAC ATCAAAATTG ATTATTAACT TATTGAAAAA ACCATTTTCT TCTACTGACA ACACTCCAAT TGCTCTTTCT GAAAATCTTA TTAAAGTAGC CGCTTTTGTA CTTGGGGAAT ATGGAGACCA AGTTACTTAT ATGTCTGAAC TTAGTACCAA ATTACAGTTT CATTTGCTTT ATGATGCCTA CTTTAAGGTG TCTTTGACTA CAAGGGCCAT GTTGTTAACA ACTTTCTTAA AGTTCTTCGT TAAGTATCCA GATGAAGATT TCATTCCTGA AATTATGGAT TTATTTGAGA TTGAAGAACT TTCATTGGAT TTAGAAATAC AGACCAGAGC TCATGAATAT CTTACATTGG CTACGCATAA GTATAGTCAG CAACTTTTCA AAGAAGTTTT GAAACCAATG CCTATTTTTG TTAAGAAAGA AAGTCATTTG ATGGACAGAA TAGGCAGTGT TAGTCACATT GTAGGTGTTA ACAGATCCAA GTCACTAGTT TTAGCTAAAA ACATTAGCAG TAACAAGTCA AAAGCAGCTA GAGGAATTGA TTCTAGTCCT ATTCTTGACG AGAACTCTGA TGGATCGAAT CCATTTGAAG AAGAATCGAA GCCAGTTGTT CTTTCTCCCA ACTGGTATTC AGGCTACCAC AGAATGTTGC ATTACGATGC AGGTATCTTT TATGAAGATC AGCTCATCAA GATCACTTAT AGAGTTATCA AGGAAGGCTG TGCCTTGACA TTAAAACTTA CGATCATCAA CAATTCTGCC AAAACTGCAG GTACAGATAT TACAGGGTTA ACAGTATTGA ATCTAGAGAG TTTAACTGAT GACCATGACC CAAATTACGT TCTCAACTTA AAGCAACTCC CTGAATCCAC ATTTCACGAT AAAGCCAACA TGGAGATCTC AGTCAAAATA AGAAACGTAG TGGAAAACCA CGAGAGTCCA ATCTTATCGA TCACATTCAT GTGTGGTGGA TCATTTAACA CCCTAAATTT GAAGTTCCCT GTATTATTGT TGAAGACATT AACTTCAACG GCCTTAAACG GGTTGGATGA ATTCAACAGA CGTTGGGCTC AAATCGGAGA GTTATTGGGC CCTCAAGGAG AGTCTTCACA AGCTGTCAAT CTTACTCACA GGTACAACTC TTCAAATATA GTTAGACTTT TGTCCAGATT AGGCTTTGCA GTCGTACATG CAACACTGGA TGAAACCGAT AACACTATTC TAGTGATGGG CGCAGGTATC TTGCATACGC AGAAGACTAA CTACGGGGTT TTGGCTACAT TGAAAAGTAC AGATCAAGTG GGAAAAGAGT TTGAGGTTGC AATCAGATGT TCAGGCGGGG GAGTTGCCGA GGTTGTGGCT ATTACGATGA AGGAGATTTT AGAAGGGAAG TTCTGA
|
Protein sequence | MKGLNQFIVD IRNSKDIEEE KKRINLELNN IQSKFNSNIN SYQKKKYVCK LIYIYLSGYA DSVDFGLKES FQLVCSSSHS EKQLGYLALS VLINNDKSTQ STRDFLDSLL DQVHSYLIKD LQSSNEDTNC LAVQFIASNF NLPESTTVRV NEADESAPKW LELIDIVYSF VTSPIHKSVI KKKAAIALYS LLKLYPQVLI SNNNWIPRLL SLADDKDYGV SIASIPLIQF VVKSKPQFVK AIIPAISLKL YNIIIENKCP EEYYYYKSPA PWLVVKLLQL IEYFFFLSDT NDYAVLSIAD LDEQTLNNLR SVVAQSIQNA SQPIKGLPNR NSQSSTLFQA VSLAVFLDAS SDAINGAINA LMMLLTSNET NTRYLALDAL IKLTARSTSN NLSASPSIDE KYTKIFKLLY DRDISVRRKS LDLLYTITNA SSYSMVITKL LDYFPLCDFT LKPELAIKIA VLAEKFATDS TWYVTTMLKL LSISGGVNSN GTNYIGNEVW ERIVQIIVNN EDLQKKTSKL IINLLKKPFS STDNTPIALS ENLIKVAAFV LGEYGDQVTY MSELSTKLQF HLLYDAYFKV SLTTRAMLLT TFLKFFVKYP DEDFIPEIMD LFEIEELSLD LEIQTRAHEY LTLATHKYSQ QLFKEVLKPM PIFVKKESHL MDRIGSVSHI VGVNRSKSLV LAKNISSNKS KAARGIDSSP ILDENSDGSN PFEEESKPVV LSPNWYSGYH RMLHYDAGIF YEDQLIKITY RVIKEGCALT LKLTIINNSA KTAGTDITGL TVLNLESLTD DHDPNYVLNL KQLPESTFHD KANMEISVKI RNVVENHESP ILSITFMCGG SFNTLNLKFP VLLLKTLTST ALNGLDEFNR RWAQIGELLG PQGESSQAVN LTHRYNSSNI VRLLSRLGFA VVHATSDETD NTILVMGAGI LHTQKTNYGV LATLKSTDQV GKEFEVAIRC SGGGVAEVVA ITMKEILEGK F
|
| |