Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_40725 |
Symbol | |
ID | 4836987 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | + |
Start bp | 1334987 |
End bp | 1337863 |
Gene Length | 2877 bp |
Protein Length | 958 aa |
Translation table | 12 |
GC content | 41% |
IMG OID | 640388302 |
Product | predicted protein |
Protein accession | XP_001382479 |
Protein GI | 150863858 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTATTG AAGAATACGA ACTACTTGAT CAAGAGCCAA GAGACTTGGA GCTGCAACAG TCGTCACCGC CAGAATCTGC CTCGACACCT AGCCCCTACA ACGAAGTCGA AGAAGAGCCA GAGTTACGAG ATTCGCTACA ATCAGATTCG TCTCAACTAT TTGACGATAT AGACGTTTAC ATGTCACGTA ATAGTGCTGA AATCGATGAT TTCAGCAATA GCCCTTTGTT TCAGTCCGTT TTTCTGAAGT ACCAGGGAGC TAACGTGTGG ATGAAAAGAG TTTGTATGGG ATTATGCATT TTTTCTATAT TATTATGGCT AGCCGGTCTT CTTGTGTATT CACAGATGTC CCTATCTTCA GCCGTCAAGA GTATAACCTG GCAGACAGAC GTAGAAGTTA GTGGTAAGAA TATCACTTTG AACAAATACA GCCCTAAATA CGCCAATTTG ACCATTGATC AGATGAGAAA GTCGAAGTAC GCAGCATACA AAACGACCAT AAAATGGCTA GAACCACAAC AATACCCCAA AGATACGGCT CCAGTACCTC GTGGATCGGG ATTCTACTTA GAAAGAGATT CTGACCGATA CAATATAAGG CAGATGAACA CCGTTTTTAC TGCCCCTTTT ATAGAAAGAA CCCAATTTGC ATACAGAAAC AATTTCTTCT ACATAACGGA TGTCATATTG AACCCACACA AGCCAATTGA CGATCCAGAC AACTATCATA TAGTTGTTAC AGACAAACTC GAACAATGGA GGAGTCTGAG TTTCGCATTG TACTGGATAT ACAATCCATT AACAGCACAG TATATACCGA TACAACCACC CCAGAATTTA AAGAAACTCC AGAACGATAA CCAGTTGCAG GAAGAGATTT TGGACAAGTT GCACTTCGCT GAATTTTCTC CAAAGGGTGA TTTCGTCGTT TTTGGCTTCA ATCACGATAT CTTCTTGCAA GATGTAGTCT CGAACGAAAT TCAGAGAATC ACTAATACAG GTTCTACCAG CATTTTCAAC GGGAAACCGG ATTGGGTGTA TGAAGAAGAA GTGGCTGCTG ATTACAAGTT AATTTGGTGG TCACCAGACC AAGAGAACTT GGTGTTTGCT TCTTTAAATG ATACGCTTGT CCAAGAATTT GAATTGGACT ACTATATCAA AGACAGCACT GAAGTAGGTA CTCAGTACAA AGAACTGCTG GAAAATAAAT TCGAAGATGT CAACCAGTAT CCTATAAAGA CGTCAATTAA GTACCCCAAA CCTGGAACTT CGAACCCTAT ATTGTCATTA TTTAATTACA GACTTTCTGA CAAGTCTATC AAAGAAATCA CCAAGTTACA GGATGGCTTG GGAGAAGATT TCATCTTGTA TAAAGCTGCA TGGGTAGACA GCAAGAACTT TCTCATGAAG CTCACAGACA GAACAAGTGC CATTCTCAAG AAAAAGGTAT TCCAGCCAGC TATATCATCT GAAGTTATTG AAGTCAATTC TATGAATGTG ACTCAGGAGT ATGGAGGATG GGTTGATAAA CTTTCGCAAA TTGCCATTGT AGAAACAGAT GACGACAAGG AAAATCTGTA CATTGACAAA GTAGTGGTCA ATGGTTTCAC TCATATAGCA CTCTTCGAAT CAGCCACATC CAAAGACTAT GCCAGATTAT TGACATCCTC CAATACGTGG GAAGTTCCAC TTAGCTCTCC ATTAGTTCAC GACAAGCAGT TTAACGTTGT CTACTTCTTG ACCACTATAA GAAGTTCCAT GGACGCCCAT CTATATGCTG TTGATCTTTC TACTGATGAC AACAAGTTGA TACCTATCAC AAGCTCTGAA GTAGACGGGT TGTACCAAGT TGAGTTCGAC CAGGCGGGCC AACATTTGAA CTTGTTCTAC AAAGGCCCAA AACAGCCATG GCAGAGACTA GTGAATATGG CCGAAGTTCA TGAATTCATT TCTTCTGGAG ATTTCAAGGG AAATGGTGTA GACGAACTCA TCTTGAAGAG TGAAGTCATC AATCATTTTG ACGTTACTGA AGGTAACTTA AAGGATACCA ACATTCCTAC AAAAGTGTAC AAGACTATTC AAGTAGGCAA ATATGACGAC GGAAGCCCAC TTCGGTTGAA TGTGATTGAA ATCTTTCCCC CTAACTTCAA CCCTCACAGG GCAAAAAAGT ACCCTTTACT AGTGTATGCT TATGGAGGTC CTGGCTCTCA AACTGTAGAC AAGTCGTTTG ACATAGATTT CCAGCACATT GCTAGTGCTT CGTTGGATGC CTTGGTGTTG GTTATAGATC CTAGAGGTAC AGGTGGGCAA GGCTGGAAAT TCAGTAGCAC CGCTAAGAAT AGACTAGGCT ATTGGGAACC CAGAGATATC ACCACTATAA CTTCAGAGTA TATAACCGTA AACAAGAAGT TTATCGACCA GTCCAGAACT GCAATCTGGG GCTGGTCCTA CGGCGGCTTC ACTTCCTTGA AGACATTAGA GTTTGACCGT GGAAAGACCT TTAAATATGG TATGGCAGTG GCACCAGTTA CAAATTGGCT ATTCTATGAT TCGGTGTATA CTGAAAGGTA CATGAATCCA CCAAAAGTAA ATGGAAACTA CGAGAAGTAC GGCCGAATCA GCGATTACAA GAATTTCAAG TCGTTGAAGA GGTTCTTGCT AATGCATGGA ACATCTGATG ACAACGTCCA CCTTCAGAAT CTGCTCTGGC TACTTGACAA GTTCAATCTT GGAGAAGTTG AGAACTACGA TGTCCATTTC TTCCCTGACA GTGACCATGG AATCTATTAC CACAATGCCA ACTCGATAGT TTTTGACAAG TTGCTTCACT GGTTGAGAGA TGCATTTATG GGCAAGTTCG ACGGGTTGTA TAGATAG
|
Protein sequence | MPIEEYELLD QEPRDLESQQ SSPPESASTP SPYNEVEEEP ELRDSLQSDS SQLFDDIDVY MSRNSAEIDD FSNSPLFQSV FSKYQGANVW MKRVCMGLCI FSILLWLAGL LVYSQMSLSS AVKSITWQTD VEVSGKNITL NKYSPKYANL TIDQMRKSKY AAYKTTIKWL EPQQYPKDTA PVPRGSGFYL ERDSDRYNIR QMNTVFTAPF IERTQFAYRN NFFYITDVIL NPHKPIDDPD NYHIVVTDKL EQWRSSSFAL YWIYNPLTAQ YIPIQPPQNL KKLQNDNQLQ EEILDKLHFA EFSPKGDFVV FGFNHDIFLQ DVVSNEIQRI TNTGSTSIFN GKPDWVYEEE VAADYKLIWW SPDQENLVFA SLNDTLVQEF ELDYYIKDST EVGTQYKESS ENKFEDVNQY PIKTSIKYPK PGTSNPILSL FNYRLSDKSI KEITKLQDGL GEDFILYKAA WVDSKNFLMK LTDRTSAILK KKVFQPAISS EVIEVNSMNV TQEYGGWVDK LSQIAIVETD DDKENSYIDK VVVNGFTHIA LFESATSKDY ARLLTSSNTW EVPLSSPLVH DKQFNVVYFL TTIRSSMDAH LYAVDLSTDD NKLIPITSSE VDGLYQVEFD QAGQHLNLFY KGPKQPWQRL VNMAEVHEFI SSGDFKGNGV DELILKSEVI NHFDVTEGNL KDTNIPTKVY KTIQVGKYDD GSPLRLNVIE IFPPNFNPHR AKKYPLLVYA YGGPGSQTVD KSFDIDFQHI ASASLDALVL VIDPRGTGGQ GWKFSSTAKN RLGYWEPRDI TTITSEYITV NKKFIDQSRT AIWGWSYGGF TSLKTLEFDR GKTFKYGMAV APVTNWLFYD SVYTERYMNP PKVNGNYEKY GRISDYKNFK SLKRFLLMHG TSDDNVHLQN SLWLLDKFNL GEVENYDVHF FPDSDHGIYY HNANSIVFDK LLHWLRDAFM GKFDGLYR
|
| |