Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_84147 |
Symbol | PUT1 |
ID | 4839238 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | - |
Start bp | 928617 |
End bp | 930268 |
Gene Length | 1652 bp |
Protein Length | 460 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640390553 |
Product | proline oxidase |
Protein accession | XP_001385187 |
Protein GI | 150865817 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0506] Proline dehydrogenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0171947 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGACTT CTAGATACTT TCTGGGCTCG CCTAATAAGC AGGCCACCAC TGTCGATGAC GTTGCTTCAT CTTTAGCCAA GGCAAGCACG CCCGCTTCCA CTGTTGCCTC CAAGGTTTCT TCAGTCGCTG CTTCTGCAGT TTCTGCTGCT TCGTCTGCTG TTCCCTCTCC TGCTTCTTCT ACAACTATCG TGGACACTGT AGCCGACTCG GGTTCGATCA CTGTTGGCAC GTCTGGTCCC GAATCTTCGA TGGTGAAAAA GACTGATTCC ACTGCTTACT TGAAACTGAT GAAGTTCTCT GAGGTCTTCT CTTACTTTGT TATGGGCTTG TGCACTTTGA ACAAACCCAT TCTTAATCTC TGCATCAAGC TCTTTCCCTA CACGCCTATG CCACTTATCA GAGCTGTTGT CTACAAGATC TACTGTGGGG GTGAGACTAT TGACGAAGTG AAGAAGACTG GTGATCGCTT GGTTAGCAGA GGCATCAACA ACATGATGAT TTCTTTAACT ATTGAAGCAT GCAACGGTAA CGATAACATT GATCCAGAGT ATATTGTCAA CGAAACTGTG AAGTCTGTCT CTGACATTTT GGTTCCTCAC ACTGTTAAAG TTATCGAAGA ATCGGGTAAA GAAATCAACG CTATTTCGCC TGGTTACGTA GCTTTGAAGC CTACTGGATT CTCCAAAAAC TCTGCCATTG TATTGAAACA CTTTAACGAG CCTCAATATG CTGCTGAGTT CGAAGAGTTG GTTGAAAGAG CCGCCAAAGT GTGCCAAACT GTTTACGATG CTAATATCAA CTTGGCCAAG CAATTTCCTA GTAGAAGTAC CCCCTTTGTT GTTGCTGTTG TCGATGCTGA AAAGCATGAA CTTCAGGAAG GTGTTTATGA ATTGCAGAGA AGATTGTATG CCAAATTCAA CAAACCCAAC ATGCCTGTTT CCATCGTGGG AACTTTACAA ATGTACTTGT CTCAATCTGC TGATTTGCTT GCCTTGGAAG AAAAGTTGGC CATGGAAAAC AACTACAGAT TGGGCTTGAA GTTAGTCAGA GGTGCCTACA TTCACACCGA AGCTGAAAGA AAATCAATCA TCCACAGTAC CAAAGAAGAC ACCGACAAGA ACTACAACCA AGGTATCTCG TACTGTATCG AATCCATCTT GGAACGCAGG GGCAACGAAT CTACAATTGG TCACTTGGTT GTAGCTTCTC ACAATGCTGA CTCCTTGAAG TTGGCAACCA CCAAAGTTTT CAACGAAACG GCTGGTGCTA ACAACAATCA GCATAATGTT GTCTTGGGCC AATTGCTCGG TATGGCAGAT GCTATCACTT ACGACTTGAT CAAAACATAC AAGATCGACA ACGTCATCAA GTACGTTCCA TGGGGTCCCC CATTGGAAAC CAAGGAATAC TTGTTGAGAA GATTAGAAGA AAACGGTGAT GCCGTAAAGA ACGATAACGG TTTCCCATTG GTGAAGGCAG CAGTTGGGGA GATGTTCAAG AGAGTCTTCC GCCTTGCATA AACTCACCAT CGATGGTTTA TGGCCACTAC TAGTATCTAC AAAATTACAT GCATCACCAT CAATACTAAC GATGGTACCC TGTATTTCTA TGTTAGTTTA ATAATGACTT ATGAATATAA AAGATGAAAT CT
|
Protein sequence | MVTSRYFSGS PNKQATTVDD VASSLAKAST PASTVASKVS STDSTAYLKS MKFSEVFSYF VMGLCTLNKP ILNLCIKLFP YTPMPLIRAV VYKIYCGGET IDEVKKTGDR LVSRGINNMM ISLTIEACNG NDNIDPEYIV NETVKSVSDI LVPHTVKVIE ESGKEINAIS PGYVALKPTG FSKNSAIVLK HFNEPQYAAE FEELVERAAK VCQTVYDANI NLAKQFPSRS TPFVVAVVDA EKHELQEGVY ELQRRLYAKF NKPNMPVSIV GTLQMYLSQS ADLLALEEKL AMENNYRLGL KLVRGAYIHT EAERKSIIHS TKEDTDKNYN QGISYCIESI LERRGNESTI GHLVVASHNA DSLKLATTKV FNETAGANNN QHNVVLGQLL GMADAITYDL IKTYKIDNVI KYVPWGPPLE TKEYLLRRLE ENGDAVKNDN GFPLVKAAVG EMFKRVFRLA
|
| |