Gene PICST_84147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_84147 
SymbolPUT1 
ID4839238 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp928617 
End bp930268 
Gene Length1652 bp 
Protein Length460 aa 
Translation table12 
GC content43% 
IMG OID640390553 
Productproline oxidase 
Protein accessionXP_001385187 
Protein GI150865817 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0506] Proline dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0171947 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGACTT CTAGATACTT TCTGGGCTCG CCTAATAAGC AGGCCACCAC TGTCGATGAC 
GTTGCTTCAT CTTTAGCCAA GGCAAGCACG CCCGCTTCCA CTGTTGCCTC CAAGGTTTCT
TCAGTCGCTG CTTCTGCAGT TTCTGCTGCT TCGTCTGCTG TTCCCTCTCC TGCTTCTTCT
ACAACTATCG TGGACACTGT AGCCGACTCG GGTTCGATCA CTGTTGGCAC GTCTGGTCCC
GAATCTTCGA TGGTGAAAAA GACTGATTCC ACTGCTTACT TGAAACTGAT GAAGTTCTCT
GAGGTCTTCT CTTACTTTGT TATGGGCTTG TGCACTTTGA ACAAACCCAT TCTTAATCTC
TGCATCAAGC TCTTTCCCTA CACGCCTATG CCACTTATCA GAGCTGTTGT CTACAAGATC
TACTGTGGGG GTGAGACTAT TGACGAAGTG AAGAAGACTG GTGATCGCTT GGTTAGCAGA
GGCATCAACA ACATGATGAT TTCTTTAACT ATTGAAGCAT GCAACGGTAA CGATAACATT
GATCCAGAGT ATATTGTCAA CGAAACTGTG AAGTCTGTCT CTGACATTTT GGTTCCTCAC
ACTGTTAAAG TTATCGAAGA ATCGGGTAAA GAAATCAACG CTATTTCGCC TGGTTACGTA
GCTTTGAAGC CTACTGGATT CTCCAAAAAC TCTGCCATTG TATTGAAACA CTTTAACGAG
CCTCAATATG CTGCTGAGTT CGAAGAGTTG GTTGAAAGAG CCGCCAAAGT GTGCCAAACT
GTTTACGATG CTAATATCAA CTTGGCCAAG CAATTTCCTA GTAGAAGTAC CCCCTTTGTT
GTTGCTGTTG TCGATGCTGA AAAGCATGAA CTTCAGGAAG GTGTTTATGA ATTGCAGAGA
AGATTGTATG CCAAATTCAA CAAACCCAAC ATGCCTGTTT CCATCGTGGG AACTTTACAA
ATGTACTTGT CTCAATCTGC TGATTTGCTT GCCTTGGAAG AAAAGTTGGC CATGGAAAAC
AACTACAGAT TGGGCTTGAA GTTAGTCAGA GGTGCCTACA TTCACACCGA AGCTGAAAGA
AAATCAATCA TCCACAGTAC CAAAGAAGAC ACCGACAAGA ACTACAACCA AGGTATCTCG
TACTGTATCG AATCCATCTT GGAACGCAGG GGCAACGAAT CTACAATTGG TCACTTGGTT
GTAGCTTCTC ACAATGCTGA CTCCTTGAAG TTGGCAACCA CCAAAGTTTT CAACGAAACG
GCTGGTGCTA ACAACAATCA GCATAATGTT GTCTTGGGCC AATTGCTCGG TATGGCAGAT
GCTATCACTT ACGACTTGAT CAAAACATAC AAGATCGACA ACGTCATCAA GTACGTTCCA
TGGGGTCCCC CATTGGAAAC CAAGGAATAC TTGTTGAGAA GATTAGAAGA AAACGGTGAT
GCCGTAAAGA ACGATAACGG TTTCCCATTG GTGAAGGCAG CAGTTGGGGA GATGTTCAAG
AGAGTCTTCC GCCTTGCATA AACTCACCAT CGATGGTTTA TGGCCACTAC TAGTATCTAC
AAAATTACAT GCATCACCAT CAATACTAAC GATGGTACCC TGTATTTCTA TGTTAGTTTA
ATAATGACTT ATGAATATAA AAGATGAAAT CT
 
Protein sequence
MVTSRYFSGS PNKQATTVDD VASSLAKAST PASTVASKVS STDSTAYLKS MKFSEVFSYF 
VMGLCTLNKP ILNLCIKLFP YTPMPLIRAV VYKIYCGGET IDEVKKTGDR LVSRGINNMM
ISLTIEACNG NDNIDPEYIV NETVKSVSDI LVPHTVKVIE ESGKEINAIS PGYVALKPTG
FSKNSAIVLK HFNEPQYAAE FEELVERAAK VCQTVYDANI NLAKQFPSRS TPFVVAVVDA
EKHELQEGVY ELQRRLYAKF NKPNMPVSIV GTLQMYLSQS ADLLALEEKL AMENNYRLGL
KLVRGAYIHT EAERKSIIHS TKEDTDKNYN QGISYCIESI LERRGNESTI GHLVVASHNA
DSLKLATTKV FNETAGANNN QHNVVLGQLL GMADAITYDL IKTYKIDNVI KYVPWGPPLE
TKEYLLRRLE ENGDAVKNDN GFPLVKAAVG EMFKRVFRLA