Gene PICST_36233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_36233 
SymbolPTR2 
ID4839188 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp491714 
End bp493627 
Gene Length1914 bp 
Protein Length637 aa 
Translation table12 
GC content43% 
IMG OID640390503 
Productproton-dependent oligopeptide transporter, POT family 
Protein accessionXP_001384757 
Protein GI126136467 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3104] Dipeptide/tripeptide permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.20752 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGACA TTAAGAAGGC CAGCTCCTCG GAATCTCTCC AGGCTGTCGA CGAAAAAATT 
GGCCACATCG CCATTAACGA TATTGACAAA GAAATCTCCT CTGGTGACGA CTACGATTTC
AATGATGCAA ACAACTACTC AACCCACTAC GTTGATGAAT TCAACCCAAA GGGTTTGAGA
ATCCCAACTG ACGAAGAATC TGAAACCCTT AGAAGAATCT TGGGTAGAGC TTCTTACGCC
TCTTACTTGA TCTGTTTGTG TGAGTTGGCC GAAAGAGCCT CTTACTATTC GGTTACTGGT
ATCTTGTCTA ACTTTATTCA AAGACCAATG CCAAAAGACT CTCCTCACGG ATGGGGTGCT
CCAGCTGATA GAAACTCGAG TGTTTCTGCT GGTGCTTTGG GCCAAGGTCT TCAAGCTGCT
AACGCTCTTA CCCTTTTGCT TACTTTCCTC GCTTACTGTG TTCCATTGTA TGGTGGTTTC
GTTGCTGATA CCAGAATTGG TAAGTTCAAG GCTATTTGGG TTGGTGTTAT TGCTGGTTTT
GTTTCCCACG TTTTGTTCGT TATTGCTGCC ATCCCATCTG TTCTTAAGCA CGGTGATGCT
GCTATGGCTC CAACCGTTCT TGCTATCATT ACCTTAGCTT TCGGTACTGG TTTCATCAAG
CCAAACTTGT TGCCTCTTCT TATGGACCAA TACAGAGAAA AGACCGATGT CGTCAAAGTC
TTGCCATCTG GTGAAAAGGT CATTGTCGAT CGTCAAAAGA CCTTGGAAAG AATGACTTTG
ATCTTCTACT GGGCTATTAA CATTGGTGCT TTCTTCCAAT TGGCCACTTC TTACTGTGAA
AGAGACGTTG GTTTCTGGTT GGCTTTCTTC GTTCCTATTA TTATGTACTT GGTCTTGCCA
GTTGTCTTGG TCTTCTTGCA AAGCAGATTG GTCAGAGACT CTCCAAAGGG TTCTGTTCTT
GAAACTGCTT GGAAGGTTAC CAGAGTTACT TTCTCTAAAG GCTGGATCAG CAGATGGAGA
AACAACACCT TGTGGGAATA CGCCAAGCCA TCTAACATGC TCGAAAGAGG TAGAGAATTC
TACAACGAAA AGAAGAAGAC ACCAATCACC TGGGATGACC AATGGGTTTT GGACATTAAA
CAAACTGTCA ACTCTTGTAA GATTTTCATC TACTTCCCAA TCTTTAACTT GGCTGATGGT
GGTATTGGAT CTGTTCAAAC CTCTCAAGCT GGTGCCATGA CTACTAACGG TGTTCCAAAC
GATTTGTTCA ACAATTTCAA CCCATTGACG ATTATTATCT TGATTCCAAT TCTTGACTAC
ATTGTATACC CAATCTTGAG AAAGTATAGA ATTGAATTCC GTCCAGTTTG GAGAATTTGG
CTTGGTTTCA TTTTGGCTGG TTCTTCTCAA ATCGCTGGTG CCATCATTCA ATGGAAAGTT
TACAAAACCT CGCCATGCGG TTACTACGCT ACTACATGTG ATGAATTTTC TCCATTGTCT
GCTTGGCAAG ATGTTTCTTT GTACATTCTT TCCGCTGCTG GTGAATGTTT TGCTATGACT
ACTGCTTACG AATTGGCCTA CACTCGTTCT CCTCCACACA TGAAGGGTCT TGTTATGGCT
TTGTTCTTAT TTACCTCTGC CATCTCTGCT GCCATTTCGC AGGCTATTAC TCCAGCTTTG
ATTGACCCAC ACTTAATCTG GCCTTTCGCT GGTATTGCCA TTGCTACTTT CATTGCTGCT
TTCATGTTCG TTTACCAATT CAGAAACTTG CACAAGGAAA TGGAAGAAGA AAGAATTATC
AGAGAAGCTT TGGACCATTC TGAAAGAGAC AGAGATCTCA TCTCGCACGG TGGAATTGAC
AACGATAACA ACTTGCAAGC CGTTACCTCC ATCAAGTCTG CTGTTGGCAA ATAA
 
Protein sequence
MSDIKKASSS ESLQAVDEKI GHIAINDIDK EISSGDDYDF NDANNYSTHY VDEFNPKGLR 
IPTDEESETL RRILGRASYA SYLICLCELA ERASYYSVTG ILSNFIQRPM PKDSPHGWGA
PADRNSSVSA GALGQGLQAA NALTLLLTFL AYCVPLYGGF VADTRIGKFK AIWVGVIAGF
VSHVLFVIAA IPSVLKHGDA AMAPTVLAII TLAFGTGFIK PNLLPLLMDQ YREKTDVVKV
LPSGEKVIVD RQKTLERMTL IFYWAINIGA FFQLATSYCE RDVGFWLAFF VPIIMYLVLP
VVLVFLQSRL VRDSPKGSVL ETAWKVTRVT FSKGWISRWR NNTLWEYAKP SNMLERGREF
YNEKKKTPIT WDDQWVLDIK QTVNSCKIFI YFPIFNLADG GIGSVQTSQA GAMTTNGVPN
DLFNNFNPLT IIILIPILDY IVYPILRKYR IEFRPVWRIW LGFILAGSSQ IAGAIIQWKV
YKTSPCGYYA TTCDEFSPLS AWQDVSLYIL SAAGECFAMT TAYELAYTRS PPHMKGLVMA
LFLFTSAISA AISQAITPAL IDPHLIWPFA GIAIATFIAA FMFVYQFRNL HKEMEEERII
REALDHSERD RDLISHGGID NDNNLQAVTS IKSAVGK