Gene PICST_52197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_52197 
SymbolDIP5.5 
ID4851474 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp1916428 
End bp1918152 
Gene Length1725 bp 
Protein Length574 aa 
Translation table 
GC content44% 
IMG OID640393182 
ProductProline specific permease 
Protein accessionXP_001387605 
Protein GI126274602 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0833] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0650572 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGAAA AGTCTGACTT GGCTAATGTG AAGTCTTCTA GCTCTCATGT TCATATTCAC 
AGAGCAGATA CTTCCTTGCT TGACCTCTCA ATTTCCAACT CAAGAGTGGT CGTAGAGGAC
GGAGGAAGAA TCTTGACCCC AAGACACTCC AACCCCGACT CGTTTGAGAC TTTTCAACTG
AACCAGTTGG AAAGAGGCTT GAAAGCTCGT CATGTTTCTC TTCTTTCTTT GGGTGGTGCT
ATCGGAACTG GTCTTTTTGT GGGGTCCGGT TCCGCTCTCT CCACTTGTGG TCCTGCTAGT
TTGGTGTTGA GTTATGCTAT CATGTCGTCG GTAGTGTACT TTGTCATGCA AATGTTGGCT
GAAATGACCA CATTCTTACC ATTACCTGGT AGTGGTGCAC AATCGTTTGT TAACGATTAC
CTTTCTGAAT CCTTTGGTTT TGCTATTGGA TGGAACTACT GGTACGCATT TTCTATATTG
GTCGCTGCTG AGGTTACTGC AGCTGCTATT GTTGTACAAT ATTGGATCAC TTCTGTCAAT
ATTGCTGTTT GGATCACCAT CTTCTTGGTT CTCATTATCT TGTTGAACAT CATCTCGGTC
AGATTTTTCG GTGAAGCAGA ATTCTGGTTC GCTTCCATTA AATTGATCAC TCTTACTGGT
TTAATCATTT TGGGTGTTGT TCTTTTCTTC GGTGGTGGTC CAAGCCACGA CAGATTGGGA
TTCCGTTACT GGAAGCACAG CCCTTTTAAA GAACATATTG TTGGCGGATC TACAGGCCGT
TTCCTCGGTA TATGGACAGC CATTGTCAAG TCCGGTTTTG CTTTTATTTG TTCTCCAGAA
TTGGTTGCCG CTGCTGGTGG TGAGTGTAGA AAGCCAAGAC GTAACATTCC AAAGGCTGCC
CGTCGTTTCA TTTACAGATT GGTATTCTTC TATATCCTCG GTACATTGGT CATCAGTGTC
ATTGTCAGCT CAAAAAACCC AAGATTGTTG TCTGGCTCCT CCGACGCTTC TGCTTCTCCT
TTCGTTATCG GTATCCAAAA CGCAGGTATC CCAGTATTGA ACCACATCAT TAATGCTGCT
ATCTTGACTT CGGCTGCATC TGCTGGTAAC TCTTTCCTTT ACTCGGCCTC TAGATCTTTG
TATTCTATTT CTTGCAGAGG ATTAGCTCCA AAGATCTTCA GCAAGGTGAA CAGATTTGGT
GTTCCAGTTT ATGCTGTCGC ATTGTCATCT GCATTGGGCT TCTTGGCTTA CCTTAACGTC
TCATCTTCTT CTGCAAACGC TTTCAACTGG TTTTCTAACC TTACAACAAT CAGTGGGTTC
ATTTCTTGGA TTTTGGTTGC CTTTGCCTAT TTGAGATGGA GACGTGCCAT TGCATACCAT
GGCTTATCCG ATAGAGTAAC ATACAAATCT CCATTCCAAC CTTTCGGTGC TTACTATGTG
ATATTCTTTA TTTCGCTCCT TTCAATCACT AACGGTTACG CTGTATTTTT CAACTTCAAC
GGCCCTGACT TCGTTGCCGC TTACATTACC CTTCCAATTG TTGTTTTCCT TTACGTTGGC
CACAGAGCCT GGAGCTACTT CACCAAGGGC CAACAGAACT GGCTCAGACC AATCAAGCAG
ATTGACGTAA TAACCGGCTT AGATTTGATA GAAGAGGAGG ATGCGAACGA GCCCGAACCA
GTGCCTAAGA ACCTTTTGGA AAAGATCTGG TTCTGGGTTG CCTAA
 
Protein sequence
MMEKSDLANV KSSSSHVHIH RADTSLLDLS ISNSRVVVED GGRILTPRHS NPDSFETFQL 
NQLERGLKAR HVSLLSLGGA IGTGLFVGSG SALSTCGPAS LVLSYAIMSS VVYFVMQMLA
EMTTFLPLPG SGAQSFVNDY LSESFGFAIG WNYWYAFSIL VAAEVTAAAI VVQYWITSVN
IAVWITIFLV LIILLNIISV RFFGEAEFWF ASIKLITLTG LIILGVVLFF GGGPSHDRLG
FRYWKHSPFK EHIVGGSTGR FLGIWTAIVK SGFAFICSPE LVAAAGGECR KPRRNIPKAA
RRFIYRLVFF YILGTLVISV IVSSKNPRLL SGSSDASASP FVIGIQNAGI PVLNHIINAA
ILTSAASAGN SFLYSASRSL YSISCRGLAP KIFSKVNRFG VPVYAVALSS ALGFLAYLNV
SSSSANAFNW FSNLTTISGF ISWILVAFAY LRWRRAIAYH GLSDRVTYKS PFQPFGAYYV
IFFISLLSIT NGYAVFFNFN GPDFVAAYIT LPIVVFLYVG HRAWSYFTKG QQNWLRPIKQ
IDVITGLDLI EEEDANEPEP VPKNLLEKIW FWVA