Gene PICST_31020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31020 
SymbolNUP2 
ID4837863 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp1539031 
End bp1540239 
Gene Length1209 bp 
Protein Length402 aa 
Translation table12 
GC content44% 
IMG OID640389178 
Productpurine nucleoside permease 
Protein accessionXP_001383913 
Protein GI150864907 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG5042] Purine nucleoside permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.171142 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.23144 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATTT TCAAATTTTT TGCAATCGTT ATCGCAACTT CTGCAGTTGC AGCGCAACCA 
GTTTTCGTCT CCAAGAGAGA AGTCACCGTA GGAGAAACAA AAGAGTTGCC TCTGTCGACC
GACAATGACC TCCATCTGAG CTACGGTAAG CCCTATGCTA TTTTCCAGCC AAAGGTGTTT
ATCGTTTCTA TGTTTGAGTT GGAAAGAGAT CCTTGGCTAG AAGCGTTGGA CTTTGTTCAC
AACATTTCGC TTCCGGGCTT GTCGCCCATA TACTCTACTA TCTACTGTAC TACTAACTAC
AGTATCTGTC AGGCTACTGC TGGAGAAGGC GAGATCAATG CCGCTTCGTC GTTGACTGCT
TTGACTTTGA GTCCCTTGTT TGATCTCACC AAGACCTATT GGTTGTTGGC GGGTATTTCT
GGCGGAGAAC CTACTCAGGT TACTACAGGA TCAGCTACAT TTGCGAAATA CGCCATTCAG
GTCGGGTTGC AATATCAAAT AGACTACCGT GAGTATATAA ACACGAATCC AGATTGGATT
AGCGGCTACA TTCCTTACGG AACCGATAAC CCGTATACTT ATCCAGGCAA TGTCTACGGA
ACTGAAGTTT TCGAGCTTAA CGAAAAGTTG AGAAATAGAG CACTTGAGTT AGCCTCTAAC
GTCCAATTGG ATAACGGAAC TGAAAAAAAT GCCGAGTTTA GAGCTCTCTA CGAAGTTGAA
CCCGCAATTA GCCCCCCTAC AGTGGTAGGC TGTGATGTCT TGACCTCGGA CAACTACTTC
ACAGGAAATG TCTTAAACGA CTACTTTGCA AACCTCACGA AGCTTATGAC TAACGGTAGC
GCTACCTATT GTTCTACAGC ACAAGAGGAC AATGCTTCGC TAGAAGTTTT CACAAGAATG
CAGAAATACG GCTTAGTCGA CTACGAGAGA ATTGTAGTAT TGAGAACTAT CTCCAACTTT
TCCAGGCCGC CGCCTTCTAT GGCCAATAAT ACAGTGAAAT TTTTCACCGA TACCGACAAA
GGCGGAATTG GTCATTCTCT TGCAAACTTG GTCAACGCTG GTTTTCCATT TATTCACGAT
GTTCTCACCA ACTGGGAAAA CGTATACGAG AGTGGAGAAA CCTACGAGGC TGACAACTAC
GTAGGCGACA TCTTCGGGAG TGTAGGTGGA AAGCCAGACT TTGGTAAAGA TAGTTTCGAA
ATAGCTTAG
 
Protein sequence
MKIFKFFAIV IATSAVAAQP VFVSKREVTV GETKELPSST DNDLHSSYGK PYAIFQPKVF 
IVSMFELERD PWLEALDFVH NISLPGLSPI YSTIYCTTNY SICQATAGEG EINAASSLTA
LTLSPLFDLT KTYWLLAGIS GGEPTQVTTG SATFAKYAIQ VGLQYQIDYR EYINTNPDWI
SGYIPYGTDN PYTYPGNVYG TEVFELNEKL RNRALELASN VQLDNGTEKN AEFRALYEVE
PAISPPTVVG CDVLTSDNYF TGNVLNDYFA NLTKLMTNGS ATYCSTAQED NASLEVFTRM
QKYGLVDYER IVVLRTISNF SRPPPSMANN TVKFFTDTDK GGIGHSLANL VNAGFPFIHD
VLTNWENVYE SGETYEADNY VGDIFGSVGG KPDFGKDSFE IA