Gene PICST_55023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_55023 
SymbolDUR4 
ID4837662 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp201284 
End bp203263 
Gene Length1980 bp 
Protein Length659 aa 
Translation table12 
GC content42% 
IMG OID640388977 
Producturea permease 
Protein accessionXP_001382273 
Protein GI150863711 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.410596 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACCAA CCTTATCTCA AGGTGTAGCA TATGGTGTCA TCATAGGTGG CGGCGCATTC 
TTTGCCATAG TGATGAACTA TTTCACTCAC CTCCAGAACC GTTACAGTCG TTACAATTCC
AACAAGATTG ATGAATTCGT TTCAGGAAGC AGATCCATAG GGTTCGGATT GCTTCTCAGT
GGTATCTTAT CCAATTGGAC ATGGAGTTTA ACTTTACTTG AATCGGCTGT CAAGAGCTAC
AATATGGGCT TCAGTGGAAG TTATTGGTAC GGTATTGGAG GTTTGTTGCA AGTTTCTGTC
TTCTCTGTCA TCTCTAGTAA GATCAAGAAG AATGCAAATT TGGTCACCAC GTTTCCTGAA
ATGGGCTATT TCAGGTTTGG AAGAGCTGGC CATTTAGCAT TCTTGTGGTG TGGATTCATT
TGTAATGCCA TTGTCAGTTC GTGTATACTA CTTGGAGGTA GTGCAGTATT TCATGCTATA
ACCGGTATTA ATCAATATGC TGCTCTCTTT CTTATTCCAT TTGGAGTTGC CGTATACGTG
TCCTTCGGTG GATTGCGTGC TACATTCATT TCAGATGCTA CTCATACTTG TATAATTCTT
GTCTTCTTAA TCGTTTTCAT GTTCGAAGTT TACGTTACCA ATCCAAAGAT CGGATCTCCT
GAAAAGATGT GGGAATTGCT TGAACTGCTT TCTCCAGTCG ACGGAAACTA TAGCGGTTCC
TACTTGACTT TTAGATCCCA ACAAGGTGCA ATATTTGCTG TAGTTAGTAT TATCACTGGA
TTTGGCTTGG TTGTTAACGA CCAGGCGTAT TTGTCCAGAG CTGTTGCAGC AGACCCGAGA
TTTACATCCA GAGCATACTT TTTTGCTTCA GTTTGTTGGT TTGTCATCCC TTTCTCAATA
GGAACATCGT TGGGTCTTGC AGCTAGAGCT CTCACGGTAT ACCCTGATTT CCCTGCTTTG
TCTGATTTCG AAGTCGGAGA AGGCTTACCA GCTGTAGCTG CTGCCACTTA TTTAATGGGT
AAATCTGGAC TGGCAATGAT GATTGTGATG ATTTTCTTTT CAGTTACGTC GTCCTTTGCT
GGCGAGTTGA TTGGTACTTC TACTTTACTT TCTTATGATG TCTATAAGAG GTATTACAAA
CCAGATGCTA CTCCTAAAGA AGTTGTCACA GCAGCCAAAA TTTTTGTCTT CCTTTGGGCC
ATATTTGCTT CATCTTTAGC TTCTATATTT TACGGTGCAG CAAAAATTTC CATGGGGTGG
TTATTCAATT TCTTGGGAGT TGCTACTGCT TCTGGTGTCT TCCCCATTGC TCTTACATTC
ACCTGGAAAA GATTGAATAA ATCAGGTGCT GTTGGTGGAT CTGTAGGAGG CATGGTATTG
GCCCTAGTTG TCTGGCTCGT CACATGTAAA GCTAGCAAGG GTGAAATCAA TGTCACCAAC
TTGTCAGATC AATGGGTCTC GTTTGCCGGT AATGTCACAG CCCTTATTCT GGGAGGCGTT
ATTTCAATAG GATCATCTCT AATTTGGCCA TCTACATTCG AATTTGAAGA AACCAGAAAC
AGAACAAGTT TGATTTCTGC ACCTGTTAAG AGTGAACCAG CGTTGAACGA AACCAAGGAA
CAAAACGAAA AGAGCTCTGA CCTCAAAATC ACTGAAAGCG ATAAAGACAT TGAACTGGCT
TCAGTAGATA CCGACTTGGA CATGGACCTT CACCAAGTGA TTGACCACCA GCATTTAGAT
AGACAGTTCA AGAAGTACTG TGGTTTGGTT GCAATTCTTG CGGTTATCAT GACATTTATA
ATCCCTGTTC CATTAGGAGC AAGCCCATAT GTTTTCTCGC CCGGCTTTTT AAAGGGCTGT
GTCATAATTA TTATCGCCTG GCTATTCTTC TCATTTTCTT TCGTTGTTCT TCTTCCAATA
TTTGAAGCTA GGAAAGAAGT ATGGAGAATT ACCAAGCTGG TTCTCTCTTT TGGACTGTAA
 
Protein sequence
MEPTLSQGVA YGVIIGGGAF FAIVMNYFTH LQNRYSRYNS NKIDEFVSGS RSIGFGLLLS 
GILSNWTWSL TLLESAVKSY NMGFSGSYWY GIGGLLQVSV FSVISSKIKK NANLVTTFPE
MGYFRFGRAG HLAFLWCGFI CNAIVSSCIL LGGSAVFHAI TGINQYAALF LIPFGVAVYV
SFGGLRATFI SDATHTCIIL VFLIVFMFEV YVTNPKIGSP EKMWELLESL SPVDGNYSGS
YLTFRSQQGA IFAVVSIITG FGLVVNDQAY LSRAVAADPR FTSRAYFFAS VCWFVIPFSI
GTSLGLAARA LTVYPDFPAL SDFEVGEGLP AVAAATYLMG KSGSAMMIVM IFFSVTSSFA
GELIGTSTLL SYDVYKRYYK PDATPKEVVT AAKIFVFLWA IFASSLASIF YGAAKISMGW
LFNFLGVATA SGVFPIALTF TWKRLNKSGA VGGSVGGMVL ALVVWLVTCK ASKGEINVTN
LSDQWVSFAG NVTALISGGV ISIGSSLIWP STFEFEETRN RTSLISAPVK SEPALNETKE
QNEKSSDLKI TESDKDIESA SVDTDLDMDL HQVIDHQHLD RQFKKYCGLV AILAVIMTFI
IPVPLGASPY VFSPGFLKGC VIIIIAWLFF SFSFVVLLPI FEARKEVWRI TKSVLSFGS