Gene PICST_57505 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_57505 
SymbolDUR5.3 
ID4837703 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp889075 
End bp891081 
Gene Length2007 bp 
Protein Length668 aa 
Translation table12 
GC content41% 
IMG OID640389018 
Producturea transport protein 
Protein accessionXP_001383804 
Protein GI150864823 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.20443 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.667471 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGACA CTGATACTGT ATTTTTGCTT CCAAAAGGTG CCGGTTATGG GGTGTTGCTC 
GGTGTTGGTG CCGTTTTTGC AATCGGTATG ATCTTAACTA CCAAATTCTT GCAGAAGTAT
CTCAACGAAA ATGCAACTTC TACGGAAACT TTCTCTGTTG CTGATAGAAG TGTGAAGCGT
TTCTTGGCGT GTTCTTCTGT TTACGCTTCT TGGTCTTGGG CCGATGAAAT TTTGCAGACA
GTGTCTATGA TCTACAACTA TGGTGTTCAA GCCTCGTTTT ACTATGGGGC CGGTTTATCG
GTGCAGATGT GCGTTATGGC TCTTATTGGT ATCAGTGCTA AAAAGAGAGC TCCTCAAGCT
CACACTTCGC TTGAAATTGT CGGTTTGAGA TATGGAAAGG CTACTCATAT ACTTTTCTTG
TTCCTCTGTT TGGTTACCAA CTTGATTTCA TGTTCCTCTA TGCTTTTATC TGCTAGTGGT
GCCATTTCGA TCATTTCTGG AAACCTTTCA ATTGTTGCAA GTACATTACT TATTCCATTC
GGAGTTTTAT TGTATACTAC CTTTGGTGGG TTGAAAGCAA CATTCTTGAC TGATTACGTT
CACTCTTTCG TCTTGTTGTT GATTTTGATT GTCATCAACA CTAAGGTTCT TGCTTCCAAG
GAAATCGGTG GTTTGAACGG CCTTTATTCT CAATTGTTAG AACACTCTCA AGATAGATAT
ATCGAAGGCA ATTATCAAGG TTCTATTCTT ACTGGTAAGT CTCAAGGTTC TATCATCTTC
GGTTTGGTTT TGACTTGTGG TAACTTTGGT TTGACTGTCA TGGACTCTTC TTTCTGGCAA
AAGTCGTTCT CTGCTGAAGT AAAGGCTACT GTTCCTAGTT ACTTGGGTTC AGCCGTCTTG
ATTTTTGCAA ACACTTGGCC AATTGGTGCT ATTATCGGAG GTGCCAGTAT CATCTTGCAA
GGCCATCCTA GCTTTCCAAC CTTCCCAAGA AAGATGACTC AGTTCGAAAT CGACTCTGGC
TTTGTTCTTC CTTATACTGT CAAAGCTGTT TTGGGTAATA GTGGTGTTGG TGCTGTCTTG
TTGACTGTCT ACCTTGCTGT AACATCTACC TCGAGTGCTC AAATGATTTC TGTTTCGTCG
ATCTTATCCT TCGATATCTA CAAGAAGTAC ATTAACCCTC AAGCTAACAA CAAGCAGATG
ATCCGAGTTG CTCATTTCGG TGTTGTCTTT TTCGGCTTGT TTGCTGCTGG TTTCACACTT
ATGCTTCACT ACGTTAATGT TAACATGACA TGGATGGGCT ACTTCATGTC CATCGTCATC
TGTCCAGGTG TGTTCCCACT TATTTTCACT GTTACTTGGG ATAGACAAAC CACAATCGCT
GCCTTTGTCG CTCCTATTAC CGGATTGGTC TTCGGTTTCG CCGTATGGAT CACTACCACC
AATAAACTCT ATGGAGAAAT TACTATCGAC ACTCTTGGTA TGCAAATCCC TTGTCTCTAC
GCTTCCTTGA CTGCCTTGTT CCTTCCTGCC GTTGTAAGTA TTATTCTCAG TTTGACTGTT
TTCCCAAAGA AATTTGACTG GAAAGAATTG CTGGAAGCTA AGCTTTTGAT CAAGGCTACG
GGATCTGAAT CTGAATCTGA ATCTGAAAGT GAAGGTGAAA AGTCTGCCAT CAAAGAAAAG
TCCACCATCG AAAATGTTCA GGTTTTCACA GTGGAAGAAG ACTTAGGAGT TCGTGCAGCT
GATCCAGCCG AGTTGAATTT CTATTCCAAA GTTGCCAAAA TTGGTGTTGT TGTTGGTTTG
TTGCTTACAT GGGTGTTATG GCCATTGCCA TTGTACCGTG ATTGGATTTG GTCTGCTGCA
TACTACAAAG GTTATGTTGT AGTTGGGTTA ATTTGGTTAT ACGTTGCCTT TATCATCATT
GGGTTGGCTC CTATTTGGGA AGGTCGTCAT GCTATCAAGA CAGTCAGTAA TGGAATCTAT
AGAGATTACA TCAAACGATC TAAGTAA
 
Protein sequence
MSDTDTVFLL PKGAGYGVLL GVGAVFAIGM ILTTKFLQKY LNENATSTET FSVADRSVKR 
FLACSSVYAS WSWADEILQT VSMIYNYGVQ ASFYYGAGLS VQMCVMALIG ISAKKRAPQA
HTSLEIVGLR YGKATHILFL FLCLVTNLIS CSSMLLSASG AISIISGNLS IVASTLLIPF
GVLLYTTFGG LKATFLTDYV HSFVLLLILI VINTKVLASK EIGGLNGLYS QLLEHSQDRY
IEGNYQGSIL TGKSQGSIIF GLVLTCGNFG LTVMDSSFWQ KSFSAEVKAT VPSYLGSAVL
IFANTWPIGA IIGGASIILQ GHPSFPTFPR KMTQFEIDSG FVLPYTVKAV LGNSGVGAVL
LTVYLAVTST SSAQMISVSS ILSFDIYKKY INPQANNKQM IRVAHFGVVF FGLFAAGFTL
MLHYVNVNMT WMGYFMSIVI CPGVFPLIFT VTWDRQTTIA AFVAPITGLV FGFAVWITTT
NKLYGEITID TLGMQIPCLY ASLTALFLPA VVSIILSLTV FPKKFDWKEL SEAKLLIKAT
GSESESESES EGEKSAIKEK STIENVQVFT VEEDLGVRAA DPAELNFYSK VAKIGVVVGL
LLTWVLWPLP LYRDWIWSAA YYKGYVVVGL IWLYVAFIII GLAPIWEGRH AIKTVSNGIY
RDYIKRSK