Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_29213 |
Symbol | DUR5.2 |
ID | 4851942 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 3258410 |
End bp | 3260577 |
Gene Length | 2168 bp |
Protein Length | 668 aa |
Translation table | |
GC content | 40% |
IMG OID | 640393650 |
Product | urea transport protein |
Protein accession | XP_001386945 |
Protein GI | 126276093 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0591] Na+/proline symporter |
TIGRFAM ID | [TIGR00813] transporter, SSS family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0581449 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTTCTG CTTCCCAATC TGTTGAATTA TTGGGCCAAG GTGCCGGGTA TGGAATTTTG GTCGGTGTCG GTGCCCTTTT CGCCGGCGGT ATGATCTTGA CCACAAAATT ATTACAAAGA TATTTACATG AGAATTCCAA TTCCACCGAA ACTTTTGCTG TGGCTAATAG AAGTGTTGGA GTATGTACTT GCAGAATTTT CAGATTTTCA CTCGCATATT TTTTCATTGA TAAGATTGCG ATCTAGTTGC AAAATTTAAT ATTTTATTGA ACCATTTTAC TAACTATTTG CTCTAGACAT TTCTTTCAGC TTCGGCAGTC TACTCATCCT GGTCTTGGGC CACGGAATTG TTGTGGACGT CTACAATGGT GTACAACTAT GGTATCCAAG CTTCATATTA CTATGGTGCT GGTTTAGCTG TACAAATTGC CGTGATGTCT GTGATCGGGA TTCACGCTAA GAAGAAGGCT CCTAGTGCAC ACACATCGTT AGAAATAGTC GAATTGAGGT ATGGAAGAGT TGGACATATA CTCTACTTAT TCCTTGGATT GGCAAACAAC TTGCTTTCGT GTGCTTCCAT GATTCTTGGT GCATCTGGTG CCATTTCCAT TATTGCTGGC AATTTGCACC CAGTCGCATC CACCATGCTT ATTCCATTTG GTGTGTTACT TTACACTGTG GTTGGTGGAT TAAAGGCCAC TTTCTTGACT GATTTTGTCC ATACATTAAT CTTACTTTTG GTGTTATGTT ATCTTAATAC CTCGGTTCTC ACTTCTGACC AAATCGGTGG CTTGGACGGC TTGTACAACG CTTTAGTCAA GAAAGACGGT GAACGTTACA TAGAAGGTAA CTTCGATGGT TCCATATTGA CAGGAAAGTC TAAGGGTGCC TTATTTTTCG GTTTGATTTT AACTGCTGGT AACTTCGGCT TGACTGTGTT GGATTCTTCT TTCTGGCAAA AGACTTTTTC AGCAAGTCCA AGAGCAACAG TTCCAGCTTA CTTAATTGCT TGTTTCTTCA TTTTTGCAAA TGTCTGGCCC TTGGGTTCAA TTATCGGTGG AGCTTCTATT GTTTTAGAAG GTACACCAGG CTGGCCAACA TACCCAAGAG AGATGACTCA ATATGAAATC GATTCTGGTT TCGTTTTGCC ATATGTATTG AAGCAAGTGT TGGGTAACGG AGGAGTTGGA GCTCTTCTTT TGGTCATCTA TTTGGCTGTT ACATCCACCG TCAGTGCTCA AATGATTTCC GTGAGTAGTA TTCTTTCCTT TGATATCTAC AAGAAGTACA TCAATCCAAG TGCTCAGAAC AAACAAATGA TCAGAATCTC ACACATTAGT GTCGTATTCT TTGGTCTTTT CTGTGCTGCA TTCTCTGTGA TGTTGCATTA CGTTGGTGTG AATATGACCT GGTTAGGTTA CTTTATTCCA ATGATTATCT GTCCTGGTGT TCTTCCATTG ATTTTTACCA TTACTTGGGA TAGACAAACT ACTCTTGCTG TTGTTGCTGC CCCTATATCT GGATTTATAT TCGGCATTTC AATCTGGTTG TCTACTGCTT ATCACTACTA TGGAGAAGTT ACAATTACAA GTACCGGTGG CCAATTGCCA GCTTTGTTCG GTTCTTTGAC TTCTTTGTTT CTTCCAGGTG TTGTATCCAT AGTCATCAGT TTGTTCTCCT CACAAAAGTT CGACTGGAGC CAATTGCAAC AAGCTGATCT TATTATTGCA GACGACACTT CAATTGAGGA GATAGTCAAG AATGATGGAG AAGTGTCTAG CGACATAAAC GAAAAGACCA GCCCCGTTCA TACCACCGTC AATGATACAG GCTCCAAGCA AGAGAGTTCT ACACAAGAAA GTGAAAAGAG TGTTATTGGT GAAGAATTAG AACCTCATCA ATTGTCTGAA CGTGAACTTG ATTTCTGGAT TAAGATTTCT ACAGGTGCTG CTATATTTGT TCTTTTGATC ACTTGGGTTC TCTGGCCATT ACCGTTATAC AGAGATTGGA AGTTTACAAG AGCTTATTTT AAGGGTTATG TCACTGTTTC TCTTATTTGG CTCTACTCTG CATTAATCGT CATCGGTATT ATGCCTCTCT ACGGTGGAAG GCATTCAATT GCTAGAATCT TTAAAGGAAT CTATAGAGAC TTCATCAAGA GAAGCTAG
|
Protein sequence | MSSASQSVEL LGQGAGYGIL VGVGALFAGG MILTTKLLQR YLHENSNSTE TFAVANRSVG TFLSASAVYS SWSWATELLW TSTMVYNYGI QASYYYGAGL AVQIAVMSVI GIHAKKKAPS AHTSLEIVEL RYGRVGHILY LFLGLANNLL SCASMILGAS GAISIIAGNL HPVASTMLIP FGVLLYTVVG GLKATFLTDF VHTLILLLVL CYLNTSVLTS DQIGGLDGLY NALVKKDGER YIEGNFDGSI LTGKSKGALF FGLILTAGNF GLTVLDSSFW QKTFSASPRA TVPAYLIACF FIFANVWPLG SIIGGASIVL EGTPGWPTYP REMTQYEIDS GFVLPYVLKQ VLGNGGVGAL LLVIYLAVTS TVSAQMISVS SILSFDIYKK YINPSAQNKQ MIRISHISVV FFGLFCAAFS VMLHYVGVNM TWLGYFIPMI ICPGVLPLIF TITWDRQTTL AVVAAPISGF IFGISIWLST AYHYYGEVTI TSTGGQLPAL FGSLTSLFLP GVVSIVISLF SSQKFDWSQL QQADLIIADD TSIEEIVKND GEVSSDINEK TSPVHTTVND TGSKQEKPHQ LSERELDFWI KISTGAAIFV LLITWVLWPL PLYRDWKFTR AYFKGYVTVS LIWLYSALIV IGIMPLYGGR HSIARIFKGI YRDFIKRS
|
| |