Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_30685 |
Symbol | DAL4.1 |
ID | 4837719 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | + |
Start bp | 673488 |
End bp | 675215 |
Gene Length | 1728 bp |
Protein Length | 575 aa |
Translation table | 12 |
GC content | 41% |
IMG OID | 640389034 |
Product | allantoin transport |
Protein accession | XP_001383412 |
Protein GI | 126133775 |
COG category | [F] Nucleotide transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG1953] Cytosine/uracil/thiamine/allantoin permeases |
TIGRFAM ID | [TIGR00800] NCS1 nucleoside transporter family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.22109 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGCTT TAAAGAAACT AGACAAGTGG ATAGCTGTCG AGGATACCTC CACAGAAAGA GGTGAAGACC AAATTAGAAG TAATGAGGAT TTGGATCCTA CTCCCTCTGA CCGAAGAACC TGGAAAATGT ACAACTACAT TTTGATCTGG GCACAATCTG CATTCAACGT TAATGAATGG AATACTGGTG CTAGTTTGAT GAAAGCTTCG GGTCTTCCAT ATGGTCAGAC CATTGGAAGT GCCATTTTCT CCATTTTTGT TGCCGTTATC TTTACTATTG CCAATGCCAG AGCTGGCTCT ACATATCACA TTGGCTACCC TACCCTCGCT AGAGCAACAT TCGGAGTTTA TGGTGCTTAC TTTTTCGTAG CTGCTAGAGG TTTTGTGGCC ATTATTTGGT TCAGTGTTCA ATCATACTAT GGCTCCATGT GTTTGGATGT TGCTTTGAGA TGTATGTTTG GACACAAATG GCTTGATTTG AAAAACCACT TGCCTGCTAG TGCTGATGTG CAATCCAGGA TTTTACTTGC TTTCTTCTTG TTTTGGTTGA TTCAATTCCC TTTGATGTTT GTACATCCAA GACAGATCAG ACATTTTTTC ACTGTAAAAT CTTTTGTTTT GCCTTGTGCA ACCATTGGTT TGCTAATATT CTGTGTCAAG AAGGGTCATG GTCCCGGTAA TTACGACTTG GGCTTACCTA TTAGCACTTC CTCTTCTGCT ATTGGTTGGG GATGGATGTC CGTCATGAAC TCTATTTTCG GTACAATTTC CCCCATGATA ATCAACCAGC CTGATATTGC CAGATATGCA AAAAAACCAA GTGACACCAT TTTGCCTCAA GCTATTGGAT TTGTACTTGC TAAAATCATG ATCATGGTTG TTGGCATGGT TGCTACAGCT TCTATCTACA GGTCTTATGG AGAGGTTTAC TGGAACATGT GGGACTTGAT GAACGCTATC CTTGATCATT CTTGGAACGC CGGGGCCAGA ACAGGTGTCT TCTTTGTGGC AGTTTCTTTT GGAATTGGTA CTGCTGGAAC TAATATCTTT GGTAACTCTA TCCCATTTGC TTGCGATATC ACCGGTTTAT TACCAAAGTA CTTCACTATT TTAAGAGGAC AGATAGTAGT TGCCATTCTA GCATGGGCCA TCGTTCCATG GAAGTTCTTA ACTGATGCTG CAAAGTTCTT AACTTTCTTG GGAAGTTACT CTATCTTTGT TGGACCCATC CTTGGTTGTA TGTTGGCTGA TTACTACTTT GTTAAGAGAG GTAACATCCA TGTACCTTCT TTGTTTACAA AGAAAAGTTC AGGGGTATAC CATTATGTCT ACGGATGGAA CCTTTGGGCT TGTTTTGCTT GGGCTGGAGC TGCCAGTATT TGTATTCCTG GTTTGTACAG AGCTTATTAT CCAGAATCTC TTAGCATTAG TGCTACTAGA ATGTACCAAA TGGGATACAT TCTAACGACT ATTAGTAGTA TGGTCTTCTA CTACTGCTTG AGTTTGATCT TCAAACCACA AATTTATCCA GAAGCTCACA GGGATACTCC AAAGACATGG GAATACATGA GAACCACCGA TGGATTCTTT GAAGATGACT CTCCAATAGG CAAGGTTGGC TACTTTGGTT CTGTCGATGT TTTCACAGGT GAGAAAGTTG ACACATCTGA AGGTTCTAGT GTCAAAACTA AGAGCGAGAA GATATTGGAA ACTGTCTCTA TTGTTTAA
|
Protein sequence | MDALKKLDKW IAVEDTSTER GEDQIRSNED LDPTPSDRRT WKMYNYILIW AQSAFNVNEW NTGASLMKAS GLPYGQTIGS AIFSIFVAVI FTIANARAGS TYHIGYPTLA RATFGVYGAY FFVAARGFVA IIWFSVQSYY GSMCLDVALR CMFGHKWLDL KNHLPASADV QSRILLAFFL FWLIQFPLMF VHPRQIRHFF TVKSFVLPCA TIGLLIFCVK KGHGPGNYDL GLPISTSSSA IGWGWMSVMN SIFGTISPMI INQPDIARYA KKPSDTILPQ AIGFVLAKIM IMVVGMVATA SIYRSYGEVY WNMWDLMNAI LDHSWNAGAR TGVFFVAVSF GIGTAGTNIF GNSIPFACDI TGLLPKYFTI LRGQIVVAIL AWAIVPWKFL TDAAKFLTFL GSYSIFVGPI LGCMLADYYF VKRGNIHVPS LFTKKSSGVY HYVYGWNLWA CFAWAGAASI CIPGLYRAYY PESLSISATR MYQMGYILTT ISSMVFYYCL SLIFKPQIYP EAHRDTPKTW EYMRTTDGFF EDDSPIGKVG YFGSVDVFTG EKVDTSEGSS VKTKSEKILE TVSIV
|
| |