Gene PICST_30685 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_30685 
SymbolDAL4.1 
ID4837719 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp673488 
End bp675215 
Gene Length1728 bp 
Protein Length575 aa 
Translation table12 
GC content41% 
IMG OID640389034 
Productallantoin transport 
Protein accessionXP_001383412 
Protein GI126133775 
COG category[F] Nucleotide transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG1953] Cytosine/uracil/thiamine/allantoin permeases 
TIGRFAM ID[TIGR00800] NCS1 nucleoside transporter family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.22109 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGCTT TAAAGAAACT AGACAAGTGG ATAGCTGTCG AGGATACCTC CACAGAAAGA 
GGTGAAGACC AAATTAGAAG TAATGAGGAT TTGGATCCTA CTCCCTCTGA CCGAAGAACC
TGGAAAATGT ACAACTACAT TTTGATCTGG GCACAATCTG CATTCAACGT TAATGAATGG
AATACTGGTG CTAGTTTGAT GAAAGCTTCG GGTCTTCCAT ATGGTCAGAC CATTGGAAGT
GCCATTTTCT CCATTTTTGT TGCCGTTATC TTTACTATTG CCAATGCCAG AGCTGGCTCT
ACATATCACA TTGGCTACCC TACCCTCGCT AGAGCAACAT TCGGAGTTTA TGGTGCTTAC
TTTTTCGTAG CTGCTAGAGG TTTTGTGGCC ATTATTTGGT TCAGTGTTCA ATCATACTAT
GGCTCCATGT GTTTGGATGT TGCTTTGAGA TGTATGTTTG GACACAAATG GCTTGATTTG
AAAAACCACT TGCCTGCTAG TGCTGATGTG CAATCCAGGA TTTTACTTGC TTTCTTCTTG
TTTTGGTTGA TTCAATTCCC TTTGATGTTT GTACATCCAA GACAGATCAG ACATTTTTTC
ACTGTAAAAT CTTTTGTTTT GCCTTGTGCA ACCATTGGTT TGCTAATATT CTGTGTCAAG
AAGGGTCATG GTCCCGGTAA TTACGACTTG GGCTTACCTA TTAGCACTTC CTCTTCTGCT
ATTGGTTGGG GATGGATGTC CGTCATGAAC TCTATTTTCG GTACAATTTC CCCCATGATA
ATCAACCAGC CTGATATTGC CAGATATGCA AAAAAACCAA GTGACACCAT TTTGCCTCAA
GCTATTGGAT TTGTACTTGC TAAAATCATG ATCATGGTTG TTGGCATGGT TGCTACAGCT
TCTATCTACA GGTCTTATGG AGAGGTTTAC TGGAACATGT GGGACTTGAT GAACGCTATC
CTTGATCATT CTTGGAACGC CGGGGCCAGA ACAGGTGTCT TCTTTGTGGC AGTTTCTTTT
GGAATTGGTA CTGCTGGAAC TAATATCTTT GGTAACTCTA TCCCATTTGC TTGCGATATC
ACCGGTTTAT TACCAAAGTA CTTCACTATT TTAAGAGGAC AGATAGTAGT TGCCATTCTA
GCATGGGCCA TCGTTCCATG GAAGTTCTTA ACTGATGCTG CAAAGTTCTT AACTTTCTTG
GGAAGTTACT CTATCTTTGT TGGACCCATC CTTGGTTGTA TGTTGGCTGA TTACTACTTT
GTTAAGAGAG GTAACATCCA TGTACCTTCT TTGTTTACAA AGAAAAGTTC AGGGGTATAC
CATTATGTCT ACGGATGGAA CCTTTGGGCT TGTTTTGCTT GGGCTGGAGC TGCCAGTATT
TGTATTCCTG GTTTGTACAG AGCTTATTAT CCAGAATCTC TTAGCATTAG TGCTACTAGA
ATGTACCAAA TGGGATACAT TCTAACGACT ATTAGTAGTA TGGTCTTCTA CTACTGCTTG
AGTTTGATCT TCAAACCACA AATTTATCCA GAAGCTCACA GGGATACTCC AAAGACATGG
GAATACATGA GAACCACCGA TGGATTCTTT GAAGATGACT CTCCAATAGG CAAGGTTGGC
TACTTTGGTT CTGTCGATGT TTTCACAGGT GAGAAAGTTG ACACATCTGA AGGTTCTAGT
GTCAAAACTA AGAGCGAGAA GATATTGGAA ACTGTCTCTA TTGTTTAA
 
Protein sequence
MDALKKLDKW IAVEDTSTER GEDQIRSNED LDPTPSDRRT WKMYNYILIW AQSAFNVNEW 
NTGASLMKAS GLPYGQTIGS AIFSIFVAVI FTIANARAGS TYHIGYPTLA RATFGVYGAY
FFVAARGFVA IIWFSVQSYY GSMCLDVALR CMFGHKWLDL KNHLPASADV QSRILLAFFL
FWLIQFPLMF VHPRQIRHFF TVKSFVLPCA TIGLLIFCVK KGHGPGNYDL GLPISTSSSA
IGWGWMSVMN SIFGTISPMI INQPDIARYA KKPSDTILPQ AIGFVLAKIM IMVVGMVATA
SIYRSYGEVY WNMWDLMNAI LDHSWNAGAR TGVFFVAVSF GIGTAGTNIF GNSIPFACDI
TGLLPKYFTI LRGQIVVAIL AWAIVPWKFL TDAAKFLTFL GSYSIFVGPI LGCMLADYYF
VKRGNIHVPS LFTKKSSGVY HYVYGWNLWA CFAWAGAASI CIPGLYRAYY PESLSISATR
MYQMGYILTT ISSMVFYYCL SLIFKPQIYP EAHRDTPKTW EYMRTTDGFF EDDSPIGKVG
YFGSVDVFTG EKVDTSEGSS VKTKSEKILE TVSIV