Gene PICST_33169 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33169 
SymbolDAL4.2 
ID4840168 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp1589619 
End bp1591400 
Gene Length1782 bp 
Protein Length593 aa 
Translation table12 
GC content42% 
IMG OID640391483 
Productallantoin permease 
Protein accessionXP_001386006 
Protein GI150866414 
COG category[F] Nucleotide transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG1953] Cytosine/uracil/thiamine/allantoin permeases 
TIGRFAM ID[TIGR00800] NCS1 nucleoside transporter family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0477743 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.268616 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTCTA AAGACGAAAA AGTTAACACA GTCGTCTCCG TAGAGAGAGG CTCTGTGGAA 
GATACTGAAG CTGGCTTCAA AAATGAAACC TTCTTACAGA AGCTTATAAG ATGGGTAGAA
GTCCAACCCA AGGGTGAATT GACTCACTCT CAGATGTTCC TTTACAATCA TGATTTGCGT
CCAGTTGAAG CTGAAAGACG TCAATGGGCT TGGTACAACT ACGTCTTCTT CTGGATTGCT
GATTCCTTCA ATATTAATAC ATGGCAGATT GCAGCAACTG GAATTCAACT GGGAGGTATG
AATTGGTGGC AGACATGGAT TTCTGTTTGG ATAGGTTATA CAATTACTGG ATTATTTGTA
TCACAGGCTG CCAGAGTTGG TATATTCTAT CACATTTCCT TCCCTGTTTC TGTACGATCT
GCGTTTGGAA TTTATGGCTC CCTCTGGCCA GTCCTCAACA GAGTGGTTAT GTCTGCTGTT
TGGTATGCAG TTCAATGTTC AGTGGCTGGT CCATGCTTTG AGGTCATGTT GAGATCAATA
TTTGGCCAAA ATCTAGACAA GACGATGGCT AATGGCATTA GTGATCCTGA CTTGACTACC
TTTAAATTCT TAAGTTTCTT CTTGTTCTGG CTTTTCTCCT TGCCATTCCT TTGGTTTCCA
CCTCATAAGG TTAGACACTT ATTCACAGTC AAAGCCTATG TTGTTCCAGT TGCAGGTATC
GCATTCTTGG TATGGACTAT TGTCAAGGCA GGAGGCATCG GTCCCGTGGT TCATACTCCA
GCTACAGCTC AAGGAAGTAA GTTGGGTTGG GCATTTGTTA CATCCACAAT GAACTGCTTG
GCTAATTTCG CAACGTTAAT CACTAATGCG CCTGACTTCT CTCGTTTTGC TACGAAGCCA
TCCTTCAGTA TGAAATATTT AGTCTACTCT CTTTCTATTC CATTGTGCTT CTCTCTAACA
TCCTTAATCG GAATCTTGGT CACCTCAGCA TCTCAGTCCA TGTACGGGGA GGCATACTGG
TCTCCAATAG ATGTTCTAGG AAGATTCTTG GACAACTACA CATCTGGTAA CAGAGCTGGT
GTATTTTTGA TTGGACTTGC ATTTGCATTG GCCCAATTAG GAACCAACAT TAGTGCCAAT
TCACTTTCTT TTGGTACCGA TGTCACAGCT TTATTGCCTA GATTCATGAG TATTAGACGG
GGAAGCTACT TGTGTGCTGC TGTTGCACTT TGTATCTGTC CATGGAAATT GACTTCTTCT
TCTTCCATGT TCACGACTTA TTTATCTGCT TACTCCGTTT TCCTTTCATC TATTGCTGGT
GTAGTAGCTT GCGACTATTA CTATCTCAGA AGAGGTCGTA TTTTCTTGAC TCACTTATAC
TCGATGACGG CACCAGAATC TGCACTTCCT TCTGTCTACA AGTACAATTT CATTGGGTGC
AATTGGAGAG CTTATGCTGC CTATATAGGC GGCATCATGC CCAACATTGT TGGTTTTGTT
GGTGCCACTG AGACTCATAA GGTCCCCATT GGAGCCACCA GAGTTTATGA CTTAAACTTC
TTTGCTGGCT TCTTTTCAGC ATTCATTTTG TACTACTTAT TGTCTTATTA CTTTCCAGTT
TCTGGAGTTC CTGCTGTTGG TCCATTCGAG AAAGGATGGT TCGAGATAAA TGCCCATGTA
GAGGATTTCG AGGAAGAGTT GGCAGGCAAC ATTATTAATC CTGAAGATAC TCCTGAAGGT
GCATTTACTT CTATTTCATA TGCTTCTGGT AGTATGAAGT GA
 
Protein sequence
MDSKDEKVNT VVSVERGSVE DTEAGFKNET FLQKLIRWVE VQPKGELTHS QMFLYNHDLR 
PVEAERRQWA WYNYVFFWIA DSFNINTWQI AATGIQSGGM NWWQTWISVW IGYTITGLFV
SQAARVGIFY HISFPVSVRS AFGIYGSLWP VLNRVVMSAV WYAVQCSVAG PCFEVMLRSI
FGQNLDKTMA NGISDPDLTT FKFLSFFLFW LFSLPFLWFP PHKVRHLFTV KAYVVPVAGI
AFLVWTIVKA GGIGPVVHTP ATAQGSKLGW AFVTSTMNCL ANFATLITNA PDFSRFATKP
SFSMKYLVYS LSIPLCFSLT SLIGILVTSA SQSMYGEAYW SPIDVLGRFL DNYTSGNRAG
VFLIGLAFAL AQLGTNISAN SLSFGTDVTA LLPRFMSIRR GSYLCAAVAL CICPWKLTSS
SSMFTTYLSA YSVFLSSIAG VVACDYYYLR RGRIFLTHLY SMTAPESALP SVYKYNFIGC
NWRAYAAYIG GIMPNIVGFV GATETHKVPI GATRVYDLNF FAGFFSAFIL YYLLSYYFPV
SGVPAVGPFE KGWFEINAHV EDFEEELAGN IINPEDTPEG AFTSISYASG SMK