Gene PICST_31639 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31639 
SymbolDAL8 
ID4838776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp1324576 
End bp1326135 
Gene Length1560 bp 
Protein Length519 aa 
Translation table12 
GC content41% 
IMG OID640390091 
Productallantoate permease 
Protein accessionXP_001384207 
Protein GI150865120 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.338334 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000117277 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGTCTAAGG TTGTTGAAGT TGAAGATTCT ACAAACGATT CTGCTTCGTT CGGCTCAGAT 
AAGAAGGGCA AAACTGTCGA AGTTAAAGAG GAGCTTTTAA CTGAAAGTGA GCTCGCTGGG
TATAATTTAT ATGAAAAAGC CCAAGAAATC AATTCGGAAG AAGAGCAGGC AATCAGCAAA
AAGTTACTTT GGAAGGTAGA TAGAAGAATC GTCCCCTTAT TATGTATCAC TTATACATTG
CAATTCTTGG ATAAGTTGTC TCTTAACTAT GCTGCCGCCT ATTCTCTAAA AGAGGATTTG
AACTTGATTG GCCAACGTTA TTCGTGGGTT GCTGCTATCT TCAATTTTGG GTACTTGTTC
TGGGCTCTTC CTGGCAATTA TATCATTCAG AGAGTTCCTG TAGCCAAATA TACTGGCTTC
ATGTTGTTCT CCTGGTCTAT TATCTTGATT GGTCACATCG GTTTGAAAAA CTATGGGGGA
GCTTTGGTTA TCAGATTCAT TCTTGGGATG TTTGAAGCGC TGATTAGTCC TTCTTGTATG
AACATCTGCA GTTCGTTCTA CACCGTTAAA CACCAGCCAA TCAGAATGTG TATCTTCCTC
TCGTTCAACG GTGTAGCTAC CATGGTTGGA GCTCTCTTGG GTTTTGCCTT GGGTCATGCC
ACCAACTCTA GCTTGAAACC ATGGAAGCTT ATATTTATGG TCATTGGACT CATGAACTTT
GTGTGGTCCT TGATCTTCCT CTGGTTGTGT CCTGATTCTC CAGATAAAGC CAAATTCTTG
ACTGAAGAGG AGAGAGCAAT CTTAGTCAAA GAAGTAGCCT CCAATAACCA GGGTCTTAGA
GATGTTAAAT TCAAGAAACA CCAGGCTATA GAAGCTATTA GTGATGTTGG GGTTTGGATA
TTGGCATTTG TTGGTTTGGC TTGTGGAGTG ATTAACGGAG GAAGTTCCAA CTTCTCTTCT
GCTTTGATTA AAGGGTTCGG TTTCTCTGGT TTGCAAGCAA CTGCGCTTCA ATTACCAACA
GGTGCGATTG AATTAGTAGT AGTGGCCGCT ACTGGTTTTG CTGTATTCAG TTTTAAGAAT
ACTAGAACTG TTGCCTTGTT CCTCATTTGT ATTCCTCCAT TGGGTGGTTT AATAGGAATT
CACGTCATTT CTTTGGAACA TAAGTGGTCT TTGGTTGGTT GTACTTGGCT TCAATTCATC
ATTGGAGGTC CAGTCATCTT GTGTTGGATC TTGTTAAATG CAAATGTTTC AGGTTCTTCA
AAGAAGACAA TAGCAAATGG CTTATGGTTT GCTTTCTACG CTTCAGGGAA CATCATTGGT
GCCAATGTTT TTTATACTTA CGAGGCTCCA AAATATCGTA GTGGTATGAT TGCCTTGATG
ACATGCTACT GTGGTATAAT GGTTTTGGCT GTGGCGTACA GAGGTTTGCT TACGTTCAGA
AACAAGAAGA AAATGGAAGA ACAGGGTGAA ATGACACCGG AAATGGAAGA ACAAGCTATT
CTTGACGGGT TCAAGGGCTT GACTGATTTC GAAAACTCTG GTTTCCGTTA TGTATTATGA
 
Protein sequence
MSKVVEVEDS TNDSASFGSD KKGKTVEVKE ELLTESELAG YNLYEKAQEI NSEEEQAISK 
KLLWKVDRRI VPLLCITYTL QFLDKLSLNY AAAYSLKEDL NLIGQRYSWV AAIFNFGYLF
WALPGNYIIQ RVPVAKYTGF MLFSWSIILI GHIGLKNYGG ALVIRFILGM FEASISPSCM
NICSSFYTVK HQPIRMCIFL SFNGVATMVG ALLGFALGHA TNSSLKPWKL IFMVIGLMNF
VWSLIFLWLC PDSPDKAKFL TEEERAILVK EVASNNQGLR DVKFKKHQAI EAISDVGVWI
LAFVGLACGV INGGSSNFSS ALIKGFGFSG LQATALQLPT GAIELVVVAA TGFAVFSFKN
TRTVALFLIC IPPLGGLIGI HVISLEHKWS LVGCTWLQFI IGGPVILCWI LLNANVSGSS
KKTIANGLWF AFYASGNIIG ANVFYTYEAP KYRSGMIALM TCYCGIMVLA VAYRGLLTFR
NKKKMEEQGE MTPEMEEQAI LDGFKGLTDF ENSGFRYVL