Gene PICST_72069 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_72069 
SymbolDAL7 
ID4838665 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp947951 
End bp949892 
Gene Length1942 bp 
Protein Length520 aa 
Translation table12 
GC content41% 
IMG OID640389980 
Productputative MFS allantoate transporter 
Protein accessionXP_001384136 
Protein GI150865072 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.145167 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TAACGCTAAA ACATTGTCCG CTGCAACTCT TCATCACCCC AAGACTTCCC GCCCCTTAAG 
CCGTTCATTT TTTCTTCACT AAATGAAATT TCACTTAGGC ACTGTCTAAC TACTATTTAG
TATCACCTGA ACACCGAGAT TTCTCGGAAA GGCAGCAAAT AATTTCTGCT AAAGTATCTC
CTTTTCTTTC AGATTTGATA GTGTGTGTTC TTTCGTCGTT TCCGCCATCC TTCTATACGT
TTCTCATTGA TAGACCATTG TCTCTTGGGT CTTACGTACT TATTTAGGTG CAGAACCAGA
ATTTTGTCTT CCTATAGCTG TTAAATACTT CAGAATGGGT GGATGGACGA TTGTTGGAGA
CTCTTTCAAG GGAGGTGATG TGAAGCTAGT GACTGAACAT TTAATTGAAC TGTCCCGGAA
GTCGAATGTA GACTATGGTG CCGAATTTCT CGCTGAAAAC GAACACCAAT ACCCTCCGGC
GACCGAAGAT GAAGAAAGAA GAATTATAAA AAAACTTGAT TTCATCTTGG TACCGATGCT
TTTCTTCACA GCGACGATGG GAGCAGTTGA CAAGGTTTCC CAGGGTACAG CGGCGATTTA
TGGCTACATT CCGGACAATA ATTTAACAGG ATCTCAGTAT TCCTGGCTAG GATCAATTCT
TTTCCTTGGT TCCTTAGTCG GGATGTTCCC CATGTCCTTT TTCTTGCAGA GGTTTCCATT
GGGAAAAGTT CTAGTAACCG CTTCACTTTT CTGGAGTAGT TTAACACTTC TATTGTGTGT
TGGTAGAAGT TTCGCTGGGT TGGCTGCTAT TCGGTTTCTT ATGGGGTTTG TCGAATGTGC
TATTGTCCCT GGGTGTACTC TTGTCTGCGG AAGATTTTAT TCCAAGGGAG AAATTGCTAC
TCGTTTGGCT TTTGTTTTTG CCTTTGCTTC TTCAGTTATT AATGGGTTTT TGTCATGGTT
GGTTGGTTAT TTTCATCATT CCACAGTCCC AGCCTGGAAG TTTCTCTACA TCTTGGTGGG
TTCTATTTCA TTTCTTTGGG GTTGTCTCAT GTGGGTATAT TTGCCAGATT CTCCCTTGAA
CGCCAAATTT CTTACCAACC AAGAAAAGGT CTACGTCGTG AGACGGATTA TCAGAAAAAG
CAATGGCGGT GTTCAGAATA ATAATTGGGA TTGGCAACAA GTCAAGGAGG CAGTTCTCGA
CAGCAAAACT TATGTCATAT TCTTTTTCAA CATTGGTATA AATATTTGCA ATGGTGGTCT
CTCAACGTTT TCTTCCATAA TCATTTTTAA CCTTGGATTT AATGCAATGA AAGCATCGTT
GATGGGTATT CCAACAGGTG TCATTGCAAC CCTTGCTACC ATTTTCTTCA CGTTTTTATG
TAACAAATTC AACAACAAGC GTTGCTTGAT TGCAATTATT TCACTTATAC CTCCGGTTGT
TGGGTCAGCT ATCATATATG CCGTGGACCG GCTGAACGTG GCACCGCAAT TGGTTGGTCT
CTACTTGCTT TATTTCTACT TTGCTCCGTA CGTCGTGATG ATGTCCCTTG CCCAAGCTAA
CACTTCTGGA AACACCAAGA AATCTGTCAC CTATTCTATC AATTATTTGG GTTATTGTGT
GGGAGCTCTT ATTGGCCCTC AAACTTTTAG GGCTAACCAG GCTCCAAGAT ACACTGGAGG
TTTTATCGCC TTGCTTTGTT CTTTCCTTAT TTGCATGATG TTTGCTGGCA TATATTGGGC
GATATGTATT TGGGAGAATT CCAAGAAATC GAGGAAGTAC GACGAAAACG AAGTGTATCT
GGAAAAGCCG GTGTCCAGAG ATGAAAAGGA GATTGACGAT GAGGAATATT ACGATCTATC
TGATTCCCAG CGAAAGCATT TCCGTTACAC TACATAGTAA TTAGAGTACA ATTAATTTAA
TTCAACGACT TATAAGACTT TC
 
Protein sequence
MGGWTIVGDS FKGGDVKLVT EHLIESSRKS NVDYGAEFLA ENEHQYPPAT EDEERRIIKK 
LDFILVPMLF FTATMGAVDK VSQGTAAIYG YIPDNNLTGS QYSWLGSILF LGSLVGMFPM
SFFLQRFPLG KVLVTASLFW SSLTLLLCVG RSFAGLAAIR FLMGFVECAI VPGCTLVCGR
FYSKGEIATR LAFVFAFASS VINGFLSWLV GYFHHSTVPA WKFLYILVGS ISFLWGCLMW
VYLPDSPLNA KFLTNQEKVY VVRRIIRKSN GGVQNNNWDW QQVKEAVLDS KTYVIFFFNI
GINICNGGLS TFSSIIIFNL GFNAMKASLM GIPTGVIATL ATIFFTFLCN KFNNKRCLIA
IISLIPPVVG SAIIYAVDRS NVAPQLVGLY LLYFYFAPYV VMMSLAQANT SGNTKKSVTY
SINYLGYCVG ALIGPQTFRA NQAPRYTGGF IALLCSFLIC MMFAGIYWAI CIWENSKKSR
KYDENEVYSE KPVSRDEKEI DDEEYYDLSD SQRKHFRYTT