Gene PICST_41479 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_41479 
Symbol 
ID4837120 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp2737590 
End bp2739395 
Gene Length1806 bp 
Protein Length601 aa 
Translation table12 
GC content42% 
IMG OID640388435 
Productpredicted protein 
Protein accessionXP_001382759 
Protein GI126132468 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3104] Dipeptide/tripeptide permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GGTGATTACG ACTACAATGA CGCAAACAAC TACTCCACCC ACTATGTTGA TGAATACAAC 
CCAAAGGGTT TGAGAGTCCC AACTGACGAA GAATCTCAAT CCCTCAGAAG GATTTTGGGT
AGAGCTTCTT ATGCTTCTTA CTTGATCTGT TTGTGTGAGT TGGCTGAAAG AGCCTCTTAC
TATTCGGTCC AGGGTATCTT GTCTAACTTT ATTCAAAGAC CTATGCCTGA AAATTCCCCT
CACGGATGGG GTGCACCAGC TGACAGAAAC TCGAATGTTT CTGCCGGTGC TTTGGACCAA
GGTCTTCAAG CTGCTAACGC CCTTACCCTT TTGCTTACTT TCCTTGCTTA CGTTGTACCA
TTATATGGTG GTTTCATTGC CGATACCAAG ATTGGTAAGT TCAAAGCTAT TTGGGTTGGT
GTTATCGCTG GTTTTGTTTC TCACGTTTTG TTCGTTATCG CAGCTATCCC ATCTGTCCTT
AAGAACGGCG GTGCTGCTTT GGCTCCAACT GTGCTTGGTA TCATTACTTT AGCTTTCGGT
ACTGGTTTCA TCAAGCCAAA CTTGTTACCT CTTCTTATGG ACCAATATAG AGAACAGACT
GATGTTGTCA AAGTCTTGCC ATCTGGTGAA AATGTTATTA TCGATAGACA AAAGACTTTG
GAAAGAATGA CTTTGATTTT CTATTGGGCG ATTAACATTG GTGCTTTTTT CCAATTGGCC
ACTTCTTATA TCGAAAGAGA TGTTGGTTTC TGGTTGGCTT TCTTCATTCC CATAATCATA
TACTTGGTTT TGCCAATTGT CTTGGTTTTC TTGCAATCTA GATTGGTCAG AGATACTCCA
CAGGGTTCCG TCCTTGAAAA CGCTTGGAGA GTTACAAGAG TCACTTTCTC TAAAGGGTGG
ATCGGTAGAT GGAGGAATAA CACCTTGTGG GAGTACGCGA GGCCATCCGT CATGCTTGAA
AGAGGAAGAG AATTTTACAA TGAAAATACA AAATCTCCAA TCACTTGGGG TGATCAATGG
GTGTTGGACA TCAAGCAAAC TGTCAACTCT TGTAAGATTT TCATCTACTT CCCAATCTTT
AACTTGGCTG ATAGTGGTCT TGGTTCTGTC GAAACTTCTC AAGCTGGTGC CATGACCACT
AACGGTGTTC CAAACGATTT GTTCAACAAC TTTAACCCAT TGACCATTAT TATCTTGATT
CCAATTCTTG ACTACCTTGT CTACCCTATG TTGAGAAAGT ATAGAATTGA ATTCCGTCCA
GTTTGGAGAA TTTTCCTTGG TTTCATTTTG GCTGGTTCTT CTCAAATTGC CGGTGCAATC
ATTCAATGGA AAATTTACAA GACTTCACCA TGTGGTTACC AAGCTACTAC TTGCTCTGAA
GTGTCTCCAT TGTCGGCTTG GCAAGATGTT TCTTTGTACA TTCTTTCTGC TGCAGGTGAA
TGTTTTGCTA ATACTACTGC TTACGAATTG GCCTACACTC GTTCTCCTCC TCACATGAAG
GGCCTTGTTT TGGCTTTGTT CTTGTTCACT TCTGCCATCT CTGCTGCTCT TTCACAAGCA
ATCACTCCAG CTTTGAGCGA CCCACACTTG ATCTGGCCAT TCGCTGGTAT TGCCATCGCA
ACTTTTGTTG CCGCATTCGT ATTCGTCTAT CAATTCAGAA ACTTGCACAA GGAAATGGAA
GAGGAAAGGA TTCTCAGAGA AGCCTTTGAT AAATCTGAGA GAAGTAATCT CATATCGCAC
GGTGGAATTG AAGATGACAA CAACTTGCAA GCAGTTACAT CCATCAAGTC TGCCGTTGGT
AAGTAA
 
Protein sequence
GDYDYNDANN YSTHYVDEYN PKGLRVPTDE ESQSLRRILG RASYASYLIC LCELAERASY 
YSVQGILSNF IQRPMPENSP HGWGAPADRN SNVSAGALDQ GLQAANALTL LLTFLAYVVP
LYGGFIADTK IGKFKAIWVG VIAGFVSHVL FVIAAIPSVL KNGGAALAPT VLGIITLAFG
TGFIKPNLLP LLMDQYREQT DVVKVLPSGE NVIIDRQKTL ERMTLIFYWA INIGAFFQLA
TSYIERDVGF WLAFFIPIII YLVLPIVLVF LQSRLVRDTP QGSVLENAWR VTRVTFSKGW
IGRWRNNTLW EYARPSVMLE RGREFYNENT KSPITWGDQW VLDIKQTVNS CKIFIYFPIF
NLADSGLGSV ETSQAGAMTT NGVPNDLFNN FNPLTIIILI PILDYLVYPM LRKYRIEFRP
VWRIFLGFIL AGSSQIAGAI IQWKIYKTSP CGYQATTCSE VSPLSAWQDV SLYILSAAGE
CFANTTAYEL AYTRSPPHMK GLVLALFLFT SAISAALSQA ITPALSDPHL IWPFAGIAIA
TFVAAFVFVY QFRNLHKEME EERILREAFD KSERSNLISH GGIEDDNNLQ AVTSIKSAVG
K