Gene PICST_80841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_80841 
Symbol 
ID4851077 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp881296 
End bp882792 
Gene Length1497 bp 
Protein Length488 aa 
Translation table 
GC content42% 
IMG OID640392785 
Productconserved hypothetical protein 
Protein accessionXP_001387805 
Protein GI126274063 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG0697] Permeases of the drug/metabolite transporter (DMT) superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.801804 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCATCA CAAATCTATC TGTAGATCAT TCTCTGACTG ACAGCGATAA GCATGCTTCC 
AAGTCCAAGA TCATATTTGT CACCTTCGTG TTTATCCTAT CGCTTGTTTC GTTTGTAACA
CAGACAGAGT TCACTTCCCA AGCTTATCAA TTAGGATTCA GCGAACCGGT AGTATTGCTT
TTGGTTACCC ATGGCTCCTG GTGGATTCTC TGGCCATTAC AGGCAATAGG CGTATCTTTG
TATAGAACCG TAAACAAATA CAGACAAAAC CAGAGCCAGA GTCAAATCTT GCAAGACCAG
AACCAGACCC AGAGACAGCA TCGATACAGT GCTAGTTCTG ACAGACAATA CCATCGTTTA
GCCTCCACTT CAGAATTGCT CGAATCTCAC ATCGATTCTG AACATCCTTC TGACGCTATA
CCAGGACTAA CTAGACCCGT GAACTATGTC TCATACTTCA AGAAATGTTT GGTAAAACAA
TTTCACAATG TGTACCACAC CTCTATCTTG ATCTTTGAAA GTAATGTCAA CGACGACAGA
ACCACCGAAA ACTTGAATTC ACTTATAGAG AAGAATCCTC ACGTTTCGTA CTCAAATTCC
ATCACTGAAT GTGTCAAAAC ATTCTTTGCT ACTCCTTCTA TTCAATACGT AGTTAAAAAG
GCTCTCCTTA TCACATGTTT ACTTACCGTC GCTGGCTCGA CTTGGTACGG TGCCATGGCA
ATGACATATG CTTCAGATGT TACAGCTATC TATAACTGTT CTGCATTCAC TGCTTATGCA
TTTGCCATTC CCATCTTGAA AGAGAAGTTT TCTTGGCTCA AGGCCAGCTC TGTAGTCATT
GCAGTACTGG GAGTCTTCAT TGTTGCCTAC TCTGGAAGCG ATGCAGACTC GCTGTCCAGC
GAAGATTACC CCTACAGATT CTGGGGAAAC TTGATCATCT TGATTGGAGC CATCTTGTAT
GGTTACTATG AAGTTCTTTA CAAGAGATAC TTGTGTATTC CTCCTCACTT AACTGCCATC
ATAACTCCAC GTCGTCAGCT GACATTCGCC AACTTCGTCA TGGGATTCTT TGGTTTTTTC
ACCTGCTTGA TTGTTCTCAC AATAATCTTG ATCGCTGAAG TTTTCCGCAT TCATAGCTTC
AATTTCTTCA ACTATGGCGA AGACACTACA CTCATCTGGA AGTATATAGT AGGCTCTATC
TTCCTGAACT TGATCTTCAG TGCCTCTTTC TTGACATTGA TGGCACTTAC CAGTCCTGTT
CTTTCATCTG TCAGTTCGCT CCTCACAATC TTCTTGATTG GTTTGGTTGA ATGGGTCATG
TTTGGCAATG TTTTGGATTT CCAGCAATTG TTGGGAGACT TCTTGGTTAT TGTAGGGTTT
GTTCTCTTAA CAATTGCATC CTGGAAGGAA ATCAGTGAAG GACAAGACGA TGACGATGAT
ATGGACGTCG TCAGTACATA TTCATTTGCT GTCAGTACTG AAAGCAGCGG CAACTAG
 
Protein sequence
MAITNLSVDH SLTDSDKHAS KSKIIFVTFV FILSLVSFVT QTEFTSQAYQ LGFSEPVVLL 
LVTHGSWWIL WPLQAIGVSL YRTSQILQDQ NQTQRQHRYS ASSDRQYHRL ASTSELLESH
IDSEHPSDAI PGLTRPVNYV SYFKKCLVKQ FHNVYHTSIL IFESNVNDDR TTENLNSLIE
KNPHVSYSNS ITECVKTFFA TPSIQYVVKK ALLITCLLTV AGSTWYGAMA MTYASDVTAI
YNCSAFTAYA FAIPILKEKF SWLKASSVVI AVLGVFIVAY SGSDADSLSS EDYPYRFWGN
LIILIGAILY GYYEVLYKRY LCIPPHLTAI ITPRRQLTFA NFVMGFFGFF TCLIVLTIIL
IAEVFRIHSF NFFNYGEDTT LIWKYIVGSI FLNLIFSASF LTLMALTSPV LSSVSSLLTI
FLIGLVEWVM FGNVLDFQQL LGDFLVIVGF VLLTIASWKE ISEGQDDDDD MDVVSTYSFA
VSTESSGN