Gene PICST_66905 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_66905 
Symbol 
ID4837310 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp1024510 
End bp1026351 
Gene Length1842 bp 
Protein Length577 aa 
Translation table12 
GC content41% 
IMG OID640388625 
Productpredicted protein 
Protein accessionXP_001382970 
Protein GI126132890 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3104] Dipeptide/tripeptide permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.425094 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.705777 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTAACT TATCGGACGA GAAACTCCCC ACAGATTCTG AATTGCAAAA ACAGGATAGT 
GACATCCTCC GCGACCCCAG TGTGATCTCC AATGACATTG ATGATGAGGG TAGAGAATTG
CCATCTGAAG AGGAAATGAA GACCTTGAGA CATGTCTCTG GCAACATCCC CTTAAGATGT
TGGTTAGTTG CAATTGTCGA ATTGGCAGAA AGATTCTCCT ACTATGGTTT ATCTGCTCCA
TTCCAAAACT ATATGCAAAA CACTCCAGAA GATTCACCAA AGGGTATCTT GGGTTTGAAT
CAGCAAGGTG CTACAGCATT ATCATACTTC TTCCAATTTT GGTGTTACGT TACCCCAATC
TTTGGTGGTT GGTTGGCTGA TACTTACTTG GGAAAATTCA ATACCATCTT TGTTTTCTGT
ATTGTCTACA TCATTGGTAT CTTCATTTTG TTCATTACAT CCATTCCTGC CATCACCTCT
AAGACGACTG CTACTGGTGG TTTTATTGCT GCTATTATCA TAATTGGTTT TGCAACCGGT
GGTGTCAAGT CTAACGTTTC CCCATTAATT GCCGATCAAG TTCCAAAGGT AAAACCACAC
ATCAAGGTTT TGAAATCTGG AGAAAGAGTC ATTGTCGACC CTCACATCAC TATCCAGAAT
GTTTTCATGT TCTTCTACCT TATGATTAAT GTTGGCTCTT TGTCAGTCAT CGCTACCACT
CAATTGGAAC ATCACGTTGG ATTCTGGGCT GCCTACTTGT TGCCATTTTG TTTCTTCTTC
ATCGCTCTTG CTGCTCTTGC CTTGGGAAGA AACCAATACA TTAAGACCCC TGTCAGTGAC
AAGATCGTCA ACAAGACCTT CAAGTGTGCC TGGATTGGTT TGAGAAACGG TTTTAACTTG
GAAGCTGCCA AGCCATCCAA CAACCCAGAG AAGAATTACC CATGGAGTGA CAAGTTTGTT
GAAGAAGTCA GAAGAGCCAT TTACGGTTGT AAGGTGTTTG TCTTTTACCC TATCTACTGG
GTCACCTATG GACAAATGAC TAACAATTTC ATTTCTCAAG CTGGTCAAAT GGAATTGCAT
GGCTTGCCAA ACGATATTTT GCAGGCAATT AACTCGATGT CGATTATTGT ATTTATCCCT
ATTTGTGAAA GATTTGTTTA CCCATTCATC AGAAGATTCA CTCCTTTCAA GGCTATCACA
AAGATCTTCT TTGGTTTCAT GTTCGCTACA GGTGCTATGG TCTATGCCGC CGTCTTGCAA
CATTACATCT ACCAGGCTGG TCCATGTTAC AACTTTCCAA AAGCTTGTGC ACCTGAGTTC
AAGACTGTTC CAAACCACAT TCACGTTGCC ATTCAAGCTC CTGCTTACTT CTTGATTGCC
ATGTCAGAAA TTTTTGCCTC CGTTACTGGT TTGGAATATG CCTACACAAA GGCTCCAGTT
TCCATGAAGT CGTTTATCAC TTCTCTCTTT TTGGTTACAA ACGCTTTCGG ATCTGCTCTT
GGTATTGCTT TGTCATCCAC TTCTGAAGAT CCAAAGATGG TCTGGACCTA CACTGGTTTG
GCAACTGCCT GTTTCATTGC TGGGTGGATC TTTTGGTTCT GCTTCAAGCA CTACAACTAC
AAGGAAGATG AATTCAACAG GTTGGAATAC GCAACAGAAG AAGAATACAA AAAGCCTACC
CTCGATGGTC TTCAGCCAAT TCCTTCTGCT AATTCATACA AGGGACTTGC TTAGTACTTC
CCGCATGCGT CGTACTTATT TATACACATA TATACTATAT TCGTAACACC GACTTGTGTT
TAATAGTCAG AGATCTTAAT GCATTCATGA TTGAAATTTT AG
 
Protein sequence
MSNLSDEKLP TDSELQKQDS DILRDPSVIS NDIDDEGREL PSEEEMKTLR HVSGNIPLRC 
WLVAIVELAE RFSYYGLSAP FQNYMQNTPE DSPKGILGLN QQGATALSYF FQFWCYVTPI
FGGWLADTYL GKFNTIFVFC IVYIIGIFIL FITSIPAITS KTTATGGFIA AIIIIGFATG
GVKSNVSPLI ADQVPKVKPH IKVLKSGERV IVDPHITIQN VFMFFYLMIN VGSLSVIATT
QLEHHVGFWA AYLLPFCFFF IALAALALGR NQYIKTPVSD KIVNKTFKCA WIGLRNGFNL
EAAKPSNNPE KNYPWSDKFV EEVRRAIYGC KVFVFYPIYW VTYGQMTNNF ISQAGQMELH
GLPNDILQAI NSMSIIVFIP ICERFVYPFI RRFTPFKAIT KIFFGFMFAT GAMVYAAVLQ
HYIYQAGPCY NFPKACAPEF KTVPNHIHVA IQAPAYFLIA MSEIFASVTG LEYAYTKAPV
SMKSFITSLF LVTNAFGSAL GIALSSTSED PKMVWTYTGL ATACFIAGWI FWFCFKHYNY
KEDEFNRLEY ATEEEYKKPT LDGLQPIPSA NSYKGLA