Gene PICST_70645 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_70645 
Symbol 
ID4836654 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp1027188 
End bp1029097 
Gene Length1910 bp 
Protein Length570 aa 
Translation table12 
GC content41% 
IMG OID640387969 
Productpredicted protein 
Protein accessionXP_001382971 
Protein GI126132892 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3104] Dipeptide/tripeptide permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0413957 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.317652 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTACAATCCT TATTTTATTG TAATGTCCGT GAGTGCTCAA GACGAAAAGG TGGAATCCTT 
AGACAAGCAA TTCTCCAGTT CCGAGACTGA TCTCCAAGCT GCTACCAACC CTAGCGTTGA
GGATGAAGGC AGAACTCCAA CTGAAGATGA AATGAAGACT TTGAGACATG TCTCTGAATC
TATTCCTATT TCTTGTTGGT TAGTTGCAAT TGTCGAATTG GCAGAAAGAT TCTCCTACTA
TGGTTTATCT GCTCCATTCC AAAACTATAT GCAAAATACC CCTCAAGATT CACCAAAGGG
TGTCTTGGGT TTGAACCAAC AAGGTGCTAC AGCCTTATCT TACTTCTTCC AATTTTGGTG
TTACGTTACC CCAATTTTGG GTGGTTGGAT TGCTGATACT TACTGGGGAA AGTACAAGAC
TATCTTTGTT TTCTGTGTTA TCTACATCGT CGGTATTTTC ATTCTTTTCA TCACTTCTCT
TCCTTCAATT ACTAGCCGTA CTACTGCTCT TGGTGGTTAC GTGGCTGCTA TCATCATTAT
CGGTCTTGCT ACCGGTGGTG TCAAGTCAAA CGTCTCTCCT TTGATCGCCG ATCAAATTCC
AAAAACTCAC CCTGTTATTA AGGTATTGAA GTCTGGTGAA AGAGTCATTC AAGACCCTAA
TATTACCATT CAAAATGTTT TCATGTTCTT CTATCTTATG ATTAACATTG GTTCCATGTC
TGTCATTGCT ACCACTCAAT TAGAAGCTCA CGTTGGTTTC TGGGCTGCTT ACTTATTGCC
ATTTTGTTTC TTCTTTATTG CCCTTCTTGC CCTTGTTCTC GGCAGAAACC AATATGTCAA
GGTTCCAGTT GGTGACAAAG TGATCAACAA AACCTTCAAG TGTGCCTGGA TTGGTTTGAG
AAACGGTTTC AACATGGACG CTGCTAGACC ATCCATGAAC CCAGAAAAAG AATTCCCATG
GAACGACCAT TTCGTTGATG AAGTTGTCAG ATCTGTTTAC GCTTGTAAGG TTTTTGTTTT
CTACCCTATC TACTGGGTTG TTTACGGTCA AATGTTGAAC AACTTCGTCT CCCAAGCAGG
TCAAATGGAA TTGCACGGTT TGCCTAACGA TATCTTGCAA GCTATCGATT CGCTTGTCAT
TATTATCTTC ATTCCTATCT TTGAAAGACT TGTATACCCA TTCATCAGAA AATTCACTCC
TTTCAAGGCT ATCACTAAGA TTTTCTGGGG TTTCATGTTT GGTGCTGGTG CCATGGTATA
CGCCGCCGTC TTGCAACACT ACATTTACAA GACCGGCCCA TGTTACGACC ATCCAAAGGC
TTGTGCTCCA CAATACCTTA ACGTTCCAAA CCGTGTTCAC GTTGCCATTC AAGCTCCAGC
TTACTTCTTG ATTGCTATTT CTGAGATTTT GGCTTCTATT ACTGGTTTGG AATATGCCTA
CACAAAGGCT CCAGTTTCCA TGAAGTCGTT CATTATGTCC CTTTTCTTGT TGATGAATGC
TTTCGGATCT GCTCTTGGTA TTGCTTTGTC ATCCACTTCT GAGGACCCCA AGATGGTCTG
GACCTACAGT GGATTGGCAG TTTCTTGTTT CATTGCCGGT ATTGCTTTCT GGTTGTGTTT
CAAGCACTAC AACTACAAGG AAGACGAATT GAACGCTTTA GAATACGATG ACGAAGAAGA
AAGAAACATT GTGCCTGTTA CTTCATTGTC ACACTCCGTC AAGAGTCTTG CATAAGGAAA
TTTGACATTT CTTTCTACTT TACGACTATG CAAATCCGAG AGGACTACGC AATCGAGCGT
TTTCATTGTA TTTATAAGGA CAGGCTTTCT TCATTCATAA CATTTCATTC ATAAAACTTC
TATAGAATGT CCTCACATCT CTGTATAATA ATTAATAAAA TCAAGCATAA
 
Protein sequence
MSVSAQDEKV ESLDKQFSSS ETDLQAATNP SVEDEGRTPT EDEMKTLRHV SESIPISCWL 
VAIVELAERF SYYGLSAPFQ NYMQNTPQDS PKGVLGLNQQ GATALSYFFQ FWCYVTPILG
GWIADTYWGK YKTIFVFCVI YIVGIFILFI TSLPSITSRT TALGGYVAAI IIIGLATGGV
KSNVSPLIAD QIPKTHPVIK VLKSGERVIQ DPNITIQNVF MFFYLMINIG SMSVIATTQL
EAHVGFWAAY LLPFCFFFIA LLALVLGRNQ YVKVPVGDKV INKTFKCAWI GLRNGFNMDA
ARPSMNPEKE FPWNDHFVDE VVRSVYACKV FVFYPIYWVV YGQMLNNFVS QAGQMELHGL
PNDILQAIDS LVIIIFIPIF ERLVYPFIRK FTPFKAITKI FWGFMFGAGA MVYAAVLQHY
IYKTGPCYDH PKACAPQYLN VPNRVHVAIQ APAYFLIAIS EILASITGLE YAYTKAPVSM
KSFIMSLFLL MNAFGSALGI ALSSTSEDPK MVWTYSGLAV SCFIAGIAFW LCFKHYNYKE
DELNALEYDD EEERNIVPVT SLSHSVKSLA