Gene PICST_64420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_64420 
Symbol 
ID4841097 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp248922 
End bp251363 
Gene Length2442 bp 
Protein Length813 aa 
Translation table12 
GC content37% 
IMG OID640392412 
Productpredicted protein 
Protein accessionXP_001386446 
Protein GI150866749 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0169] Shikimate 5-dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0116008 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATATATG AGAGATGTGA GAACCTGCTC CAACATAAGT TTCAATCGGT TGTAATTGTA 
GGTTTGCGAG GAGTGGGAAA GTCAACTTTG GCCTTGATGG CTCTGGCTAC TTTGGGTCTT
GAATATGTAG ATCTTGAAAG ATGTCTAGTC GACTATACTG GAGTGTCTGA TGCAACTTTT
ATCAAAAGCG TTTCTAAGGA AGAATTTATA CATTTGCAGT ACAAGTTGAT TGTAAGATCA
TTTAGAGCTA ACAAGAATAA GAGAGCCATT TATGTATTGC CAGCAAGTTC GATCAACAAT
TCCGCCGTGA TGGAATATTT AAGAAATAAT TGCAATTGTC ACTGCGTTAT CAATATCGAA
TGTGACGAAG ATAGAATATT GAAGTATGTC AACTACACTG GTGAATATCA AAAGGGAATC
CTGTCAATAC AGTCCGGAAT ATCCCAATAT AGATCTGTTG CAAACTATAA TTTCTTCAAC
TTGGAATCAA ATTTAGATGT TTGGAAAAAG TATTCGTTCT CTAAGGTGGA TGATTCCAAG
CAAATTGAAG TTACACCACA TCTAATATTG AAACCCGTAG AATTGGAATT CATCAATTTT
ATGTCATTTA TTCTTTGGAA TCCAATACCG GAACCCTCTG ATGTTCTTTT GAAGCATTAT
CAGAGAAGCA ATTTAAGAAG CTTTTCTAAC TGTTTACAGC TCACATTCCC TTATGATGCC
AGAAACATTC ACGTGAACAA TTTTGGCGAC GTCTTGAATG GAGTAGATGC AATCGAAGTC
GGAGTTGATC TTATCCAGTT AATTAGAACA AAAATCATGC ATGTCAATTT GCGGCTAGAT
GAGTATATCG CGAGAATAAG AAGAAGTACT CGAGGATCCA TACCAATACT TGTTGGTATT
AAGAATACTA TCCCTGAGCT CAACAACTTC ATTATGGAGA GTACTGTTGA CTCTGTCATT
AGTACATCTC AAATTAAACA GGATTTTAGA AGCTTTTACT TCAGTATCTT GTACTCGATT
ATCAAAGTAG CTGCTGACTA TGTGGTTCTC AATTTGGAAA TTTTTCTTTT TGATGAAGTC
AATTTCCTAA ACGATATTAT CGTGAATGAT TCTTTCTATA TCATGGACCA ACTTAGACAT
ATGCAAGGTA ACAGTCTGTT TCTAGGAACA TATAACTCGA ACTGCGATGA ATTCTGGGCC
ATTCAAAAAA CAATAGGCAA AGTACGGTGC CTTGATATTA TCGATTTGAC GAATGATTTA
CAGATATCCA TGGTAAGAGT CACTTCTACT GCTCGTTGTG TATCAGATAA TTACAAGATA
CAAACATTCT TGGAATATTG CATGAAGAAA TATCCAGAAA CTACGGTATC AGCTTATAAT
CAAGGAACAA ATGGAAAAAT ATCCAAGATC TTAAACAAAG TGTTGACACC TGTATGTTCT
CCAAGTGCTG ATCCTCTGCA AGGTGAGCTC ACGTCATATG CTCTTAACCA GTCGAGGTTT
TCATGTTTTT TGCAGCCAAC GCTACGATTT TTTGCTGTGG GCAGAAGTGA TTCAAGTATT
CTCTATCAAT TTGTCTATAG ACTGGTGTTT GAGAAATTGG GACTTTCATA TTTTTTCAAA
ATTCTCGAAG ATGTATCAAT TGATGAATTA TTGAAGTCTC CCGATTTTGG AGGGGCGATA
TTAGCAACTC CTATAGAAAT TAAAGCGAAC GAATTTGCTG GTAAATCATC AGCACATGCC
GCAGAGATTG GATTAGTGGA TTCTATAATT GCAGAAAGAT CTCTTGATGA TCCATCTAAG
TTCCTACTTC GAGGGGAAAA TGCGGATTGC TTAGCAATCA AGGTCTATAT CTCTGACAAT
GTTGCTCCCA TAAATGCTGT AAGCCACAAT AAGAGTGTTC TCGTCATAGG TTCAGGTTTC
AAAAGTCGTG CTGCTATTTA TTCGTTGATG AAACTTGGCT ATAAGAACAT TTTATTGTAT
AGCCCTATGT CCATTGCTAG ACAGACTGAG AAGGATGTAT CTCTATCGCA CAATTTGGAT
TCTTCCAGGA AATTGGATTC CCACAATCTA TTGGCTAAGA TTACAATAAT TACTGAAGAA
CAATTCCAAA ATGGTATCCT CCCTGATGAT CTTCTATATC CAACAATAAT TATTAATTGC
ATGAGTGATG AGGATGTTCC TATCGATGGT CAGGTCAAGC TATCTGCAAA TTGGCTCAAG
AGTCCTTCTG GTGGAATATT CTTGGACACA CATATTGCAA ACAAAGAAAT TACAACCCTA
AATGAGAGCC TGGAATGGGA AAAAGGATGG ATCAAGACTA ATGGACTTGA ATTCTTGCTT
GCCAAAACAT TGATCCAGTT TGAGTTGTTT GTGGGTAAAC CAGCACCAAG AGAGCTTATA
AAATCCATTC TAATAGAGCA TTATCCTAAT GAAGTTCAAT AG
 
Protein sequence
MIYERCENSL QHKFQSVVIV GLRGVGKSTL ALMASATLGL EYVDLERCLV DYTGVSDATF 
IKSVSKEEFI HLQYKLIVRS FRANKNKRAI YVLPASSINN SAVMEYLRNN CNCHCVINIE
CDEDRILKYV NYTGEYQKGI SSIQSGISQY RSVANYNFFN LESNLDVWKK YSFSKVDDSK
QIEVTPHLIL KPVELEFINF MSFILWNPIP EPSDVLLKHY QRSNLRSFSN CLQLTFPYDA
RNIHVNNFGD VLNGVDAIEV GVDLIQLIRT KIMHVNLRLD EYIARIRRST RGSIPILVGI
KNTIPELNNF IMESTVDSVI STSQIKQDFR SFYFSILYSI IKVAADYVVL NLEIFLFDEV
NFLNDIIVND SFYIMDQLRH MQGNSSFLGT YNSNCDEFWA IQKTIGKVRC LDIIDLTNDL
QISMVRVTST ARCVSDNYKI QTFLEYCMKK YPETTVSAYN QGTNGKISKI LNKVLTPVCS
PSADPSQGEL TSYALNQSRF SCFLQPTLRF FAVGRSDSSI LYQFVYRSVF EKLGLSYFFK
ILEDVSIDEL LKSPDFGGAI LATPIEIKAN EFAGKSSAHA AEIGLVDSII AERSLDDPSK
FLLRGENADC LAIKVYISDN VAPINAVSHN KSVLVIGSGF KSRAAIYSLM KLGYKNILLY
SPMSIARQTE KDVSLSHNLD SSRKLDSHNL LAKITIITEE QFQNGILPDD LLYPTIIINC
MSDEDVPIDG QVKLSANWLK SPSGGIFLDT HIANKEITTL NESSEWEKGW IKTNGLEFLL
AKTLIQFELF VGKPAPRELI KSILIEHYPN EVQ