Gene PICST_64463 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_64463 
Symbol 
ID4840951 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp529427 
End bp532234 
Gene Length2808 bp 
Protein Length935 aa 
Translation table12 
GC content42% 
IMG OID640392266 
Productpredicted protein 
Protein accessionXP_001386694 
Protein GI150866932 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0466] ATP-dependent Lon protease, bacterial type 
TIGRFAM ID[TIGR00763] ATP-dependent protease La 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.658129 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AGCAATAACA ATAATAACAA TAACAACAAC AATAACGATA ACGATGAGCC TAACGAGATA 
GTAACGAACG CCGGTACAGG CTTGTATCCT CCGCTATTGG CCATTCCAAT GAAGGATCGT
CCCCCTCTTC CCGGACGTCC TTTTGCTATC AATATTACGG ATCCAGAAGT GATCAGGTCG
ATCTACACCA TCATCGACAA GAGGGAGCCC TACTTTGTAT TGTTCCACGT CAAAGATCCA
AATGAAGGAG ATACTGATGT CATCAATAGT AAGGATTCAG TGTACAACAT TGGTGTACAC
TGTCAAATTA TCAGACACAC GACTCCAAGA CCAGGAGTGT TCAATGTCTT GGGGTATCCT
CTTGAAAGAT GTCTGTTGGC TGACCTTAGC ACTCCCAGTG AGAAGAAGGG CGAAACTGAG
ACCAGAAAGG AGGGAGAAAA CTTTCCTACT TCTTATTTGA AAGGTCTCAA AGTGTCCTAT
GCTACCGTGA AACCTGTCAA AGATGAGCCT TTCGATAAGA CTTCGACCGA TATCAAGTCA
TTGGTAGAAT CCTTAAAGGC TCTTTTGTCG AAGATGGGCG CAAAGAATCC CCTTGAAAAG
CTCCAGATCA AAGAAGGTAC AGAATTGGTG AACGATCCAC CCAGATTTGC CGATTTTGTA
GGCTCCACTA TTCATGGAGA CCCCAAGAAG ATCCAAGAGA TCTTGGAATC ATTGAACATC
CAGACAAGAT TATCAAAGGC TTTGGAATTG TTGAAAGTTG AACTCAAGGC AAGCTTAATT
AAAGAGAACA CCATCCATAA CTTGAGTACC AAGGCCGATG AATACCAAAC GAGACTCTTC
ATAAAGGAAT TTATCAAGGA ATTGCAAAAG CGTGCTGGAA TTGTAGAGTC TGATGACAAA
AAGACGTCGA AATTTGATGA GCGTCTCAAA CATTTGAAGA TGACAGAAGA GGCTCTTGAA
GCATACAATG CCGAAAAGGC AAAAATGGAA AGTCAGAACG AACACTCGAG TGAGCTTGGT
GTTAGTGAGA GATACTTGGA TTGGTTGACT TCGATTCCCT GGGGAATCTA TTCTAAGGAT
CGCTTTAATA TCAAGCAGGC CAGAGAGATC TTGGACAGGG ACCACTATGG GTTGAAAGAT
GTCAAGGACA GAATCTTAGA GTTCATCTCT ATGGGCAGAG TTTCAGGAAA AGTCGATGGG
AAGATATTGT GTTTGACAGG CCCACCCGGT ACTGGTAAAA CATCCATAGC CAAGTCTATT
GCCGAGTCAT TGAACCGTAA GTATGTTAGA ATCGCCATGG GTGGTATCCA GGATGTTCAC
GAAGTTAAAG GTCATAGAAG AACATATGTT GGATCAATTC CTGGTCGTAT CATTTCTGCG
TTGAAGCAAG CCAAAACGTC CAATCCATTG ATGTTGATTG ATGAAATTGA CAAGTTGGAC
TTAAGTCGTA GTGGGGGTGC CTCTTCAGCC TTTTTGGAGA TCTTGGACCC TGAACAGAAT
AATGCCTTTG TTGACAACTA CATTGATGTC AAGGTCGATT TGTCCAAGGT GTTGTTTGTT
TGTACTGCTA ATTATTTGGG CAACATTTCT CCTCCGTTGA GAGACCGTAT GGAAATCATT
GAAGTCAATG GTTACACCAA CAATGAGAAA ATTGAGATTG CCAAAAGACA CTTGATTCCA
GATGCAGCCA AAAAAGCTGG ATTGGAAGGT GGGCATGTTG TAATTGAGAC GAAGACCATT
TCTAGATTGA TAGAGAAGTA CTGTCGTGAA AGTGGATTGA GAAACATCAA AAAGCTTATC
ACCAGAATCT TCAGCAAGGC CTCTCTCAAG ATCGTGGAAG AAGTTGAAGC TAGAGAAGGC
GAATCAAAAT CGAAATCTGA AGAAGCTAAG TCAGAAGCTA TCACTGGTTC TGTTACTGAG
ATTTCTGTTG AAGATGCTAC AGTAAAGGCC CAGTCCATTG AAGAACCTAG TGTAGAATCT
GCTTCTCAGA AGGTTGACGA AGCCAAACCT GTCGAATCAG AAGAACTTAA ATCAGATGAA
GAAGAAGAGG AAGTCGTGAA GTTGGAAATT CCAGATGACA TAAAGTTGGA AATCACTTCT
GCCAACTTGA AGGATTATGT TGGACCAGAG ATTTATACTA GGGACCGTGT CTACGACATC
CCTCCTCCTG GTGTTGCTAC TGGTCTTTCG TATAGTACTT CTGGTAATGG AGATGCATTG
TACATTGAAT CTATCTTAAC ACACTCTATT GGATCAGGTT CGGGACATGC TAGTATTCAT
GTTACTGGTA GCCTCAAGGA TGTCATGAAG GAATCTGCTT CCATCGCTTA TTCTTTTGCC
AAACTGTACA TGGTCAAGAA CTACCCAGAA AACAGATTCT TTGAAGCTGC TGAGATTCAT
GTTCACTGTC CTGACGGTGC TATTCCAAAG GATGGTCCTT CCGCTGGTAT TTCCTTCACA
TCTTCATTGA TTTCATTAGC TCTTCAAAAG CCTTTGCCTC CTACAATTGC CATGACAGGT
GAGATCACTG TTACTGGTAG GGTATTGGCC GTTGGAGGTT TAAGAGAAAA GATCTTAGGT
GCTAAGAGAT ACGGATGTAA CACCATTATC TTCCCCAAGG ATATTGAAAA CGAACTTGAA
GAAATCCCTG AAGAAGTAAA GGAGGGTGTT AAATTTATCC CCGTCGAATG GTACCAGGAT
GTATTTGACG AAATATTCCC CAACTTGTCT AGTGATGAAG GTAACGAGGT ATGGAAGGAA
GAGTTCAACA AATTGGATAA GAAGAAGGCT AGCAACAAGA AGAAATGA
 
Protein sequence
SNNNNNNNNN NNDNDEPNEI VTNAGTGLYP PLLAIPMKDR PPLPGRPFAI NITDPEVIRS 
IYTIIDKREP YFVLFHVKDP NEGDTDVINS KDSVYNIGVH CQIIRHTTPR PGVFNVLGYP
LERCSLADLS TPSEKKGETE TRKEGENFPT SYLKGLKVSY ATVKPVKDEP FDKTSTDIKS
LVESLKALLS KMGAKNPLEK LQIKEGTELV NDPPRFADFV GSTIHGDPKK IQEILESLNI
QTRLSKALEL LKVELKASLI KENTIHNLST KADEYQTRLF IKEFIKELQK RAGIVESDDK
KTSKFDERLK HLKMTEEALE AYNAEKAKME SQNEHSSELG VSERYLDWLT SIPWGIYSKD
RFNIKQAREI LDRDHYGLKD VKDRILEFIS MGRVSGKVDG KILCLTGPPG TGKTSIAKSI
AESLNRKYVR IAMGGIQDVH EVKGHRRTYV GSIPGRIISA LKQAKTSNPL MLIDEIDKLD
LSRSGGASSA FLEILDPEQN NAFVDNYIDV KVDLSKVLFV CTANYLGNIS PPLRDRMEII
EVNGYTNNEK IEIAKRHLIP DAAKKAGLEG GHVVIETKTI SRLIEKYCRE SGLRNIKKLI
TRIFSKASLK IVEEVEAREG ESKSKSEEAK SEAITGSVTE ISVEDATVKA QSIEEPSVES
ASQKVDEAKP VESEELKSDE EEEEVVKLEI PDDIKLEITS ANLKDYVGPE IYTRDRVYDI
PPPGVATGLS YSTSGNGDAL YIESILTHSI GSGSGHASIH VTGSLKDVMK ESASIAYSFA
KSYMVKNYPE NRFFEAAEIH VHCPDGAIPK DGPSAGISFT SSLISLALQK PLPPTIAMTG
EITVTGRVLA VGGLREKILG AKRYGCNTII FPKDIENELE EIPEEVKEGV KFIPVEWYQD
VFDEIFPNLS SDEGNEVWKE EFNKLDKKKA SNKKK