Gene PICST_29522 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_29522 
Symbol 
ID4837215 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp478194 
End bp479615 
Gene Length1422 bp 
Protein Length473 aa 
Translation table12 
GC content41% 
IMG OID640388530 
Productpredicted protein 
Protein accessionXP_001382327 
Protein GI150863752 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.182383 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGGTA GAAGACGATA TAGTAACGAC AGAGATGGCA CCAAGAGGCA GAAACTAGAA 
CTGAAAAATG CACGATTCCA TTCAAACAAC CAACCAAACA AACAAACAAA CGACAAATCA
GCAAGTTCAG TGCGACTCGG AGCAGTGAGT TCGTCTGAGG CAGGAGTTTC ATCTCAAGCA
TCTTCTGAAG GAAAAGGTGG ACTCAATGTA GAAATACATC CTCTTCTCAG AGCTACCGTT
CCTACTCCTG CCCTTCCTAA GAACCATAAT CCTCTTTCAC AGAATGTCCG GAAGTGGTTT
GATCCCTTTG CCATCAATCC ATACTTGAAA CAGACAGACT CGTCTGTTCC TCAGCATAAA
CCAAGACAGT TAGTTTTCAA CCCCAAGGGA AGGTATATAG CACAAGGAGA TGCCCTTCGT
GAAAAGATAG CTCTTGAACA ACAGCATAAA AAGGAACTTG AAGAAAAAAA GGCCCGAGGT
CTAGCTCCTG ATGAAAGTCT TGGGGAGCAA TTGTACAAAC CCGAACATCC TCCTCTGCTT
GAATGGTGGG ATAAGCCATT TGTATTGGAC AGAGACTATT CACAAATTGA AGATTCAACC
AGATTGAATT TGAATGATGA AGAACAGCCA GTTTCCATAT ATATCCAGCA TCCTGTTTTC
ATTCCACCAC CATGGGAAAA ACTCAATCCT GAAGCCAAAC CCATGTACTT GACCAAGAAA
GAGAGAAAGA GAATCAGAAA GAACGAAAGA CTGGAAAAGC ACAAAGACAA ACAAGATAGA
ATTAAGCTTG GTCTTGATGC ACCTCCCCCA CCAAAAGTCA AACTCTCTAA CTTAATGAAT
ATTCTCACCA ACGAAGCCAT CAAAGATCCT ACTGCTATAG AAATGCGTGT ACGACAAGAA
GTAGAAGAAA GACTCCAGAA TCATTTAGCA ACAAACGAAG CTAGGAAGCT TACTAAAGAA
CAGAGGCACG AAAAGATACA GGAACAAAGA GAAAAAGACC TCTCTAAGGG ATACTTTACT
TCTGTTTACA GAGTAGATAA TCTCAGCAAC CCCCAACACT TCTTCAAGGT GAACAAGAAT
GCCCAGCAGC TCGATTTGGT GGGTATCTGT TTGAGAAATC CCAAGTTCAA CTTGATAGTA
GTAGAAGGAG GACACAAAAG CATCAAATTC TTTAATAAGC TTCTTACTAA AAGAATCAAA
TGGACGGAGA ACGTTGTTCC TAAGCATAGT AATGAGTCTA CACAAGAACT CCAAGACTTG
TCTGCTAACA AATGCTATTT GGTGTGGGAG GGCCAGGTCA AAGAGCTTAG TTTCCAGAAA
TGGAGTGTGA TGTACTCACG TGATGAATAT GAAGCATTTG ACGTGCTCAA TAGATTTAGG
ATCGAGAATT ATTGGAGAGA AGCTTTAGTT GTAGAAAACT AA
 
Protein sequence
MSGRRRYSND RDGTKRQKLE SKNARFHSNN QPNKQTNDKS ASSVRLGAVS SSEAGVSSQA 
SSEGKGGLNV EIHPLLRATV PTPALPKNHN PLSQNVRKWF DPFAINPYLK QTDSSVPQHK
PRQLVFNPKG RYIAQGDALR EKIALEQQHK KELEEKKARG LAPDESLGEQ LYKPEHPPSL
EWWDKPFVLD RDYSQIEDST RLNLNDEEQP VSIYIQHPVF IPPPWEKLNP EAKPMYLTKK
ERKRIRKNER SEKHKDKQDR IKLGLDAPPP PKVKLSNLMN ILTNEAIKDP TAIEMRVRQE
VEERLQNHLA TNEARKLTKE QRHEKIQEQR EKDLSKGYFT SVYRVDNLSN PQHFFKVNKN
AQQLDLVGIC LRNPKFNLIV VEGGHKSIKF FNKLLTKRIK WTENVVPKHS NESTQELQDL
SANKCYLVWE GQVKELSFQK WSVMYSRDEY EAFDVLNRFR IENYWREALV VEN