Gene PICST_82025 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_82025 
Symbol 
ID4837009 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp675060 
End bp676531 
Gene Length1472 bp 
Protein Length460 aa 
Translation table12 
GC content46% 
IMG OID640388324 
Productpredicted protein 
Protein accessionXP_001382893 
Protein GI150864175 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.830676 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CCACTGCAGG CCGGATGAAG ATCAAGACCA TTAAACGTTC GTCCGAGACG TATGTTCCGG 
TTAGAAACAC TCAGGAATCT GCGCTTCCGC GGAACTTAAA CCCTGCTCTC CATCCTTTTG
AAAGGGCCAG AGAATATACC AAAGCCTTAA CTGCTACCAA GATGGAACGG ATGTTTGCTC
AGCCATTTGT AGGGCAGCTA GGAGACGGGC ACAGAGATGG GGTTTATTGT ATAGCTAAGA
ACTTCTCGAC TACAAACCAA GTAGCTTCTG GATCAGGCGA TGGTGTTATC AAATACTGGA
ACATGACTTC AAGACAAGAG TCAGTCAGCT TCAGAGCCCA CTACGGGATG GTCGGTGGCC
TTTGTGTGAC ACCAAAGCAA CAGCACATGT TATCTTGCGG TGACGACAAG ACGATCAAGT
TGTGGTCAGT GAAAAGTGAC GAATTTGAAA CTGGATTTGG AGACGAAGAG GTGTACAGCA
ACAAAAGCAT GGGTTTGGTC AAGACATTTT TGGGCGAGCA TGCCTTCAAA GGCTTGGACC
ACCACCGTGA TGACGATCTT TTTGTTACAG GCGGTGCCAC CATCCAGTTG TGGGACATGC
ACCGTTCAAA GTACATCTCA GACTTGCTGT GGGGAGCAGA CAATATCACC ACAGTCAAGT
TCAACCAAAC CGAAACCAGC ATTATCGCTT CAGCTGGCTC TGACAACTCG ATTGTTTTAT
ACGATGTCAG AACCAACTCC CCTATACAGA AAGTTGTGAC TTCCCTTAGA ACTAATGCCA
TAGCCTGGAA TCCCATGGAA GCGTTTAACT TTGCATCAGC CTGTGAAGAC CACAATGGGT
ATCTCTGGGA TATGCGTAAA TTGGACAGGT CTCTCAATGT GTACAAGGAT CATGTAGCAG
CCGTTATGGA CATCGATTTC TCACCCACGG GTGAGGAAGT CGTTACTGGA TCTTACGACA
AGACGATAAG AATCTTCCGT GCCAGAGAGG GCCATTCCCG TGATATCTAC CATACTAAGA
GAATGCAGAG AGTGTTTTGC ACCAAGTTCA CTACTGACGC CAGATACATT TTGAGTGGTT
CTGACGACAC CAATATACGT TTGTGGCGTG CTAATGCTGC TGACAGATCG AACATCAAAT
CGTCCAGACA GAGGGCCAAG TTGGAATACG ACGCTGCCTT GAAGGAAAGA TACAAGCATA
TGCCAGAAAT CAAGAGAATA TCAAGACACC GTCATGTGCC CAAGACCATC AAAAAGGCTG
GAGAAATCAA ACGTGTTGAG ATTGACAGCT TGAAGAAGAG AGAAGACAAT GAGAGAAGAC
ACAGCAAGCC AGGTTCCAAA CCTTTCAAGT CAGAGAGAGA AAAGCATATC AGGGGAACAG
CAATCAAAGA AGATTAGACC CCTACAGACC TCAAGGGCCA TGTACATTAG TAATAGCATA
AATAATGAAA AGCGACATCA TAGAAGCATG GT
 
Protein sequence
MKIKTIKRSS ETYVPVRNTQ ESALPRNLNP ALHPFERARE YTKALTATKM ERMFAQPFVG 
QLGDGHRDGV YCIAKNFSTT NQVASGSGDG VIKYWNMTSR QESVSFRAHY GMVGGLCVTP
KQQHMLSCGD DKTIKLWSVK SDEFETGFGD EEVYSNKSMG LVKTFLGEHA FKGLDHHRDD
DLFVTGGATI QLWDMHRSKY ISDLSWGADN ITTVKFNQTE TSIIASAGSD NSIVLYDVRT
NSPIQKVVTS LRTNAIAWNP MEAFNFASAC EDHNGYLWDM RKLDRSLNVY KDHVAAVMDI
DFSPTGEEVV TGSYDKTIRI FRAREGHSRD IYHTKRMQRV FCTKFTTDAR YILSGSDDTN
IRLWRANAAD RSNIKSSRQR AKLEYDAALK ERYKHMPEIK RISRHRHVPK TIKKAGEIKR
VEIDSLKKRE DNERRHSKPG SKPFKSEREK HIRGTAIKED