Gene PICST_15419 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_15419 
Symbol 
ID4840786 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp723959 
End bp726199 
Gene Length2241 bp 
Protein Length714 aa 
Translation table12 
GC content40% 
IMG OID640392101 
Productpredicted protein 
Protein accessionXP_001386334 
Protein GI150866666 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0198581 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.726976 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CGGACTAAAA CAGCCTGCGA CTACTGTAGG AAAAGGAAGT CCAAGTGTAA CGGAGAAAAC 
CCCTGTTCAA ACTGCTTGAC TCACAGCAAA GACTGTACAT ATACTGTTGT GCCCAAAGTG
AGAAAGAAGA GGGTGTCCAA GTCTCTGTCT AATTCCAAAG ATAAGAAGCA TAACCCCAAT
TCCATACAAG GACTCAATAG TAGACTTTCT ACTTTGGAAG GTCTTCTTTC TAAACTAGTG
AAAAGACTCG ACTCCAAAAG TCTATTGGAC CTTGGGAATT CTAGATCAGG TGATAAAGAT
GACGATTCGT CAGATACAGT GTCTTTAAAC TCGTTACCTT CTGAAAGTAA CAGTGAGGAC
GAACAGGAGG AAGAAGATTT TTCCAAAGAA TTGGTCCTCA ACGAAAAATC GTCAGTGCTG
AGCCCTGAAA AGCTTAAGCA GATGCTTCAG CCCTGTGAAC CGAGGTCCTG TCAAATAGTG
ACTAGCGCCA GAGACAGAAT CTTACAGTAC TTTGGTTCTC ATTCGATGTT CTGTATCTTT
TCAGCAAAGT CGATAAAATG GATGAAAGGC AGAATAGAAG GTAAGGGCGA TGACTCCCTT
CTTTTGCCAA TAAGAAACTT GCCTTACGCT CTTAACAGTG TAGTCCAGTC TAACATGAAA
GTATGGGCTC AGTCTTTGCC AACTTCACCG TCCAATAGCA AGTTCTTCTT CAACAAGAAC
GAAAAGAATC TCATTTTTGA GCTTCTTGAA TACTATTATG ACGATATCAA TATCGCACCC
TACTTGTGCA GTTTACACAC AATTCGTGAA TTGTTCCAAC TCTACTTCTA CGCTTTGTCC
AGTCATGATG TAGATATTCT TAACGGAATA TCACAGAGTG AATTCTTGAT TATGAATGTC
TCTATAGCTT TGTGTTTGAC CAACAAGTCT GGAGATGTGA AGGATAATCA TAATTTCCCA
GCTTTGTCAG CAGCATCCAG TTCTGATCTC GCCCAATTCA AGCAGAAAAT GTTCTCCAAT
GCTGTTGCAT GTTATGAAAG AGTCTCTGTG GTCTGTGAGG GTATAAGAAC CATCCAAGGT
ATAGCATTGG TCACTTTGTA TATCGAAGCT AGTTTTATTA CTGATTTCCA GATTAACCAC
ATGTTGGTTT CTGTGATGGT CAGGTTTGCT TGCGATTTGG GACTTCACAG CACTGTTTCC
GTTTCCAAAT ACGATGCTGA AGAACATGCC CATTTGAAAA GAAGGTTGTG GTGGTTTTGT
GAATACATGG ATAGTGAAGT TTGCTACAGA AGTGGGAAAG CCACGTTGAT TAATCGTGCA
AATGTGACAA TCTTAACCGA AGAAGATGAC TACTTCCTTT CTGTGCCTTT GGATCCATTC
AAAAACGACG TCTGTAAGAA AAACTCATCT GAACTAGTAG CAAACTGCCG TAAGTGGGGG
TACCAAAATT ACTACATCTA CTACACTTTG ATGTTATGCA GACTTAAAGA AAAGAGTTAC
AACAACTTGT TCAAACCTCA GGTTGCTCAT CAAAGCCAAG AGGAATTATT GAAATCATTG
CAAGATATCA ATGAGGATAT GTTCCGTATG GCGAGATTGA TGGAACCCGA AATTAGACCA
ACATTATATT ATGTCAAAAG ACCAGAGTCT CCTAGCAGAA GTTGTTTCGC TGAACTTTCT
GAAAGCAACA ACAACTTTTT CCAGTACAGT TCTCTCTTGT TGCAGTTGTC TTTCTTTGCT
CATTTACTCT CAATCAACAG GGTGCCCTTT ATGAACAACA TGTTTGAAAG CAATGAAAAA
ACTATCAAGT TTGGTAACTT GTCTTTGGAA AGTGCCAGAA CCATATTGCA TCTCGTGGTA
GATTTGGATA GAACAAAGGT TCCTAATTCG ATCTTGAATT GGGTCACTTT CTATCCATTT
ATGGCATATT GCAGTCTTAT TGGCCACTGT TTGTCTTTCC CACAAGAAAA CAGTACTCAT
ATGGATTGTA CGTTGTTGAT TCGTGTATCT TTGAACTTTT TTGCATACAG AGGTCTTAAT
GAAGAAGACA TAAGAACGTT TGGTGAATCA AAAACATATG ACAATAAGAG TATGATGTAC
GACTTGATCA CACGTTTACT TTTGAGAGTG TTGGTCAATT TGATGGACAA AGAATCCGAA
CATCACTATG CAAATGAGAT CAAGGGTTTG AGCGACCACA TGGAAGCATG TGCCAATATA
TACCCTGACT TGTTCAAGAA G
 
Protein sequence
RTKTACDYCR KRKSKCNGEN PCSNCLTHSK DCTYTKRVSK SSSNSKDKKH NPNSIQGLNS 
RLSTLEGLLS KLVKRLDSKS LLDLGNSRSD TVSLNSLPSE SNSEDEQEEE DFSKELVLNE
KSSCQIVTSA RDRILQYFGS HSMFCIFSAK SIKWMKGRIE GKGDDSLLLP IRNLPYALNS
VVQSNMKVWA QSLPTSPSNS KFFFNKNEKN LIFELLEYYY DDINIAPYLC SLHTIRELFQ
LYFYALSSHD VDILNGISQS EFLIMNVSIA LCLTNKSGDV KDNHNFPALS AASSSDLAQF
KQKMFSNAVA CYERVSVVCE GIRTIQGIAL VTLYIEASFI TDFQINHMLV SVMVRFACDL
GLHSTVSVSK YDAEEHAHLK RRLWWFCEYM DSEVCYRSGK ATLINRANVT ILTEEDDYFL
SVPLDPFKND VCKKNSSELV ANCRKWGYQN YYIYYTLMLC RLKEKSYNNL FKPQVAHQSQ
EELLKSLQDI NEDMFRMARL MEPEIRPTLY YVKRPESPSR SCFAELSESN NNFFQYSSLL
LQLSFFAHLL SINRVPFMNN MFESNEKTIK FGNLSLESAR TILHLVVDLD RTKVPNSILN
WVTFYPFMAY CSLIGHCLSF PQENSTHMDC TLLIRVSLNF FAYRGLNEED IRTFGESKTY
DNKSMMYDLI TRLLLRVLVN LMDKESEHHY ANEIKGLSDH MEACANIYPD LFKK