Gene PICST_19003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_19003 
Symbol 
ID4838389 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp1586818 
End bp1588053 
Gene Length1236 bp 
Protein Length412 aa 
Translation table12 
GC content41% 
IMG OID640389704 
Productpredicted protein 
Protein accessionXP_001384608 
Protein GI150865405 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0445647 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0373954 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TACACCATTT TTCTCTTGCT CCACTCTCTT TTCACATTAG TTGCCCCAAC ATGGTACTGG 
AAAGAGCACA AAGTATGTGA TCCTAGTCTG CTGAACATGG TACCTTCTAC TGGGTTCAAG
ATCAACAGAC AATACAAGAT CAGTGGTTTA TTTCCTCCTT TCAGTATGCA TTTCATACAA
ATTTCACATT GTGACCATGG GCAAGTTGAC TACCAGAAAG TAAAAAGAAC CATGTCCAGC
ACAGACATAG ACTGGGATAA GCCCTTCTGT ATGCTCAAGC CTAAGTTGGG AGATGTACTA
TCTTTACCAC CCGAAGACAC AGTAGAGAAC TTGTTGCCTG GAAATAATGG TACGATTTCT
GGATATTCAG AAATTCCTAA TAGTCGTTTA GAGCGTTTTG GTCAGTTTGT GAGGAATAAA
TTTGACAAGG CATTGGAATC GTTGACTGCG GGCAAGAGAA TCCAGTATAA GGGGAATATA
GCTTACCACC ATTTAGAGGA TATCAAACAA GCTGATCCGC AATCTTCCTT TGAATGGGAA
TCACGAAACC TAGTTTGCTA CAAGATGGCT CGACGAAAGA AGTATGCCAA GAGAGATGTT
TCTTTCTACA TGCCAAATAC TTTTGTAGGA GGACTTTTTG AATGTCCTGT TCTGGCACAA
CAGAAGAAGA CTTTGTTCCA ACAGTTTGGC AATGAAGATT TTCTCAAAAG CCTTGATTTT
GATTGTGATA GTTCTTTGGC CAGACCAATT CTACCCTTGA TTGCTCAAGA TAGTACTCAA
CAATGGATAG ATACCAATTT CAAAGATTTT CTTAACCCTT CGATCGTCCA GACGATACCT
TACTTGGCCA CAGCTTCTCC ATGGAATCTA CGGTTTTCCC TTAGTAACAC AGCTGACATG
AATAGCGACA CATGGGCAAC CAAAATCGAT GTTAATGAAC AGTATGTCTA TTCTGAGCTG
GAGGTGCATG TTATATCTCG TGCAGTGGCT GTTTCTGCCG ATTACTTGAA CAGAATCATC
TCTCCTATTG ATATCAACAA GTTTCTGCAG AGATATAATA ACTGGCTCTC TAAGTCCCAG
CAATGGGAGA ATCCTATTTT AGAAGAAGGT GATATGACCA AGATCATGCT GAGCCTCTCA
TTTGACCAAA GGTTGCAGCT CATTATCTCA AGCAAGACCA GGAAGGTTGC CATGGAACAA
ATACAGAAGC TTCAGGACTT GTGGGATCAA CTACAA
 
Protein sequence
YTIFLLLHSL FTLVAPTWYW KEHKVCDPSS SNMVPSTGFK INRQYKISGL FPPFSMHFIQ 
ISHCDHGQVD YQKVKRTMSS TDIDWDKPFC MLKPKLGDVL SLPPEDTVEN LLPGNNGTIS
GYSEIPNSRL ERFGQFVRNK FDKALESLTA GKRIQYKGNI AYHHLEDIKQ ADPQSSFEWE
SRNLVCYKMA RRKKYAKRDV SFYMPNTFVG GLFECPVSAQ QKKTLFQQFG NEDFLKSLDF
DCDSSLARPI LPLIAQDSTQ QWIDTNFKDF LNPSIVQTIP YLATASPWNL RFSLSNTADM
NSDTWATKID VNEQYVYSES EVHVISRAVA VSADYLNRII SPIDINKFSQ RYNNWLSKSQ
QWENPILEEG DMTKIMSSLS FDQRLQLIIS SKTRKVAMEQ IQKLQDLWDQ LQ