Gene PICST_52835 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_52835 
Symbol 
ID4851483 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp1945283 
End bp1946653 
Gene Length1371 bp 
Protein Length429 aa 
Translation table 
GC content43% 
IMG OID640393191 
Productpredicted protein 
Protein accessionXP_001387999 
Protein GI126274611 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATCATGTTAC AGTACAAGAA ATGGCAAACA GAAGATGTAA TTAACTCGTA CTTTGACGAT 
CAACAGAAAT TCTACGAGAG CTGTGGATTA CCATTCGGAA AGCCTAGTAA AAATACTTTT
GCAATAAAAC AGAATTACTA CTGTGTAATT TGCTGTGAGA CCCGTGTCTC CACCCCGGTA
TATTCGTTGA CATGTGGCCA TGAGTTTTGT ATCAATTGCT ACTACCACTA CATCAATAAT
GAGATCAGCA ACAGTAAACT CATAACCTGT ATAATTCCGG AGTGTCCCTA TACAATTCCA
CATAGAGACA TCGACGAGAT AATTCTCGTT GTAGAGCTGG CAAATTCGGT CAAGGTGCGC
AAGGCTCTCA GCTCGAACCC GCTCTTGATT GCCACAGCCA AGGTCTACAT CGATTCACAC
GAAAACTTCA AGTGGTGCCC GGCCACTGAT TGTACACATT TCACAGAAAT CGTGTCGCCA
CGAAGGCTAG AGGAAGATGA AGTGGAAAAT AAGGATAAAA AGCCCATCGA CATCTCCATA
GTACCCATTG TAGGATGTGC GGACCATCAC GAGTTTTGCT TTGAGTGTAA TTACGAGAAT
CATTTGCCGT GTCCCTGTTG GATTGTACGC TTGTGGATCA AGAAGTGCGA AGACGACTCG
GAGACCGCCA ACTGGATAGA CGCAAATACC AATGCCTGTC CCAAATGTCA GGCCTCCATC
GAAAAAAACG GAGGGTGTAA CCATATGACA TGTCGAAAAT GTCAATTCAA CTTCTGCTGG
ATCTGTTTGG GAGACTGGAA GGACCACAAC AATAGCTACT ACTCGTGTAA CAAATTCAAG
CCAGACAGCG AAGATTCAGA GGTGGCTAAT CGTAAAATCA AGAGTAAAGT CTCGTTGCAA
AGATATCTTC ATTTCTATAA GAGATTCTCA ATTCACGAAA GCTCAATGCA AGGAGACCAA
TCTACACTCT CTAAGTTGCA TGACCTAACC ATGTTATACA TGGAGAACAG AAAAGAACAT
GAGACGAACT TGTCTTGGAC AGACATCCAG TTCTTGCCAG ATGCCTTCAT AGCCCTTGCG
AATGGACGTA AGACGTTGAA GTGGACATAT TGCTTTGCTT ATTATTTGGC AGATTCCAAT
TTCTCTGAGA TCTTTGAAAG TAACCAGGAC TATTTGAACA AGACTGTAGA GGACTTATCA
GGAATCTTTC AGAACATGTT GGACAAACAC AACAAGAACA AGGTGGCGTC CATCTTGAAG
CATCGTAGTC AAATCATCAA CTTGAGCGAG TTGATCACTT CCAGGAGAAA AATGTTGATC
AGCGGAGCAG AAATGAACTT GAAAGAGCAC TTGCTTCGGT TTGAAGCGTG A
 
Protein sequence
IMLQYKKWQT EDVINSYFDD QQKFYESCGL PFGKPSKNTF AIKQNYYCVI CCETRVSTPV 
YSLTCGHEFC INCYYHYINN EISNSKLITC IIPECPYTIP HRDIDEIILV VELANSVKVR
KALSSNPLLI ATAKVYIDSH ENFKWCPATD CTHFTEIDKK PIDISIVPIV GCADHHEFCF
ECNYENHLPC PCWIVRLWIK KCEDDSETAN WIDANTNACP KCQASIEKNG GCNHMTCRKC
QFNFCWICLG DWKDHNNSYY SCNKFKPDSE DSEVANRKIK SKVSLQRYLH FYKRFSIHES
SMQGDQSTLS KLHDLTIKEH ETNLSWTDIQ FLPDAFIALA NGRKTLKWTY CFAYYLADSN
FSEIFESNQD YLNKTVEDLS GIFQNMLDKH NKNKVAQIIN LSELITSRRK MLISGAEMNL
KEHLLRFEA