Gene PICST_30337 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_30337 
Symbol 
ID4837604 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp2553594 
End bp2554922 
Gene Length1329 bp 
Protein Length442 aa 
Translation table12 
GC content43% 
IMG OID640388919 
Productconserved hypothetical protein 
Protein accessionXP_001382723 
Protein GI150864042 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.18618 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGACC CAGGTTATGT TATACTCTTC TTGGGGTTAT CGATTTCCTT ATTGTTGTCC 
CGTAAGCATG TTGTTTCATT AATTTCATCG TTCAGAAGCA AAACTGCAGC TGTCGATGAA
GAAAAGGCAA GAAGAAATCT GTATGGAAGC GATGAACCAT TAAAGCCTCC TACTCCCTTG
ATGATTACTC CAGAACAAGT TCTGAACTTT GACGATAGAC CATGGAGACC ATTCAGATGG
CCATATCACC AGACTATGTC TATCTTCAAG TTGGATATGA ACCACTGGTT GGACATGGAC
AAGTACTACG TTCACTACAT CGAAGAAAAG AAGAGAATTA TCCAAAAGTA TGGCAAGGAA
AACATCGACT GGCTACCTGA CAGTGAGGAT GCCACTTTTG AACTCATGCA AACTGTTGTG
GATCACCTCA TTGTTAGATA TCCATTGTTG TTCACTGTTT TGAAGGACGG GGACTTCTAC
GAAGGTAAGG GAAAGATTAT CAAAAACGAG ATCACGAAAG AGATCTTGGA CATGACTTTA
CCTTTGAAGG AACATCCTTT GATGTATGTG ACAAAGTTGG CCAAGGAAGA TTTCTACATT
GTGAAGAAGA ACCCTGTGGA TGATTTACAT TACTTGGTTG CAGCTGCCGT CCCATTCCCT
GGTGGATCTT TCGGAGTTGA CCACAAGATT GGTAAGACAT TGGATGTGAT TCACCTGGAC
GTTCCCTACT ACAAGGAAAA GTTGAAGAAA TCGATGGAAA GATGGTTTGA CAGAATGAAG
CCCAACGATC CTGTGGAAAG AGCTAGCTGG TATATCTCTT GGGATCACAA GTTGAAGGTC
AACAATGTGT ACCAATTACC AAAATACGTA CCTAATTTGG TTGCAGACTT GGAATCCACC
GACCCTCGTG AATTTAATGT TAGAGTTGAA AGACAGACGT TGAGAAGACT TCCAAGGTCG
AACGCCATCA TCTTCACCAA CCACCCCATC TTCTACTCGA TTGAAGAAAT GAAGGACGAA
CCTCTTGTTC CATCGTTGAT TAAAAAGATC ATATACGAGG GTCCCAAGGA TATCATCAAG
TACAAGAACT TCGAAGTGTT CAGAGACCAC ATTGCTTCTT ACCTTGACGG CTTGATAAAG
AGACAGATAG ACAAGGGTAT TATCAAGGAA GACACTCCAT TAAAGACGTT GCCCTCGTAT
CCTTTTGCAC ACTGGGCCAA AACTGACTTT GACTTTGTCA ATGGCTGGAA CAACCCCAGT
CCTGCGTACG ACAAGTCTGC CAACTACAGC GAGAAGGCCA AGAAGGAGTT GGTACATCAG
AATGATTAG
 
Protein sequence
MIDPGYVILF LGLSISLLLS RKHVVSLISS FRSKTAAVDE EKARRNSYGS DEPLKPPTPL 
MITPEQVSNF DDRPWRPFRW PYHQTMSIFK LDMNHWLDMD KYYVHYIEEK KRIIQKYGKE
NIDWLPDSED ATFELMQTVV DHLIVRYPLL FTVLKDGDFY EGKGKIIKNE ITKEILDMTL
PLKEHPLMYV TKLAKEDFYI VKKNPVDDLH YLVAAAVPFP GGSFGVDHKI GKTLDVIHSD
VPYYKEKLKK SMERWFDRMK PNDPVERASW YISWDHKLKV NNVYQLPKYV PNLVADLEST
DPREFNVRVE RQTLRRLPRS NAIIFTNHPI FYSIEEMKDE PLVPSLIKKI IYEGPKDIIK
YKNFEVFRDH IASYLDGLIK RQIDKGIIKE DTPLKTLPSY PFAHWAKTDF DFVNGWNNPS
PAYDKSANYS EKAKKELVHQ ND