Gene PICST_33945 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33945 
Symbol 
ID4840849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp841990 
End bp843219 
Gene Length1230 bp 
Protein Length409 aa 
Translation table12 
GC content43% 
IMG OID640392164 
Productpredicted protein 
Protein accessionXP_001386575 
Protein GI126140106 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3064] Membrane protein involved in colicin uptake 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.913132 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGCCA TTAATCCATA CTTCCAGGCT AGCCAACAAG GTTCAACCTA CGACTTCGAT 
TCTTTGATCA ACTACTTGAA TAATGTCCAG AGACAACAAT TCCACGAGCA GTCCAAGCCA
AAGATAGTGA AAAAGATAGA AACGGAAGAT GAATACCAAA TCCAAGTCTA CAAGAAGACT
GGAGACTTTA ACAACTACGA AGTCAGGGCC ATCAAAGTTC CAAAGCAATT CTCCCAACAA
CCTCAACTCA TTAACGTCAT CTTGCAATCG GCCAAGGATA GTTTCAAAAA GACATTCCAG
TTCAAGGAAC AAGATATCAA CGTTGAGGAC ATCAACTGGG AATGGTACAA GCAAGAAAAC
ATCTTGGTGT TGAACGTCCC CAAGAAGGTT CATGTCTGCC ACTCCAACAG TTTCGAGGAC
TTGGCATCAA TGCTCGGATT CCCATTTGGT GCATTTGGAT TGCAACAGCA GACACAAAAG
AAGCCAACAC CAGTTGCTAT GGAAAGGTCC AGGTCAGAGC AAGCAAGATT GCAGGCTGAA
GCAGAGGAAA TTGCAAGAAA AGAAGCCGAA GAACAGTTGA GGCAACAACA AGAAGAAGAA
AGAAAAGCTG CCATTTCTAG AAGAGAGGCT GAAGAAAGAA AAGCAGCTGA GGCTTTGGCA
AGAGCTGAAG TCGAAAGGGC CAGAATAGAA GCAGAAGAGA GAGCTCAAAG AGAAGCAAAA
GAGAAGGCTA GAAGGGAAGC CCAAGAAAAA GAAAGAAGAG AAGCCCAAGA AAAGGCAAGA
AGAGAAGCCC AAGAAAAAGC TAGAAAGATT GCTGAAGCAA AGATTGCTGA ATCAAAGGCT
GCTGAAGCTA AGAGAAAGAC TGCAGAACAG GAAAGAATCA AGCAACAGAG AGAAGCTTAT
GAAAACATGC TCAGACAACA ACAAGAATTC ATGAATAACT TCTTTGGTCC TTACTTGTCG
CACAATTTCG GAACAGGATC AGTACCAGTA TCTCCACCAA CATCAAGGCC CACTTCAACA
CAAGCTACTT CGGAAAAGAC TGAATCCCCT GCTAAGGCTA ACTACGATTC TGACAGTGAA
TCAATAACAT CTGAGCCGGA AACTTCCTCT GAAAAGTCAC ATCCAAAGGA ATCTGAAGAA
ATGCACAGAT TGCATAAGCA TCCATCTTTA GAGGAAGTTG ATGATGAAGA GTTCGTGTTG
TTCAACAAAA AGTTCGGAGA CCAGAAGTGA
 
Protein sequence
MFAINPYFQA SQQGSTYDFD SLINYLNNVQ RQQFHEQSKP KIVKKIETED EYQIQVYKKT 
GDFNNYEVRA IKVPKQFSQQ PQLINVILQS AKDSFKKTFQ FKEQDINVED INWEWYKQEN
ILVLNVPKKV HVCHSNSFED LASMLGFPFG AFGLQQQTQK KPTPVAMERS RSEQARLQAE
AEEIARKEAE EQLRQQQEEE RKAAISRREA EERKAAEALA RAEVERARIE AEERAQREAK
EKARREAQEK ERREAQEKAR REAQEKARKI AEAKIAESKA AEAKRKTAEQ ERIKQQREAY
ENMLRQQQEF MNNFFGPYLS HNFGTGSVPV SPPTSRPTST QATSEKTESP AKANYDSDSE
SITSEPETSS EKSHPKESEE MHRLHKHPSL EEVDDEEFVL FNKKFGDQK