Gene PICST_41366 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_41366 
Symbol 
ID4836678 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp851569 
End bp852771 
Gene Length1203 bp 
Protein Length400 aa 
Translation table12 
GC content42% 
IMG OID640387993 
Productpredicted protein 
Protein accessionXP_001382936 
Protein GI150864204 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.122227 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.125571 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAAGT ACCACGACCT TCCGGATATC GACACCAACG CCCCGGATGT GTTTGAGACA 
TCGGACGTAG AAAGTGATCT CGAGTTGTCT CACCATCATA GTGAATCTAG TCCAACTCTA
GAAGAAGATG ATTCGGAGAT CAAACACCAG CAATTGAATG CTGAAACTGC CAGAACTCGC
TTTTCTCATA GTCTGTTGGT GCCCCAAGAT GGTGCAGACT TTCTGGGTAG CGTTTCTTAT
CCCAAGCTTG GGAAATCGGG GTATTTGATC GAGACAACGG TCGAAACAAG ACAACAGAAG
TTGGCAAGGA TAGCGCGAGA GCTAGAGGAA TTGAAGCAAG AAGGTGAAAA TGATGTGGAG
AAAATATTTT CAGAAAATGT AGACGGTTTA CAAACTCAGC TCCAAGAAGT TCTTGAAAAG
ACTGTTTCAG GAAGTAACTC CAAAATTCGC CAACTTGATG TATATAGTCA GAGGATTAAC
CACCTTTTCG AAGCGATATC TAGTAATATT ATTAAAGGTG AAGTTTATGA AAAATCGGCG
CCCCAGAATG AGGCTAGCTT TCAGAAATCG AGCACCTCGA CAAGTCCCAG TGAGATCCTT
TCGCTTGAGA ATAGAATCAA CGAGTTGGAA AAGTTAATAG GTGTAGATAT GGTCCAGAGT
TTGAGTTCCA AGCCCACTGG AACTACGGCA TCTCTTCAGA GCTATGTCAA TGACTTGAGT
CGAAAGATCA ACATAGTGCA TAATCCTGAA TATCACATCG CCGCCGTGAA ACTGGAAGTA
GAACTGCTCA TAGCGAAGAT GGATGAGTTG GAGACGAAGA GAAGGATTGC AGAAATACGG
GAAACTACGC TTGGAAAACC TCAACAAAGT ATCAGTGAGT CTACACCCTT ACAGAAAAAG
ATAGATGACT TATACAAAAA CTTACCAGAA TTTGAGAGAG CGAACAAGTT GGTGCCATCT
GTAATTTCAA GACTTAAGTC TCTCAGCGTA GTACATTCAG ACTTGGCTGG TTGTACTCAG
ACAGTAGGGG AGTTGGATAG TATTCTAGGT GATTTAAGGG ACGATATGAA ACGATGGGAC
GACAGCTTAA ACGACGTTAA TGCAAAGATA GACAACTATG AGACCATTTT TGGGGAAAAT
AGAAAGGTGG TGACGGCACA GATTGAGGAG TTGGAGAGCA AGATAGATAA AGTATTCAAA
TAG
 
Protein sequence
MEKYHDLPDI DTNAPDVFET SDVESDLELS HHHSESSPTL EEDDSEIKHQ QLNAETARTR 
FSHSSLVPQD GADFSGSVSY PKLGKSGYLI ETTVETRQQK LARIARELEE LKQEGENDVE
KIFSENVDGL QTQLQEVLEK TVSGSNSKIR QLDVYSQRIN HLFEAISSNI IKGEVYEKSA
PQNEASFQKS STSTSPSEIL SLENRINELE KLIGVDMVQS LSSKPTGTTA SLQSYVNDLS
RKINIVHNPE YHIAAVKSEV ESLIAKMDEL ETKRRIAEIR ETTLGKPQQS ISESTPLQKK
IDDLYKNLPE FERANKLVPS VISRLKSLSV VHSDLAGCTQ TVGELDSILG DLRDDMKRWD
DSLNDVNAKI DNYETIFGEN RKVVTAQIEE LESKIDKVFK