Gene PICST_33418 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33418 
Symbol 
ID4840441 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp589050 
End bp590234 
Gene Length1185 bp 
Protein Length394 aa 
Translation table12 
GC content41% 
IMG OID640391756 
Productpredicted protein 
Protein accessionXP_001386308 
Protein GI150866642 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.265783 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATTA TACGACGACT CCTTTCTACT TCCAATTGGA AACCACCAGA GTCATATTTT 
TCTCATTCTC CACTTAACTA TGAGTCGTAC TCAAGAAGAT TGAAAGGTGC AATCCACTAC
ATCGCTCAGA ATGGAAGATT CACGGAAAGT ATTCTTATAG ATTGTATCAG AGCCAATAGA
CAGTTACAAC AGCAGAACTG GAATTCGAGC CCAATAATCC AGAAGACAAG ATCCAGAAAC
GACTTTCTCA ATCTCAAATT GAGTCCCAGC AATAGTACAC TTGAAGATGA ACTATTCAGT
TTTGTATTCA ACAGACACCA AGAACGTTCG TCGAGTCCTG AAATTGTCCG CTCATATCTA
ATTACTGAAC CTCTCCCTTC CAATACTGCG CGAGTTATAG ACGTAGGAGT GAAAGGGTTT
GAATACAGTT TTTTGAAGCA GAAAGTTGAA CCTTCATTGG TTTTCACAGC TTTGCGGTTG
TTGTTAGACA GAAAAGATTA CCAAAATAGT TTCAAATTGA TAGACTCTAC ATTCAACTGC
GACGCCTACA AGGAGCTACA AAGACATCAA ATTGGTAGAA ACTTGTTTGG TTGGTTTCTG
TACATTGCAG TAGCAACTGT AGTACAAGCA ATATTATTTC CTTTGGTTTC AATATTGGCC
CTCTTCTCGG TGAATACAGC AACTGCTGGC ATTTTGATGT ACGGACTTCT AAGGTTAGAC
ACGGCTGAAA ATTTGGGTAG AATCAGCTGG AGACCCTACG TCTCAATGCT CTACAAATTT
ACACATCGCG ATGAATTGCT TGCAATAAAT ACTGTTATCA CTCATTTCGA AGAACATAAC
GAAGTGAATA TCAAAAACTA CCACCACAGC CGAGTTCGCA AGCTCTCTAA CCTAAAGTTG
TTTGACCAGG ACGAGTATGT TTTGGAGCTA CCTAACGACA GTGTAGAACT CGCATCTGTA
GAGTATAGTG GACAGACATT CCAAGAGGAA AAGGGTATAG TTGAACTTCA GCAGTACTTC
AAACACGAGT TGAACAGCCG GAAGATGGTC TTCAACGATT TGCCGGAGGA GTTGATTTTC
CTCGAGTTCT GGTTTACCCA CGGTGAAAAT TTCGAGTGGG TAGAGCCAGA TCAGGATCCG
GCAGAGATCA TAAAGTTAGA TATCCAAAAC CAGAAAACCG ATTAA
 
Protein sequence
MTIIRRLLST SNWKPPESYF SHSPLNYESY SRRLKGAIHY IAQNGRFTES ILIDCIRANR 
QLQQQNWNSS PIIQKTRSRN DFLNLKLSPS NSTLEDELFS FVFNRHQERS SSPEIVRSYL
ITEPLPSNTA RVIDVGVKGF EYSFLKQKVE PSLVFTALRL LLDRKDYQNS FKLIDSTFNC
DAYKELQRHQ IGRNLFGWFS YIAVATVVQA ILFPLVSILA LFSVNTATAG ILMYGLLRLD
TAENLGRISW RPYVSMLYKF THRDELLAIN TVITHFEEHN EVNIKNYHHS RVRKLSNLKL
FDQDEYVLEL PNDSVELASV EYSGQTFQEE KGIVELQQYF KHELNSRKMV FNDLPEELIF
LEFWFTHGEN FEWVEPDQDP AEIIKLDIQN QKTD