Gene PICST_33170 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33170 
Symbol 
ID4840241 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp1593416 
End bp1595332 
Gene Length1917 bp 
Protein Length638 aa 
Translation table12 
GC content42% 
IMG OID640391556 
Productpredicted protein 
Protein accessionXP_001385663 
Protein GI150866163 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.121339 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.2488 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGTTG GGAAACTCAA AAAGGGGTTG ACGGAGTTCG GGTCCAGTGT AAAGGACACA 
GTCGTATCTG TCTCAGATAC TGTGACTACG GTCAGAATTC ACAGGGACTA CGACAAGGAC
GACGAGCTCA TTGAACACTA CAAACATGAC TTGAGCAAGG CAAAGCTGGG ATTAAGCTAC
ATCGCCTCAC AACAGAAGAA AATGGCTCTG AGTCACTGGG GAAAGCTTTT TAAGCTCAAC
ATACGAATTG TAGAACACTT TATTCAGCTT TTAGGAACAG ACTCTCTCAG CTTCAAAGGT
ATTGAGGATT ACTACCATGA CTTCGACAAG TTCCAGGCCA CAGAAGAAAT ACCCATGGTA
CATCCCAAAG AGAGGCAGTT TCTCATAGAA AGTGTTCATC TGGAGTTGGT CAACTACATG
AGCTCGTTGC AACAGGGAAA GTTTAAAATC ACTCAGGACT GGGACATTCA TGAAAAGAGT
CTCAAGCTTA GAATTACAGA AATGAACAAA CATATCAGCG ATACGCTAAA GTTGATCAAA
AAGAGAAACA AGAAGAAAAC TAACTATATC AAGACTGAGC ACAAGATTTC CAAGTTAATG
AAGAAGACGG CACCGCTCGA GACAAAGGAG CAGGACCAGT TGAACACTCT CGAGTCTTCA
TTAAAGGAAG AAGAAAAGGA ATACACAAAG ATCAACGACA AACTCAAGTC CATCTTGCCT
CATGTAATTT CGTTTTTGGA TGAGTTCGTA GAGAATATCA CCAAGATCAT ATTGTGCAAG
CAGGTGGAAA CATACAAGGA AATTGCGCAA ATGCTCGATT ACTTCTGTAC TTTCCACGGC
TTTTTGGACA CTTCCGGAGA TCCGCATAAC AAAATCCAGT CTTACGAGGA TATAATCAGT
AAATGGGAAG AGGCTACGAC TCCAACTAGA TTGCAAATCG AATCCTTTAT CTCCATCATC
TACGACAAGA AACCAGAGTT GATAGATACT GAGATCGACG AAAAGGATAA AACGCTGTCT
GCTGCAAAGA TGTGGAACAA GATCACAGAC AAGGTTGTAG AAAAGAAACA CACTGTCAAA
ACGAAGGATC ATGTGAACGG AATCTTCAAC GACTATTTGT CTGTTGATCC ACTCCAGGCT
TTTCTTCAGA ACAATGACCC CAACAGTAAC ATCTCGGAGA CGTATCATCC GTCCAAGGTG
GTCGATGTAG ATGATGTCTA TATTCCTAAA CCTGTTACTG CGCCAGTAAT TTCACCTAAA
CTACCACCAA GAGTCAACAC CGCTCACAGC TCTAAACCTT TGCCTACCGT CGCGGCCAAT
AAAGTCACTA CCCCATTGCC ACCCCTTCCA CCTGACAGAT TTGTTTACAA CAGTAACTTC
TCGCGCAGTG ATTCCTTGGA CTCAATTCAT TCTGATAACG AGTCAATCAT ATCTGATTCT
TCTTCCCACA GTACTACTTC TCTTGTCAGT GACATACTCC TTCACAATGC CTCCGCTGAT
GTTGTGAACA AGCACTTGAA GAAGGTCTAT AATTCGTCTA AGAATGACAT CAAGTATTCT
CCTATTCCAG AGAGATTTGC TGATTTGGAT ATACCTCCCG CAACAGATGA TCTTATATTC
CAAAAGACCA CTACTGTAAC CTATAAGTTG CACGAGTTCA ACAAGTTCTT CGACAAAATT
ATCGCATTGT CAGATTCGAT GCAATTGGAT CGACGTGTTT TGGAGGCTAA ATATGATTTT
CCTGGTATCG AGCCGGGTGA CTTGTCTTTC AAAACAGGCG ACAAGATCGA AATCATCTTT
GACTTTCAGT CCATCGACAC TTTATATAGT AATGACCAGA AGAACTGGTT GATTGGCGCC
TCCAAGTTCG GCCAAGACCA TTTCCGGATT GGATTTGTTC CAAGCAATTA CTTCTAG
 
Protein sequence
MSVGKLKKGL TEFGSSVKDT VVSVSDTVTT VRIHRDYDKD DELIEHYKHD LSKAKSGLSY 
IASQQKKMAS SHWGKLFKLN IRIVEHFIQL LGTDSLSFKG IEDYYHDFDK FQATEEIPMV
HPKERQFLIE SVHSELVNYM SSLQQGKFKI TQDWDIHEKS LKLRITEMNK HISDTLKLIK
KRNKKKTNYI KTEHKISKLM KKTAPLETKE QDQLNTLESS LKEEEKEYTK INDKLKSILP
HVISFLDEFV ENITKIILCK QVETYKEIAQ MLDYFCTFHG FLDTSGDPHN KIQSYEDIIS
KWEEATTPTR LQIESFISII YDKKPELIDT EIDEKDKTSS AAKMWNKITD KVVEKKHTVK
TKDHVNGIFN DYLSVDPLQA FLQNNDPNSN ISETYHPSKV VDVDDVYIPK PVTAPVISPK
LPPRVNTAHS SKPLPTVAAN KVTTPLPPLP PDRFVYNSNF SRSDSLDSIH SDNESIISDS
SSHSTTSLVS DILLHNASAD VVNKHLKKVY NSSKNDIKYS PIPERFADLD IPPATDDLIF
QKTTTVTYKL HEFNKFFDKI IALSDSMQLD RRVLEAKYDF PGIEPGDLSF KTGDKIEIIF
DFQSIDTLYS NDQKNWLIGA SKFGQDHFRI GFVPSNYF