Gene PICST_67553 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_67553 
Symbol 
ID4838892 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp730725 
End bp733952 
Gene Length3228 bp 
Protein Length850 aa 
Translation table12 
GC content42% 
IMG OID640390207 
Producthypothetical protein 
Protein accessionXP_001384437 
Protein GI150865287 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00659474 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TGCTATTACA GTTGCTGTGT TGACTACGTC TATTGAAACT TCTTCTTCTT TACTACAAAT 
TTCCAATACT ACTACTTGAA CTGATTCACT TTTTTATTCG CAACCACTTC TTACCTGATT
CTCCAGTGTC TAGCAATGCC TAACAAGTTA GAATTAACTG CCAGCGAAAT CAAGGCTATT
CAAACTTCGT GGGAACGGTT ACAATCCAGT AACAGAAAGC ACAAGGACCA CTTCACTTCG
AGATTGTACT CGAATTTGGT AAGATCCCAC CCACAATTCA AGACTATCTT CGATGATTCC
AGCGTCTTGA AGGAACACTC TACTTTATTC GGCGAAATCT TGAGTTTCAT CGTCTTGTAC
AAACATGACC CTGAAATCTT GAATGACTTC ATGTACCAGT TTATTCATGA AAACCAGAGA
TTTGCCAACG TTACTGTTCT TTACTTGGAG CCTATGGGAG AATCTTTGAT TGATACTTTC
AAGCAATGGT TAGGTGACTC TGTTTTCACA AATGAATTTG AATCTGTCTG GATCAAGACT
TACGTCTTTG TTGCAAACTC TTTGTTGCAA TACTCCGATA GTGACGACGA AGCCAGCGAA
GTGGAAAGCA CCTTCAGCCC ACAGCAGGCT GGAAGTGAAG CTGACGTTTC TGACAACGAA
GACATTCAGC CTTTGAATAT CAAAAGAAAC GATCGCTCGT TGTACTCGAC TCCTGAGCCT
GTTTCTGAGC CTGTTTCTGA GCCTGTTGCT ACTCCCGTTG TTGCACCTGT TGTTGCACCT
GTTGTTGCAC CTGTTACTCC AAAGACTGCC ACTGAGGCTC GTGCCTCAGT TTTGGACAAG
TCTAACTCTA TCCAATTCAC ATTGGGTGGC AACGACAAGT ACAGAGGCTT CAGAAGATCG
GTTCATGATA ACATTCCCAA GAATGAACCA ATTTCTGTCA AGATTCCACA GTCCAGCAAC
TTTCTGAAAA CGCATTCCCA TATTCCTACT TCTAACATTT CCTCTTCGTT GAAGTCTGTC
TTGGAATCGT CTCCTACTTC CTCTAACTCA TCTATAACTT CAGCTCCTAC ATTCGATCCA
AGAAGGCAAA GAAGATCTCG TTCTGTTGTG AGTTCTTCTG CATCTTCAAT CAACGAAGAA
TTTACTTCTG CACCTGAATC ATTTGAACAA CCTATCATAA CTCCTCGTTC TGCCAGAAGA
AATTCTAATG CTTCTGAAAA GCCACTTCCA GCTGCACCTG TACAACAAAA GAAGACTTTG
CCATATCAAC CTTCTTTACT CAGAAAATTG GAACAGAGAC AAGCTTCCCC ATCTTCAGAA
GAATACTCTG ATGAAGAAGT TGAAGTTAAG TCGAGGTTTG ACCCAAGAAA ATCTAGGGGC
CACAAGAGAA CAAACTCGAT ACACATTCCA TCCCCAGAAT CTTCTGAAGT GGACGAAACT
GACGAGACTG AACTTTTCAA TCCATACAGA AGCACTTTTG CCGAGGAATT TGACGAAGAA
AAACAAATTG ACATCGACCC ACAAATGCCA TCTGCGTCAT CTTCTATTCC CGAGAGATCT
TTGTCTAGAG GAGGATTGAC TGGTGGTACT TTTGACTACA ATAGTTTTGG ACTTAAGGGT
TTAGCTCCTA TTGTCGAAAA TGAAATTGAT GATGGTGCTT CTTCTAAGTA CAACAGTGAT
GAAGATAATG TTTCTTCGAG CTATTGTGCT GACGAAAGTA ACAAATCATT ATCTGACCAA
GGCTCTTCTG ACCAAGGGTC TTCTTCCAGA GCCTCGTCTT TGTCGTTACA CAATAGTGAT
TACAAATCTT CTATCTCGTC TGGCACTGAA TCTGCCAGCA ATTCGCCATT TATGAACGGC
AAGTTCCAGC CGGGCCATCA AATGAGACAT TCTTCTGGAT CGAGTGAAAT TAGTTACATG
AAGTCATTAT CTGATGATAA TACCTTCGCT TACAAGACTT ACAGTTCATC TACACCCTCC
TTGTTGTTAG GAAAGGATCC TACGCTGTTC AGTAAGAGAG CTTCGATTGG CTTCATGAGA
AGTTCCTTTG TGTTGAAGAA GGAAATGGAA GAGTTTGGTT TGGCTAGACA AGACTCGTTG
ACCAAGACTC TCTCCGGTCT GGGATCTACT CCAAACTTGG CTCACCTTCC TCAAAACTAC
AATAGAAGGG CTGCTACTAC CATTTCAATT CCAGAAAGCG ATGGTGACGA TGAATTTGAC
ATGTTCAGAT CCTTTGGTCA AGCTCCTCCT ATTCAGACGA AGCCTATAAT TCACAACAAG
TATGAGTCCA TGAACTTTAC TGCCATTGCT GAAAAGGCAA AGGCAAAGGC TCCAGTTCAA
CAAGCTCCAC CTGCTGTTAA GGAAAAGAAG TCGTTCAAGA AGAGAATTTC GTCGATGTTC
TCATCCAAGT CTTCTGATTC TACACCATCA GTTCCTCCTA TTTCTGTTGC TGCCCCTAGT
GTTTACACTA GTACCAGCTC CAAGAAGACT TCTAAGCGTT CTGGATCCAC CTATGACTTG
GCTTCGATCA ACACCAACTC CACCAAGGGT ACCACAGTTT CTGGCTTCTC CTTCTTCAAG
CAAAAGAAGA ACGATAGCAA GTATAACGTG GTACACAGAA CTACTCCAAG AAAGGGTAAC
AAGTACCAGG TTAAGGCTGC TGCTTACGAC TTGGATTTGT TCAAGTAAAC AGCCCCAACC
CTAGTGCCTG TACTATAGAC CGACTTTCCT ATGAAAGACA AAAACACTAG AAAACACGAT
GATACCCAGA TAGCCTGATT GGTGCTGTCT CTTATTTAAA GTTGTGAAAT AACTACGAAG
CTGGCCGTTC CAAAATTTTC CTTTATTTTC TTTTCTTCTC TCGTTGCCGT TTTGAATACT
ACTTGTTCTA CTGGCTGGAA CGTTTGGTGC TGTTCTACAG AGGTCTCGAA ACATGAGTCG
GAGTCAAACA TCGTCTGAAA GACGTCAATC GACTACCCCC CATGTAATGC ATTGGGTCAA
ACAAAAGGAT TGTTGCTAGA CATCGATAAA CTTAGACTTG TTTGATGATT CTGAATGCAT
GATCACGAGT ACCTGGTTCA CATAGCTCAG GCGGATCCTA CACCTGGCTG CAAATAGCAT
CTTTTGATTT GTATGATCTG CACTTTGTTC CTTTAGCTGG AATTTTTTTC ATATCAGCTT
CGTCTATTTC CTTTGTGAAT GTATAGTAGA TTAATGAACA TTTGAATT
 
Protein sequence
MPNKLELTAS EIKAIQTSWE RLQSSNRKHK DHFTSRLYSN LVRSHPQFKT IFDDSSVLKE 
HSTLFGEILS FIVLYKHDPE ILNDFMYQFI HENQRFANVT VLYLEPMGES LIDTFKQWLG
DSVFTNEFES VWIKTYVFVA NSLLQYSDSD DEASEVESTF SPQQAGSEAD VSDNEDIQPL
NIKRNDRSLY STPEPVSEPV SEPVATPVVA PVVAPVVAPV TPKTATEARA SVLDKSNSIQ
FTLGGNDKYR GFRRSVHDNI PKNEPISVKI PQSSNFSKTH SHIPTSNISS SLKSVLESSP
TSSNSSITSA PTFDPRRQRR SRSVVSSSAS SINEEFTSAP ESFEQPIITP RSARRNSNAS
EKPLPAAPVQ QKKTLPYQPS LLRKLEQRQA SPSSEEYSDE EVEVKSRFDP RKSRGHKRTN
SIHIPSPESS EVDETDETEL FNPYRSTFAE EFDEEKQIDI DPQMPSASSS IPERSLSRGG
LTGGTFDYNS FGLKGLAPIV ENEIDDGASS KYNSDEDNVS SSYCADESNK SLSDQGSSDQ
GSSSRASSLS LHNSDYKSSI SSGTESASNS PFMNGKFQPG HQMRHSSGSS EISYMKSLSD
DNTFAYKTYS SSTPSLLLGK DPTSFSKRAS IGFMRSSFVL KKEMEEFGLA RQDSLTKTLS
GSGSTPNLAH LPQNYNRRAA TTISIPESDG DDEFDMFRSF GQAPPIQTKP IIHNKYESMN
FTAIAEKAKA KAPVQQAPPA VKEKKSFKKR ISSMFSSKSS DSTPSVPPIS VAAPSVYTST
SSKKTSKRSG STYDLASINT NSTKGTTVSG FSFFKQKKND SKYNVVHRTT PRKGNKYQVK
AAAYDLDLFK