Gene PICST_69865 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_69865 
Symbol 
ID4837444 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp2133916 
End bp2137063 
Gene Length3148 bp 
Protein Length1021 aa 
Translation table12 
GC content44% 
IMG OID640388759 
Productpredicted protein 
Protein accessionXP_001382635 
Protein GI150863974 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 
TIGRFAM ID[TIGR01408] ubiquitin-activating enzyme E1 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.91244 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CATGGCCGAC AGCATGCTTG TAGACTCGCC TGTACCTCAG GAAATCGACG AGGGCCTTTA 
CTCTCGTCAA TTGTACGTTT TGGGCAAAGA AGCCATGTTG AAGATGCAGA ACGCATCTGT
GTTAATTATA GGCTTGAAAG GCTTGGGTGT AGAAATCGCC AAAAACGTTG CTCTTGCCGG
AGTCAAGCTG TTGAGCTTGT ACGATCCCGA GCCCGTAGCC ATCTCTGACT TGTCGTCGCA
GTTCTTTTTG GACGAATCGG CTGTTGGCAG CAACCGTGCT GAAGTCACTG CTCCAAGACT
CGCTGAACTC AATGCCTATG TTCCCATCTC CGTCATTACT GACTTGTCTG AGGCAACTTT
GTCAAACTAC AAATGTATTG TTGCCACCAA CTTGCTGTTA GAAGAACAAG TCCGAATCAA
CACCTTTACA CACGAACGGG ACATTGGTTT TATTGCCGCC GACAATCGAG GCTTGTTTGG
CCAGTTGTTT GTAGATTTCG GATCCAGTTT CACGATTATC GACCAAACGG GCGAAGAACC
ACATACGGGG ATTGTTTCCG ACATTGAGGC TGACGGAACT GTCACCATGC TTGACGATAA
CAGACACGGC TTGGAAGATG GAGACTACGT CAAGTTTACG GAAGTCGAAG GAATGCCCAA
GTTGAATGAT GGAAACCCTC ACAAGATTGA AGTTTTGGGC CCATATGCCT TTAGAATTAA
TATTGATGAA TCTTACGGCA AGTATGTCAA GAACGGTTTA TATACGCAAG TCAAGGTTCC
CAAAGAAATC CACTTTGAGC TGCTTTCATC GCAATTGGCT AATCCTGAGT ACATCATTTC
CGACTACGCC AAGTTCGATA GACCTCCCCA GTTACATTTA GGCTTCCAGG CTTTACAAGC
TTTCCAGACG AGACACTCCG GGAAATTGCC CAGACCATAC AACTCTGAGG ATGCTTCGGA
ATTGTTCCAA TTATCCCAAG AAATCGCAGT ACAATTTCCT ACCATCTTGG GAGAAGCTTC
TGTAGACGAG AAGTTGATCA AGGAATTGGC CCACCAGGCT ACTGGGGACA TTCCAGGAAT
TGTAGCCTTC TTTGGTGGTT TAATTGCCCA GGAAGTGTTG AAGAACTGTT CTTCTAAGTT
TGGCCCAGTC AAGCAATGGT TGTACTTCGA TTCGTTGGAG TCGTTACCTT CCAACGAAGA
GTTTCCCAGA ACTGCTGAGA CAACCAAACC ACTTGGTTCG AGATACGACG GCCAGATTGC
CGTCTTTGGT AAAGCTTTCC AGGAAAAGAT CGCCAATTTG AAAGTCTTCC TTGTTGGTTC
TGGTGCCATT GGTTGTGAAA TGTTAAAGAA CTGGGCCATG ATGGGGTTGG GCAGTGGACC
TGATGGTAAG ATTATAATCA CTGACAACGA CTCAATTGAA AAGTCCAACT TGAACCGTCA
ATTCCTTTTC AGACCAAAGG ACGTTGGTAA GAACAAGTCT GATGTAGCTG CCGTAGCTGT
TCAGCACATG AATCCAGACT TGACGGGTAA GATCGAGTCA AAACTTGACA AAGTGGGCCA
CGAGACTGAA AATATCTTCG ACGATGATTT CTGGAAAGGT TTGGATTTTG TTACCAACGC
TTTGGATAAC GTCGAAGCCA GAACCTATGT AGACCGTCGT TGTATCTTCT ACAAAAAGCC
TTTGTTGGAG TCGGGAACAT TGGGAACTAA AGGTAACACG CAGGTAGTTA TCCCTAACTT
GACTGAATCA TACTCTTCTT CACAAGATCC ACCTGAAAAG TCCATTCCCT TATGTACTTT
GAGATCGTTC CCAAATAAGA TCGACCACAC AATTGCTTGG GCCAAGTCAT TGTTCCAGGG
TTATTTCTTT GACTCTCCTG AAAGTGTCAA CTTATACTTG AGTCAACCAA ATTATGTAGA
ACAGACTTTA AAGCAGAACC CGGACATCAA GGGCACGTTG GAAAACATCT CCAACTTGTT
GAATAAACGG CCATACAGCT TCGACGATTG CATCAAGTGG GCACGTTTGG AATTCGAAAC
CAAGTTTAAT GGTGACATCA AGCAACTTTT GTACAACTTC CCAAAGGATG CAAAGACGTC
TACTGGTGCT CCGTTCTGGT CCGGTCCAAA GAGGGCTCCA GAGCCCTTGG TATTTGACAT
CAACAACTCT GACCATTTCA ACTTTGTAGT TGGTGGAGCC AACCTCTTGG CTTACATCTA
TGGCTTGAAG GCTCCTGTTG CTAGTTTCGA CGATTACAAG AAGGTCATTG ATCTGACTAA
AATTCCAGAG TTCACTCCTA GATCAGGTAT TAATATTGCT GCAACTGAGA ACGAAGCCGA
AGAACAGGAA AAGACTTTGG CTGGATCGAT CGACGACAAC GAAATCAGAA AGATCGCCGC
TTCTTTACCT GAGCCAAGCA CTTTAGCCGG CTATAGATTA AATCCTATTG AATTCGAAAA
GGACGACGAC ACCAACCATC ATATTGAGTT CATTTCGGCT GCTTCTAACT GTAGAGCTTT
GAACTATGCC ATCGAGACTG CCGATGCCCA CAAGACTAAG TTCATTGCTG GTAAGATTAT
CCCAGCCATT GCCACCACCA CAGCATTGGT GACTGGTTTG GTCTGCCTTG AGTTATACAA
GGTTGTTGAC GGAAAGGACG ACATTGAGCA ATACAAGAAT GGTTTCATCA ATTTGGCCTT
ACCTTTCATT GGATTCTCGG AACCAATCAA GTCAGCTCGT GGAAAGTACA ACAAGAAGGA
TTTCGACCAG ATCTGGGATA GATTCGACTT AGAAGGTAAC ATCACCTTGC AGCAGTTGCT
TGACCACTTC AAGGAAAAGG AGGGTTTGGA AATCTCGATG TTGTCATATG GAGTGTCTTT
ATTATATGCC TCATTCTTCC CACCCAAGAA AATCAAGGAA AGATTATCAT TGGACTTGAC
GACTTTAATC AAACAAGTCA GCAAGAAGGA CATCCCAGAC CACGTGAACT ACTTGATCTT
TGAAGTTTGC TGCGACGACG AAGAAGGTGA AGATGTTGAA GTTCCTTACA TCTGCGTGAG
TATCTAGATT AAATGATTTA TAAAGTTACG ATTACAATAG AAATAACATA AAGATATTAT
ATCTCGATTT TGAATTATAG CGACCATT
 
Protein sequence
MADSMLVDSP VPQEIDEGLY SRQLYVLGKE AMLKMQNASV LIIGLKGLGV EIAKNVALAG 
VKSLSLYDPE PVAISDLSSQ FFLDESAVGS NRAEVTAPRL AELNAYVPIS VITDLSEATL
SNYKCIVATN LSLEEQVRIN TFTHERDIGF IAADNRGLFG QLFVDFGSSF TIIDQTGEEP
HTGIVSDIEA DGTVTMLDDN RHGLEDGDYV KFTEVEGMPK LNDGNPHKIE VLGPYAFRIN
IDESYGKYVK NGLYTQVKVP KEIHFESLSS QLANPEYIIS DYAKFDRPPQ LHLGFQALQA
FQTRHSGKLP RPYNSEDASE LFQLSQEIAV QFPTILGEAS VDEKLIKELA HQATGDIPGI
VAFFGGLIAQ EVLKNCSSKF GPVKQWLYFD SLESLPSNEE FPRTAETTKP LGSRYDGQIA
VFGKAFQEKI ANLKVFLVGS GAIGCEMLKN WAMMGLGSGP DGKIIITDND SIEKSNLNRQ
FLFRPKDVGK NKSDVAAVAV QHMNPDLTGK IESKLDKVGH ETENIFDDDF WKGLDFVTNA
LDNVEARTYV DRRCIFYKKP LLESGTLGTK GNTQVVIPNL TESYSSSQDP PEKSIPLCTL
RSFPNKIDHT IAWAKSLFQG YFFDSPESVN LYLSQPNYVE QTLKQNPDIK GTLENISNLL
NKRPYSFDDC IKWARLEFET KFNGDIKQLL YNFPKDAKTS TGAPFWSGPK RAPEPLVFDI
NNSDHFNFVV GGANLLAYIY GLKAPVASFD DYKKVIDSTK IPEFTPRSGI NIAATENEAE
EQEKTLAGSI DDNEIRKIAA SLPEPSTLAG YRLNPIEFEK DDDTNHHIEF ISAASNCRAL
NYAIETADAH KTKFIAGKII PAIATTTALV TGLVCLELYK VVDGKDDIEQ YKNGFINLAL
PFIGFSEPIK SARGKYNKKD FDQIWDRFDL EGNITLQQLL DHFKEKEGLE ISMLSYGVSL
LYASFFPPKK IKERLSLDLT TLIKQVSKKD IPDHVNYLIF EVCCDDEEGE DVEVPYICVS
I