Gene PICST_32486 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_32486 
Symbol 
ID4839165 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp1628564 
End bp1630375 
Gene Length1812 bp 
Protein Length603 aa 
Translation table12 
GC content42% 
IMG OID640390480 
Productpredicted protein 
Protein accessionXP_001385009 
Protein GI150865686 
COG category[S] Function unknown 
COG ID[COG3538] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.209404 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.436632 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGTTG CTGGAAGAAG AGTTCATCCC AGACGGTTTA CTCTAGTAAT ATTACTAGTC 
GGGATACTCT TCTTTTTGTA TTTATTCTTT TCACTGGATA GCGACTTAGC TGAAGACGAT
GAATATGAAA ATGATGATAT TATCAAGCTG GTTCACAACC GACACCGTTT TGCGGGTAAG
GGTAAATGCC CCGATTACGT AGAATACTCA CAGAAGCCAC ATCCTCCATT CACACAAGGA
AAGTATAATT TCCCTTTCAT GCGGCCATCG CTTGAGTGTC GAACTTTTAC TTCGAAGGCT
GTAGAATACT TGATTCTGGA TTTGAAATCC AAAGTGAATT CTGTGGACTT GGGCCGTTTG
ATTGAAAACT GTTTGCCCAA CACGTTGGAT ACCACCATCT TGTGGCACAA ATCGTCATTG
ACCAATCTCA ATGCTGCCAG AAGACAGGAT TACCCGCAAA CATTTGTAGT TACTGGTGAT
ATCCACGCCG AATGGTTGAG GGATGCAGCC AGACAGTTGT CGACATACCA GCCTTTGATC
AAATATGATC CAGAATTACG TGAGATGATT AAGGGAGCCA TCAGCACCCA GGCTTTCTAT
GTCATCAACT CACCATATTG TAATGCTTTT CATCCTCCTC CAGGCTCTGG TGTAAAGCGA
GGAAATACAG CTATGGACTA CGTATTTCCA CACCCGGACT GGAGGCAAGT ATTTGAGTGT
AAATACGAGT TGGACTCATT GGCATCGTTT TTGACATTGA GTAACGATTT CTATGAGAAT
TCAGATGGCG ATATTTCTTT CGTTAACCAC TACTGGTTGA GAGCGTTGGA AAAGTTGCTA
ATAGTATTCA AACGAGAATC CCAACCATCA TTTGACGAAG AAACTGGTGC TGCCATCAGA
TTCTACTATG CCTTCAAGAG GCAAACAGAC ATTGGAACTG AAACGTTGCC CTTGGGAGGT
GTAGGAAATC CTGTCAACTA TGGCACTGGT CTCATACGAA GTGCTTTCAG ACCCAGTGAC
GATGCATCGA TTCTCCAGTT CTTTATTCCA GCGAACATAC ATGCGCTCAC GGAGTTGCAA
AGACTTCGTA AGAACTTCTT GAATGAAGAT GTTATCACGG ATAAAGATAT TTCTGTCTTG
ATAGAGACGG TGGACCATTT CATTGACAGC TTGACAAAGG GAATTGAAAA GTATGGAATT
GTAGAGCATC CAAAATTCGG AAAAGTGTAC GCTTATGAAG TAGACGGATA TGGTAGTGCT
GTTTTCATGG ACGACGCCAA TATTCCTTCG TTGTTGTCGA TTCCAGATAT GGGCTTTAAA
CCAAGAGATG ACCCAATCTA CCAGAATACA AGAAAGATGA TTCTTCTGAA ATCTGGAAAT
CCTTACTACT TGAAGGGAAG ATACTTTGAA GGTATTGGAG GTCCTCATAT CGGTATTCAC
AATGCCTGGC CAATGTCACT TCTAATGAGA ATTAGAACTT CAGACGATGA CGAGGAGATC
ATGGAGAACC TCAAGGCTGT CATGGATACC ACTGCTGGTT TAGGTTTGAT TCACGAAAGT
GTGAATGTCA ATTCTGCCTA CGGTAGAGAG TATACCAGGT CATGGTTTGC CTGGTGCAAC
TCTGAGTTCG GAAAGACCAT CTTGCATTTG GCCAAGCACA AACCATACCT AATTTTCAAG
GAAGAGTTCC AGTCTGAGGC ATACAAGATC GATGAGGTAT TGGCTCCTTT GTTATCAAAT
GTACCAAAAA TTGATAAAGA TCCATCCGAC AACCAAAAAC ATGTCGAGGG TGAACCTCGA
AATGATGAAT AG
 
Protein sequence
MHVAGRRVHP RRFTLVILLV GILFFLYLFF SSDSDLAEDD EYENDDIIKS VHNRHRFAGK 
GKCPDYVEYS QKPHPPFTQG KYNFPFMRPS LECRTFTSKA VEYLISDLKS KVNSVDLGRL
IENCLPNTLD TTILWHKSSL TNLNAARRQD YPQTFVVTGD IHAEWLRDAA RQLSTYQPLI
KYDPELREMI KGAISTQAFY VINSPYCNAF HPPPGSGVKR GNTAMDYVFP HPDWRQVFEC
KYELDSLASF LTLSNDFYEN SDGDISFVNH YWLRALEKLL IVFKRESQPS FDEETGAAIR
FYYAFKRQTD IGTETLPLGG VGNPVNYGTG LIRSAFRPSD DASILQFFIP ANIHALTELQ
RLRKNFLNED VITDKDISVL IETVDHFIDS LTKGIEKYGI VEHPKFGKVY AYEVDGYGSA
VFMDDANIPS LLSIPDMGFK PRDDPIYQNT RKMILSKSGN PYYLKGRYFE GIGGPHIGIH
NAWPMSLLMR IRTSDDDEEI MENLKAVMDT TAGLGLIHES VNVNSAYGRE YTRSWFAWCN
SEFGKTILHL AKHKPYLIFK EEFQSEAYKI DEVLAPLLSN VPKIDKDPSD NQKHVEGEPR
NDE