Gene PICST_83827 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_83827 
Symbol 
ID4839240 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp937150 
End bp939542 
Gene Length2393 bp 
Protein Length789 aa 
Translation table12 
GC content41% 
IMG OID640390555 
Productpredicted protein 
Protein accessionXP_001384848 
Protein GI150865577 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0497188 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CAGACCTTTG CCCTTCCTCA AGCATGGATA CCTTGAAAAC TACCTATAGC CACCGGGATA 
TAGAACCTAT TTATGTGGGT GGAACCTCGG CTACCATTTC TGCTGACGGG CTGATCTTGG
CCACGCCTTT GAACGAGGAC GTAGTTATCA CCAGTTTAGA TACTAACGAG ATCCTCCACA
AGATCGAGGG AGATGGTGAA ACCATTACAA ATCTCGTTAT TACACCCGAT GGATCCAAGT
TGGCAATTTT GTCGCAGTCC CAGCAATTGC GTATATTTGA CTTGGACAAG CTGGAAATCA
CTAAAACCTT TAAAATGCCA TCTCCAGTAT ATATCTCTTC AGTCGATTCC ACATCTTCCC
TATTTGCTTT CGGAGGTTCG GATGGTGTTA TCACTGTTTG GGATATCGAT GGTGGATATG
TGACGCATTC TCTCAAAGGA CACGGAACCA CCATCTGTTC GTTAATATTC CACGGCCAGT
TGAACTCTAC CGAATGGAGA TTGGCTTCTG GTGATACTAT GGGTACAGTT AAAGTATGGG
ACTTGGTAAA AAGAAAATGT CTCACAACAG TTAATGAACA TAATACAGCA GTCAGAGGCG
TTGGCTTCGA CAGTTTGGGC CAACATTTCA TCACTGGTGG AAGAGACAAT GTAGCAATCA
TCTATAACAC GAAAAACTAT AAGCCAGTGA ACACCTTTCC AATCAACGAA CAGATCGAAT
GTGCTGGTTT CATCACCATT TATGACAGGG AATTCTTCTA CACCGCTGGT TCTGAAAATA
TCTTGCGACT CTGGAGCATT GCTACTGGTA CATTGGTGGC CAGCTCAAAA GCTTCGTTAA
AGACAAATGA AGAATTGATC ATAATAGATG TCTTAAAGTT GGAAAACAAT GACTTGGTCT
TAGTAGTAAG TGACCAGACG TTGATTCATT TGGATTTACA AGAGCTTGAT TTCGACAATG
GTGAAACTGT AGAAATTCCT GTAGCCAAAA GAATTGCAGG AAATCATGGT ATCATTGCCG
ATATCAGATA CGTGGGAGAA AAGTTCAACT TGATAGCATT ATCCACCAAT TCGCCAGCAT
TGAGAATAGT GGACCCATTA AAGCCATTGG AATTGAGATT ATACGAGGGC CATACCGATT
TGCTCAATGC TATGGATGTC TCTACTGATG GAAAGTGGAT AGCTACTGCT TCTAAGGATA
ACGAAGCCAG ATTATGGAAA TGGGATGAAG AACAAGATGA CTTTGTTCCT TTTGCAAAAT
TTCAAGGTCA TGCTGGATCA GTCACAGCTG TGGCATTATC CAAAGCAGAA AACACACCCA
AGTTCTTGAT TACTGGATCC AGTGATCTTA CTATCAAAAA ATGGAAGATC CCAGCTACTG
CCGGATCTAC TGTCAAGACA TCCGAATATA CCAGAAGAGC TCACGATAAG GATATCAACT
CTATAGACGT AGCGCCAAAC GATGAGTACT TTGCATCTGC ATCATATGAT AAGTTCGGTA
AAGTATGGAA CACTGCTAGC GGAGAAACTA TAGGTGTTTT GAAAGGTCAC AAGAGAGGAT
TGTGGGATAT TAACTTCTAC AAATTCGACA AGCTCATTGT CACTGCAAGT GGTGACAAGA
CTCTCAAGGT ATGGTCCTTG AATGACTTCA CCTGCGTCAA AACTTTTGAA GGCCATACCA
ACTCCGTGCA GAGAGCCAAA TTTTTCAATA GATTCAGCCC ACAGTTGCTT TCAACTGGTG
CAGATGGTTT AGTTAAGGTT TGGGACTACA AGAGCGGAGA AATCATAAAA ACGCTCGATA
ACCACGAAAA TAGAATTTGG TCTATCGATA TTAAAGAAGA TGGTAATACT TTTGTCACTG
CTGATGCTGA TGGTAAATTG AGTGAGTGGG ACGACAATAC GGCAGAAGAA ATCAGACTTA
GGGAACAACA AGACAAGTTC AAGGTTGAAC AAGAACAAAA CTTGTCCAAT TATATCAGTA
ATAGAGACTG GCCAAATGCT TTTTTGTTGG CATTGACGTT GGACCACTCT ATGAGATTGT
ACAATGTCGT CAAGTCTTGT ATTGAAGCCA ACGAAGATCC TAATTCAGCT ATTGGCTCAG
AGCCATTGGA AGAAACTATC ATTCAGTTAT CTGACGAGCA ATTGTTGAAG TTATTCAAGA
AAGTCAGAGA TTGGAACACC AACTTCAAGT TCTTTGAAAT TAGTCAAAAA TTGATTTCTG
TTCTCATGTC CAACATCGAA ACTGAAAGAT TGATAGAAAT ACCAGGCTTG ATGAAAATCA
TCGAGGCACT CATTCCATAC AACGAACGTC ATTACAATAG AATCGACGAC TTAATAGAGC
AAAGTTACAT CTTGGATTAC GCTGTGGAGG AGATGAACAA ATTGATAGCA TAG
 
Protein sequence
MDTLKTTYSH RDIEPIYVGG TSATISADGS ILATPLNEDV VITSLDTNEI LHKIEGDGET 
ITNLVITPDG SKLAILSQSQ QLRIFDLDKS EITKTFKMPS PVYISSVDST SSLFAFGGSD
GVITVWDIDG GYVTHSLKGH GTTICSLIFH GQLNSTEWRL ASGDTMGTVK VWDLVKRKCL
TTVNEHNTAV RGVGFDSLGQ HFITGGRDNV AIIYNTKNYK PVNTFPINEQ IECAGFITIY
DREFFYTAGS ENILRLWSIA TGTLVASSKA SLKTNEELII IDVLKLENND LVLVVSDQTL
IHLDLQELDF DNGETVEIPV AKRIAGNHGI IADIRYVGEK FNLIALSTNS PALRIVDPLK
PLELRLYEGH TDLLNAMDVS TDGKWIATAS KDNEARLWKW DEEQDDFVPF AKFQGHAGSV
TAVALSKAEN TPKFLITGSS DLTIKKWKIP ATAGSTVKTS EYTRRAHDKD INSIDVAPND
EYFASASYDK FGKVWNTASG ETIGVLKGHK RGLWDINFYK FDKLIVTASG DKTLKVWSLN
DFTCVKTFEG HTNSVQRAKF FNRFSPQLLS TGADGLVKVW DYKSGEIIKT LDNHENRIWS
IDIKEDGNTF VTADADGKLS EWDDNTAEEI RLREQQDKFK VEQEQNLSNY ISNRDWPNAF
LLALTLDHSM RLYNVVKSCI EANEDPNSAI GSEPLEETII QLSDEQLLKL FKKVRDWNTN
FKFFEISQKL ISVLMSNIET ERLIEIPGLM KIIEALIPYN ERHYNRIDDL IEQSYILDYA
VEEMNKLIA