Gene PICST_89047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_89047 
Symbol 
ID4838555 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp859594 
End bp862559 
Gene Length2966 bp 
Protein Length837 aa 
Translation table12 
GC content43% 
IMG OID640389870 
Productpredicted protein 
Protein accessionXP_001384466 
Protein GI150865309 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TATCCAAGAG TCTTGATAGG GTGCTGTTAC TTATAAATAT ATCTCGTATC CAAGAGAATT 
CTCTCAACAA AAGCAAAACT GAAACTCTCT TGATCTTCTA CTGTCGTTCA GTAAACTACT
GAAAAATCTT GGATCTCGTC TCAGATCTTC TCCACAACGA TTCGAAGGCA TTCACGAAAC
CCAATACTGA AGATATTCCA CAACTATCAC TGAAATTTTC CAGTTAAGAA TCTAAGCAAC
GGTTTATCTT TCACACTATG GAAACATTGG AGATCCATTC CAAGGACTTC TTGGTCAAAT
GGGTCCATGC GCCCGACAAC TGTGTCATTG ACTGGCAAGT TAAACCGCTC AAAAAGTCCA
TCAACTTCGC CATTTACAAG TTGAATGACG AGACGCCCAG TGAAAACTCA GTAGAGTCTT
TCCAGGCTCC ACCTCCTATA GGCCATAACG ATTCTTCAAC CAGTGTAAAT GGTTCTAGCA
GAATCAGATC AAGTTCGGTC ACTTCGGTCA ACCAGATCAC CAATAACAAC AACCCCTACA
AAACCAAGTC AAGGTCGTCG ACCTTCTCCA CGAATTTGAT CAACTCGGAC TTGACGTTGT
TGAAGAACTA CAACAAGTTG ATTGCGGGGG AGTTGGTCCA CGGCAAGTTT GACGTAGCCA
AAGACGGAAT GTACGCCTTT GTCTTTGACA ACTCCTTCTC TAAAACGACT GGAAAGAAGG
TCTTTTTCAG TAGCAAAATC GTTTCTGACA ATGCTGCTGT TTCCAGAAGA AAATCCGTCG
CAAGACTGTC TAGTTTCCGA GGCAATGGTG TTGATCCAGG TGCCCCTTCT ACTATACCTG
CGCTGACGAT TCAGTTTCAA ACACCATTTG AAGCTGACGA AAATGAAGTT GAGGGCGATG
TTCCTCTTGA TAAGAGAGGC AACATTCTTC GTCCCAAGAA CGGAGAGTTG TTACAAAGTA
TCTTGTTGAA AAAGAGAAGA AAGAAGCTTC AGGGCTTCAC CAAGAGGTAC TTTGTTCTTA
ACTTCAAGTA CGGCACCTTA TCGTACTTCC GAGTCAAGGA CAATAAGTTG AGAGGTCAAA
TGCCAATCAA ACATTCCATC GTCAGTGCCA ATGCGAAATC AAGAGAAATA TTTATAGATT
CAGGTATGGA AGTGTGGAAT TTAAAGGCTT TGAACGAGAA AGAATTCAAT GCTTGGGTTG
ACGCTTTCAA CCAAATCAAG AAATCCAGCG ACGAAACACC AACTGAAGAA GCCTTTTATG
AAGAAGAAGA ACAGGGCATT CTTGCTCTGG AATTGGAATC GATTTCGACA AAGTTGACCC
AGCTCAAAAT GACTACAGGG GATAACGCTC CAGCTGCTAA ATTGGTCGAT AGCATCTCGT
TAGATATCAA CAGCTTGTTA GCCAGAGTGA TACCAGCCAA CAGAAACTCA CTACATGATC
TAACATCAGT TAAATCGTCT TCTGAGTTCT ACGATGCCCA GGAGTACCTC GATGTAATGA
GCTCTGGTGT TGTTCTTTTG GACACGCCAA TACCACCATT AGAGAGCAAG GTTATCGGCC
AGCTAGAAAC GCCATCTGAT GAAAGCATAG ATGAAAACTT AGATGGCTTG TCGTTGTCGT
CGTCTTCTTC AGAAGAAGAT GAAGACATTG AGCCAACCAA ACCTGTTGAA GTAATCCAGA
AAGTCAAGCT GGCAGATGAC AGTGACGATA CTTTATATCC CTTACCTCAT GACCCAATCG
AGAGAGAATC GGATATTCCC GTGTGTAACC ATACCCCTCC TAGTATATTG GCCTTTGTAC
GTAAGAATGT CGGTAAGGAC TTGTCCACTA TTGCCATGCC GGTGACAATG AACGAGCCTA
TTACTTTCTT GCAAAAGTAT GCCGAAATAT TTGAGTATAG CGATTTGATC AACAACGCTT
TGCAGCCCAG TTTTTCCGAC GAGTCCGGTG AAAAGATCTT GAGAATCGCT GCCTTTGCTC
TCAGTTACCT TTCCAGTGCC AGAGTCAAAG AAAGAAACAA CCGTAAACCC TTCAACCCAT
TGTTAGGAGA AACGTTTGAG TTGGTCAGAG AAGATCGTGG AATCCGTGTA GTCAGTGAAA
AGGTTAGCCA CAGGCCACCT GTATTTGCTT TCTTTGCCGA ATCAGAAAAG TGGGACTTGT
CGTTTAATCC AGCTCCTAAC CAGACTTTCT GGGGTAAGAA TGCTGAAATT GTAACGAAGG
GTACTGCCAA GTTAACCATT AAGTCAACCG GTGAGGTGTT CACTTGGTCT CATCCAGCTA
CTTTGTTAAA GAATATCATC GCTGGTGAAA AGTATTCCGA GCCCTCAGCT CCTATGACGA
TCAAGTCATC TTCTGGTTAC AAGGCTGTTG TAGAGTTTGC TAAGGGAGGT TTGTTCAGCG
GCAGATCTGA GGATTTGACC ATCAAGGCAT TCAACCCCAA CAAGAAGCAA TTAGCATATA
CTGTCAGCGG AAAGTGGACC GAGTCCTTGA CGTTGAAAAC TAACACCACT GAAAAGTTGA
TCTGGGAAGT TGGTGACTTG TTGCCTAACT CCAACAAGAA GTTTGGTTTC ACTGCATTTT
CTGGTACTTT GAACAAAATC TATGCCATTG AAGATGGTAA ATTGCCACAC ACAGATTCTA
GGTTGAGACC AGACATACAT ACCTACGAGA AAGGTGACGT CGACAAGGCT GAAGCACAAA
AGGTTGAATT GGAAGAGAAG CAGAGAGAAA GAAGAAAAGA ATTGGAAGAA AGCGGGAAGT
CTCATGTACC CAACTTCTTT ACCCAAGTTA GTGGCGACAC TCCTGACTCG GGTGAATGGG
CTTACATCAG AGGAAAGAAG AGTTATTGGA ATAGAAGAAA GCATGGCGAT TGGGATGACA
TCACCAGACT CTGGTAGTCA TGAATTAGCT TGGAATGTCT AAAAGTTATC ATATTTATGA
AGTTGTACAA TTCTATATAT CATAGC
 
Protein sequence
METLEIHSKD FLVKWVHAPD NCVIDWQVKP LKKSINFAIY KLNDETPSEN SVESFQAPPP 
IGHNDSSTSV NGSSRIRSSS VTSVNQITNN NNPYKTKSRS STFSTNLINS DLTLLKNYNK
LIAGELVHGK FDVAKDGMYA FVFDNSFSKT TGKKVFFSSK IVSDNAAVSR RKSVARSSSN
ILRPKNGELL QSILLKKRRK KLQGFTKRYF VLNFKYGTLS YFRVKDNKLR GQMPIKHSIV
SANAKSREIF IDSGMEVWNL KALNEKEFNA WVDAFNQIKK SSDETPTEEA FYEEEEQGIL
ASELESISTK LTQLKMTTGD NAPAAKLVDS ISLDINSLLA RVIPANRNSL HDLTSVKSSS
EFYDAQEYLD VMSSGVVLLD TPIPPLESKV IGQLETPSDE SIDENLDGLS LSSSSSEEDE
DIEPTKPVEV IQKVKSADDS DDTLYPLPHD PIERESDIPV CNHTPPSILA FVRKNVGKDL
STIAMPVTMN EPITFLQKYA EIFEYSDLIN NALQPSFSDE SGEKILRIAA FALSYLSSAR
VKERNNRKPF NPLLGETFEL VREDRGIRVV SEKVSHRPPV FAFFAESEKW DLSFNPAPNQ
TFWGKNAEIV TKGTAKLTIK STGEVFTWSH PATLLKNIIA GEKYSEPSAP MTIKSSSGYK
AVVEFAKGGL FSGRSEDLTI KAFNPNKKQL AYTVSGKWTE SLTLKTNTTE KLIWEVGDLL
PNSNKKFGFT AFSGTLNKIY AIEDGKLPHT DSRLRPDIHT YEKGDVDKAE AQKVELEEKQ
RERRKELEES GKSHVPNFFT QVSGDTPDSG EWAYIRGKKS YWNRRKHGDW DDITRLW