Gene PICST_31875 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31875 
Symbol 
ID4839580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp110204 
End bp112492 
Gene Length2289 bp 
Protein Length762 aa 
Translation table12 
GC content47% 
IMG OID640390895 
Productpredicted protein 
Protein accessionXP_001385042 
Protein GI150865711 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGACC AACAGCTTCC AGATCCCGTG GTTCAGCATG CATTTGTCTC AGCTCTTTCC 
AAGCACTTGG AATTGGTAGA AACTTCGTTA AAATCGCAGA GCTTTCCCGC TGGCCAGACG
GAAGAGGAAA CCATCCAGTT CCAGGATTTG GCCAACGACC TCAACCAGGT TAGAACAGCT
AGTCTTTCCA AGATATGCTT GTACAATGAC TACTTATACT GGAAGAAACA TCCCAAATAC
CCCCAGAACG CCACAGCTAC GATGGCTGAC TATGCCCGCC TGTCCCCCTC TGGAATTTCA
GTCGAACTCG CTAAGAACCT CTTGGGCCCA GAACAAGCCG CTTACGATGT ACCATGGAAA
ACTGAAACAA TTGATATAAT TAGCAGTATC GAGGGAATTG GCTCTGAAAC GAGCAGACCC
AATACCACAG CACCGTTTGA AATTGGTACT TCAGCTAACG GTAGCTCTGC TAACACCGAG
TCAGGGATTG CCTCTAATGT CAGCGAATCA CCATTCATAT CCGAGATCTC ACCATCTCGA
ACCTCCCAGC TCAAAAATAA GTTGTATCTC ATCTTGTGTC AAGCTGTATC GACCGTCAAG
TTGGCAGATC CTCATGTTAG CTCTCTTCCC GATTCTCAAC TCCTCTCATA CTGTGCTAGT
AAGTTTCTCA GCGAGGCACC TGTTGTCACA GGAATTCCAT TTTCATCACA AATGGAATCG
AAAATCACAG TAGTGCAGCA GTTTCTTGAA AAAGACAACT CGAAAGGTCT TGCTCCTCGT
ATTATCTATA CTGCCTTCAA GGGATTGCTT CATTACACAA TCAACTACGT CGTTACTGAT
TCGTCAGCTA CAGTATCGAA CACGGAGTTG ACAAACAAGC TTCGTGAACA AGGTAATAAC
TTGATGTCGA ATCTGGCCTT TGCCCAGGCC ATCAAAGTGT ACACCAATGC CTTGGATGTA
GCTCGTCTAT CATCTCACGA ATCAATTCCT CAGTTACTCA CCAATAGAGC CATAGCATAT
ATTGGTTTGT ATTGCTTCAT TGAAGCCATC GAGGACTTGA ACCGAGCCGT TTGTTTCGAC
AGAACGTTCA CCCCAGCATG GACTCAATTG GGATATTGCC ACCTCTATAT GGGTAGTGGA
CTAACAGCGT TGAAATGCTA TCTTCTCGCT TTGAAATGTA CGGCTGGAGA AATATTGCCT
GTGGATTTCC CAGCGAATAA CGCAGAATTG AGAACTGCTT ATCGCGCTTC CAAAATCAAG
GCAGTACTTC CTCAATTTGT CCTGAGGCTC TGTCAATCAA TCGCGTTAAC AGAAAAGAGA
GCTTACCAAC TGTACGAGTC ATCAGCTCAG ATAAGACAAA CTGCTAGCGA AGTACGTAAA
ACTCTCGCTC TCTTGCGCGC AGAATGTTCT GAAGAGGACC GTGATTACTT TGCGTATTTC
CCCAACTTGC GAGACTCCAA TTTGCGGAAC ATGGCTGATA GAGCCAACCG TGCTCGTCCC
AATATCTTGA GTCCAGAAGT GGCCCAGAAC ATGATGTCTA GTACAGGTGT AGAAACTGTT
CAAATACCTA GAGAAACTGT GGAAGGTCGA GCTGTTAGCA ATGATGCTGC TCAACTGGCT
GCCGTAGCGG CCGCAGCAGC CGCGGCAGCA GGAGCTATAC CAACTGTAAC ACCTAGAAGA
CCAATGCCCC AAACATTGAG AACCAATGCT AGTGCAGTTG TAAATAATGC TACAACATCC
GGTGCTACTG GTGCTAGTTC TGGAGCAAGA CCTAGTGCAA ACCCTGCAGT GAACACTGTC
ATAACAACTC CTGATGGTAG AATTTCTACT ACCAACAATT TTGCATCAGC ACAGTTTCCG
CCAGTGCGTG ACTTCTTTAA CAATTTTGGC GCATACATGG AAGACGATAA TGAACACAGT
AGACCAACAC AAGAGCTGCC ACAAACAGGG GACCGAGCAC CGAGTCCTTC TGTTAGAGCT
TACACAATGC ATACATCTGG CACTCCAGAA AACATGTTCA GGGATGTCAT ACAGGGACTT
GGAACCGTAA TTTCGACATT CACACAAGAA CAATCACCGC CTGCTTCTGT ATTACAGCAC
CCGAGTGTTT CAGCAGCGCA GAGAGCAGCA CGAGCAGCAC AGTCGGCCGC ACAAGCTGCA
CATCGTGTTG GACAAGTTTT AACTCGTGAT AGAAGCGGAA ACACCTCTGG AACTCCTGCC
ACGCGAGCCA ATCAACCAAG TCAATCTCCT ACTGACGAAG ACATAGATAT GCCGGATGAT
TTAGACTAG
 
Protein sequence
MSDQQLPDPV VQHAFVSALS KHLELVETSL KSQSFPAGQT EEETIQFQDL ANDLNQVRTA 
SLSKICLYND YLYWKKHPKY PQNATATMAD YARSSPSGIS VELAKNLLGP EQAAYDVPWK
TETIDIISSI EGIGSETSRP NTTAPFEIGT SANGSSANTE SGIASNVSES PFISEISPSR
TSQLKNKLYL ILCQAVSTVK LADPHVSSLP DSQLLSYCAS KFLSEAPVVT GIPFSSQMES
KITVVQQFLE KDNSKGLAPR IIYTAFKGLL HYTINYVVTD SSATVSNTEL TNKLREQGNN
LMSNSAFAQA IKVYTNALDV ARLSSHESIP QLLTNRAIAY IGLYCFIEAI EDLNRAVCFD
RTFTPAWTQL GYCHLYMGSG LTALKCYLLA LKCTAGEILP VDFPANNAEL RTAYRASKIK
AVLPQFVSRL CQSIALTEKR AYQSYESSAQ IRQTASEVRK TLALLRAECS EEDRDYFAYF
PNLRDSNLRN MADRANRARP NILSPEVAQN MMSSTGVETV QIPRETVEGR AVSNDAAQSA
AVAAAAAAAA GAIPTVTPRR PMPQTLRTNA SAVVNNATTS GATGASSGAR PSANPAVNTV
ITTPDGRIST TNNFASAQFP PVRDFFNNFG AYMEDDNEHS RPTQESPQTG DRAPSPSVRA
YTMHTSGTPE NMFRDVIQGL GTVISTFTQE QSPPASVLQH PSVSAAQRAA RAAQSAAQAA
HRVGQVLTRD RSGNTSGTPA TRANQPSQSP TDEDIDMPDD LD