Gene PICST_31745 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31745 
Symbol 
ID4838387 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp1580658 
End bp1582640 
Gene Length1983 bp 
Protein Length660 aa 
Translation table12 
GC content38% 
IMG OID640389702 
Productpredicted protein 
Protein accessionXP_001384258 
Protein GI150865156 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.442814 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.849842 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACACTC ATTGTGAAGT GAAGGATAAG CTAATCTCGG GATTATTTCA AGTATTACCT 
TCTAGCTACA TACAAGGTAT TATTGATACC ATTTCTAGAC CGGTTCTTCT CGCTGTAGTG
TCTAATGGAA CTTCGCTATA CCATAAATAC TTCCTCGATG CACTATTTAG AAATATCCGT
GTTGGATTGT TAGAAAAACC CTGGCATAAG ATAAAAGTTA GAAGATGTAA GAGAATTGAG
TTGTTAAAAT TCGAAATTCT CCTTGAACAT ACAAATGATC ATGGCATATT GGTAAATGGC
TCCCAAACAT TGCACACCTT ACGCGAAGCA TATCCCCTTC TACGCTTTCG AAATGTCGAG
GTAGAAGTCG AAAACGAAAT AGAAGGATCC CAATATAGAT ACAATACAAT AATCAAGCAG
GAAGAAGATT TGAATAACTT GACGCTGGGC GATAATTTGC AGGCTATTCA TTTCGATTGG
CCACCTGATC GAAGGAACTT CCCATTAAAC TTGAAGAAAC TTCTGTTTGT ATGTCTCGAA
GAGCCCCACA ATTTTGATGC TCTGGAATTC ATAAACAAAT TACCGACAAG ACTTGAGGAT
CTAGATGTTG ATTGTAAAGA TATGCATATA GAATTAGAAC ATCTATCATT ATTGCCATCG
ACATTACGAG TACTTCGCTT GCACAATTGC TTGAAAATAC AAGGAGAAGA TAAATTAATA
GTTGATTTTC CACCTCTTCT CGAAATACTT GAAATTCTGG GTTACGTTGG TAGTGGTGAA
TGCCTAGATA TATCTCATTT GCAGAAGTTG ACTCTGGTGA CACTTCTGAA GCACTTAGTA
TTCCATCTTC CAACCCAAGT ACGGAGACTT AAGCTTGTTT CCGTATGTGA CCTCGTAGGG
TTGGATCAAC TTTCTGAGCT TGTTAACTTA GAGAGTCTCA AGATAAGTAC AACATCCAGA
ATACTTGATA GAATAGTAGT TCCACAAAAT GTAAAACTCC TAAGTATAGA CAATTTCGAT
GATTCTGTGT TTTCGTTGGA ATTTAAAGCA AACGAATATC ACCCAACTAT AAGGCTAGAT
GAGCTCAACA GATTTTTGAG AGTGGGTACA CCTTTTTCAG CTTTTGAGGT TCCTTGTGTT
TTAGGTGGTT TGAGGATTTT GAAAATGTTT AGAGGAACAC GCTTATCAAA ATGCTTTTGG
AATGCAATTG AAAATTTGCC AGAGTTAAAG GAATTAACTA TTAACTACTA CGACATTCAT
TCTTGTCCGA ATCAATTTCC TCCTAAATTG AGCGTCTTAG ACCTTTCTCG CAACAATATT
TCTGAAATTT CAATTTCGAG TCCATTGAAG AAGCTTGTGC TTGAAGGTAA CAACTTCAAC
AGTATCTCTA AGAGAACGTT ACAGCTACCA GCGACTTTAT GCGAACTATA CCTTAACAAC
AACTCGATAA ATTGCTTCGA AAAAGCATAT GAATTTCCTC GTAGTCTTCA GATCCTAGAT
TTGCGTGATA ATAAAGACTA CCTAATCGAA GACATTTTCA GGAACTTACC ACCACAAATA
GTTAAGTTAC TGGCATCATG CTTGAGTAAC AAGTTTGGAC GGACTTTAGT TGAAGTAAGC
AGTAAGACGC TTTGGCATGT TTATTTGGAG GGAGGAGTCG AAGAAAGCTC CATGAAATGG
CAGTTCAATT GGAGCGATTG CTCTAATCTA CAATATATTC AAATAAGAGA TGTAGAATTA
GAGAGTATTC GACTAGATTA TTTTCCATCT TCGTTACAAA AGATTAATTT CACAAATACA
GGAATAAGAG AGATACAAGG GGACTTCGGG AGTTTGCCCA ACTTGATTGA TGCTCTGTTC
GAAAACAACC CTCTACAGGA GTGGCTAGAG AAGAACGAAG ATAAGGTGCC ACCGAGCGTG
GCATTTGAGA TAGAGCGACC TGCTCTTACG ACCTATTTGA TAGATTGGCA GTACAATATT
TAA
 
Protein sequence
MDTHCEVKDK LISGLFQVLP SSYIQGIIDT ISRPVLLAVV SNGTSLYHKY FLDALFRNIR 
VGLLEKPWHK IKVRRCKRIE LLKFEILLEH TNDHGILVNG SQTLHTLREA YPLLRFRNVE
VEVENEIEGS QYRYNTIIKQ EEDLNNLTSG DNLQAIHFDW PPDRRNFPLN LKKLSFVCLE
EPHNFDASEF INKLPTRLED LDVDCKDMHI ELEHLSLLPS TLRVLRLHNC LKIQGEDKLI
VDFPPLLEIL EISGYVGSGE CLDISHLQKL TSVTLSKHLV FHLPTQVRRL KLVSVCDLVG
LDQLSELVNL ESLKISTTSR ILDRIVVPQN VKLLSIDNFD DSVFSLEFKA NEYHPTIRLD
ELNRFLRVGT PFSAFEVPCV LGGLRILKMF RGTRLSKCFW NAIENLPELK ELTINYYDIH
SCPNQFPPKL SVLDLSRNNI SEISISSPLK KLVLEGNNFN SISKRTLQLP ATLCELYLNN
NSINCFEKAY EFPRSLQILD LRDNKDYLIE DIFRNLPPQI VKLSASCLSN KFGRTLVEVS
SKTLWHVYLE GGVEESSMKW QFNWSDCSNL QYIQIRDVEL ESIRLDYFPS SLQKINFTNT
GIREIQGDFG SLPNLIDASF ENNPLQEWLE KNEDKVPPSV AFEIERPALT TYLIDWQYNI