Gene PICST_79644 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_79644 
Symbol 
ID4840649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp346758 
End bp349748 
Gene Length2991 bp 
Protein Length772 aa 
Translation table12 
GC content44% 
IMG OID640391964 
Productpredicted protein 
Protein accessionXP_001386269 
Protein GI150866614 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.241607 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.527548 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTCATAGTGC ACTCTGTGCT CAACTCTCGC TTCGCGTCCC ATAGCCTCAA AAGACAAGAA 
AAAGTCTCGA CCACAAAATC AAGCCAAACA AATCAAGCCA ACCAAATCTT CAACTTACAA
AACCATCCCA CGCATTGATT AGAGAAGGAG CTATCCGCGT GTCAGTATCT ATCTTAGACC
GTTATCTATC TTAGACCGTT ATCATAGACC TCATCCTGGC CAACTTGCAT CTCGTAGACC
CGCTTGTCCC ACGATTGTAT CTTGCCATAG ATCTCCGTAT CTTCACAGCT GATATTCTCT
TCTAGAAATA ATTCCATTTC ACCATCTGAC CTGTGGTAGC TGCTATAAAA AGCGCCTTAT
TATTGCCTCG CACTCCCCAG CCATTTTTTC CGCGACTGAG ATCATCTGAA CAGATCATCT
ACAGATATCT TTTTTCCACA CATTAGTATA CACTCGTATT TCCATTTTCC TCTACTTATT
CCCACTACTT TTCTTTCTTC CAATCTGCTG AATCTTCTAC TGCTAGTACT CCGTCTACAG
TTCTTCCAAT TTTCACAATG CGGTTGAACC TCATATACAT CTGGATTTTC AGCTTGCTCC
AGGCAGCTTT GGCTCAGTCG GTTCTTAAGA CCTCGTCCTT GTTGACATGT ATGGACAACT
CGAAGTTCAC TGCCTCCTTC TTCGATGTCA GATACTTTCC CCACAACACA AGTGTCTACT
TTCAAGTGAA TGCGATCTCG TCACTCGACA CAAATGTGAC TGCTCAAATC ACGTTGATTG
CCTACGGACT CAATATTCTT TCGCGTAACG TCTCGCTCTG TAATCTTAAC TACCCCGAGA
TCTGCCCCTT GACTTCCGGC CACTTGGATT TGACATCCCT GTACAATGTG TCGAGAAGCA
TCACTGACCA GATCCCTGGA ATTGCCTTCA CAATCCCGGA CCTCGACGCC AGAGTCCGTG
TCACAATCAC AGAAGATGGT AAGTCAGACC AGTTAGCTTG TGTTGAAGCA GTATTGACCA
ACGGGAAAAC TGTCCAGACA AAATACGCTG CCTGGCCCAT TGCTGCCATT GCTGGTCTCG
GTGTTATTAC ATCGGGAGTC GTCTCAGTGA TTGGCCACTC CAACACAGCA GCACACATTG
CCTCGAACTC GATGTCGCTT TTCGTTTATT TCCAGTCTTT GGCCATTACA GCCATGATGG
CCGTGGCTAA AGTGCCTCCT ATTGCCGCAG CCTGGGCCCA GAACTTCCAG TGGTCGCTCG
GTATCGTCAG GGTCGGATTT GTCCAAAACA TTGCAAATTG GTACCTCCAG GCTACTGGGG
GTACCCCTAC GGATATCTTG GGATCGCAGT ATCTCTCTAT TTCCGTCCAA AAGAAGCTAA
AGAAAAGAGC CTACGAATTG TTCGAGTCCT TCTACAAGCC TCAAGAAAGT GTGGTTTCAG
GCTTATCCAA GCGTGCCTCG ATCACGCTTG ATTCCGACGA CTTTGGCTAC AGCGACTCTC
TCAACTCCAC GTTGTACTCT TTAAACGAAA AGGACAAAGA CTTGTCTTCC AAGATCTTAG
TTCTTCGTGG TATCCAGAGA GTTGCTTTCT TGACCAGAAT AGAAATCACT GACCTCTTCA
TGACCGGTAT CATTTTCCTC TTGTTCTTTG CTTTTGTCAT GGTGGTGTGT CTAATGTTGT
TCAAGGCAAT CATCGAAATC TTGATCAGAG CCAAGTTGAT GAATGAAGGA AAGTTCAACG
AATACAGACT GCAGTGGTCT CTTGTCATCA AAGGTACCCT CTACAGATTG TTTGTATTGG
CTCTTCCGCA AATCGCCGTG TTATGCCTCT GGGAATTGAC TACGAGAGAC TCAGTGGGTA
CTACCGTGAT TGCTGTCTTC TTGTTTGTCT TATCCGTAGT TTTATTGTTT CAAGCAGCCA
TCAGAGTATT CATGTTTGGT AGAAAGTCTG TGCTGCAATA CAAGAACCCA GCCTATTTGT
TGTATGGCGA CGGTGCCTTT TTGAACAAAT TCGGTTTCCT CTACGTTCAA TTCAGAGCTG
ATTGCTACTA CTTTATTCTC GTCAGTTTGG TATACATGCT TGCTAAGTCA TTGTTTGTGG
CAGTCTTACA AACCCACGGA AAAGTACAAT CTGTCATTGT CTTCGTCATT GAATTGGCCT
ACTGTGTACT TGTTAGTTGG ATTAGACCAT TTATGGACAA GAGAACTAAT GCGTTCAATA
TCACCATTGC TGTCATCAGC ACTCTCAATG CCCTTTTCTT CATGTTCTTC TCCTTCGTCT
TTAGGCAACC GCATGTTGTG GCTTCGGTCA TGGGTGTCGT TTACTTTGTC ATTAATGCCG
TATTTGCCTT GTTCTGTTTA ATCTTCACCG TTGTCACCTG TGTTTTGGCC TTACTTTATA
AGAACCCTGA TGCGAGATAC CAACCAATGA AGGATGACAG AGTCTCGTTC CTTCCTAGAT
TTGACAACCC GAAGCAAGCA CAAAACGGTG AAGAAGATTT GGAGTTGATG GCCTTGGGTG
CTACTGCCAG AAAGGGTCAC GAACACGGAG GCAAACCTGC CAACTTGTAC GATGAAGATG
AATCGATGTA TGAAGAAGAT TCCATGTTTC CTAACAAGGA TTCCAGAAAC GAGCTGAACT
CCAACTCCAA CTTCAATTTC TCCCACGATG CCAATGACTC TAAGCATGAT TCTTACTTAG
AGACAATGGA ACCTACCCAA CCCGGTTCCA CGATTGTTGG TAATCCTGGT GCCATAACAG
GGTATCATAA TAGTGCATAT GTTGGTGGCT CAAGCAGAGG TCCACCTGTC AATCCATATT
CACAATCGAC ATCTTACAAC ACAAGTCAAA GTGGCAGTCG TGTCAACTTC ATATGATAGA
AGTAAAGATC AGAGGACTAC TTGTATCAAT TTATCTTTTC AATTCTTATC TTATTCTTAT
TTATGTATTT TAGCTTAACG ATATATTACC AATTCAATCA GTTTCGAATC G
 
Protein sequence
MRLNLIYIWI FSLLQAALAQ SVLKTSSLLT CMDNSKFTAS FFDVRYFPHN TSVYFQVNAI 
SSLDTNVTAQ ITLIAYGLNI LSRNVSLCNL NYPEICPLTS GHLDLTSSYN VSRSITDQIP
GIAFTIPDLD ARVRVTITED GKSDQLACVE AVLTNGKTVQ TKYAAWPIAA IAGLGVITSG
VVSVIGHSNT AAHIASNSMS LFVYFQSLAI TAMMAVAKVP PIAAAWAQNF QWSLGIVRVG
FVQNIANWYL QATGGTPTDI LGSQYLSISV QKKLKKRAYE LFESFYKPQE SVVSGLSKRA
SITLDSDDFG YSDSLNSTLY SLNEKDKDLS SKILVLRGIQ RVAFLTRIEI TDLFMTGIIF
LLFFAFVMVV CLMLFKAIIE ILIRAKLMNE GKFNEYRSQW SLVIKGTLYR LFVLALPQIA
VLCLWELTTR DSVGTTVIAV FLFVLSVVLL FQAAIRVFMF GRKSVSQYKN PAYLLYGDGA
FLNKFGFLYV QFRADCYYFI LVSLVYMLAK SLFVAVLQTH GKVQSVIVFV IELAYCVLVS
WIRPFMDKRT NAFNITIAVI STLNALFFMF FSFVFRQPHV VASVMGVVYF VINAVFALFC
LIFTVVTCVL ALLYKNPDAR YQPMKDDRVS FLPRFDNPKQ AQNGEEDLEL MALGATARKG
HEHGGKPANL YDEDESMYEE DSMFPNKDSR NESNSNSNFN FSHDANDSKH DSYLETMEPT
QPGSTIVGNP GAITGYHNSA YVGGSSRGPP VNPYSQSTSY NTSQSGSRVN FI