Gene PICST_41332 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_41332 
Symbol 
ID4836749 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp1092694 
End bp1095879 
Gene Length3186 bp 
Protein Length951 aa 
Translation table12 
GC content43% 
IMG OID640388064 
Productpredicted protein 
Protein accessionXP_001382988 
Protein GI150864247 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACTC ATCTTCTCGC AGTTCCCTCA AAGCGATCCG AGGAAGTCAA TTGGTTGAAA 
CCCTTGAGCA ACTACTTGCT TTCTACTTAT GGTTCCACCA GTGAATACAC CGAAGACCTC
ACAGCCTTCA ACAAGTTGAG ACAGGACATT CGTGGAGTGA ATGCTGATAA CACGGGAATC
AACCTATACT ACAAGTACTA CAGCCAGCTT GAGCTACTCG ACTTGAGGGT GCCCTTCAAT
GTAGTTAACG CAAGTAAGAA GATCAACTTC ACGTGGCACG ATGCCTTCCA ACCCTCATTG
GTTAATAAGC AAGGAGCCTT GCCGTTTGAG AAGGCCAACG TTCTCTTCAA CCTAGGAGCT
CTTTTGTCAG AGTATGCCAA AGTAAGATAT GAAGAGTCTC AACGCAGTGT TGCTGGAACG
GAGGAGGCTT CTACGAAGGA GGCTATCCAG CTCTTTCAAC AGGCTGCTGG AATCTACCAG
TTTTTGAACG AAAACTTTCT CCATGCTCCC TCCCTAGACT TGAACCAGTC TACAGTTAAG
TTTTTGGTCA AATTGATGTT GGCTCAGGCG CAGGAAGTTT TTGTTTTGAC GGTTATAACC
GGCGATCTTG AAGGAAAGAA GAACTCTTTG GTATCAAAAT TGTGTCGTAG TGCCTCCGTT
CATTACGAGG AATGCCACAA TATGACTTCG TACATATCCA GTTTGGGAAG CAACTTCGAC
GATTTCGCAG TGGTGGACTC GGAAGACTTA GAAGAAGATT TTTTGGATAA ACCGGATGAC
TCTGACGAGA CTACAGAAGC CAGCACTTCC CATGTTCCTG CCAAGCTTGA TGCTTCATGG
ATTGCTACCG TCACCTTGAA AATGCACTAT TACAAGTCGT TGTCGTACTA CTACAATGCC
ATGAACTTGG AGGCCGGAAA AAAATATGGT GATGCATTGG CTTACTATAC GAAATCGCAG
GATATTCTTC ACGAGATCAA CAGCACCTTG TTGAGAAATA TCTCCAAAGC TGGCTCTAAT
GAAGCATACG AGATTTTAGA CAACTACAAG TACCAAAAAG ATGCTGTAGG AATCAAATTA
ACTGATTTGA CAAAGGATAA TGACTTAATC TACCATGAAA TAATACCATC TTTGGTGACC
TTGCCAGACA TCAAGCCATT GGATAGTACA AAGGTCATCC CTATAACTCA GAATACGACG
TTCCAGGAGA TAAATGACCA CAACTATAAC AATTTCATGA GCAATGTTGT TCCCGTGAAT
ATCCACGAAT TGTCCAGTTT TTACTCTGAA GAAAAGTCAC AGTTCCTTAG GAACGAGTTG
GATGCTGTTG ATGTTTCGAA TGAGGAGATT TCGTCTGTTT TGGAATACTT GAAATTGCCT
AAGGCCTTGG TAACTATCAA GGAATTGATA AATAGCACAG AAAACTCTGA CACAGACTCA
AGTGGTAGTT CTATCGACCC CAAAATTGAA GCCATCGCCA ATGAAATCTC GTCGGAGTAT
GCCAATGATC AATTGAATAG GCAAAAAATT TCCCAACTTA GGAAGGAAAT CTATGAAAAT
ATTTCACAGA GCGAAGAAAT AGCTTCCAAG CAGGTTTCAG AGTCGTTGAC TAGTTTTAAG
ATGGATCTTG TGAAGATCAA GAAATCGCTA TATGATGCCA CTAATTCGGA TAACCAACTT
TTCGGCTTAA TCAATGACGA CTCTCAAAGT TTGTATGCTC TTTTGGGAAA GGGTTCGAAT
TCCGAAGAGC TCAAGAATCT CTTCAAGACT TCTTCTGATA AGTCACAAGC TTCGAAGCCG
GACATCAGTT TGCTAGACAT GATAGATACA GAAGTCAAGT CACCTAAAGA CCAGATTCTC
TCGCAAATCA AAGTCTTGGA AGATATATTG CACGACTTAA ATGTGATCAA AGCCAACAAG
ACCAAGTTAG TTGAGACGTT GAAGAAGGAA ATTCATAATG ACGATATTTC AGACATTTTG
ATTTTGAATA GCAAGATGAA GTCTACAAAT GAAATCAAAA CACTTATATT CCCAGAAGAG
TTGAAGAAAT TCCAGCCATA CAACGAAGAG TTAGATAAGT TGATCCAGAA GGAAAAGTCC
TTTGTCAATG ATTTGAGGAC CGAATGGGGC AAACTTTCTT CTGATCCTGA GATCAAAAAT
ATCCAATCAT CGAAGGCATC CAAAGATCAA TTGGTAGCTA GTCAAAGTGC AAGAATCACC
TCTTTCTACA ACGACTCCTG GAAAAAGTAT TCTTTGGGTT TGAAGAGAGG TTCTCAATTC
TATGCTGGTT TACTAGATTC GGCCATTAAT TTGAAAGGAA ATATCCAAAA CGAAGCTGAC
CGAGCTGCTA TTAAACCAAG GTCATCGTTG ACTAGTAGTT TCGATGGTTT AAGTTTGAAC
CAACAGCCAC CTCTACCACC CCAACAACAT TATCAGCAAC AACCTCAGCA GCAAACCCCG
TCAGCAGCTC CGGGTCAGTA TGAGTATTTT GATCGTTACT CAGCACCTCA GCGACAGAAT
ACACAGCCTT TGGGTTCACC TTCTGTCAGC CAGTATTCGA TGCCATCGCA AACCGCACTT
AGCCGTCAGA ATTCTCAACC TATGGTTACA CCTCCATTTG CTAACCAATA TGGCCAACCA
TCGTCTCAGA ATCAGAACCA ACATCAACAG CCTCAACATC CTCAACAAAG ACCACAGTAT
GGTCAACCTA CAAACTCGTA TAATCAACAA AACCAATATT ACAACACTCC TCCAAACGTC
TCTCTGCCGG CTCCAGGCTC ATCCTACGAT ACAGCAGCAC AAGCTCCGCA GCGTAGTTCA
ACTGGAGGAA GTTTTGCTGG TTATTCTGAA GCTTCTACTG GTGGATACCA TAGAGCTCCT
CCCGTTCCAC CAAAGAATAT CGACCAGCAA CCACTGGGTG GACGTTCTGC GCCTCCTTTG
CCTCCACAGA TTCCACACTA TGGCCAGCCA TCACAATTCC ATTCATATGG TCAGCCGCAG
CCTGGCAGCG ACAATGGCCA GGGAAGACCT CCACAGGCTA ACCAAGCTTA TCAACAAGCT
TACCAACAGC AGCCTTCTCA GGGCAACGGA GGAAACGACC CCAACAACCC TAACGGTAGC
AACTTGATCT ACGACCAGCC TTCGAAGTAC CTGCCAAATA TGTACAATTT CTTTTCCAAC
AATTAG
 
Protein sequence
MKTHLLAVPS KRSEEVNWLK PLSNYLLSTY GSTSEYTEDL TAFNKLRQDI RGVNADNTGI 
NLYYKYYSQL ELLDLRVPFN VVNASKKINF TWHDAFQPSL VNKQGALPFE KANVLFNLGA
LLSEYAKVRY EESQRSVAGT EEASTKEAIQ LFQQAAGIYQ FLNENFLHAP SLDLNQSTVK
FLVKLMLAQA QEVFVLTVIT GDLEGKKNSL VSKLCRSASV HYEECHNMTS YISSLGSNFD
DFAVVDSEDL EEDFLDKPDD SDETTEASTS HVPAKLDASW IATVTLKMHY YKSLSYYYNA
MNLEAGKKYG DALAYYTKSQ DILHEINSTL LRNISKAGSN EAYEILDNYK YQKDAVGIKL
TDLTKDNDLI YHEIIPSLVT LPDIKPLDST KVIPITQNTT FQEINDHNYN NFMSNVVPVN
IHELSSFYSE EKSQFLRNEL DAVDVSNEEI SSVLEYLKLP KALVTIKELI NSTENSDTDS
SGSSIDPKIE AIANEISSEY ANDQLNRQKI SQLRKEIYEN ISQSEEIASK QVSESLTSFK
MDLVKIKKSL YDATNSDNQL FGLINDDSQS LYALLGKGSN SEELKNLFKT SSDKSQASKP
DISLLDMIDT EVKSPKDQIL SQIKVLEDIL HDLNVIKANK TKLVETLKKE IHNDDISDIL
ILNSKMKSTN EIKTLIFPEE LKKFQPYNEE LDKLIQKEKS FVNDLRTEWG KLSSDPEIKN
IQSSKASKDQ LVASQSARIT SFYNDSWKKY SLGLKRGSQF YAGLLDSAIN LKGNIQNEAD
RAAIKPRSSL TSSFDGLSLN QQPPLPPQQH YQQQPQQQTP SAAPGQYEYF DRYSAPQRQN
TQPLAAQAPQ RSSTGGSFAG YSEASTGGYH RAPPVPPKNI DQQPSGGRSA PPLPPQIPHY
GQPSQFHSYA YQQQPSQGNG GNDPNNPNGS NLIYDQPSKY SPNMYNFFSN N