Gene PICST_31755 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31755 
Symbol 
ID4838484 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp1610273 
End bp1613533 
Gene Length3261 bp 
Protein Length1086 aa 
Translation table12 
GC content39% 
IMG OID640389799 
Productpredicted protein 
Protein accessionXP_001384263 
Protein GI150865161 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.115021 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCAAAG TCAAGACGGA AAACGAAAAA GAAATCAAGG ACAAATCAGC GTCCAACCTC 
AAGTTGATCT CAACCAACAC GGGGACGAGA GTGTCACAGG CTTGTGACCG GTGTCGTATC
AAGAAGATCA AATGTGACGG TCTCTATCCG TGTCACAACT GCAACAAGAT TGGGTTTGAG
TGTAGGACAA GCGACAAACT CACCAGAAGA GCGTTTCCCA AAGGGTATAC AGAAAACTTG
GAGAAGAAAC TTAAAGAGTT GGAGCAAGAG ATGATCGATT TGAAGGCCAA GTATGGCATA
GTAGATGACG CTGGTAACTC TGAAGTTGGA GCTGGAGGGG CTGTTAACAG TCTCAGTGTG
AACGACACCT CTGTAGCGAA TACACCTAAT AACACATTTA CTCCTTCAAC TTCTAACGTT
CATAATCCTA ACTCTTCTGA TGACAAGGTG CTATTGACCA CCACCAACCA GACGGTGCAA
ATCAACAACC CTATTGACCA GATCTTCAAC CTAGACAACA AGGGTATAAT TATCGGAAAC
GATAATCTCA ACTTCGAGTC GCAGTTCAAC CATTTGCTCA TCAACTTAAA TCTCCCATTT
CTCAAGATCA CCAATTCACA TAACTACTTG CTAAACGATC CAAACAGCTA CTTGTACCAT
CCTTCGTACC ATAACTACAA CCAGTTCCAT AACAAAGATT TGGATGTGAT CTACAACCCA
TTAACGGGAA ACAGCAACGT CAACGAGTTC AGCTTGGTCA ACAACCAGTT GCCTACAGAT
ATCTACGACT TGTTCATCAA GCTAATCAAT AACTTCAAAA AGTTGTTCAA CAACAAAAAG
GAGTTGGACA ATCAGATCAT TCAGTTCTTT CTCAACTACA ACATCTTCTT ACCCATCTTT
GACTACAAAC AGTTCATGGA ATCGTACGAT GCTTTTCACA CGATGTATCC CTTCATCTTC
ACCTATGATG ATTCGACTAT CAACGGGTTC AATCTCTCCA ACAGCAATGA CTACCACATT
GTGAACCAGT ACCTCATGAT CATCATCCAG ATATATGCCA TGATCTACAT GAACAACCCT
ACCATCAATT TGAACTTGTT GTTGAACCAT TCAGACCCAA ATTATACCTT CCACAAAAAG
TCTCCTAAAG ATAACTCGCC CAATATTATC AAGTCCTTGT ACGATTTCCT CCCCTATTTC
AATGTATTCC ATGTATCTGT TAACCAACTA CAAACCTATT TATTATTCTT GTATTACTCC
CTCTTGACCA ACAACAAGGA GAAGTCCTTG ATTTTGTCGT CCCTAATTAA TTCATTTATT
GGGATCCTTG GTATCAACCT TAATTCCAAA AACTTGTTCT TCGACGACTT GTCGCTTAAC
ACACTTCAGA GAAGGAACAG AGTGAAAACA TTCTGGGTGT TCAAGGTCTT GTTAAAGTGC
TTCAATTTAA AGTTTGGCTT CAAGCCTAGC TTGAATACCA CTGTTATTAA TCCTGTTACT
ATCGACAAGT ATTTTCAATT GACACCAGAA AAATTGAGTA CCTTGCTCGA CAACAGTGAT
GACTTGTTCA ACACTTTGTT GAAGCCAAGT ATCGAGTTCT TGAACTTGAT GAATATTATC
ATTCCCTCGT CGTTTGCGCC TAACTATTAC GAATATCTTA AAAATAAGAA GAAGAAAAAA
GATCCAAACC ACCAGCATCA TACCAAGCAT CTTGATTGGA TCCTTAAAGA AGATGATGGT
GAAGGCAATG ATGGTAACTT GAACTATAAC TATGCTCAAT TCTTAACTAT TGACAAGAAT
TTGTCCAACT GGAGAAATTC GTTGAAAACC AAGACCATCA ACTTATTGCC CTTGGCAGAT
GAAATGGGCT TACCTAACAT TTCCAACATT TCTTCCAATG ATCTCTTTCA TGAATTAAGA
ACCGGGAAGT TATCATCAGG TATCACCCAT GAAGGTTTAC TCAATTACTA TTCTACTGGG
ATCCCGGATA TCTATACTGC TTCTCAGTTG ATAAAAATTC AGTTGAACTT CCACTACATA
TTGATTAGAT CCATGAATTA TCTTAACTTC ATTGTTGACA AGGAATTAAC TTATCCATAT
TATATCGAGA TTTCTCATAT ATCTCGGGAG GTATTGCATT ACTTCTTGTT TATATTCGAT
CACATAAACA AGTCCAATGA GAAAACGATT GATCCGACAC CTTCTTCGTC CAATATTGTC
ATGGAAAGTT TAGGTCTCGA TGTAGATGAA GATGGCTTTG TCATTAATGA TTTTTCTGTG
AAAAGAAGAA AGACTAATGG TTCCGATAGA CACAATATCG CTAAGAGAAT TACCAAAGAA
ATTCCACCTT CGCCATTCAA CTATATGTTG AATGGTTTAT CTATGACCAT TATCAACTTT
AAGAAATCTA TGATCTTACA ATTGTTGTTT CTTTTAATTT GTCAATTGAA GTTCTTCAAA
CGTAATGATT TTGATTTGAT CAGCGACTCC ATTCCTTTGT TAAACGAAAG TGTTGAATTG
TTCATCAAGA TTTTCATCAA CTACAAGGTC GGAGTCAATA GAAAGGACCC CAAGAAGGTG
AAGGAAGATA AACTCTTCGA AAAACTTATG AATGATCAGT TGCGTGATGA AATCTTGAAG
GACCAAGCCG ATAGCGATTT TGACGATGAT GACGAAGCAA GCAATGCATA TCAAAGCATT
GACTGGGATG ATGAAGAGAT GGACGAAGAT TTGAAATACT TGAAGATTCT CAAATTCGTA
AAATACAAAA GTAACGATAT CTTGAACAAA CTTATTGGGA AGAACTCTTC AAGTGTTCCG
GAACCAAGTC AAGTCACTCA CCATCATCAT GTCGAGCAGC CATCTCAGCT TCATGTCCAT
CCTCACATCG ACAGCAATCG TTCATCTTTG TCAAAGCCAA CAGGGCCTCC AATATTGCCT
CCGCCTGTTA TCAACAACTT CTATGCCAAT TCTCCTAGTG CTGGTTCGCC AGGTTTGGGC
TCATTCAACA AGATCCCATC TATGTCTAAG TTCGATTTTC TCCTTTCAGA CAATGACTAC
AGCAACCAAG CGAAATCGCA ATCGCATACT CCTCAAGATT ATGAGCTAGC CAAGAAAGAG
AAACTAGTCA TAAATGACTT AATGAACTTG CGGCACGGAT CTCAGGGTAG CAATAAGTAT
CCTGGACCAT CTCTGAATCC CAATTGGATT CATTCGTCGT CCAATTCGAA CACAAACGGA
TACAATGGTT CAGGGCACTA A
 
Protein sequence
MVKVKTENEK EIKDKSASNL KLISTNTGTR VSQACDRCRI KKIKCDGLYP CHNCNKIGFE 
CRTSDKLTRR AFPKGYTENL EKKLKELEQE MIDLKAKYGI VDDAGNSEVG AGGAVNSLSV
NDTSVANTPN NTFTPSTSNV HNPNSSDDKV LLTTTNQTVQ INNPIDQIFN LDNKGIIIGN
DNLNFESQFN HLLINLNLPF LKITNSHNYL LNDPNSYLYH PSYHNYNQFH NKDLDVIYNP
LTGNSNVNEF SLVNNQLPTD IYDLFIKLIN NFKKLFNNKK ELDNQIIQFF LNYNIFLPIF
DYKQFMESYD AFHTMYPFIF TYDDSTINGF NLSNSNDYHI VNQYLMIIIQ IYAMIYMNNP
TINLNLLLNH SDPNYTFHKK SPKDNSPNII KSLYDFLPYF NVFHVSVNQL QTYLLFLYYS
LLTNNKEKSL ILSSLINSFI GILGINLNSK NLFFDDLSLN TLQRRNRVKT FWVFKVLLKC
FNLKFGFKPS LNTTVINPVT IDKYFQLTPE KLSTLLDNSD DLFNTLLKPS IEFLNLMNII
IPSSFAPNYY EYLKNKKKKK DPNHQHHTKH LDWILKEDDG EGNDGNLNYN YAQFLTIDKN
LSNWRNSLKT KTINLLPLAD EMGLPNISNI SSNDLFHELR TGKLSSGITH EGLLNYYSTG
IPDIYTASQL IKIQLNFHYI LIRSMNYLNF IVDKELTYPY YIEISHISRE VLHYFLFIFD
HINKSNEKTI DPTPSSSNIV MESLGLDVDE DGFVINDFSV KRRKTNGSDR HNIAKRITKE
IPPSPFNYML NGLSMTIINF KKSMILQLLF LLICQLKFFK RNDFDLISDS IPLLNESVEL
FIKIFINYKV GVNRKDPKKV KEDKLFEKLM NDQLRDEILK DQADSDFDDD DEASNAYQSI
DWDDEEMDED LKYLKILKFV KYKSNDILNK LIGKNSSSVP EPSQVTHHHH VEQPSQLHVH
PHIDSNRSSL SKPTGPPILP PPVINNFYAN SPSAGSPGLG SFNKIPSMSK FDFLLSDNDY
SNQAKSQSHT PQDYELAKKE KLVINDLMNL RHGSQGSNKY PGPSSNPNWI HSSSNSNTNG
YNGSGH