Gene PICST_30785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_30785 
Symbol 
ID4837833 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp934607 
End bp937606 
Gene Length3000 bp 
Protein Length999 aa 
Translation table12 
GC content43% 
IMG OID640389148 
Productpredicted protein 
Protein accessionXP_001383808 
Protein GI150864827 
COG category[R] General function prediction only 
COG ID[COG1524] Uncharacterized proteins of the AP superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.778067 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGTCAG TTGATATCGA CACTTCAGCG ACCAACCGGC GAGCAATCAA ACAACAGCAA 
TTCCAGCAAC AGCTGCGATC AAAGATCCGA CAGCAGAAGT TAAAAGTCAC CTTTATAGGC
TATCTTGTGG TGCTAGTATT TCTAGCTCTA GTTCAGTTCA TAGGAGTAGG GTTTTTCACA
AAGGGCTTCT TACTTCTGCG TAACGTCTTA CCCAATGTAT CCGAGTGTAC CACTAATGAC
TTCAATACGT GCATGGCGCC AGCCCGGTTC GATAAGGCGA TCTTGTTGGT GATAGATGCG
TTAAGGTTTG ATTTCGCTAT TCCTATAGCC GACTCAAATG AATACTATCA CAACAACTTC
CCTATACTAC ATCAATTGGC TCAGGATGAT CATGGTGTAT TGCTCAAGTT CATTGCCGAC
CCTCCCACAA CGACCTTGCA ACGCTTGAAG GGATTAACCA CAGGTTCGTT GCCAACTTTC
ATCGACGCCG GTTCCAACTT TGACGGGGAT GCCATCGATG AAGATAACTG GTTGCTCCAG
CTCCACAAGA ACAACAAAAG CATTGCATTT ATGGGTGATG ACACCTGGTA TGCCTTGTTC
AACCACTACA TCAATCCTGC GTTGAACTTT CCTTACGACT CCTTAAATGT CTGGGACTTA
CACACTGTTG ATAACGGAGT CATAGAGCAC TTGTACCCAT TGCTTCACAA GGATAATTCG
AGCCAGTGGG ACCTTCTAGT GGGCCATTTT CTCGGAGTAG ATCATGTAGG GCACAGGTAT
GGGCCCCGTC ACTTTCTGAT GAAGGAGAAG TTAAACCAGA TGAACGAGGT TATAGCCAAT
GTAGTCAAAA GTTTAGACGA CAAGACATTA TTGGTAGTGA TAGGTGATCA TGGAATGGAT
TCCACCGGTA ACCACGGAGG CGATTCGCCG GATGAGTTGG AGAGCACCTT GTTCATGTAT
GCCAAAAACA ATAAGTTTTT TAAAAAGGAT TCAAGTCATT ACAACACTAC AGAGCAAGGT
AAGCATTATA GAGCTGTCAA CCAGATTGAC TTGGTCTCCA CCATGTCGTT ATTGCTTGGG
CTACCAATAC CGTTCAACAA CCTTGGTTTC CCCATCGATG AGGCATTTGA AAACCAAATG
GAATTGTCCG TAGCGTCGTA TAAAACTCTA CAACAGATAC AGGGATTCAG AAACAGTACG
CCAAATTTGT CGCCCGAAAT CAACAAACAA TACCACCAGA TCATCAGTAA TTATACTAAC
AATTCTCATG ACTTGTACAC CTTGGTAGAT CTGGCCAAGA CATACCAGTC CCGTTCTTTA
GAAGAATGCA AAGGATTATG GGCGACGTTT GATTTGAAGA TGATTGGCGT TGGAATAACA
ATCCTTTTGC TAGCATTGAC TTTTATCCTA ACCTATGCTA GATCGATTCC TGCGGTCAGA
GTTCTGACGA TGTCTTTCGA GTTCATTGGA TCAGTGATAG CCATGTCATT GCTTGGTTTA
GTGCTAAGTC TTTCAGTGAG TTTGGTATTA AAGCCTGCTG ATTTCAACTT GAAAAAATGC
TTAGCCCTCG GTGCTTCTTT GGGAATAATT GTCGGATTCT GGGCCCCCAT AATGGATAGA
TTCAGCATCA ACTGGCTAGT GCACCAGCTC ATCGATTTCT TCGTGTACAA TTTCAACAGT
TGGTCCTTTT TAGGGTTGGT CTATGTCGTG GCACATTGCT TGATTTTTGC ATCCAACTCA
TACGTGGTAT GGGAAGATAA AATGGTTCTG TTTTTCTTGA TGACTTTTGG TGTTGCTTGT
ATTTTTAACA TTGCTATCAA TTTCGAGCTT CCGCGTTCAC AAAAGATTTT AGGACTTCTG
CATGCCATCA CATTTACCCT GTTAACAAGG TTGGTATCCA CTATTAATCT TTGTCGTGAA
GAACAAAGAC CTTACTGCCA GGCTACCTTC ACTACGTCTT GGTGGTCCAT TGTGTTATTG
CACTTGTGCT CTTACCTTCT TCCAACAATC ATCAAGTCAT TCTATAAGTT GTCTGATTCA
TACCATTCAG CTGCTCCTTT GTGGGTTGGT ACTGGCCTCA AGTTTCTAAT GTTCATGAAT
GCTGTTTATT GGACCTTAGA ATATGTTCTG AACAGCGAGT ATTTCTTATC GACGAGTTTT
GTCTTGAGCT CGCCTTTGAT CAAATCCTTG AAGTTGGCAA TTGCAAGAAT CGTCTTGTTC
ATTACCTTGG TGCTTGCAAA TTTTAGTTGG TCCAAGGGTC CTTTGTGTGT CAAGTTAGAG
CTCTCGGATG CCGTACAAGA AGACTCAGCA GAATCTGAAG ATTCAGACGG GCCACTGAAG
ACAGCCACAA TTTTAGGATA TGGCAACGTT TACGGGTCAT CATATTTTTT GTTGGTTCTC
AACTTTACAG TAGCCATCAT GTTGGTATCC AAACCATTGG GAGCCATTTC CATCAATATG
TTGATCGTAC AAATCTTGTC GTTGTTGGAG CTATACCACA TAATGGACAT ACGTAGAAAC
TTGATTTCGC CATTGATCTT TGGATTGTTG GGTTACCAGC ACTTCTTCAG TACGGGGCAT
CAAGCTACTC TTGCTGCTAT TCAATGGGAT GTGGGCTTCA TGACCACAGA AACCATAACC
TTCCCATTCA CCCACTTGAA TATTGTGTTG AATACATTTG GTCCTTTCTT GATTATTTGC
TTGTCGGTGC CTTTGATCAC GTTGTGGAGA TTGGCTCCTT CAAGCAAGCC TATTACCATC
TTGTCGCAAA TCGTAACCAA TGTAACCACT CTTATTACAT ACCAGTTGTT CACTGGGGTG
TCCAGCTTAA TATTTGCAGC TCATTTTAGA AGACACTTAA TGGTGTGGAA AATCTTTGCA
CCCAGATTCA TGTTGAGCGG ATTGTTGATC ATAACCATAA ACATTTTCGT TATCGTCGTG
ACGTTGTGGT TTGGAACAGG CAGGGTTGTA ACCCAAGTGA ACAGAATCTT TGGGAAGTAG
 
Protein sequence
MESVDIDTSA TNRRAIKQQQ FQQQSRSKIR QQKLKVTFIG YLVVLVFLAL VQFIGVGFFT 
KGFLLSRNVL PNVSECTTND FNTCMAPARF DKAILLVIDA LRFDFAIPIA DSNEYYHNNF
PILHQLAQDD HGVLLKFIAD PPTTTLQRLK GLTTGSLPTF IDAGSNFDGD AIDEDNWLLQ
LHKNNKSIAF MGDDTWYALF NHYINPALNF PYDSLNVWDL HTVDNGVIEH LYPLLHKDNS
SQWDLLVGHF LGVDHVGHRY GPRHFSMKEK LNQMNEVIAN VVKSLDDKTL LVVIGDHGMD
STGNHGGDSP DELESTLFMY AKNNKFFKKD SSHYNTTEQG KHYRAVNQID LVSTMSLLLG
LPIPFNNLGF PIDEAFENQM ELSVASYKTL QQIQGFRNST PNLSPEINKQ YHQIISNYTN
NSHDLYTLVD SAKTYQSRSL EECKGLWATF DLKMIGVGIT ILLLALTFIL TYARSIPAVR
VSTMSFEFIG SVIAMSLLGL VLSLSVSLVL KPADFNLKKC LALGASLGII VGFWAPIMDR
FSINWLVHQL IDFFVYNFNS WSFLGLVYVV AHCLIFASNS YVVWEDKMVS FFLMTFGVAC
IFNIAINFEL PRSQKILGLS HAITFTSLTR LVSTINLCRE EQRPYCQATF TTSWWSIVLL
HLCSYLLPTI IKSFYKLSDS YHSAAPLWVG TGLKFLMFMN AVYWTLEYVS NSEYFLSTSF
VLSSPLIKSL KLAIARIVLF ITLVLANFSW SKGPLCVKLE LSDAVQEDSA ESEDSDGPSK
TATILGYGNV YGSSYFLLVL NFTVAIMLVS KPLGAISINM LIVQILSLLE LYHIMDIRRN
LISPLIFGLL GYQHFFSTGH QATLAAIQWD VGFMTTETIT FPFTHLNIVL NTFGPFLIIC
LSVPLITLWR LAPSSKPITI LSQIVTNVTT LITYQLFTGV SSLIFAAHFR RHLMVWKIFA
PRFMLSGLLI ITINIFVIVV TLWFGTGRVV TQVNRIFGK