Gene PICST_68309 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_68309 
Symbol 
ID4840727 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp485918 
End bp488831 
Gene Length2914 bp 
Protein Length927 aa 
Translation table12 
GC content42% 
IMG OID640392042 
Productpredicted protein 
Protein accessionXP_001386294 
Protein GI150866630 
COG category[L] Replication, recombination and repair
[R] General function prediction only 
COG ID[COG0494] NTP pyrophosphohydrolases including oxidative damage repair enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00286141 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCAAAC TGAAAATCTA CAATTCTCGA TTCACGAATC ATATTCACTC TCTCAGCTAC 
CTATTCGCCA GCTTCACTAT ACATAGCAGT AGCTTTTTAT CCTTTTCTAC ATTACTGTCC
ATCAGTAACT CACTCATGTC CATCCAATTG CGAGACGGCT TAGCCAACCA GCTGGTTGAT
CTTGTGCTTG AAGACCTATT GGTGAGGTTT CTAGTCAATT GTCCCGAGGA AGATCTTTCG
CTGATCGAAA GAGTGTTTTT CCAAGTTGAA GAAGCGCAAT GGTTCTATAC CGACTTTGTC
CGAGTGCTAA ACCCGGCACT TCCTAACATG AAGATGAAGC TGTTCTGCTC CAAATTTTTG
GAGAAGTGCC CTCTATTCTG GAAATGGGGA GACCCCAACG ATGCGCTATC CCGATTCGGC
AAGTATAAAC TGACGATTCC CGTTCGTGGA GTCGCTCTTT TCAATAGAGA CTTGACGAAA
GTGCTATTGG TGAAGGGGAC AGAGTCTAAC TCGTGGTCGT TTCCCCGAGG AAAAATTTCC
AAAGACGAAT CTGACATCAA CTGTGCCATT CGAGAGGTCG AAGAAGAGAC TGGCTTCAAC
GCCAAAGACC TAATCAATGA AAGCGACGTT ATTGAGAGAA CCTTCAAAGG CAAGAATTAT
AAAATTTACT TGGTGAGAGA CGTGCCTGAA GATTACAACT TTCTGCCTGT AGCTCGTGGG
GAAATCGCTA TGATTGAATG GCATGACATT AAAACTTTGC AAAAGAAGAT TAGAGCTTCT
CCCAACAACT ACTTCATCGT CGAGACTGTT ATAAAACCAA TGATTCAATG GATAAATAAG
AAGAAGGGTG GGTTAAACGA GGCAGAGTTG ATGCTAAAGG CAGAAATCCA GTTGAAGGCT
TTATTGGGTG TAGGAAAGCG CGAAGAAAAC GTAGATGCTG GAAGAGAATT GTTGAACATT
CTCCAGAAAG TCAGCCCCAG CCACTCAGCC AGCGACTCCA ATGTAGTACC TCTCTCATCT
ACTGTGCCTG GTGCACCTCA ACAACAGAAC TATATCCAAT TCGCCTTACC GCAACACTTA
CAGAACCAAA TCCCGTTCTT CTCTTCATCT TCAACCCACC ATCCACAGCC TATGTTGCCA
TTCTTCAATC CCTTCGGCTT CTATCCTGGC GGATCTCCTT TACCTCCTCA TGCCATAACT
CCTGTTCCAA TGCCTGTCCC ACCTCATCAG ATTCCTTTCC TGAATGTCTC CCCTCAAAAA
CAGAGACATC CATATGAAAT ACACCAGCCA ACATCAGAGT CTCTTCAGAA ACCCACAGGT
AAAAACTCCA AGGAATTCTT ATCGATTTTG AATACAAAGT CACTGAAAAT AACAGATGAG
GCTACATCAA AAGATTATGT TAGATCTACG GAAGCAGAGA ATAACCGTAC AAAAGCTCAA
GACTTGCTTA ACTTAGTAGG AAAGCAGAGA AAGGAGTCTG TTACATCTGA GTCAAGATCA
ATTTTGGACC TTGTCAATAG GAAACAAGTA AGTTCATCTC CATCTCCAGA ACAAGACCCG
GGCAAGACTT TATTGAACAT TCTCAACGAG AAGAAACATC CTGAGATTGT GCCTACTAGA
GATTCGAGAT TTTTGGGTAT AAATTCACCT ATAGCATCCA ACGATCTAGA AGAATCGGTT
ATTCACGGTG CTGGTCTTGG ATTACCGGCT CCAGAACCTG GCAAAATAAC TTTGCTAAAA
AGACCAGACG ATGCAACAGA GGGTAGAAGA AAAAAGTCGG CTGATTTACT TAGTTTGTTG
GGTAAAAAAC CTATCGTCCA ACCAAGACAG GAAGTTAAGT CATCGTCCAA CGAGATACTT
GATCTCTTGA AGGGCCCAAA GAAACCGCTT TCAGAAACTA CAAGATCTCC CAATTCTTCT
TCTAAAGAGT TGCTTGAATT ACTTAAACCA AAGAAGGAAC CAATAGCGGT CGACCATACT
TCTTCAAATG AATTATTAGA CTTGCTCATC AAAAACAAGC CAAGCAGTGA ATCTGAGAAG
AAACCAAATA ACCCATTATT GGATATGCTC CATAGTAAGG CACCACGTCA AGTATCACAA
CCAAATGCCG AACATAATCA TACTTCAGCC AATGAACTTT TGGGTTTGTT GAACAAAAAA
CCATCCATTC CATTAAATGA TGCTGACTCT AACTTCATTA GGGAAGAAAA GATTGAAGAG
ACTCAATTCG ACAACTTTGA AGATTTTGAA GACTTTGAAG ACTTTGGAGT CATTGATAAC
CAGCTATTGG GCAAGACCTC CTTCCGCAAC TTCGACATCG CAAGCGATGA AGAAGATGTA
GACCATTTGA TAGATGATCT CGGAGATCCA TATTCCGCTC CAAATACAGC AGTAGAATCG
TTTTCCAATC CACCAGATTT CTTCCTGGAT CCGCAGCCTT CTCTAGAACC AAAGAAGGGA
AAGATCAGGC TTTTGAAGCC AGGAGAAGTA TTGAATGATA TCTTTTCTAC TAATCGTCCC
AATGTTTCAT CGCCTCCTGT GCATGCTTCT AATGCTAATG GACAGAATCT TCTTGCATTG
TTGAATGGAA AAAATCCCTC CAATTCAAAT GGTGCTATTC CTGTAAGCGA CAGCTTTCAA
TCTATTTACG GGAATGCCAA CCCAACGGCG GGTCTTGATT CTAATACTCC TTCTTCTTTG
ACAAATGCTC TTAGTGGTCA GAATAGCCCC AATAAGAATT CGGCTAATTT CCTTAAAGAC
ATTCTTTGGA AACGCGAGCC ATAGTTAGCA CAGCATAAAT TAGCATAGCG TCTACGATAG
TACCGGTGAA GTAGCACTAC AATAGTACGA ATTGCTATTT ATAGCAAATT TAGCATAGTG
TCTAAGATAA CAAAACATAT AAATGATTTA GTCC
 
Protein sequence
MSKSKIYNSR FTNHIHSLSY LFASFTIHSS SFLSFSTLSS ISNSLMSIQL RDGLANQSVD 
LVLEDLLVRF LVNCPEEDLS SIERVFFQVE EAQWFYTDFV RVLNPALPNM KMKSFCSKFL
EKCPLFWKWG DPNDALSRFG KYKSTIPVRG VALFNRDLTK VLLVKGTESN SWSFPRGKIS
KDESDINCAI REVEEETGFN AKDLINESDV IERTFKGKNY KIYLVRDVPE DYNFSPVARG
EIAMIEWHDI KTLQKKIRAS PNNYFIVETV IKPMIQWINK KKGGLNEAEL MLKAEIQLKA
LLGVGKREEN VDAGRELLNI LQKVSPSHSA SDSNVVPLSS TVPGAPQQQN YIQFALPQHL
QNQIPFFSSS STHHPQPMLP FFNPFGFYPG GSPLPPHAIT PVPMPVPPHQ IPFSNVSPQK
QRHPYEIHQP TSESLQKPTG KNSKEFLSIL NTKSSKITDE ATSKDYVRST EAENNRTKAQ
DLLNLVGKQR KESVTSESRS ILDLVNRKQV SSSPSPEQDP GKTLLNILNE KKHPEIVPTR
DSRFLGINSP IASNDLEESV IHGAGLGLPA PEPGKITLLK RPDDATEGRR KKSADLLSLL
GKKPIVQPRQ EVKSSSNEIL DLLKGPKKPL SETTRSPNSS SKELLELLKP KKEPIAVDHT
SSNELLDLLI KNKPSSESEK KPNNPLLDML HSKAPRQVSQ PNAEHNHTSA NELLGLLNKK
PSIPLNDADS NFIREEKIEE TQFDNFEDFE DFEDFGVIDN QLLGKTSFRN FDIASDEEDV
DHLIDDLGDP YSAPNTAVES FSNPPDFFSD PQPSLEPKKG KIRLLKPGEV LNDIFSTNRP
NVSSPPVHAS NANGQNLLAL LNGKNPSNSN GAIPVSDSFQ SIYGNANPTA GLDSNTPSSL
TNALSGQNSP NKNSANFLKD ILWKREP