Gene PICST_67427 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_67427 
Symbol 
ID4837677 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp1682510 
End bp1685797 
Gene Length3288 bp 
Protein Length501 aa 
Translation table12 
GC content45% 
IMG OID640388992 
Productpredicted protein 
Protein accessionXP_001383609 
Protein GI150864675 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0564378 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.625866 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTTGTTCCAA ATCGAAATTG GGTTAGAAAT ACTGGAAGTA ATATATAACT TTGTACCTGT 
TCCATTTCAG TCCTAATTTT TCTGATTTTA TTTTGCCCAA AAATATTTTT GTTCTGCCAC
CGGTGAGATT TAAATCATAA CATATATTTA TATTATATAT ATACGAGAGT ACCAGGAATC
ACAACTTTTC CAGCTCATCT CCATACTTTT GAAAGTGTTG CTCCTCTCGG TACATCAGCA
CAGACCTTCC TAGCATCGGC TATTTTCACC ATTAGCCGCA TATCGTTCTC ACGGCCTCCA
TCTTTGACCA GAACCAAACG TCCACCTTGA GTTCTCGGAA GAATCGTCCC AAAGAACCAA
AATCAACCTT CGAATCACGC CAATTCGACT CCACCGTCTG AAGCACGAAG TATCTATACT
AGACCGTTGA AATCGGGAGT CTTTGTCCAA CAAACACAGC CAATTTCATC CTCCAGTCCA
TCTCAGCTTC CAGAGCCCGT TCTCAAGCTG ACTTTGTTCG TGTGTTGATT CCCGCTTAGC
CATATACATC TTCAACTAGC TCCGACTAGC TCCCACCGGT AATCCCAACC ATAGAAAGAG
CTTATATCGC CGGCAAAGCT ATCTTAGCCC CCAATTACCC CCCAAAACGC ACTTGGCTAA
AGCGCAGAAA AAATTTCGAA GCAGAATCCA CCCTCACTCT TGCGGAAAAA AAATCCAGTC
TCACTGTTTT TTTGATATCT TGAATAATTG TTTGTTTATT TGCGGTTGCT AGCACTTCAG
TCTGGTTCAC AGCAAGACAT ACACAGATAT CCACAAGAGC CAGCAGTCAC GTTCCTCGAT
CCTCTGCCCC AGCTTGTAAT AGCCAGTTCA ATTGTTGTAA ACAAATACAA CCTTGAGTAC
TCGCAAAGTG TACCTCATTG GAAAACAACA AAGAGTAATA ACAAACAACA ATAGTAGTAC
AAGTAGTGTT ACCTATAATA TCTCAGTGTT ACTTACAGCA TATCGTTTCG AACGCGTTCT
TCCCGATCCC TTCAGTCATT TCCCTAGCAC GGTAGCTGAA CATCATCATC TGATCGTATT
GATCGTCTAG TGGTATCGTA GTTCAAGAGA AGTTGGAAAA TCTGTAGAAA TCTTTTTTCC
GGTCATCTGA ACACATTTAT TCCTGATATC TTGACTGTAT TCATTCCTTC ATCTGACACA
TCTCCATTAT GAACGGCAAC TTCAACAACG GCTTGCAACA GACGTCGGCC ACGGCCCAAC
AGTATCCTTC ACAGCCGTAC GGCTCGTACT ATATCCAACA AGGCCAACCC CAAACGACGC
AGTCGATACA ATCCACAGGG CAGCAGACCC AGCAGCAACA ACAAGTGCAA GGTCAGCAGC
AAGCACAACA TCTGCAGCTG CAACAGCCGC AAACAGGCGC GTCTGCGTCT GATTATTATG
TCCGCTACCA GACGTCGTAC GGTGCTGCTC CACCAAACTA CTCCATGTAC CAGCAATCGA
TACCAGGCCA GTTGTCGCAG CCTCAGCCTC AGCACTACAA CCAGATGGCC TCGGCACAGA
CCGGCGCTAC CACTGCCTCG TCCTACGGGT CAGCAGCGGC TGCTGTAGCT AGTTCCAGCG
TTGGAACGAC GTCTAACACC CAATCTACGC CAATTCAGGA CACCATCAAC TCCTCCAGCA
ACTCCACTGT TAGCCAGTAT CAGCCTCCGG GAATCAGACC CCGTGTCACC ACCACAATGT
GGGAAGACGA GAAGACGTTG TGTTACCAAG TAGATGCCAA CAACGTCTCA GTTGTCAGAA
GAGCCGACAA CAACATGATA AACGGTACCA AGTTGTTAAA CGTAGCCCAG ATGACGCGTG
GTCGTCGTGA CGGGATCTTG AAGTCGGAAA AAGTTCGTCA CGTCGTCAAG ATCGGATCCA
TGCATTTGAA GGGAGTTTGG ATTCCCTTCG AACGAGCATT GGCCATGGCA CAACGAGAAG
GTATTGTAGA TTTGTTGTAC CCCTTATTTG TTCGTGATAT AAAGCGAGTT ATACAATCAG
GAGTAACTCC TAGTGCTAGC AACGCTGCTT CTACTGGTGC TGCATCTGCT GCTGCTGTTA
GTTCGAAGGC TACATCGACA CCAACACCAG CTGTGCAACC TTCGCTCAAC AGCTTCTATC
TGCAATATGG TCAGCAATAT GGCCAACAGT ACTCTGCACC ACAAGTCAAC GGTACAGCAG
GTACTGCCTC CGCTGCATCT GCAAGTGCCA CCACTGCTGG TGCATCTACG GCAGGAACCT
CTGCTACGGG CCAGCAATCG CCACAGCAAG CCACATCGTC TACCGCTACT ACCGACAGCT
ATCAACAGCA GCAACAGTTG CAGTTTCAAC AGCAGGCCCA GCTGCAAACT CTTCCCCAGC
TGCAACAACA GCAGGTTTAT GGCTACCAGC AACCATACTA CGCCGCCTCG TCTGGAGCCC
CTAACCAATA CTATCCATAC AACCCTACTG CTGCTAGTTA TTCCAGCTAC GGGAACCAGC
CGGTGTATTC GCCTTTTGGC TACGTGAATC CCGTGCCTCC CCAAATGCCT CACCAAGCTC
AGCAGCCTCA TCAGCAGCAG CCACAACCAG CCCTGTTAGG TCAGACTAGC TCGCTAGCAC
TGGCTCAAAC TTCTGCAGCC AATGGTAGTG CTGTTGCTTC TTCTGATGCT GCTAGCAAGA
AGGAAGAAAA ATAGAGTGAA AAGTTCAAGC CGGAATCATA AAGAGAAGAA ATCAAAGATA
CGAAAGGTCT AATGGTTTTT GTGATAGAGC TACTACTTTG TAATATACTT TTATGAATAC
CGGCCATGAA TTTTTCAAAA AAAATTATTG TTGTGGTTTA TCTTCAGCAT GATTTCTGGC
ATTGCTCTTG ATTGCAACCG CAACCCTCCC GTTAATTCCT AGTCATACAC TCCTCGTCTA
CCAGAATTGC TAGCAGAAGA GAGAGCTTCG TTCCACCAAC TGTACATTTT TTTCTAAACA
AAGATACTTA GAAGAGTTTT TGTTACGCTA GATCTATATT TTCATTTACT TGCTTCCCAA
AATTACATTG CAGTACATCA TTATAGTTGC TAATCCCTCG TTTTTGTTTT TGTTCCATTA
TGAGTATCCA TTGTTTTACA ACTAGTTCCA AGTTTGCTTT TGCTAATCTT CCTTCCTTTG
TATTAAATAG TGTATTTATG TTCTATACTT GTCTTCATAG CGTAGCCAGA AGCGAATTAT
TACTCCTTAC TTTAAAATTA TTCATTAAAC GATAATCTTC TTCAGAAA
 
Protein sequence
MNGNFNNGLQ QTSATAQQYP SQPYGSYYIQ QGQPQTTQSI QSTGQQTQQQ QQVQGQQQAQ 
HSQSQQPQTG ASASDYYVRY QTSYGAAPPN YSMYQQSIPG QLSQPQPQHY NQMASAQTGA
TTASSYGSAA AAVASSSVGT TSNTQSTPIQ DTINSSSNST VSQYQPPGIR PRVTTTMWED
EKTLCYQVDA NNVSVVRRAD NNMINGTKLL NVAQMTRGRR DGILKSEKVR HVVKIGSMHL
KGVWIPFERA LAMAQREGIV DLLYPLFVRD IKRVIQSGVT PSASNAASTG AASAAAVSSK
ATSTPTPAVQ PSLNSFYSQY GQQYGQQYSA PQVNGTAGTA SAASASATTA GASTAGTSAT
GQQSPQQATS STATTDSYQQ QQQLQFQQQA QSQTLPQSQQ QQVYGYQQPY YAASSGAPNQ
YYPYNPTAAS YSSYGNQPVY SPFGYVNPVP PQMPHQAQQP HQQQPQPASL GQTSSLASAQ
TSAANGSAVA SSDAASKKEE K