Gene PICST_65566 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_65566 
Symbol 
ID4838967 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp577096 
End bp580307 
Gene Length3212 bp 
Protein Length734 aa 
Translation table12 
GC content45% 
IMG OID640390282 
Productpredicted protein 
Protein accessionXP_001384073 
Protein GI150865023 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.445286 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TGTTTTCATC ATCCAACTCC ACCATTTTTT GGTTCACCAG ACAAGTTCCT CCTACAAACA 
CCTTCACGAC ACAGCATCCA CCCGCAAATA CTCACACCAT CCTACACTAT AAATCCGCTT
TCTGACTCGC CGCCATGAGT CACCACAGTA CTCCTCATTT GTACACCAAG CGAACTCCAG
GATCGTTGGA AAACCCTGGT TCCAACCCCT TGGCACAGGC TCTTGCCACT AATTCCTCCT
CAAAACAGCA ACAGCAACAG CAACAACAGC CGTCACAACA GAATATCTCT TCATCTGTTT
CTGCTGCCAA TGCTGCCAAT GCCGTTGCTT CTGGTAAACC TACCATGGAT TCTTCTCGTT
CAGTGTCAGA CTCTACGGCT ACCAACGCCA AACAACTCTT AAACGCTTAT GTTTACGATT
TCCTCGTCAA GTCACGTTTG CCGAACACGG CAAGGATATT TGTCAATGAG GCAGAAGTTC
CCTCCGTCCA AAGCAGTGCT ATTGTTGCTG GCTCCAAGCT AGGCTCCCAC CAGCTGTCAC
AAAAGAACCT GCCCCAGATT AACCTGGGCG CCAACACCAA CACAAACACC AACACAAACA
CACCACAGAC GCCAAACCTG TCGTACCAGC AATTCCAGAA GGAAAACAAC TTGCCCAACT
TGCTGGTCGC TGTCGATGCA CCCCAGGGAT TTCTCTTTGA GTGGTGGCAG GTATTCTGGG
ACGTCTTCCA GGCAAAAAAC TCGTTTTCTC CAACTTCTGG ATTCAAGCCA AACAACATCA
ACATCAATGC TGCCAATGCC GCATCGCAAA ATATGGCTTT CCAGTACTAC CAGCTCCAGC
TCATGAAACA AAGACAACAA CAAGAAATAG GTCTTTCTAC CAATGGACAA CCGATGATGT
TTGCACCTAA TGGTGGCAAT GCTAGTATGG CTGGAAATGT CAATATGGCT GGCAATACGA
ATGGCAATCC TATGCTTCAG CAACAACTCA TGTTTCCGCA ACAACAACTA CAAAACCTAC
AACATCCACA GAATCAGCAA CCTCCTCGTT CACTACAACA GCAACAGATT CCTCAACAGG
CCCAACCGGG TACTGGTACG GGAGCTCAAT CACAGGGTCC AGTACAAGCT GGACAGCCAC
AGTTGCTTCC GCCAGGATCA CAAGGAGTAC CACAGACAAT TGAACAGCAA CAGCAACAGC
GACTTGTAAT GCAAATGATG ATGAAACAGC AGCAACAGGC TCAGCAAGCT CAACAACAGC
AGCATCCTTT ACAGCTTCCT ATGGGTGTTA ACCCCATGGA TCCTCTGCAA CAGCAACAAC
AGTTATTGGC AGGTATGGGA CCCAACAACA ATAACAACAG CACCAATAAC AATAACAACA
ATAATAATAT GAACTTACAA CAGCAATTAT TCTTATCACA GCAACAACAA CAGAATCAGA
GTAGAATTCA GCAACATGCT CAGAATCAGA TGAATAGTTT CAGACAACAG GCAGCAGCAG
CTCAACAGGC ACAACAACAA CATGATGGTT CTAACAATAA CTCTCCAGCT AATGGACATA
GATTGAGCCA GATGACACCG CAATCGGATC CTGCCCAGGC TCCACCAATG AATGGAATGA
ATTTTTCACA ACAGCCAAAC ATGATAAACG GGCAAGTACC ACCTCCCTTT GTCCAGCAGC
AACAACAGCA ACAGCAGCAG ATGCGTATGA ATAGTGTTAG CAACGGCAAG AGTGTAGGCA
CCAAGCAGGG AAGTCCCATC ATGCTTGCCG GAGCTGTGGG ACCACAACTG GGCAACAGGA
ATGGAAATGT TAGTGTTCCA GGTAATGTAA ACAACAGCAA TCCCAATCGT AACATGAATG
CTTTACAAGA CTATCAGATG CAGTTGATGT TGTTGGAAAA GCAGAACAAG AAACGTCTTG
ATATTGCCAG AAACAGCGGC GACGTCAACT TGCTTAGTTC CGGACTTATA GGACAAGCAC
AAGCACAACA ACAAGCTGGA CCTGGACCAC AGCAACAAGG TCAACAAGCC CAGCAGCAGC
AGCAGCAGCA GCAGCAGCAG CAGCAGCAGC AGCCGCAGAA TGCTCAAACT AATCGTCTGT
TTACCCAGAA GCCATCACCA GCAACTGCTA GCAGTTCACC TGTGCTACAC AATAAACCTT
CTCCACAGTC TACAACTGCA AAAAGAAAGA AGGAAACTAC TGGTAAGCGA GGTAGAAAGG
CAAGTGCAGC GGGACTCAGT GGAAGCACTC CAGCGATGAA CCCAGCTAAC ACGCCTAGTT
TACTCAAGAA AGAGTACACC ACACCTCTTA CGCCAGCTTC AGAACCAGCT AGTGACCCCA
AGAGAAAACG AAAGAGCACA ACTGGCAACA CTGATTCGCC CAAGAAGCAA GCAACTGCCA
AAACCACGGC AACAGCAGCT TCTACGACCT CAGCGGTAGC AGAAGCCAAA AAGGAAAAGC
CGATTAAGGA AGAAGAAGCC CCTGTTCCGG AGACCAAGAA GAAAGAAGAG TCGGAAATGC
CTCCTCCTAC ATCTTTTTCG GATCCTCTTG GACAGTCCGA CCAAATATTC TCTGTAGAGT
TGCTTGGAAA CGGTAGCACC GATTCACAAA ATTTCTTTGG CGCAAACGCC CAGGGCAACC
AGGGTGGTAT TGACGACATT GACTTTGATT TCGGTCAGTT CTTGGAGAGC AACGGCGATG
GCATTAATGA TGGCATCGGT GGCTTCAACT GGGGCAACGT GGATGCAATT GAAAACGGCG
AGTAGCACTG AAACACACCA CTTCTGAAGC TTTTCTTGCA ACATGAATAT CAGGAACATT
TTCAGAAAAA GAAATCATGA ATATCATGGA TTCTTCATCT AAGAAAATTA TATTATAGTT
GAAAGCTTAA GAATTTACAT TAAAGCTGGA GTTTAAGAAG CCACAAATAT CCTGGAAACA
CTTCTTTAGA GATTGAAACA TTTCTCAGAT TATCAACATG ACATTCTTCA CTATTAGGGC
TCATCCAGAC TGTGTCAAAT GCCTCTAATA TTTTATTCTC ATTGGTTTGT TTTATTTATT
CCTTTATTCT TATTTCTCGA CATATACCTC TTGAGAGTGA CGTTTTCAGG TTCTTTATTT
GTATTGTACT ATTATTCGTT GTTATTGTCG GCTCATTAGT CTATATATAC TATGCGATCT
TCATATTAAT AACATTCCTC GGACTTCGTT AA
 
Protein sequence
MSHHSTPHLY TKRTPGSLEN PGSNPLAQAL ATNSSSKQQQ QQQQQPSQQN ISSSVSAANA 
ANAVASGKPT MDSSRSVSDS TATNAKQLLN AYVYDFLVKS RLPNTARIFV NEAEVPSVQS
SAIVAGSKLG SHQSSQKNSP QINSGANTNT NTNTNTPQTP NSSYQQFQKE NNLPNLSVAV
DAPQGFLFEW WQYYQLQLMK QRQQQEIGLS TNGQPMMFAP NGGNARSQGV PQTIEQQQQQ
RLVMQMMMKQ QQQAQQAQQQ QHPLQLPMGV NPMDPSQQQQ QLLAGMGPNN NNNSTNNNNN
NNNMNLQQQL FLSQQQQQNQ SRIQQHAQNQ MNSFRQQAAA AQQAQQQHDG SNNNSPANGH
RLSQMTPQSD PAQAPPMNGM NFSQQPNMIN GQVPPPFVQQ QQQQQQQMRM NSVSNGKSVG
TKQGTVGPQS GNRNGNVSVP GNVNNSNPNR NMNALQDYQM QLMLLEKQNK KRLDIARNSG
DVNLLSSGLI GQAQAQQQAG PGPQQQGQQA QQQQQQQQQQ QQQQPQNAQT NPTASSSPVL
HNKPSPQSTT AKRKKETTGK RGRKASAAGL SGSTPAMNPA NTPSLLKKEY TTPLTPASEP
ASDPKRKRKS TTGNTDSPKK QATAKTTATA ASTTSAEEEA PVPETKKKEE SEMPPPTSFS
DPLGQSDQIF SVELLGNGST DSQNFFGANA QGNQGGIDDI DFDFGQFLES NGDGINDGIG
GFNWGNVDAI ENGE