Gene PICST_80003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_80003 
Symbol 
ID4841046 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp143797 
End bp146671 
Gene Length2875 bp 
Protein Length931 aa 
Translation table12 
GC content44% 
IMG OID640392361 
Productpredicted protein 
Protein accessionXP_001386424 
Protein GI150866736 
COG category[J] Translation, ribosomal structure and biogenesis
[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0513] Superfamily II DNA and RNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.586246 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CCGACTACTC GAGCATGTCA GACAACGAAG AAGACTACGA CATTGCCCGC TCGTTGACCG 
TCAACCTCGA AGACAGTGAC TCTGACTCGG GCTCTGACTT CAGTGACGAG GAGCAAGAAG
TCCAGGACAT AATTTCGAGC GAAGATGAAG CTGAAGAACA GCCCAAGAAG AAACAAAAGA
CAGCCAAACC AGCCAAAGAG GCTTTCCCAT CTCTAGAGCT CTCAGGTGAT GAAGACGAAC
AGGACGATGA CAAGGATATG GCGTCATATT TTGCTGCCAA CAATCCTCAG GCCAAGAAGG
CTAAGGCTGG TTCTTTTCAA TCGTTTGGTC TTTCCAAATT GGTACTCACC AATATTGCTA
AAAAGGGCTA TAGACAGCCA ACACCTATCC AGAGAAAAAC CATTCCACTC ATAATGGCCA
ACCGTGATGT GGTCGGTATG GCCAGAACCG GTTCTGGAAA GACTGCTGCA TTCACATTGC
CAGTAATTGA AAAATTAAAG GGCCACAGTG CTAGAGTAGG TATCAGAGCC ATTATCTTGT
CACCTCTGAG AGAATTGGCT TTACAGACAT ATAAGCAAGT CAAGGAGTTC AGTAAGGGCT
CAGATTTGCG TGCTATCGTA TTGACTGGTG GTGACTCTTT GGAGGATCAG TTCTCCAGCA
TGGTTTCCAA CCCGGATATT GTCATTGCTA CCCCAGGTAG ATTTTTGCAT TTGCAGGTAG
AAATGCAGTT GGATTTGAAA ACTGTAGAGT ATATAGTGTT TGACGAAGCC GATCATTTGT
TCGAACAGGG TTTTGCAGAG CAGTTGAATG AGTTGTTGGC TGTTTTGCCT CCACAAAGGC
AGTCTTTGTT GTTTAGTGCT ACTTTGCCAC GTTCGTTGGT GGATTTCGCA AAGGCTGGTT
TGTCCAATCC TGTGTTGGTC AGATTGGATG CTGACTCCAA GATCTCTGAC CAATTGCAAA
TGGCCTTCTT CACCACCAAG AAGAATGAAA GAGAAGCCAA TTTGTTGTAC GTCTTGCAAG
AAGTCATCAA GATGCCATTA GGAACAGCAG AACAAATCAA GAAGTTAAGA CTCATGGATA
AAAGAGTCAA CGACGAGGCA GAAGAAGAGG AGTTGGAAAA TGAATCGGGT TCGAAAAGAA
AGTACAAATT CAAGAAGGAA AGGTTACCAT CTGCTAACGT TTTGCCCTCT CCGCATTCCA
CCATTGTTTT CGTTCCTACT AAGCACCACG TAGAATACAT CACAACCTTA CTCCGTGACG
CTGGGTATTT GGTTTCGTAC ATCTACGGTA CATTAGACCA GCATGCTAGA AAGAACCAGT
TATACCAATT CAGATTAGGC ATGACTACCG TCTTGGTTGT CACAGATGTG GCTGCTCGTG
GTATTGATAT TCCTGTCTTG GCTAACGTTG TCAACTACAC GTTACCAGGC TCGTCTAAAA
TCTTCATTCA TCGTGTAGGT AGAACTGCGA GAGCAGGTAA CAAGGGTTGG GCGTACTCTA
TTGTTAACGA AAAGGAGTTG CCATATCTTC TTGACTTGGA ATTGTTTTTG GGAAAGAAAA
TCCTCTTAAC AGCAATGCAA GAACAGAAGT GTCAACTCTT GAAAGACAAG CAGGGAGACA
ATTACGTCGA GCCAAAAGTC CAGTATACCG ACCGTTTGGT ACTTGGTTCT TGTCCAAGAT
TGTCTTTGGA AACTTTTGAA GAACTTTATG AAAACTTGTT ACGTAACCAC TATGAGCTCT
CTGTCATTAA GGAAGTTGCT GCCAAGGGTG AGAAGTTGTA TTACCGTACC AGGAAGGCTG
CTTCTACCGA ATCGGTCAAG CGTGCAAAGG AAATCATGGA CACAGGAACC TGGGACGATC
AGCATCTACT TTTTGGTCCA AACTTGGAAA AGGAAAAGGA AAAGTTCTTG GCCAAGTTAG
CCAACAGACA TGTCAAGGAG ACAGTCTTTG AATTTAACAA AAAGGGTAAC GATCGTGACG
AAGACAGTCT TGTCAGTTTC ATGCATAGGA GAAGAAAGCA ACTTGCTCCC ATCCAGCGTC
GTGCAAGTGA AAGAAGAGAC TTGTTGCAAC GTGAAAGAGA AGCTGGTTTG ACCCATGGTA
TCGAAGATGA GATCTTGAAG GCTCATGGGG AGACTGGATA CTCTGCTTCC GGTATTAACG
ATGTAGACGA AGAGGAATTA CAAAATGCTT TTGAGGACGC TGATCAGTTA TCGTCCAAGT
CGAACAAGAA GAACTACCGT GACGATAGGT TCTTCATTAG TCACTACGCA CCAGCTTCGG
TAATTCAAGA TCAGCAGTTA AATATCACCT CGTCGTTTGC TAACGAGGCT GCGTCTGCTA
CATTCGACTT GGACAACGAT GACAAGATCG GAAATGGCAA ACAAGTCATG CAATGGGACA
GAAAGAAGGG TAAGTACATC AACTCCCAGT CCACAGACAA GAAATTCATC ATTGGTGAGA
GTGGTGCCAA AATTCCAGCC ACCTACAGAT CTGGAAAGTT CGACGAGTGG AAGAAGAAAC
GTAACATGCA ACCTGCCAAG GTGGGATCAC TTGAAACTGA AGGTGAAAGC AAGCAGCGTT
TCAAGCATAA ACGTAATGCT GCTCCAAAGT TGCCTGACAA ATACAGAGAT GACTACCACA
AGCAAAAGAA GAAGGTTGAA AAGGCTGTGG AATCGGGCCG TGACGTTAAG GGCTACCATA
AACCGGGTCA AAGACTGGAA ATCAAGTCTA CTGAAGATAT TAGAAAGGCT AGATTGTTGA
AAGAAAAGAA AATGGCCAAG AATGCCCGTC CTAGTAGGAA GCGCAAATAG ATATAGTTAA
CTTATAGAAT CTGTACATTA ATAATAGTCT ACTAGAAATT GTATTACGCT ATGGG
 
Protein sequence
MSDNEEDYDI ARSLTVNLED SDSDSGSDFS DEEQEVQDII SSEDEAEEQP KKKQKTAKPA 
KEAFPSLELS GDEDEQDDDK DMASYFAANN PQAKKAKAGS FQSFGLSKLV LTNIAKKGYR
QPTPIQRKTI PLIMANRDVV GMARTGSGKT AAFTLPVIEK LKGHSARVGI RAIILSPSRE
LALQTYKQVK EFSKGSDLRA IVLTGGDSLE DQFSSMVSNP DIVIATPGRF LHLQVEMQLD
LKTVEYIVFD EADHLFEQGF AEQLNELLAV LPPQRQSLLF SATLPRSLVD FAKAGLSNPV
LVRLDADSKI SDQLQMAFFT TKKNEREANL LYVLQEVIKM PLGTAEQIKK LRLMDKRVND
EAEEEELENE SGSKRKYKFK KERLPSANVL PSPHSTIVFV PTKHHVEYIT TLLRDAGYLV
SYIYGTLDQH ARKNQLYQFR LGMTTVLVVT DVAARGIDIP VLANVVNYTL PGSSKIFIHR
VGRTARAGNK GWAYSIVNEK ELPYLLDLEL FLGKKILLTA MQEQKCQLLK DKQGDNYVEP
KVQYTDRLVL GSCPRLSLET FEELYENLLR NHYELSVIKE VAAKGEKLYY RTRKAASTES
VKRAKEIMDT GTWDDQHLLF GPNLEKEKEK FLAKLANRHV KETVFEFNKK GNDRDEDSLV
SFMHRRRKQL APIQRRASER RDLLQREREA GLTHGIEDEI LKAHGETGYS ASGINDVDEE
ELQNAFEDAD QLSSKSNKKN YRDDRFFISH YAPASVIQDQ QLNITSSFAN EAASATFDLD
NDDKIGNGKQ VMQWDRKKGK YINSQSTDKK FIIGESGAKI PATYRSGKFD EWKKKRNMQP
AKVGSLETEG ESKQRFKHKR NAAPKLPDKY RDDYHKQKKK VEKAVESGRD VKGYHKPGQR
SEIKSTEDIR KARLLKEKKM AKNARPSRKR K