Gene PICST_80919 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_80919 
Symbol 
ID4851742 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp2714172 
End bp2717481 
Gene Length3310 bp 
Protein Length1060 aa 
Translation table 
GC content41% 
IMG OID640393450 
Productpredicted protein 
Protein accessionXP_001386850 
Protein GI126275448 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG5593] Nucleic-acid-binding protein possibly involved in ribosomal biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.212453 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACACAT CCAAGTTGAA CTTGTCCTCT TTGAGAGACA AGATTTCCAA TAAATTGGGC 
AGCCTGGCTG GTTCCAAGAA TCAGAAAAAG AATAAGAGGG CTTCCGGTAA AGACGACAAA
AGTATCAAGG TCAATGAGAC GAAGCCAAAG GATTCTGAAA AGAAACTGAA AAAGACTGAA
AAGTCCAAGG TTGGAGAATC TGAAGCAGAA GTTTTGCGTC GTGAAGCTCT TGCCTTGGGT
GCTACCGAAG AAGATTTGGC CATGTTAAGT GGAGTAGAGG AAGGTGAAGA TAGTGAACAG
GAATTCGATG TTTCTGATGC AAAATTGGAC AAAGCTTTTG GAGATGACTT GACTAATTTT
ATGAAAGGAA TTGGCTTAGG TAATGGAGAA GCTGTTGTAG TCGATGACGA TGAAGAAGAA
GAAATTCCAG AATTGGTCGA AATAGCCGAA GAAGAAGAGG AAGAAGAGGA AGAAGAGGAA
GAAGAGGAAG AAGAGGAAGA AGAGGAAGAA CAAGAAGAAG AAAATGTAGA AATTAAAGAG
TCTGTTGAAG AATCTGAATC TGAATCAGAG TCTGAATCAG AGTCTGAGTC CGAGTCCGAG
TCTGAAGAGA GTGATGATGA TGTAGAAGTT GAAGAAAAAG AGATCAAGAA GGTCACTGCC
ACTTCGGACA AGGTAACTAA TGTAGCTTCT GTCTCCGGCA AAAAACTTTC CATTCCTATC
AGAACTGACT GGTACAATGT TACTCTCACC AAGAAGGACC CACAAGAAGT TGTCGATCGT
TTTGCTCGTG AAAGATTGTA CGAAAGAGCC AAGAAGCTTG TCGACAGCGA GAACAAGCTC
TACTTAGAAG AATTTGCATC CAATAACTCC CAGAAGAAGT TCTTGTCTCA AATTTTGTCT
GATGGTACTC TTAACGATAA AATTTCTGCA TTAACACTTT TGATACAGGA AGCTCCATTG
CACAACATTA AGGCATTGGA CACTTTAGTA GGCTACTGTG AAAAGAAGTC GAGAACTGCC
GCATTGCAGT CCATTAACGC ATTGAAGGAC TTGTTGTTGA ACGGAGTCTT ACCTGACCGG
AAGTTGTACG CCTTTGACAA GCAGCCTAAT TTGAGCAAAA GTAATACTGA TGCTGAGCTT
GCTGCCTACT ACTTTGAAGA TATAGTCAAA AAACACTATT TCAAGTTGGT TTCCATCTTG
GAAATCTTAT CGCATGATCC TATTCTTCAC GTGAGAATGA ATGTCGTCAG TCATATATTT
GATTTGTTGA AGGCCAAACC AGAGCAGGAA GCCAACTTGT TGCGCTTGGG AGTCAACAAG
TTGGGTGATA TCGACAACAA GGTGGCATCC AAGACCTCGT ACCAGATTTT GCAGCTCGAA
CAAGCACATC CAGCCATGAA GAAGATCATT TCCGACTCTG TCATCGATAT TGTCTTCAAG
AACAACAGTG ACTACCACGC TCAATACTAC TCTATTACAA CCTTGAATCA AACTATTTTG
ACCAGACAAG AAGACGAATT GGCCAACGCT TTGATCAAGA CCTATTTCGC ATTATTTGAG
AAAATTTTGG TAGAGACCGA CGGAAAAACA GAAGGAAAGG GCGAAGACAA GGTCCTTGGA
AAGACCGAAA AGGGTAGAAA AAATAACCGT AAGAATTTCA AGAAGGGTAA GAAGGGTGGA
AAATCTGTGA AACAGGAGCC TAAGTCGGAA TCAGAAATAA TCGAAGAAAA AAATACCAGA
TTGTTCTCGG CCCTCTTAAC TGGTTTGAAC AGAGCATTCC CATTTTCTAA CTTGCCGAAC
GAAGTTTTCC AAAAACACTT GGACACCTTG TTCAAGATTA CCCATTCCTC CAACTTCAAC
ACCTCTGTTC AAGCATTGGT GTTGGTAAAT CATATCATTA CCCAGCAGCT GTTGGATTCG
AATAGATATT ATAGAACACT ATACGAATCT TTATTGGATT CGAGATTGGT TAACTCTTCC
AAGCAAGGTA TCTACCTCAA CTTGTTGTAC AAGTCGTTGA AGAACGATTC CAGCACAGAA
AGAGTGCTTG CGTTTGTGAA AAGAATCTTA CAAGTCTGTG CGCACTGGTT GAATGTCGGT
GCAATCTCAG GTATGTTGTT CTTGTTGATG GAATTATCTA AGACATTCCC CCAAATTCTG
GACTTGAGCA TCGATATTTC ATCAAGACCC GACCCTGAAG AGGTTGAAGG AGAAGAAAAG
AAAGTTGAAA CTGAAGATGA TTCAGAATAT GATCCACGTA AACGTGATCC TAAGTTTGCC
AACGCTGATA AGTCTTCGTT ATGGGAGCTT GGACAGTTCT TACATCACTA CCATCCTACT
ATTGCTGTAT ATGCTTCATC CTTACTTGAA GGTAAGAATG ATCAGCCAAA GCCCGATTTG
GGTCTCTTTA CGTTGAGTCA CTTCTTGGAC AGATTTGTCT ATAGAAACGC AAAGCAAAAG
CCAGCCACAA AGGGATCTTC CATTATGCAA CCTTTGGGAG GTGTTCACAC TGGATCTTTA
TTGGTTAAGG CAACTGGCGT TCAAAGCACG GAAGTCCCTG CCAACACCGA AGACTGGTTA
ACTAAGAAGG CTAAAGACAT TAGACCAGAC GAGCAATTCT TCCACCAATA CTTTACTACA
AAGAAAGACA AATTGAGAGG TAAGAAACAA GACAATGAAG CAGAAGAAAT CGACGAAGCC
GAATTAGAAG CTATGGGTGA TGATGAAATC TGGAAGGCGT TGGTTAAATC CAGACCAGAA
GTTGAAGAAG GAAGCGACGA AGATGAAGGT TTCTCCTTTG ACGAAGAAGA TTTCTCTGAC
TTGAGTGATG ACGAGGAAGT CGATGATAGC GAAAAAGTCA ACTTTCTAGA TGATGAAGCC
GAAGAACAAG AAGAAGAGGA GCAAGAGGAA GAAGAGGAAG AAGAAATTGA AGATGAAGAT
GAAGAAGACG CTGAAATTGA AGACGATGAA ATTCCAGAAT TTGACGAAGA AGAAGGAGAA
ATATTTGGTA TCAACTCTGA AGACGAATTG AATGATTCTG AAGTTGAGAT CAAGATGCTT
GGTGACTTCG ATGAGGACGA CTCAGAAGAA GAAGTGCAAG AGGAAGTGAA GCCCAAGAAA
AAATCCAAGA AGCGTAGCAA ATCTGACGAC AGCGAAGACA AGAAGGACAA CAAGAAGAAG
AAGACCAAGT TGAGCAGCTT GCCTGTTTTT GCTTCTGTGG ATGATTATTC GCAATATTTG
CAATCGGATG ATGAGGATTA CAGTTAGACT ATTATTATAT ATCCTGAAAA AAATATCAGA
TTATAGACGC
 
Protein sequence
MDTSKLNLSS LRDKISNKLG SLAGSKNQKK NKRASGKDDK SIKVNETKPK DSEKKLKKTE 
KSKVGESEAE VLRREALALG ATEEDLAMLS GVEEGEDSEQ EFDVSDAKLD KAFGDDLTNF
MKGIGLGNGE AVVVDDDEEE EIPELVEIAE EEEEEEEEEE EEEEEEEEEE QEEENVEIKE
SVEESESESE SESESESESE SEESDDDKVT ATSDKVTNVA SVSGKKLSIP IRTDWYNVTL
TKKDPQEVVD RFARERLYER AKKLVDSENK LYLEEFASNN SQKKFLSQIL SDGTLNDKIS
ALTLLIQEAP LHNIKALDTL VGYCEKKSRT AALQSINALK DLLLNGVLPD RKLYAFDKQP
NLSKSNTDAE LAAYYFEDIV KKHYFKLVSI LEILSHDPIL HVRMNVVSHI FDLLKAKPEQ
EANLLRLGVN KLGDIDNKVA SKTSYQILQL EQAHPAMKKI ISDSVIDIVF KNNSDYHAQY
YSITTLNQTI LTRQEDELAN ALIKTYFALF EKILVETDGK TEGKGEDKVL GKTEKGRKNN
RKNFKKGKKG GKSVKQEPKS ESEIIEEKNT RLFSALLTGL NRAFPFSNLP NEVFQKHLDT
LFKITHSSNF NTSVQALVLV NHIITQQLLD SNRYYRTLYE SLLDSRLVNS SKQGIYLNLL
YKSLKNDSST ERVLAFVKRI LQVCAHWLNV GAISGMLFLL MELSKTFPQI LDLSIDISSR
PDPEEVEGEE KKVETEDDSE YDPRKRDPKF ANADKSSLWE LGQFLHHYHP TIAVYASSLL
EGKNDQPKPD LGLFTLSHFL DRFVYRNAKQ KPATKGSSIM QPLGGVHTGS LLVKATGVQS
TEVPANTEDW LTKKAKDIRP DEQFFHQYFT TKKDKLRGKK QDNEAEEIDE AELEAMGDDE
IWKALVKSRP EVEEGSDEDE GFSFDEEDFS DLSDDEEVDD SEKQEEEEEE EIEDEDEEDA
EIEDDEIPEF DEEEGEIFGI NSEDELNDSE VEIKMLGDFD EDDSEEEVQE EKKSKKRSKS
DDSEDKKDNK KKKTKLSSLP VFASVDDYSQ YLQSDDEDYS