Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_80919 |
Symbol | |
ID | 4851742 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 2714172 |
End bp | 2717481 |
Gene Length | 3310 bp |
Protein Length | 1060 aa |
Translation table | |
GC content | 41% |
IMG OID | 640393450 |
Product | predicted protein |
Protein accession | XP_001386850 |
Protein GI | 126275448 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG5593] Nucleic-acid-binding protein possibly involved in ribosomal biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.212453 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACACAT CCAAGTTGAA CTTGTCCTCT TTGAGAGACA AGATTTCCAA TAAATTGGGC AGCCTGGCTG GTTCCAAGAA TCAGAAAAAG AATAAGAGGG CTTCCGGTAA AGACGACAAA AGTATCAAGG TCAATGAGAC GAAGCCAAAG GATTCTGAAA AGAAACTGAA AAAGACTGAA AAGTCCAAGG TTGGAGAATC TGAAGCAGAA GTTTTGCGTC GTGAAGCTCT TGCCTTGGGT GCTACCGAAG AAGATTTGGC CATGTTAAGT GGAGTAGAGG AAGGTGAAGA TAGTGAACAG GAATTCGATG TTTCTGATGC AAAATTGGAC AAAGCTTTTG GAGATGACTT GACTAATTTT ATGAAAGGAA TTGGCTTAGG TAATGGAGAA GCTGTTGTAG TCGATGACGA TGAAGAAGAA GAAATTCCAG AATTGGTCGA AATAGCCGAA GAAGAAGAGG AAGAAGAGGA AGAAGAGGAA GAAGAGGAAG AAGAGGAAGA AGAGGAAGAA CAAGAAGAAG AAAATGTAGA AATTAAAGAG TCTGTTGAAG AATCTGAATC TGAATCAGAG TCTGAATCAG AGTCTGAGTC CGAGTCCGAG TCTGAAGAGA GTGATGATGA TGTAGAAGTT GAAGAAAAAG AGATCAAGAA GGTCACTGCC ACTTCGGACA AGGTAACTAA TGTAGCTTCT GTCTCCGGCA AAAAACTTTC CATTCCTATC AGAACTGACT GGTACAATGT TACTCTCACC AAGAAGGACC CACAAGAAGT TGTCGATCGT TTTGCTCGTG AAAGATTGTA CGAAAGAGCC AAGAAGCTTG TCGACAGCGA GAACAAGCTC TACTTAGAAG AATTTGCATC CAATAACTCC CAGAAGAAGT TCTTGTCTCA AATTTTGTCT GATGGTACTC TTAACGATAA AATTTCTGCA TTAACACTTT TGATACAGGA AGCTCCATTG CACAACATTA AGGCATTGGA CACTTTAGTA GGCTACTGTG AAAAGAAGTC GAGAACTGCC GCATTGCAGT CCATTAACGC ATTGAAGGAC TTGTTGTTGA ACGGAGTCTT ACCTGACCGG AAGTTGTACG CCTTTGACAA GCAGCCTAAT TTGAGCAAAA GTAATACTGA TGCTGAGCTT GCTGCCTACT ACTTTGAAGA TATAGTCAAA AAACACTATT TCAAGTTGGT TTCCATCTTG GAAATCTTAT CGCATGATCC TATTCTTCAC GTGAGAATGA ATGTCGTCAG TCATATATTT GATTTGTTGA AGGCCAAACC AGAGCAGGAA GCCAACTTGT TGCGCTTGGG AGTCAACAAG TTGGGTGATA TCGACAACAA GGTGGCATCC AAGACCTCGT ACCAGATTTT GCAGCTCGAA CAAGCACATC CAGCCATGAA GAAGATCATT TCCGACTCTG TCATCGATAT TGTCTTCAAG AACAACAGTG ACTACCACGC TCAATACTAC TCTATTACAA CCTTGAATCA AACTATTTTG ACCAGACAAG AAGACGAATT GGCCAACGCT TTGATCAAGA CCTATTTCGC ATTATTTGAG AAAATTTTGG TAGAGACCGA CGGAAAAACA GAAGGAAAGG GCGAAGACAA GGTCCTTGGA AAGACCGAAA AGGGTAGAAA AAATAACCGT AAGAATTTCA AGAAGGGTAA GAAGGGTGGA AAATCTGTGA AACAGGAGCC TAAGTCGGAA TCAGAAATAA TCGAAGAAAA AAATACCAGA TTGTTCTCGG CCCTCTTAAC TGGTTTGAAC AGAGCATTCC CATTTTCTAA CTTGCCGAAC GAAGTTTTCC AAAAACACTT GGACACCTTG TTCAAGATTA CCCATTCCTC CAACTTCAAC ACCTCTGTTC AAGCATTGGT GTTGGTAAAT CATATCATTA CCCAGCAGCT GTTGGATTCG AATAGATATT ATAGAACACT ATACGAATCT TTATTGGATT CGAGATTGGT TAACTCTTCC AAGCAAGGTA TCTACCTCAA CTTGTTGTAC AAGTCGTTGA AGAACGATTC CAGCACAGAA AGAGTGCTTG CGTTTGTGAA AAGAATCTTA CAAGTCTGTG CGCACTGGTT GAATGTCGGT GCAATCTCAG GTATGTTGTT CTTGTTGATG GAATTATCTA AGACATTCCC CCAAATTCTG GACTTGAGCA TCGATATTTC ATCAAGACCC GACCCTGAAG AGGTTGAAGG AGAAGAAAAG AAAGTTGAAA CTGAAGATGA TTCAGAATAT GATCCACGTA AACGTGATCC TAAGTTTGCC AACGCTGATA AGTCTTCGTT ATGGGAGCTT GGACAGTTCT TACATCACTA CCATCCTACT ATTGCTGTAT ATGCTTCATC CTTACTTGAA GGTAAGAATG ATCAGCCAAA GCCCGATTTG GGTCTCTTTA CGTTGAGTCA CTTCTTGGAC AGATTTGTCT ATAGAAACGC AAAGCAAAAG CCAGCCACAA AGGGATCTTC CATTATGCAA CCTTTGGGAG GTGTTCACAC TGGATCTTTA TTGGTTAAGG CAACTGGCGT TCAAAGCACG GAAGTCCCTG CCAACACCGA AGACTGGTTA ACTAAGAAGG CTAAAGACAT TAGACCAGAC GAGCAATTCT TCCACCAATA CTTTACTACA AAGAAAGACA AATTGAGAGG TAAGAAACAA GACAATGAAG CAGAAGAAAT CGACGAAGCC GAATTAGAAG CTATGGGTGA TGATGAAATC TGGAAGGCGT TGGTTAAATC CAGACCAGAA GTTGAAGAAG GAAGCGACGA AGATGAAGGT TTCTCCTTTG ACGAAGAAGA TTTCTCTGAC TTGAGTGATG ACGAGGAAGT CGATGATAGC GAAAAAGTCA ACTTTCTAGA TGATGAAGCC GAAGAACAAG AAGAAGAGGA GCAAGAGGAA GAAGAGGAAG AAGAAATTGA AGATGAAGAT GAAGAAGACG CTGAAATTGA AGACGATGAA ATTCCAGAAT TTGACGAAGA AGAAGGAGAA ATATTTGGTA TCAACTCTGA AGACGAATTG AATGATTCTG AAGTTGAGAT CAAGATGCTT GGTGACTTCG ATGAGGACGA CTCAGAAGAA GAAGTGCAAG AGGAAGTGAA GCCCAAGAAA AAATCCAAGA AGCGTAGCAA ATCTGACGAC AGCGAAGACA AGAAGGACAA CAAGAAGAAG AAGACCAAGT TGAGCAGCTT GCCTGTTTTT GCTTCTGTGG ATGATTATTC GCAATATTTG CAATCGGATG ATGAGGATTA CAGTTAGACT ATTATTATAT ATCCTGAAAA AAATATCAGA TTATAGACGC
|
Protein sequence | MDTSKLNLSS LRDKISNKLG SLAGSKNQKK NKRASGKDDK SIKVNETKPK DSEKKLKKTE KSKVGESEAE VLRREALALG ATEEDLAMLS GVEEGEDSEQ EFDVSDAKLD KAFGDDLTNF MKGIGLGNGE AVVVDDDEEE EIPELVEIAE EEEEEEEEEE EEEEEEEEEE QEEENVEIKE SVEESESESE SESESESESE SEESDDDKVT ATSDKVTNVA SVSGKKLSIP IRTDWYNVTL TKKDPQEVVD RFARERLYER AKKLVDSENK LYLEEFASNN SQKKFLSQIL SDGTLNDKIS ALTLLIQEAP LHNIKALDTL VGYCEKKSRT AALQSINALK DLLLNGVLPD RKLYAFDKQP NLSKSNTDAE LAAYYFEDIV KKHYFKLVSI LEILSHDPIL HVRMNVVSHI FDLLKAKPEQ EANLLRLGVN KLGDIDNKVA SKTSYQILQL EQAHPAMKKI ISDSVIDIVF KNNSDYHAQY YSITTLNQTI LTRQEDELAN ALIKTYFALF EKILVETDGK TEGKGEDKVL GKTEKGRKNN RKNFKKGKKG GKSVKQEPKS ESEIIEEKNT RLFSALLTGL NRAFPFSNLP NEVFQKHLDT LFKITHSSNF NTSVQALVLV NHIITQQLLD SNRYYRTLYE SLLDSRLVNS SKQGIYLNLL YKSLKNDSST ERVLAFVKRI LQVCAHWLNV GAISGMLFLL MELSKTFPQI LDLSIDISSR PDPEEVEGEE KKVETEDDSE YDPRKRDPKF ANADKSSLWE LGQFLHHYHP TIAVYASSLL EGKNDQPKPD LGLFTLSHFL DRFVYRNAKQ KPATKGSSIM QPLGGVHTGS LLVKATGVQS TEVPANTEDW LTKKAKDIRP DEQFFHQYFT TKKDKLRGKK QDNEAEEIDE AELEAMGDDE IWKALVKSRP EVEEGSDEDE GFSFDEEDFS DLSDDEEVDD SEKQEEEEEE EIEDEDEEDA EIEDDEIPEF DEEEGEIFGI NSEDELNDSE VEIKMLGDFD EDDSEEEVQE EKKSKKRSKS DDSEDKKDNK KKKTKLSSLP VFASVDDYSQ YLQSDDEDYS
|
| |