Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_66043 |
Symbol | |
ID | 4840461 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009047 |
Strand | - |
Start bp | 224660 |
End bp | 227901 |
Gene Length | 3242 bp |
Protein Length | 1009 aa |
Translation table | 12 |
GC content | 40% |
IMG OID | 640391776 |
Product | predicted protein |
Protein accession | XP_001386255 |
Protein GI | 150866601 |
COG category | [R] General function prediction only |
COG ID | [COG1524] Uncharacterized proteins of the AP superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.219798 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAGAA ACCGTCAGCT TCTTTTGTTA GTTGGGCTTG TGTTCCATTT TTTCTACCTT TGGTCCATTT TCGACATCTA CTTCGTCCTG CCCTTAGTAC ATGGAATGGA CCACCATGTT TCCACGACTA CAGCACCAGC CAAACGTCTC TTTCTAATAG TGGGAGATGG ACTTCGTGCT GACAAGACAT TCCAAAAATT GAAGCATCCA AGAACTGGCG AAACAAAATA CTTGGCTCCG TACTTGAGAA GCATCGCTCA GAATGAAGGC ACCTGGGGCA TTTCGAACAC GAGAATGCCG ACTGAGTCTA GACCTGGCCA CGTAGCCATG ATCGCTGGTT TTTACGAAGA TGTCTCTGCT GTTACCAAGG GATGGAAGGA AAATCCAGTA GATTTCGACT CATTTTTTAA CCAATCGAAA CACACCTATT CCTTTGGATC GCCCGATATC TTACCCATGT TCGCTTATGG TGAAGGAGTA GTTCCCGGAA GAATCGACGT TTGTATGTAT GGCCACGAGT TCGAAGATTA TACCCAGAGC TCGATCGAGT TGGACGCATT TGTGTTCAAA CACTTTGACG AGCTAATGGC CAATTCTGAG ACCAACCAGA CACTACACGA CGAGTTGCAC GAGGAAGGAA ATGTCTTCTT CTTGCATTTG TTGGGTCCAG ACACAGCCGG TCATGCCTAC CGTCCGTATT CGGCCGAATA TTACGAGAAT ATCGAGTATA TCGACATGCA GTTGTCCAAG TTGATTCCTA GAATCCATGA ATTCTTTGGT GATGATGATT CTGCCTTTGT TTTCACAGCT GACCACGGCA TGTCCGATTT TGGATCGCAC GGTGATGGCC ATCCTGACAA CACCAGGACA CCGTTGATTG CATGGGGTGC CGGTGTTAAC AAGCCTAAAC ATATAAAGGA CTTACCTGAT CCGCAAGCCC AAAGAGCAAA ACAAGATCCA GTCCGCAGCG GATACGAAGA TACATATTTT GAAACGTGGG AACTTGACCA TTTGGTTAGA AATGACGTCA AGCAGGCTGA TATCGCTTCG TTAATGGCTT ATTTGATTGG TGCAAACTAT CCTGCCAATT CTGTAGGTGA GTTGCCTCTT GCGTACCTCG ACACCGATGC TGTGACGAAG ATCAAAGCTC TCTTCGCTAA CGCTTTAGCT ATTATTGAGC AATACTATGT CAAAGAAAAG GAAGTGTACA ACCACCAATT CAAGTTCAAG CCTTTCCGGC CCTTCGACGA AAAGTCAATC GATGAATACA GCGGTCAAAT TAACTCATTT ATTTATTCCT TACAAAATGA ACAACTCAGC CAATCTCAAA AGGAATTATT GGAAAAAGAG GCTGTCATGG TTGTGGAAGA ATTGATGAAG ACAGCTTTGG ATGGATTGAA TTATTTGCAA ACTTACAACT GGTTATTGTT GAGATCCATC GTCACATTGG GCTTCATTGG ATGGATCGTT TATGCTTTTG GTCTTTTCTT AAAGTTGTTC ATTATATCAG AAGAGGATTT ACAAACTTTG AAACCAGGTA ATCTGATATT CTTGTTGCTG TCTTTCTCTG CATTAGCATT GTCCACCAAC TACTTGCTTT TCTATCAAAA CTCGCCTTTC AATTACTACA TGTATGCTGC ATTTCCATTA TACTTTTGGT ACACCATCTT CAATGAGCTC ACCTATCTTG GAGAAGGTTT GAATCAGTTG TTGTACGGTA TCTCCATTCC AACTAGAGTT TTCATCGCAG TTTCCTTCAT TGGAATGTAT GAAGGTATAG CTTACGGCTT TTTTGAAAGA TTTGTGTTTT CGATCATCTT CGTCCTTATA GGATTGTACC CATTGTTCGT CTCAGGAAAT GAAAAAGTAT CTACTTACCA AAAGCTTGTA TGGTTAGCAA GTTGCCTGTT AATGTGTATT TTCACTAACT TGAACCCGGT GAAAGTTGAA AGCTTACTCC AAATTAATGC TGGTGCCTTG TTTTCGTTGA TAATTGCTCT GATTGGTATC AGCAAAGTCT TTAAGAGACC TATAGAGTCT GTACAGAAAA GGTTGGTCAT ACATCAATTG CTCATAATTC CTCTTATCTT GTACGCTACG AATGTTTCTG TACTTTCATT GCAAGCTAGG AATGGTCTTC CTTTATATTC ACAAGTTCTT GGTTGGTTGT CTTTTGTTGC ATCCTTATTG TTACCAGTAT TCCATTCGGT CTATCCTTCA AAGGACTACG CCTTGAGATT ATTGATTATT TTCCTTACTT TTGTTCCTGC CTTTATTATC TTGACCATTT CATTTGAATT GTTGTTCTAC AATGGATTCT CCTTGATATT GCTTCAATGG CTCAATATCG AAGAGAATTT AAAATTCCCA AGAGCAGAAA TTATCAAATC ACAAGAAGAA ACTGGAAGGT TACCGAAAGG GTATTGGTTG CAGTTGATTA GAATATCTAT CATCGGCTTC TTTTTCCTTC AGTTAGCATT CTTTGGCACT GGTAACATTG CATCGATTTC TTCTTTCTCA TTGGACTCAG TTTACAGGCT TATTCCTATA TTCGATCCTT TCCCAATGGG GGCGTTGTTG ATGTTTAAAT TGATTGTTCC ATACGTGCTA CTTTCCACTT GTCTTGGTAT CATGAACCAC AACTTGGAAA TCAGACAGTT CACAATTTCC ACATTAATTA TTTCTACAAG TGATTTCTTA TCGTTGAACT TCTTTTTCTT AGTGAGAACG GAAGGTTCTT GGTTAGACAT TGGACTCAGT ATCTCCAACT ATTGCTTAGC TATCCTATCT TCGCTTTTCA TGTTGATTTT GGAATTGGTC GGTTCCATAA TTTTGCGCGG AGTCGAATAC AATGACAAGT GGACGGAGAA AGAAGAGTAC AAAATAGAGG AAATAGAAAC TGAAGAAGAA AAAGCAGAGG AAGTTGAGGT TGTCGAAATT GATGAAACTG AAGAAATTGA AGTAATTGAA ATTGTCGAAG TTAACGAAGA GGAAATTGAA GATGAAGATG AAGATGTCGA GGATGAAGAA GAAGATGATG ATCACAACGG TGAAATAGAC CCCAGAAACA TTATACCAGC CCAATCGGAG ACCGAGGAAT TGCCGATCAG CGCAAGAATA AGACGGAGAG GCGCCAAGCG CGATTGAAAA GGAGGACAAA AACACTAGAT TTATTGTATA GTATATATAT ACAAAGGTAT ACTACCAAAA AGTTTAGAAA TTAGACACCG GGAAATTATA AC
|
Protein sequence | MNRNRQLLLL VGLVFHFFYL WSIFDIYFVS PLVHGMDHHV STTTAPAKRL FLIVGDGLRA DKTFQKLKHP RTGETKYLAP YLRSIAQNEG TWGISNTRMP TESRPGHVAM IAGFYEDVSA VTKGWKENPV DFDSFFNQSK HTYSFGSPDI LPMFAYGEGV VPGRIDVCMY GHEFEDYTQS SIELDAFVFK HFDELMANSE TNQTLHDELH EEGNVFFLHL LGPDTAGHAY RPYSAEYYEN IEYIDMQLSK LIPRIHEFFG DDDSAFVFTA DHGMSDFGSH GDGHPDNTRT PLIAWGAGVN KPKHIKDLPD PQAQRAKQDP VRSGYEDTYF ETWELDHLVR NDVKQADIAS LMAYLIGANY PANSVGELPL AYLDTDAVTK IKALFANALA IIEQYYVKEK EVYNHQFKFK PFRPFDEKSI DEYSGQINSF IYSLQNEQLS QSQKELLEKE AVMVVEELMK TALDGLNYLQ TYNWLLLRSI VTLGFIGWIV YAFGLFLKLF IISEEDLQTL KPGNSIFLLS SFSALALSTN YLLFYQNSPF NYYMYAAFPL YFWYTIFNEL TYLGEGLNQL LYGISIPTRV FIAVSFIGMY EGIAYGFFER FVFSIIFVLI GLYPLFVSGN EKVSTYQKLV WLASCSLMCI FTNLNPVKVE SLLQINAGAL FSLIIASIGI SKVFKRPIES VQKRLVIHQL LIIPLILYAT NVSVLSLQAR NGLPLYSQVL GWLSFVASLL LPVFHSVYPS KDYALRLLII FLTFVPAFII LTISFELLFY NGFSLILLQW LNIEENLKFP RAEIIKSQEE TGRLPKGYWL QLIRISIIGF FFLQLAFFGT GNIASISSFS LDSVYRLIPI FDPFPMGALL MFKLIVPYVL LSTCLGIMNH NLEIRQFTIS TLIISTSDFL SLNFFFLVRT EGSWLDIGLS ISNYCLAILS SLFMLILELV GSIILRGVEY NDKGINEEEI EDEDEDVEDE EEDDDHNGEI DPRNIIPAQS ETEELPISAR IRRRGAKRD
|
| |