Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_77242 |
Symbol | ENP1 |
ID | 4837970 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | - |
Start bp | 1471284 |
End bp | 1472711 |
Gene Length | 1428 bp |
Protein Length | 454 aa |
Translation table | 12 |
GC content | 38% |
IMG OID | 640389285 |
Product | bystin-family protein putative nuclear protein |
Protein accession | XP_001383900 |
Protein GI | 126134751 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.525338 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.363907 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTAAAA TCACAGTAAC TGAGACCAAA GGAAAGCAAC GCCATAATCC CTTGTATAAG GATATTTCCA CTCAAGGTGG TAATTTGAGG TCAACACCTA GGTCTTCGCA GGCGGGAAGT AGAAAAAATG AAGAAGAAGA GGAGTACCTT GATGCAAGCA CTTCTAGAAA GATTCTACAG TTAGCGAAAG AACAACAAGA AGAAATACAA GAGGAAGAGA ATGTTTTCCT AGGCAAGCCA TCATTTGCAG ATTCATTTAG AGCACAAGAG AGTGGTGAAG AAAGCGAGGA AGACGTAGAT GAAGAGGATG AGTTCGAATT TGAAGAGGAA GAACTATACG AAGAGCAAGA AATTGAAGTA GATGAAAAAG ATGCCGAGTT ATTCAATAAG TATTTCCAAA GTAACGGACC ATCCGAACAT GGCGGTGAAT CCTTTAATTT GGCTGATAAA ATTATGGCCA AAATTCAAGA AAAGGAAATG ATGAAAGAAA AGGCATCAAG ACCTACGGAT GCGGTTTTGC TACCTCCTAA GGTGATTGCT GCATATGAAA AGATTGGTAA GATCTTGTCC ACTTATACTC ATGGAAAGTT ACCTAAATTA TTCAAGGTTT TACCTACTTT GCGTAATTGG GAAGATGTCT TATTCGTAAC AAATCCAGAG CAATGGACTC CGCATGCTGT ATATGAAGCT ACCAAATTAT TTGTATCCAA CTTACAAGCC CCAGAAGCTC AAAAGTTTGT GGAGAGTGTC TTATTAGAGA GATTCAGAAC ATCTATTGAA GACTCAGAAG ACCATTCATT AAACTATCAT ATTTACCGTG CCTTGAAGAA GTCGTTATAT AAACCTGCAG CATTTTTCAA GGGGTTTTTA CTTCCTCTCG TTGACTCTTA TTGTTCTGTT AGGGAAGCTA CTATTGCTGC TTCTGTGTTG TCAAAAGTAT CAGTTCCCGT TTTGCATTCC TCTGTGGCAT TGACTCAATT ATTACAAAGA GACTTTAAAC CATCAACCAC TGTCTTTATC AGAGTATTAG TGGAGAAGAA ATATGCTTTA CCATACCAAA CTTTAGACGA ATTGGTATTT TACTTCATGA GATTTAGAAA TGCTGTCCAA CAAGATTCTA TGGAAATTGA AATATCCGAA AATAGGGAAC CTCAGTTGCC AGTAGTGTGG CACAAGGCAT TCTTGGCATT TGCACAACGC TACAAGAATG ATATAACCGA TGACCAGAGA GACTTTTTGT TAGAAACTGT TAGGCAAAGA TTTCATCACG CCATTGGACC CGAAATTCGT AGAGAACTCC TAGCAGGAAA GCCAAGATTG ACGTCGGAAG CACCCAAGAT TGCAATAATG CAAGATGCCT TCTAGGTTTT CAATAATTGT TTGTATATTA TATTGGATAT TTTACTTTAA AAAAATGATT TTTGTCTA
|
Protein sequence | MGKITVTETK GKQRHNPLYK DISTQGGNLR STPRSSQAGS RKNEEEEEYL DASTSRKILQ LAKEQQEEIQ EEENVFLGKP SFADSFRAQE SGEESEEDVD EEDEFEFEEE ELYEEQEIEV DEKDAELFNK YFQSNGPSEH GGESFNLADK IMAKIQEKEM MKEKASRPTD AVLLPPKVIA AYEKIGKILS TYTHGKLPKL FKVLPTLRNW EDVLFVTNPE QWTPHAVYEA TKLFVSNLQA PEAQKFVESV LLERFRTSIE DSEDHSLNYH IYRALKKSLY KPAAFFKGFL LPLVDSYCSV REATIAASVL SKVSVPVLHS SVALTQLLQR DFKPSTTVFI RVLVEKKYAL PYQTLDELVF YFMRFRNAVQ QDSMEIEISE NREPQLPVVW HKAFLAFAQR YKNDITDDQR DFLLETVRQR FHHAIGPEIR RELLAGKPRL TSEAPKIAIM QDAF
|
| |