Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_54653 |
Symbol | |
ID | 4836655 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | + |
Start bp | 2051751 |
End bp | 2052797 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 12 |
GC content | 47% |
IMG OID | 640387970 |
Product | predicted protein |
Protein accession | XP_001382621 |
Protein GI | 150863962 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.191622 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGAAT CAGTGGATGA TATATTAAAG GGCATCTCGG CATCACTTCA GTCTACCAAG ACTACCGTAG ACGAGCTTGT TTCTGGTGTC CAAAACGAAG AGAGCTCATA TCCCCAAATC ATCCAGTCAT TGTTGAGCAA GTCTTCACAG CAGAAAGTCG AGGGTATGTC CCTTTTGGCA CTCAAGAATA ATTCCTTGGT GTCCTACTTG AACAACCTTG CCTTGATAGT GTTGGCCCAG TTGGAGAGAT TAGAATCTCA CGATATCAGC GACATTGAAA AGATTAGAGA AGATATCATT AAGCGTACCA TAGTCCAGAG AGTGACCTTG GAGAAAGGTG TTAAGCCTTT GGAAAAGAAG TTGACATACC AGTTAGACAA AATGGTGAGG TCGTACACGA GAATGGAAGC TGACGAAACC AAGTTGGAAG AGAAGTTGAA GAGCAAACAG GAAAATGGCC AGGAAGGTGA AGTGAGCGAC GGATCAGACT CTTCAGAAGA CGAAGATGCC TTATCTTACA GGCCCGATGC CGCGGCCTTG GCCAAGATGG CACCCAAGAG TTCTAGAAGC AAGCCCAAGT CTCGTGACGG GGACGAAGAG TCGAACGAAA AGTATAAACC TCCCAAGATC TCTGCCGTTG CACCACCCAC AGCTCCACAG CGTGATCCAG ATGCCAAAGA AAAAGAAGAC AAGAACAGAA AATTACAGAG TATGGAAGAA TACTTGCGCG AGCAGTCTGA CTTGCCTCTG GTGGAGTCAT CGATTGGTTC TACTATTGTA GACCATGGTA GAGGAGGTGT CAAGACCCAG CATGACAAGC AGAAGGAACA GGAAGTCCAG AGATACGAAG AAAGCAACTT TGTCAGATTG CCCCAGAACC AGACCAAGAA GTCGTTCAAG CAGAGACGGA GAGACATGGC CAACACCTTT GGTGGGGAGG ACTGGTCTAT GTTCAGTGAG ACTAACTCCC GTAATGTCAG CAGTGGTACT TCTAGAAAGA GAAAGGCGGG CAGTGTGTGG GACAAAGTGA AGAAGAGACA AGGTTAA
|
Protein sequence | MSESVDDILK GISASLQSTK TTVDELVSGV QNEESSYPQI IQSLLSKSSQ QKVEGMSLLA LKNNSLVSYL NNLALIVLAQ LERLESHDIS DIEKIREDII KRTIVQRVTL EKGVKPLEKK LTYQLDKMVR SYTRMEADET KLEEKLKSKQ ENGQEGEVSD GSDSSEDEDA LSYRPDAAAL AKMAPKSSRS KPKSRDGDEE SNEKYKPPKI SAVAPPTAPQ RDPDAKEKED KNRKLQSMEE YLREQSDLPS VESSIGSTIV DHGRGGVKTQ HDKQKEQEVQ RYEESNFVRL PQNQTKKSFK QRRRDMANTF GGEDWSMFSE TNSRNVSSGT SRKRKAGSVW DKVKKRQG
|
| |