Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_63574 |
Symbol | |
ID | 4840774 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009047 |
Strand | + |
Start bp | 239286 |
End bp | 240560 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640392089 |
Product | predicted protein |
Protein accession | XP_001386064 |
Protein GI | 126139083 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3616] Predicted amino acid aldolase or racemase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATCCCT TTCAATTTGT TGCCCTTCCA GACAAGGAAG CTCTACTAAG GGCATACAAA AATAAAAAGC TTAATGAATT GCCCACTCCA TCTGTTGTTA TCGACCGAGC TGTCTTCCAA GAGAATTGCG AAAAGATGCT TGCCAACGCC GAAAGATTAA AAGTAGATTT TAGACCGCAT ATTAAGACCC ACAAAACACT TGAAGGAGCG AGACTACAAT TGGGTTCAGG TAGAAGGAAG TCAGATAAGA TTATTGTTTC AACTATGATG GAGGCTTGGA ACTTGCTTCC CCTTGTCAAT GAAGGCTTAT CCGTAACAGA TTTTCTCTAT AGCCTACCAG TAGTAAAACC CAGGGTGGCC GAATTGGCCG AATTTGCTAC TAAGATACCA CACTTGAGGT TATTAATTGA TCATAGGGAA CAATTAGATA TCTTATCTGA GTGGAGTGAA GCTCATCCTC ATTCTAAAAG ATGGTCGGTA TTCATCAAGA TTGACATGGG TACCCATAGA GCAGGATTAA CGAATGAAAG CCATAATCTT GGTGAAACAC TTCAGCATAT CCTTACGGAT GCCACATCAA GGAAAAATAT TGAGTTGTAT GGATTCTACT GCCATGCTGG TCATTCTTAT TCTTCAACAA CAGAGGATTC AGCAAAGGAA CTATTGCTTG AAGAGATTGT CCAAGCAAAC CATGCTGCAA TCGCTGCCAA AAGTATTGAC CCAAGTTTAC ATTTGAGGCT CTCAGTTGGT GCTACACCAA CTTCTCATGC TTCGGAAATA CTTACAATCG AAGAATTGGA ATCAGCGTTG GGTCCCAATA GTTTGCAGGG TACATTAGAA TTACATGCGG GAAACTATTG TTGCTGCGAT TTACAACAGC TTGCAACAGG CTGCATAAGA GAAGAAAACA TTTCACTTTC GGTTATTGCC CATGTTATAT CTACGTATCC AAAGAGAGGT GAGAAGACTC CGGGTGAACA GTTAATAAAT GCTGGAGTGG TAGCTTTATC TCGTGAGTCG GGGCCAATCA TTGGATATGG AAAGGTGATT GAACCTGCCG AGTACAACAA TTGGATAGTC GGAAGATTAA GCCAAGAGCA CGGTATCCTC GTACCCTTTG ATGAACATCA TGCTACAAAG TTCATTCCAA TTGGAACTCA AATCAGGATT GTTCCACAGC ACTCTTGTAT TACAGCAGCT TCTAATCCTT GGTTCTTCAT AGTCGATTCG GGCGACGTAG TTGTGGATGT TTGGGTTCCA TTTAGAGGAT GGTAA
|
Protein sequence | MYPFQFVALP DKEALLRAYK NKKLNELPTP SVVIDRAVFQ ENCEKMLANA ERLKVDFRPH IKTHKTLEGA RLQLGSGRRK SDKIIVSTMM EAWNLLPLVN EGLSVTDFLY SLPVVKPRVA ELAEFATKIP HLRLLIDHRE QLDILSEWSE AHPHSKRWSV FIKIDMGTHR AGLTNESHNL GETLQHILTD ATSRKNIELY GFYCHAGHSY SSTTEDSAKE LLLEEIVQAN HAAIAAKSID PSLHLRLSVG ATPTSHASEI LTIEELESAL GPNSLQGTLE LHAGNYCCCD LQQLATGCIR EENISLSVIA HVISTYPKRG EKTPGEQLIN AGVVALSRES GPIIGYGKVI EPAEYNNWIV GRLSQEHGIL VPFDEHHATK FIPIGTQIRI VPQHSCITAA SNPWFFIVDS GDVVVDVWVP FRGW
|
| |