Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_30337 |
Symbol | |
ID | 4837604 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | + |
Start bp | 2553594 |
End bp | 2554922 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640388919 |
Product | conserved hypothetical protein |
Protein accession | XP_001382723 |
Protein GI | 150864042 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.18618 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGACC CAGGTTATGT TATACTCTTC TTGGGGTTAT CGATTTCCTT ATTGTTGTCC CGTAAGCATG TTGTTTCATT AATTTCATCG TTCAGAAGCA AAACTGCAGC TGTCGATGAA GAAAAGGCAA GAAGAAATCT GTATGGAAGC GATGAACCAT TAAAGCCTCC TACTCCCTTG ATGATTACTC CAGAACAAGT TCTGAACTTT GACGATAGAC CATGGAGACC ATTCAGATGG CCATATCACC AGACTATGTC TATCTTCAAG TTGGATATGA ACCACTGGTT GGACATGGAC AAGTACTACG TTCACTACAT CGAAGAAAAG AAGAGAATTA TCCAAAAGTA TGGCAAGGAA AACATCGACT GGCTACCTGA CAGTGAGGAT GCCACTTTTG AACTCATGCA AACTGTTGTG GATCACCTCA TTGTTAGATA TCCATTGTTG TTCACTGTTT TGAAGGACGG GGACTTCTAC GAAGGTAAGG GAAAGATTAT CAAAAACGAG ATCACGAAAG AGATCTTGGA CATGACTTTA CCTTTGAAGG AACATCCTTT GATGTATGTG ACAAAGTTGG CCAAGGAAGA TTTCTACATT GTGAAGAAGA ACCCTGTGGA TGATTTACAT TACTTGGTTG CAGCTGCCGT CCCATTCCCT GGTGGATCTT TCGGAGTTGA CCACAAGATT GGTAAGACAT TGGATGTGAT TCACCTGGAC GTTCCCTACT ACAAGGAAAA GTTGAAGAAA TCGATGGAAA GATGGTTTGA CAGAATGAAG CCCAACGATC CTGTGGAAAG AGCTAGCTGG TATATCTCTT GGGATCACAA GTTGAAGGTC AACAATGTGT ACCAATTACC AAAATACGTA CCTAATTTGG TTGCAGACTT GGAATCCACC GACCCTCGTG AATTTAATGT TAGAGTTGAA AGACAGACGT TGAGAAGACT TCCAAGGTCG AACGCCATCA TCTTCACCAA CCACCCCATC TTCTACTCGA TTGAAGAAAT GAAGGACGAA CCTCTTGTTC CATCGTTGAT TAAAAAGATC ATATACGAGG GTCCCAAGGA TATCATCAAG TACAAGAACT TCGAAGTGTT CAGAGACCAC ATTGCTTCTT ACCTTGACGG CTTGATAAAG AGACAGATAG ACAAGGGTAT TATCAAGGAA GACACTCCAT TAAAGACGTT GCCCTCGTAT CCTTTTGCAC ACTGGGCCAA AACTGACTTT GACTTTGTCA ATGGCTGGAA CAACCCCAGT CCTGCGTACG ACAAGTCTGC CAACTACAGC GAGAAGGCCA AGAAGGAGTT GGTACATCAG AATGATTAG
|
Protein sequence | MIDPGYVILF LGLSISLLLS RKHVVSLISS FRSKTAAVDE EKARRNSYGS DEPLKPPTPL MITPEQVSNF DDRPWRPFRW PYHQTMSIFK LDMNHWLDMD KYYVHYIEEK KRIIQKYGKE NIDWLPDSED ATFELMQTVV DHLIVRYPLL FTVLKDGDFY EGKGKIIKNE ITKEILDMTL PLKEHPLMYV TKLAKEDFYI VKKNPVDDLH YLVAAAVPFP GGSFGVDHKI GKTLDVIHSD VPYYKEKLKK SMERWFDRMK PNDPVERASW YISWDHKLKV NNVYQLPKYV PNLVADLEST DPREFNVRVE RQTLRRLPRS NAIIFTNHPI FYSIEEMKDE PLVPSLIKKI IYEGPKDIIK YKNFEVFRDH IASYLDGLIK RQIDKGIIKE DTPLKTLPSY PFAHWAKTDF DFVNGWNNPS PAYDKSANYS EKAKKELVHQ ND
|
| |