Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_32230 |
Symbol | |
ID | 4839175 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | + |
Start bp | 998997 |
End bp | 1000353 |
Gene Length | 1357 bp |
Protein Length | 433 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640390490 |
Product | predicted protein |
Protein accession | XP_001384861 |
Protein GI | 150865586 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTATTC ACTTGCTTTC AAGACTCCCA GGGGAGTTAC CCGTTTCAAG AAACCTGCTT AATGCCATGG GGAGCCTTTT ATCCCCAGAA TTTGAGTATC TTAACGAGTT TTTGCCATCT CATCTAGATT CGAAATCAAA TTCTTTGGAT CTAGATGTTG GACTGATTTC TTCTAAAGTT GAATCAAACC TTCCAGTTAG TGATGAATTC GATAGCTTGT TCAGGTATCT TCTAACCCCA ACACTTGTTG GACCATCTCC AAATTCATTT TACTACGATT ATGTTGCCTG CGCCGACAAC AATGAGTTCT TGGGTATATC TGAACCAATA GCTAGTCAAC CTAGAGAATT CAGTCTTTCT GTGGACGGAA TCAAAGTAAC AACCTCTGAA GACGATACTT TCTTCCAAAA TCTCCTGTCT CCCAATAGGT CTGATGCTGT GAGTACTCGT GAACCAGGTG AAACAGTTGT GCAACACCAA CTGGAATTAG CAACAAGGCT GCAGAGTCAT CGGGAGGTTG TAGCACACCA AAAAGCTAGA ACTACGACTG GATCTAAGCC CAGCTCCACA TACAAGGTAG TGAAGCATAG GCCAAAGGGA TCTAAACAAG CTTGTGTTCC AATTAAAATT TCATACGAGA AACTTAAATT GACAACAAAG CTTGGAGCTG AACTCTCGGA TTCGTTTGTC GAAACTGTTG AATCCAGTAT GTCAGCGTCG GTCCGTGCAA TGTTAGCCGA AAGAAAATTA CCAGAGGAGC TTGAAAATGG TGCCTCAAGG TGTAAGATTG ATAGACAAGT CTATGAAAGA CCTCTATTGA TTGAAGAAAT GGAAAAGTTC TGTGGCCATC CAAAGGTGAG ATATATTCGA AACTCAAACT TTGGACGGAC TCCCTACGAA GCAGAGTACT ACCTGTACCA GGTGGACAAT AAGGGTCAAC TGATCAACCA TACAAGACAT GGTTTGTGTC CATATTGTCC AGAAGTCCTG TTCTTCAAGT TGAAGAATTC TGCCTACGGG AACCATTTAG GCAACATTCA CGGCATCCGC ACGAACGGGT CCCTTTTTCC AGATCCAATT CTCCCAGGAA TCTACTTGAT GGCCAAGAGT GAATTTGTAG AAACTGAAAG AAAAACTCTA GCCAAAGAGA GAGCTACAGC TGGGGTGGTA TGTCAAGCAT GTTATACAAT TCAAGAGATG CAGTGCACCC TGAGAAGCAC AGATTTGGGA CACTATCTTC GACATTACCG AGACAACCAT GTCAAATGCA AAAGCAGAAG CAAAGGTTGT CGCTCTGGTC GGAGTAATTT GGAATATAAT TAGAAAGACT TTGGTGTTTT TGGTTATCGT CAACTAG
|
Protein sequence | MSIHLLSRLP GELPVSRNSL NAMGSLLSPE FEYLNEFLPS HLDSKSNSLD LDVGSISSKV ESNLPVSDEF DSLFRYLLTP TLVGPSPNSF YYDYVACADN NEFLGISEPI ASQPREFSLS VDGIKVTTSE DDTFFQNLSS PNRSDAVSTR EPGETVVQHQ SELATRSQSH REVVAHQKAR TTTGSKPSST YKVVKHRPKG SKQACVPIKI SYEKLKLTTK LGAELSDSFV ETVESSMSAS VRAMLAERKL PEELENGASR CKIDRQVYER PLLIEEMEKF CGHPKVRYIR NSNFGRTPYE AEYYSYQVDN KGQSINHTRH GLCPYCPEVS FFKLKNSAYG NHLGNIHGIR TNGSLFPDPI LPGIYLMAKS EFVETERKTL AKERATAGVI WDTIFDITET TMSNAKAEAK VVASVGVIWN IIRKTLVFLV IVN
|
| |