Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_83093 |
Symbol | ARO3 |
ID | 4839037 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | - |
Start bp | 671505 |
End bp | 672939 |
Gene Length | 1435 bp |
Protein Length | 372 aa |
Translation table | 12 |
GC content | 45% |
IMG OID | 640390352 |
Product | 2-dehydro-3-deoxy- phosphoheptonate aldolase |
Protein accession | XP_001384425 |
Protein GI | 126135802 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.184515 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CGTTAAACGA AAAGATCCAT AGCAGCCAAA TAGCGAAAAA TTACGCTTCT TGCTGATTGC ACTATACTCG ATCTAATTAC ATAAGATGTT CATCACTAAC GAGCACGTTG GAGACAGATC CAGAATGGAG GACTGGAGAG GTAAGTAAAA GAAGCCAGAT AATAGACACG CTGATGGCAC ATCACATATC CAGGCATGTG GATATGTCTG GCAGTTCTAG AATCTCTTTT GTGTACATGA TACTCAAAGT CGTGATCATC AGAGAATACT TTGGAGCATA TGTGATAAGA TAGAACATAC TTTACATTCA GAATACTAAC ATACTAGTCC GTGGTTACGA CCCTTTGACT CCTCCTGACT TATTGCAGCA CGAGTACCCC TTGCCTGAGT CTTCCCAAAA GACCATCTTG GACGGTAGAA ACCAAGCTGT AGACATCTTG AACGGTAAGG ATGACAGATT GATCTTGGTC ATTGGACCAT GTTCAATCCA CGACCCCAAG GCAGCTCTCG ATTACGCTGA CAGATTGCAC AAAGAGGCTC AGAAGCATAA AGGTGAGTTG CATATCGTCA TGAGAGCGTA CTTGGAAAAA CCCAGAACCA CTGTAGGCTG GAAAGGTTTG ATCAACGATC CTGAAATCGA CGGTTCTTTC CAGATTAACA AGGGGTTGAG AATCTCGAGA AAGTTATTTG TCGAATTGAC CTCCAAGTTG CCCATTGCCG GAGAAATGTT GGATACTATT TCTCCGCAGT TCTTGTCCGA TTTGTTCTCT GTGGGAGCTA TTGGTGCCAG AACTACTGAG TCTCAGTTGC ACCGTGAGTT GGCTTCTGGC TTGTCCTTCC CTGTGGGGTT CAAGAACGGT ACTGACGGTA CTTTGGGTGT AGCTGTAGAT GCCATGAGAG CTGCATCTCA CCCTCATCAC TTCCTTTCTG TCACCAAGCC TGGTATTGTA GCCATTGTAG GAACTGACGG AAATCAGGAC ACTTTTGTAA TCTTGAGAGG TGGAAAGAAG GGTACCAATT ACGATGAGAA GTCCGTCAAG GAGGCCAAGG CAGAGTTGAT CAAGGCCAAA GTGGTCAGTG AAGAAAAGCC AGGACCAAAG ATCATGGTTG ATTGCTCCCA CGGTAACTCT AATAAGGATC ACAGAAACCA ACCCAAGGTA GCAGCTGAAG TAGCACGTCA AGTTGCCAAC GGTGAAGACG GTATTTGTGG TTTGATGATC GAGTCCAACA TCGTCGAAGG CAGACAAGAC GTCCCACCTT TGGAGCAAGG AGGCAAGGAT GCATTGAAGT ACGGTTGTTC TATCACCGAT GCCTGTATTG GCTGGGACAG CACTGAAGAA GTGTTGCAGT TGTTGGCTGA TGCTGTCAAG GCCAGAAGAG CCTTGAAGTA GGTAAACTAA AACTTTGTAT ATAATAACAC ATGAAGATGT AATGG
|
Protein sequence | MFITNEHVGD RSRMEDWRVR GYDPLTPPDL LQHEYPLPES SQKTILDGRN QAVDILNGKD DRLILVIGPC SIHDPKAALD YADRLHKEAQ KHKGELHIVM RAYLEKPRTT VGWKGLINDP EIDGSFQINK GLRISRKLFV ELTSKLPIAG EMLDTISPQF LSDLFSVGAI GARTTESQLH RELASGLSFP VGFKNGTDGT LGVAVDAMRA ASHPHHFLSV TKPGIVAIVG TDGNQDTFVI LRGGKKGTNY DEKSVKEAKA ELIKAKVVSE EKPGPKIMVD CSHGNSNKDH RNQPKVAAEV ARQVANGEDG ICGLMIESNI VEGRQDVPPL EQGGKDALKY GCSITDACIG WDSTEEVLQL LADAVKARRA LK
|
| |