Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_32048 |
Symbol | URA4 |
ID | 4839136 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | + |
Start bp | 556192 |
End bp | 557304 |
Gene Length | 1113 bp |
Protein Length | 370 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640390451 |
Product | Dihydroorotase |
Protein accession | XP_001384770 |
Protein GI | 150865519 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0418] Dihydroorotase |
TIGRFAM ID | [TIGR00856] dihydroorotase, homodimeric type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.898174 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGTAC CTACTGAGAT TGAATTAGGA ATCACTGCTG ATTTGCATGT TCACTTACGT GAAGGTGCCA TGATGGAGTT GATCACTCCT ACAGTAAAGC AGGGTGGATT TTCCATTGCA TACGTCATGC CCAACTTGGT CCCTCCAGTG ACTAGTATTG AGAGAGTGAC TACCTACCAC GAACTGTTGA AGAAATTGAG TCCTACCACG ACCTTCTTGA TGTCGTTCTA TTTAAGCAAG GAATTGACAC CAGAGTTGAT CGAAGAGGCA GGCTCTAAGA AGATTATCTA TGGTATCAAG TGCTATCCTG CTGGAGTCAC CACCAATTCT AAGTTTGGAG TTGATCCCAA CGACTTTTCA TCGTTCTATC CTATATTTGA AGTAATGCAA AAACATGGTT TGGTGTTGAA CATCCATGGA GAAAAGCCTG CTGTCAAGAA TACTACGCAA TCTGAAGAAG ATGACATTCA TGTGTTAAAT GCGGAACCAA AGTTCTTGCC TGCTTTAAGA AAATTACATC AAGATTTCCC CAAACTTAAA ATAGTGTTGG AACACTGCAC TACCCTGGAC GCAGTGGCAT TAATCAGGGA ACTCAACAAG GATACGAAGC CAGAAGACGA GGTGTATGTA GCTGGCACAA TTACTGCGCA CCATTTGTCT TTGACAATCG ACAATTGGGC TGGTAATCCA ATAAATTTCT GCAAGCCAGT CGCGAAATTG CCCAAGGACA AGCGAGCTTT GGTTGAAGCA GCAACTAGCG GAGAAAGATG GTTTTTCTTT GGGTCTGACT CAGCTCCTCA CCCAATCGAG GCCAAGAGCA CTCATGTTGG AGTCTGCGCT GGTGTTTATA CTCAAAGTCA TGCACTTGGC TACCTTGCTG ACGTATTCGA AGAACTGAAC AAGTTGGAAA ACTTAGTTAA ATTTGCAAGT ACAAACGGTC TCGGTTTCTA TGCGCAACCA CAAATTTTGG AACAGGCTGC GAAACTTGAC AAACAAAGGG CGTGGGTAGT CAAGAGACCA GTACAGGTAC CGGAAGTGAT TGCCAACCTG CAATTGAGAG TGGTTCCATT CAGAGCCGGA GAGACATTGA ACTGGGCTGT GGAATGGAGA TGA
|
Protein sequence | MSVPTEIELG ITADLHVHLR EGAMMELITP TVKQGGFSIA YVMPNLVPPV TSIERVTTYH ESLKKLSPTT TFLMSFYLSK ELTPELIEEA GSKKIIYGIK CYPAGVTTNS KFGVDPNDFS SFYPIFEVMQ KHGLVLNIHG EKPAVKNTTQ SEEDDIHVLN AEPKFLPALR KLHQDFPKLK IVLEHCTTSD AVALIRELNK DTKPEDEVYV AGTITAHHLS LTIDNWAGNP INFCKPVAKL PKDKRALVEA ATSGERWFFF GSDSAPHPIE AKSTHVGVCA GVYTQSHALG YLADVFEESN KLENLVKFAS TNGLGFYAQP QILEQAAKLD KQRAWVVKRP VQVPEVIANS QLRVVPFRAG ETLNWAVEWR
|
| |