Gene PICST_32048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_32048 
SymbolURA4 
ID4839136 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp556192 
End bp557304 
Gene Length1113 bp 
Protein Length370 aa 
Translation table12 
GC content44% 
IMG OID640390451 
ProductDihydroorotase 
Protein accessionXP_001384770 
Protein GI150865519 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0418] Dihydroorotase 
TIGRFAM ID[TIGR00856] dihydroorotase, homodimeric type 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.898174 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGTAC CTACTGAGAT TGAATTAGGA ATCACTGCTG ATTTGCATGT TCACTTACGT 
GAAGGTGCCA TGATGGAGTT GATCACTCCT ACAGTAAAGC AGGGTGGATT TTCCATTGCA
TACGTCATGC CCAACTTGGT CCCTCCAGTG ACTAGTATTG AGAGAGTGAC TACCTACCAC
GAACTGTTGA AGAAATTGAG TCCTACCACG ACCTTCTTGA TGTCGTTCTA TTTAAGCAAG
GAATTGACAC CAGAGTTGAT CGAAGAGGCA GGCTCTAAGA AGATTATCTA TGGTATCAAG
TGCTATCCTG CTGGAGTCAC CACCAATTCT AAGTTTGGAG TTGATCCCAA CGACTTTTCA
TCGTTCTATC CTATATTTGA AGTAATGCAA AAACATGGTT TGGTGTTGAA CATCCATGGA
GAAAAGCCTG CTGTCAAGAA TACTACGCAA TCTGAAGAAG ATGACATTCA TGTGTTAAAT
GCGGAACCAA AGTTCTTGCC TGCTTTAAGA AAATTACATC AAGATTTCCC CAAACTTAAA
ATAGTGTTGG AACACTGCAC TACCCTGGAC GCAGTGGCAT TAATCAGGGA ACTCAACAAG
GATACGAAGC CAGAAGACGA GGTGTATGTA GCTGGCACAA TTACTGCGCA CCATTTGTCT
TTGACAATCG ACAATTGGGC TGGTAATCCA ATAAATTTCT GCAAGCCAGT CGCGAAATTG
CCCAAGGACA AGCGAGCTTT GGTTGAAGCA GCAACTAGCG GAGAAAGATG GTTTTTCTTT
GGGTCTGACT CAGCTCCTCA CCCAATCGAG GCCAAGAGCA CTCATGTTGG AGTCTGCGCT
GGTGTTTATA CTCAAAGTCA TGCACTTGGC TACCTTGCTG ACGTATTCGA AGAACTGAAC
AAGTTGGAAA ACTTAGTTAA ATTTGCAAGT ACAAACGGTC TCGGTTTCTA TGCGCAACCA
CAAATTTTGG AACAGGCTGC GAAACTTGAC AAACAAAGGG CGTGGGTAGT CAAGAGACCA
GTACAGGTAC CGGAAGTGAT TGCCAACCTG CAATTGAGAG TGGTTCCATT CAGAGCCGGA
GAGACATTGA ACTGGGCTGT GGAATGGAGA TGA
 
Protein sequence
MSVPTEIELG ITADLHVHLR EGAMMELITP TVKQGGFSIA YVMPNLVPPV TSIERVTTYH 
ESLKKLSPTT TFLMSFYLSK ELTPELIEEA GSKKIIYGIK CYPAGVTTNS KFGVDPNDFS
SFYPIFEVMQ KHGLVLNIHG EKPAVKNTTQ SEEDDIHVLN AEPKFLPALR KLHQDFPKLK
IVLEHCTTSD AVALIRELNK DTKPEDEVYV AGTITAHHLS LTIDNWAGNP INFCKPVAKL
PKDKRALVEA ATSGERWFFF GSDSAPHPIE AKSTHVGVCA GVYTQSHALG YLADVFEESN
KLENLVKFAS TNGLGFYAQP QILEQAAKLD KQRAWVVKRP VQVPEVIANS QLRVVPFRAG
ETLNWAVEWR