Gene PICST_83093 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_83093 
SymbolARO3 
ID4839037 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp671505 
End bp672939 
Gene Length1435 bp 
Protein Length372 aa 
Translation table12 
GC content45% 
IMG OID640390352 
Product2-dehydro-3-deoxy- phosphoheptonate aldolase 
Protein accessionXP_001384425 
Protein GI126135802 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.184515 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CGTTAAACGA AAAGATCCAT AGCAGCCAAA TAGCGAAAAA TTACGCTTCT TGCTGATTGC 
ACTATACTCG ATCTAATTAC ATAAGATGTT CATCACTAAC GAGCACGTTG GAGACAGATC
CAGAATGGAG GACTGGAGAG GTAAGTAAAA GAAGCCAGAT AATAGACACG CTGATGGCAC
ATCACATATC CAGGCATGTG GATATGTCTG GCAGTTCTAG AATCTCTTTT GTGTACATGA
TACTCAAAGT CGTGATCATC AGAGAATACT TTGGAGCATA TGTGATAAGA TAGAACATAC
TTTACATTCA GAATACTAAC ATACTAGTCC GTGGTTACGA CCCTTTGACT CCTCCTGACT
TATTGCAGCA CGAGTACCCC TTGCCTGAGT CTTCCCAAAA GACCATCTTG GACGGTAGAA
ACCAAGCTGT AGACATCTTG AACGGTAAGG ATGACAGATT GATCTTGGTC ATTGGACCAT
GTTCAATCCA CGACCCCAAG GCAGCTCTCG ATTACGCTGA CAGATTGCAC AAAGAGGCTC
AGAAGCATAA AGGTGAGTTG CATATCGTCA TGAGAGCGTA CTTGGAAAAA CCCAGAACCA
CTGTAGGCTG GAAAGGTTTG ATCAACGATC CTGAAATCGA CGGTTCTTTC CAGATTAACA
AGGGGTTGAG AATCTCGAGA AAGTTATTTG TCGAATTGAC CTCCAAGTTG CCCATTGCCG
GAGAAATGTT GGATACTATT TCTCCGCAGT TCTTGTCCGA TTTGTTCTCT GTGGGAGCTA
TTGGTGCCAG AACTACTGAG TCTCAGTTGC ACCGTGAGTT GGCTTCTGGC TTGTCCTTCC
CTGTGGGGTT CAAGAACGGT ACTGACGGTA CTTTGGGTGT AGCTGTAGAT GCCATGAGAG
CTGCATCTCA CCCTCATCAC TTCCTTTCTG TCACCAAGCC TGGTATTGTA GCCATTGTAG
GAACTGACGG AAATCAGGAC ACTTTTGTAA TCTTGAGAGG TGGAAAGAAG GGTACCAATT
ACGATGAGAA GTCCGTCAAG GAGGCCAAGG CAGAGTTGAT CAAGGCCAAA GTGGTCAGTG
AAGAAAAGCC AGGACCAAAG ATCATGGTTG ATTGCTCCCA CGGTAACTCT AATAAGGATC
ACAGAAACCA ACCCAAGGTA GCAGCTGAAG TAGCACGTCA AGTTGCCAAC GGTGAAGACG
GTATTTGTGG TTTGATGATC GAGTCCAACA TCGTCGAAGG CAGACAAGAC GTCCCACCTT
TGGAGCAAGG AGGCAAGGAT GCATTGAAGT ACGGTTGTTC TATCACCGAT GCCTGTATTG
GCTGGGACAG CACTGAAGAA GTGTTGCAGT TGTTGGCTGA TGCTGTCAAG GCCAGAAGAG
CCTTGAAGTA GGTAAACTAA AACTTTGTAT ATAATAACAC ATGAAGATGT AATGG
 
Protein sequence
MFITNEHVGD RSRMEDWRVR GYDPLTPPDL LQHEYPLPES SQKTILDGRN QAVDILNGKD 
DRLILVIGPC SIHDPKAALD YADRLHKEAQ KHKGELHIVM RAYLEKPRTT VGWKGLINDP
EIDGSFQINK GLRISRKLFV ELTSKLPIAG EMLDTISPQF LSDLFSVGAI GARTTESQLH
RELASGLSFP VGFKNGTDGT LGVAVDAMRA ASHPHHFLSV TKPGIVAIVG TDGNQDTFVI
LRGGKKGTNY DEKSVKEAKA ELIKAKVVSE EKPGPKIMVD CSHGNSNKDH RNQPKVAAEV
ARQVANGEDG ICGLMIESNI VEGRQDVPPL EQGGKDALKY GCSITDACIG WDSTEEVLQL
LADAVKARRA LK