Gene PICST_49451 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_49451 
SymbolPUR7 
ID4840515 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp1091703 
End bp1092623 
Gene Length921 bp 
Protein Length306 aa 
Translation table12 
GC content41% 
IMG OID640391830 
ProductPhosphoribosylaminoimidazole-succinocarboxamide synthase (SAICAR synthetase) 
Protein accessionXP_001386402 
Protein GI126139759 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0152] Phosphoribosylaminoimidazolesuccinocarboxamide (SAICAR) synthase 
TIGRFAM ID[TIGR00081] phosphoribosylaminoimidazole-succinocarboxamide synthase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.471375 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTTGC ACACTACAGA ATTAGACAAC ATCTTGCCTT TGGTTACCAG GGGTAAAGTC 
AGAGATATCT ACCAAGTGGA TGAAAACACT TTATTGTTTG TAGCTACAGA CAGAATCTCT
GCTTATGATG TGATAATGGA TAATGCTGTT CCTGAGAAGG GGAAGCTCTT GACCAAGTTG
TCTGAATTCT GGTTCGAGTT TTTGTCGCAG ACTATACCTA ACCATCTTAT CCTATCTAAG
AACGATGACG AAAGTTTGTT TGCTAAATTG CCAGCTCAAT TGTCTGAAGC AAAGTATAAA
TTGCAATTGT CAGGTAGATC GTTGTTGGTG AGAAAGTTGA AGTTGATTCC ACTAGAAGTG
ATTGTCAGAG GTTACATAAC GGGCTCTGCT TGGAAGGAGT ACAAGAAGAC TCAGACTGTT
CATGGTTTGT CAGTTGAAGC TGGCTTGTTG GAATCACAAG AATTTGCAAC TCCAATTTTC
ACTCCATCGA CCAAAGCTGA TCAAGGTGAA CATGATGAAA ACATTTCCCC AGAAAAGGCT
GCTGAGATTG TTGGCCAAGA ATTGTGTGAT AAATTAGCCA AAGCTGCTAT CGAATTGTAC
ACGAAGGCTA AGGAGTACGC AAAGACTAGA GGCATCATCA TAGCTGATAC CAAGTTCGAA
TTCGGTTTGG ATACTGACCA CAATTTGGTT TTGGTTGATG AAGTTTTGAC TCCAGATTCT
TCCAGATTCT GGAATGCTTC TGCCTACAAA TTAGGCAAAT CTCAAGAATC TTATGACAAG
CAATTTTTGA GAGACTGGTT AACTTCGAAC GGCATTGCTG GCAAGGACGG AGTTAAGATG
GACGAAGATA TTGTCGCAAG GACCAGAGCC AAGTACATCG AAGCATACGA AGCTATCACT
GGCGACAAAT GGACCTCTTA A
 
Protein sequence
MSLHTTELDN ILPLVTRGKV RDIYQVDENT LLFVATDRIS AYDVIMDNAV PEKGKLLTKL 
SEFWFEFLSQ TIPNHLILSK NDDESLFAKL PAQLSEAKYK LQLSGRSLLV RKLKLIPLEV
IVRGYITGSA WKEYKKTQTV HGLSVEAGLL ESQEFATPIF TPSTKADQGE HDENISPEKA
AEIVGQELCD KLAKAAIELY TKAKEYAKTR GIIIADTKFE FGLDTDHNLV LVDEVLTPDS
SRFWNASAYK LGKSQESYDK QFLRDWLTSN GIAGKDGVKM DEDIVARTRA KYIEAYEAIT
GDKWTS