Gene PICST_67165 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_67165 
SymbolHIS1 
ID4837608 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp2660226 
End bp2661416 
Gene Length1191 bp 
Protein Length304 aa 
Translation table12 
GC content40% 
IMG OID640388923 
ProductATP phosphoribosyltransferase (ATP-PRTase) (ATP-PRT) 
Protein accessionXP_001383262 
Protein GI150864445 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0040] ATP phosphoribosyltransferase 
TIGRFAM ID[TIGR00070] ATP phosphoribosyltransferase
[TIGR03455] ATP phosphoribosyltransferase, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AAAATTGCAA GTTGAAAAAT TACCAGATTT TCTTTTTGTA GTTCTCCAGC CAATTCTAAC 
TATACACAAT GGATTTAGTT AACCACCTCC CAGACCGTTT GTTGTTTGCT GTGCCCAAGA
GTATGTGATC TTGTCTATCG AAATTTTACA GATTTTTCAG ATTTTTCAGT CCGATAAACG
ATTTCAGATT TCGTTCTCCG ATTTAAAAAT AAATACCTAG CTATAAGTTA TCATTGTCAT
ATTGTTACTA ACTATATTCA GAGGGTCGTT TATATGAAAA GTGTTGCAAC TTGTTGCAGG
GTGCTGACAT CCAGTTCAGA CGTTCCAACA GATTGGATAT TGCCCTTTCC ACCAACTTGC
CTGTTGCCTT GATTTTCTTG CCTGCTGCCG ACATTCCTAT TTTTGTTGGA GAAGGTAACT
GTGATTTGGG TATAACCGGT TTAGACCAGA TCCAAGAAGC TCTGATGCTT GACCATACTG
AGGACTTGTT GGACTTGAAC TTTGGCTCTT GTAAGTTACA GATTCAGGTT CCAGCCGAAG
GAGACATCAC TACTCCTGAG CAATTAGTCG GTAAGAAGAT TGTTTCGTCG TTTACCAAGT
TGTCTACCAA CTACTTTAAG TCATTGGAAA AGGTTTCGGA TGAATCACAA TTAACAACTA
GCATCAGATA CGTTGGTGGA TCTGTCGAAG CTTCATGTGC CTTAGGAGTT GCTGATGCAA
TTGTCGATTT GGTAGAAAGT GGTGAAACCA TGAAGGCTGC CGGATTGAAG CCTATAGAAA
CTATTTTACA GACCTCGGCC CATTTGATCT CGTCTAAGAA CCCCAAGTTT CCCGAGTTAG
TTGAAATCAT CCACCAAAGA TTCGAAGGTA TCTTGGCTGC TCAGAAGTAC GTATTGTGTA
ACTACAATGC TCCAAGAAGA TTGTTGCACG ACGTGTTAAG TATTACTCCT GGAAGAAGAG
CTGCCACCGT GTCACCTTTA GAAAAACACA ACCCAGAAGA CGAAGACTGG GTGGCCATAT
CGTCAATGGT GGAGAGAAAG GCTATCGGAG ACAAGATGGA CTTGTTGAAG AAGAGTGGAG
CCTCCGACAT CTTGGTGTTT GAAATCAGCA ACTGTAGAGT TTAGACCAAA ATTACAAAAC
AATTTACAAA TAATTACATA TACATAATTA ATAGACTTCA TTTATTCTGT C
 
Protein sequence
MDLVNHLPDR LLFAVPKKGR LYEKCCNLLQ GADIQFRRSN RLDIALSTNL PVALIFLPAA 
DIPIFVGEGN CDLGITGLDQ IQEASMLDHT EDLLDLNFGS CKLQIQVPAE GDITTPEQLV
GKKIVSSFTK LSTNYFKSLE KVSDESQLTT SIRYVGGSVE ASCALGVADA IVDLVESGET
MKAAGLKPIE TILQTSAHLI SSKNPKFPEL VEIIHQRFEG ILAAQKYVLC NYNAPRRLLH
DVLSITPGRR AATVSPLEKH NPEDEDWVAI SSMVERKAIG DKMDLLKKSG ASDILVFEIS
NCRV