Gene PICST_65481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_65481 
SymbolHIS7 
ID4838325 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp1475977 
End bp1477792 
Gene Length1816 bp 
Protein Length588 aa 
Translation table12 
GC content39% 
IMG OID640389640 
Productimidazole glycerol phosphate synthase 
Protein accessionXP_001383902 
Protein GI150864898 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0107] Imidazoleglycerol-phosphate synthase
[COG0118] Glutamine amidotransferase 
TIGRFAM ID[TIGR00735] imidazoleglycerol phosphate synthase, cyclase subunit
[TIGR01855] imidazole glycerol phosphate synthase, glutamine amidotransferase subunit 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.361476 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.151399 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CTACATTAAA AAAATGGTAA AGTCAGTTTA TGTAATTGAC GTAGAGAGTG GAAACTTGCA 
GTCATTGGCC AATGCCATAA AGCGTATTGG GGAATACGAC GTCAAATTCA TTCGTAATGC
GGATGATTTC GAACAGTATG ACAATGACAT TGAAAAGCTC ATCTTTCCAG GGGTCGGAAA
CTATGGACAC TTCGTGAGGG AAATGCATGC GAGAAACTTA ATTAAACCAA TCAACAAGTA
CATTGATAGT GGAAGATCAT TGATGGGTAT TTGTGTTGGG TTGCAAGCAT TCTTTGACAG
CTCAGAAGAA AGCCCAGGAG TGGATTTTCG TGGATTGGGT TTTTTAAAAT TGAGATTAGC
CAAGTTTAAC ATTCATGATC CAATATTCGA AGAGAAAAAA TTAAAGAAGT CTGTTCCACA
TATCGGTTGG AACAGTATAA CCGATATCAA GATTGGTTGC AACTCTTTAG AGAGGTCTAA
ATCGTTATAC CATATCAACA CATTTAACAA ATATTATTTT GTTCATTCCT ACGCAGCTAT
TATAAATGAT GAGAACAAAC ATATTTTGGA AAAGGCTTCT AAAGAGGGCT GGAACTTTGC
AATTGCAAGA TATGGTTCAG AAAAATTCCT AGCAGCAATT AACTACAAGA ATTTCTTTGC
AACTCAATTT CACCCGGAAA AGTCTGGTTT GGCTGGCTTA AGGGTTATAA AATCTTTCTT
AGAAAGTATT CAGTTTGCCG ATGTTGATAA ATCTATCATT CAAGAAGTTG TTGGAGTCGA
GCAATCCTTA GGTGGAACCA CCAGAAGAAT CATAGCATGT TTAGACGTCA GATCTAACGA
TGAGGGCGAT TTGGTTGTCA CAAAAGGTGA TCAATACAAC GTCCGCGAAA CTGCCCTGAG
TGAAAGCAAA GTTAGAAATC TTGGAAAACC AGTTGAATTA GCGACCAGAT ATTACAATCA
AGGTGCTGAC GAAGTTACCT TTCTAAACAT TACCTCTTTC CGTAACTCTC CGTTAAAAGA
CTTGCCCATG CTTCAAGTTT TGAGCAAAGC CGCTGAAACC ATTTTTGTCC CTTTAACAGT
TGGTGGTGGT ATTAAGGATA TGACAGACCC AGAAACCGGC AAATTAGTGC CTGCGGTTAA
GGTTGCTGAT TTGTATTTCA GATCTGGAGC AGACAAAGTT AGTATTGGAA GTGATGCTGT
TACTATTGCA GAAGAGTATT ATGCAAATGG AAAGCAAAAA ACTGGCAAAA CATCTATCGA
AAGCATTTCT GCAACTTTTG GTGCGCAAGC TGTGGTTATA TCCGTTGATC CAAAGAGAAA
GTATGCTGCT AGTCCGATGG AAACCAAGAT GCAAACCATA AAAATTGTAG ACCCAGCCAA
GTTTGGACCT AATGGCGAAC AGTACTGCTA CTACCAAGTT ACTTCACAAG GAGGGAGAAA
GGTCCACGAG TTGGGCGCTC TTGAATTATG TACCGCTTGC GAGGAATTGG GTGCAGGTGA
AATATTATTG AACTCGATCG ATCATGATGG GTCCAACAAG GGATACAATC TCGAATTATT
GACTCAAATC AAGAGCAACG TTTCCATCCC AGTAATTGCA AGTTCTGGTG CTGGTAATCC
GCAACATTTC CAAGACGCTT TCGAATTGGA ATGTGGAATT GACGCTGCAT TGGGAGCAGG
AATGTTTCAC AGAGGTGAAT ACGAAGTCAA TGACGTTAAA AAGTATCTTC AGACCAATGG
CAAGATGGAC GTTCGATTAG ATGAAGAAGT AGAATTATAA ATCATATAAA TTTTGTATAT
AGTGCAGTCA TTATCG
 
Protein sequence
MVKSVYVIDV ESGNLQSLAN AIKRIGEYDV KFIRNADDFE QYDNDIEKLI FPGVGNYGHF 
VREMHARNLI KPINKYIDSG RSLMGICVGL QAFFDSSEES PGVDFRGLGF LKLRLAKFNI
HDPIFEEKKL KKSVPHIGWN SITDIKIGCN SLERSKSLYH INTFNKYYFV HSYAAIINDE
NKHILEKASK EGWNFAIARY GSEKFLAAIN YKNFFATQFH PEKSGLAGLR VIKSFLESIQ
FADVDKSIIQ EVVGVEQSLG GTTRRIIACL DVRSNDEGDL VVTKGDQYNV RETASSESKV
RNLGKPVELA TRYYNQGADE VTFLNITSFR NSPLKDLPML QVLSKAAETI FVPLTVGGGI
KDMTDPETGK LVPAVKVADL YFRSGADKVS IGSDAVTIAE EYYANGKQKT GKTSIESISA
TFGAQAVVIS VDPKRKYAAS PMETKMQTIK IVDPAKFGPN GEQYCYYQVT SQGGRKVHEL
GALELCTACE ELGAGEILLN SIDHDGSNKG YNLELLTQIK SNVSIPVIAS SGAGNPQHFQ
DAFELECGID AALGAGMFHR GEYEVNDVKK YLQTNGKMDV RLDEEVEL