Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_36448 |
Symbol | GAH1 |
ID | 4839345 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | + |
Start bp | 1528086 |
End bp | 1529711 |
Gene Length | 1626 bp |
Protein Length | 501 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640390660 |
Product | guanine deaminase (Guanase) (Guanine aminase) (Guanine aminohydrolase) (GAH) |
Protein accession | XP_001384984 |
Protein GI | 150865670 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | [TIGR02967] guanine deaminase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0130969 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCTCCA ACTCTCCACT CATCGAACCT AAAGCTTCGT CCAGCATTGG CTACACCTTG TACTATGGTA CGTTTGTTCA CACGCCTACG TTGGAAGAAC TTGAAATCTG TTTCAACACA TTGGTCGGAG TTACGCTGGA CGGGGAAATC GACTACATCC ACAAGAACTA CAAGCCAGAG GAGCACGACT ATATGACGGC TGTCCAGTTC TTCATCTCCA CCACCAACAA TAACGACAAC AAGGATACCA GAAACAACAA TAACAACCAC AACAACTACA ATGGAAATAA TGGAAATGGT AACTACAATA ATGGTCGTCG AAACAGCAGA AACAGAAACA GACACTTTGA TTTCATCGAC TATTCCCAGG ATCCCACCAA GTTCTTTGTG CCAGGTTTCA TTGACACCCA CATACATGCT TCACAATTCC CCAACGTCGG CATTGGTTTA GACTGTCCTC TTTTGGATTG GTTGAACGAC TACACTTTCC CGTTGGAGAA CCAGTTCACT GACTCTAACG AGAAGAAGTT GCAATTCGCT AAAAACGTCT ACTCCAAAGT AATCAACAAA ACCCTTACTA GTGGCACTAC TTGTGCCTCA TACTTCACAA CAATTGACCC GCAGACTACC AACTTATTTG CTGAGTTGTT ATTGGAACAT GGCCAAAGAG GTTTTGTAGG AAAAGTGTGT ATGGACCACA ACGACACTTA CCACGACTAC GAGGAAAGCT TTGAAGACTG TGTCCATTCG ATGAACCTGA TCATCAACCA TTTGGACAAG TTGAACCCAA GTGACGACAC CTTGGTTAAG CCGATTATCA CCCCTCGTTT CGCACCCGTC TGTTCTCGTA AGATGTTGAA TTGGTTAGGA AAATTGAGCA AGACGCACAG CTTGCCCATC CAGACTCACA TCAGTGAAAA CACCAAGGAA ATTGAGTTGG TTCGTGATAT GTTTCCTGAT TGCGAAGATT ATGCTACTGT ATATGATAAA CATAACTTGT TGAGTTCTTC CACAATCTTG GCTCATGCCA TTCACTTGAC TAAGAAGGAA AGAAAGATGA TCAGCAAGAA GGAATGCTCC ATCTCTCATT GTCCAACATC TAACACATTC ATCTCCAGTG GTGAAGCTCC AGTCAAACAG TATCTTTACC AAGATAAGAT CAACGTATCA TTAGGTACAG ATGTCTCTGG AGGCTTTGAT CTGAGCATCT TGGCTGTCAT CAAACATTCC ATTTTGGTCA GTCACCACTT GGCAATGAAG ACAGGAAGGC AAGGTGACAA GTTGTCAATC ATAGATGCTC TCTACATGGC CACCCAAGGA GGAGCCAAAG CCATTGGTAT GCCAGACGTG TTGGGATCTT TTGAAGTAGG AAAGAAGTTT GATGTCCAGT TGATTGATTT GAGTTCCAAG GATTCGATCG TGGACACCTT CGAATGGCAA TTGCCTCTCG AAGAAGAAGC TAACCAACGC AAAAAGTCAA AACAAATGCA AGATTTGTTG GGCAAATGGA TCTTCAGTGG TGACGACAGA AACTGTGTCA AGGTCTGGTG TAATGGTCGT TTGGTAGTAA ACAAGATGCA TTATCAACGT GATGACAGAT GGGTCATGGT TGAAAAGGAT TTCTAA
|
Protein sequence | MPSNSPLIEP KASSSIGYTL YYGTFVHTPT LEELEICFNT LVGVTSDGEI DYIHKNYKPE EHDYMTAVHR NRNRHFDFID YSQDPTKFFV PGFIDTHIHA SQFPNVGIGL DCPLLDWLND YTFPLENQFT DSNEKKLQFA KNVYSKVINK TLTSGTTCAS YFTTIDPQTT NLFAELLLEH GQRGFVGKVC MDHNDTYHDY EESFEDCVHS MNSIINHLDK LNPSDDTLVK PIITPRFAPV CSRKMLNWLG KLSKTHSLPI QTHISENTKE IELVRDMFPD CEDYATVYDK HNLLSSSTIL AHAIHLTKKE RKMISKKECS ISHCPTSNTF ISSGEAPVKQ YLYQDKINVS LGTDVSGGFD SSILAVIKHS ILVSHHLAMK TGRQGDKLSI IDALYMATQG GAKAIGMPDV LGSFEVGKKF DVQLIDLSSK DSIVDTFEWQ LPLEEEANQR KKSKQMQDLL GKWIFSGDDR NCVKVWCNGR LVVNKMHYQR DDRWVMVEKD F
|
| |