Gene PICST_36448 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_36448 
SymbolGAH1 
ID4839345 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp1528086 
End bp1529711 
Gene Length1626 bp 
Protein Length501 aa 
Translation table12 
GC content43% 
IMG OID640390660 
Productguanine deaminase (Guanase) (Guanine aminase) (Guanine aminohydrolase) (GAH) 
Protein accessionXP_001384984 
Protein GI150865670 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID[TIGR02967] guanine deaminase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0130969 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCTCCA ACTCTCCACT CATCGAACCT AAAGCTTCGT CCAGCATTGG CTACACCTTG 
TACTATGGTA CGTTTGTTCA CACGCCTACG TTGGAAGAAC TTGAAATCTG TTTCAACACA
TTGGTCGGAG TTACGCTGGA CGGGGAAATC GACTACATCC ACAAGAACTA CAAGCCAGAG
GAGCACGACT ATATGACGGC TGTCCAGTTC TTCATCTCCA CCACCAACAA TAACGACAAC
AAGGATACCA GAAACAACAA TAACAACCAC AACAACTACA ATGGAAATAA TGGAAATGGT
AACTACAATA ATGGTCGTCG AAACAGCAGA AACAGAAACA GACACTTTGA TTTCATCGAC
TATTCCCAGG ATCCCACCAA GTTCTTTGTG CCAGGTTTCA TTGACACCCA CATACATGCT
TCACAATTCC CCAACGTCGG CATTGGTTTA GACTGTCCTC TTTTGGATTG GTTGAACGAC
TACACTTTCC CGTTGGAGAA CCAGTTCACT GACTCTAACG AGAAGAAGTT GCAATTCGCT
AAAAACGTCT ACTCCAAAGT AATCAACAAA ACCCTTACTA GTGGCACTAC TTGTGCCTCA
TACTTCACAA CAATTGACCC GCAGACTACC AACTTATTTG CTGAGTTGTT ATTGGAACAT
GGCCAAAGAG GTTTTGTAGG AAAAGTGTGT ATGGACCACA ACGACACTTA CCACGACTAC
GAGGAAAGCT TTGAAGACTG TGTCCATTCG ATGAACCTGA TCATCAACCA TTTGGACAAG
TTGAACCCAA GTGACGACAC CTTGGTTAAG CCGATTATCA CCCCTCGTTT CGCACCCGTC
TGTTCTCGTA AGATGTTGAA TTGGTTAGGA AAATTGAGCA AGACGCACAG CTTGCCCATC
CAGACTCACA TCAGTGAAAA CACCAAGGAA ATTGAGTTGG TTCGTGATAT GTTTCCTGAT
TGCGAAGATT ATGCTACTGT ATATGATAAA CATAACTTGT TGAGTTCTTC CACAATCTTG
GCTCATGCCA TTCACTTGAC TAAGAAGGAA AGAAAGATGA TCAGCAAGAA GGAATGCTCC
ATCTCTCATT GTCCAACATC TAACACATTC ATCTCCAGTG GTGAAGCTCC AGTCAAACAG
TATCTTTACC AAGATAAGAT CAACGTATCA TTAGGTACAG ATGTCTCTGG AGGCTTTGAT
CTGAGCATCT TGGCTGTCAT CAAACATTCC ATTTTGGTCA GTCACCACTT GGCAATGAAG
ACAGGAAGGC AAGGTGACAA GTTGTCAATC ATAGATGCTC TCTACATGGC CACCCAAGGA
GGAGCCAAAG CCATTGGTAT GCCAGACGTG TTGGGATCTT TTGAAGTAGG AAAGAAGTTT
GATGTCCAGT TGATTGATTT GAGTTCCAAG GATTCGATCG TGGACACCTT CGAATGGCAA
TTGCCTCTCG AAGAAGAAGC TAACCAACGC AAAAAGTCAA AACAAATGCA AGATTTGTTG
GGCAAATGGA TCTTCAGTGG TGACGACAGA AACTGTGTCA AGGTCTGGTG TAATGGTCGT
TTGGTAGTAA ACAAGATGCA TTATCAACGT GATGACAGAT GGGTCATGGT TGAAAAGGAT
TTCTAA
 
Protein sequence
MPSNSPLIEP KASSSIGYTL YYGTFVHTPT LEELEICFNT LVGVTSDGEI DYIHKNYKPE 
EHDYMTAVHR NRNRHFDFID YSQDPTKFFV PGFIDTHIHA SQFPNVGIGL DCPLLDWLND
YTFPLENQFT DSNEKKLQFA KNVYSKVINK TLTSGTTCAS YFTTIDPQTT NLFAELLLEH
GQRGFVGKVC MDHNDTYHDY EESFEDCVHS MNSIINHLDK LNPSDDTLVK PIITPRFAPV
CSRKMLNWLG KLSKTHSLPI QTHISENTKE IELVRDMFPD CEDYATVYDK HNLLSSSTIL
AHAIHLTKKE RKMISKKECS ISHCPTSNTF ISSGEAPVKQ YLYQDKINVS LGTDVSGGFD
SSILAVIKHS ILVSHHLAMK TGRQGDKLSI IDALYMATQG GAKAIGMPDV LGSFEVGKKF
DVQLIDLSSK DSIVDTFEWQ LPLEEEANQR KKSKQMQDLL GKWIFSGDDR NCVKVWCNGR
LVVNKMHYQR DDRWVMVEKD F