Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_56703 |
Symbol | CGA1 |
ID | 4837769 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | - |
Start bp | 1084117 |
End bp | 1086999 |
Gene Length | 2883 bp |
Protein Length | 951 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640389084 |
Product | Glucoamylase 1 precursor (Glucan 1,4-alpha-glucosidase) (1,4-alpha-D-glucan glucohydrolase) |
Protein accession | XP_001383835 |
Protein GI | 150864848 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATATTCA GAGACTTAAT AAAATCTTCG GTTGTCGCTC TTGGAGTGAC AAACTCTGTG CTTGGTGCTG CAATTTCTTC TTCTGAGTCA GAGGTCGCTT CGAGCACCTC CTCAAGTGGA GACTCTCGAG CAACAGTTCC CAATGACTTG ACTTTAGGTG TTAAGCAGGT TCCTAACATC CTCAATGATA CTGCTGTTGA TGCTAACCAG GAAGCAAAGG GTTACACTTT GGTCAATGTT ACCTCCACAC CAAGAGGATT GACCGGTATT CTCGAGTTAA AGGAGGCTAC TAATATCTAT GGTTATGACT TTGATCATTT GAACTTGACG GTGACTTACC AGTCTGACAA GCGTTTGAAT GTTCATATTG AACCAACAAA CTTAACCGAT GTCTACATTT TGCCTGAAGA CTTGGTTGTG AAGCCAACTA TCGAAGGAGA TGTCAATTCA TTCAATTTCG AAGACTCTGA CTTAGTCTTC CAGTACCACT CGGATGACTT CTCGTTTGAA GTGGTTAGAG CTTCTACCGG AGAGGTTTTG TTTTCCACTG ATGGTAATCC ATTGGTATTT TCCAACCAAT TCATTCAATT CAACACCACT TTGCCAAAGG GGTATGCAAT TTCTGGATTA GGCGAGTCCA TCCATGGTTC TTTGAGTTTG CCAGGAACTG TAAAGACTTT GTTCGCTAAC GATGTTGGTG ACCCTATCGA CGGTAACATT TACGGTGTTC ATCCTGTTTA CTATGACCAA AGATACAATT CAAATACCAC CCATGGTGTC TACTGGAGAA CATCTGCTAT CCAAGAAGTC ATTTTTGAAG AACAATCTTT GACCTGGAGA GCACTTTCTG GAGTTATTGA CTTATACTTC TTCAGCGGTC CTGACCCTAA GGATGTTATT CAACAATATG TTTCAGAAAT AGGATTGCCA GCTTTCCAAC CATACTGGGC TCTTGGATAT CACCAGTGTA GATGGGGTTA TCGTGAAATC GAGGACTTGG AAGATGTAGT AACCAACTTC AAGAATTTCA ATATTCCATT GGAAACCATC TGGTCTGATA TCGATTACAT GGACTCATAC AAGGATTTCA CAAATGACCC ACATAGATAC CCAACTGACA AGTACCAGGA CTTCTTGGAC AAGCTACACA AGAACAACCA ACACTATGTT CCAATCTTCG ATGCAGCAAT TTACGTTCCA AATCCAAATA ATGAAACTGA CAATGACTAT ACCCCCTTCC ATGCCGGTAA CGAATCTGAC ATCTTCCTTA AGAACCCAGA TGGCTCTTTA TACATTGGTG CTGTTTGGCC CGGTTACACC GCTTTCCCAG ATTTCCTTGC TAACAATACA CAAGACTGGT GGAACGAGAT GTTTAAAGAA TGGCACGACA GAATCCCATT CGATGGTATT TGGTCCGATA TGAACGAAGT CAGTTCTTTC TGTGTTGGTT CGTGTGGTAC TGGCAGATAC TTTGAAAACC CAGCAGATCC ACCATTCTTG GTTGGAGGCG AAGTTACTCA ATATCCATCA GGTTTCAACG TTTCGAACTC AACTGAATGG AAGTCTATTT CCAGTTCAAT TGCTGCTACA GCCACTACTT CTAAGCCCAG TCCATCAAGC TCTTCTGCTT CAATCGACTC CATGAACACA TTGTTGCCAG GTAAGGGAAA CATCAACTAC CCACCTTACG CTATCAACCA CGCCCAAGGT GATCATGATC TTGCAACTCA TGCAGTTTCT CCAAATGCAA CTCATGCAGA TGGCACTGTT GAATATGATA TCCATAATCT CTACGGATTC TTACAAGAAA AGGCTATCCA CGCTGCTTTG TTGGAAATCT TCCCAAACAA GAGGCCATTT ATTATTGCCC GTTCTACCTT CTCCGGTGCT GGCCATTACA TGGGTCACTG GGGTGGTGAC AACAATGCTG ACTATGACAT GATGTACTTC TCTATTCCTC AGGCATTCAG CATGGGTCTT TCTGGTATTC CTTTCTTTGG AGTTGATGTT TGTGGTTTCA ATGGAAACTC GGATGCTGAA TTGTGCTCGA GATGGATGCA ATTGGGTTCG TTCTTCCCAT TCTACAGAAA CCACAATGTT TTGGGAGCCA TTTCTCAAGA ACCATACGTT TGGTCTTCTG TTGCTGATGC CACGAGAACC TCAATGGCTA TCAGATACTT GTTGTTGCCA TACTACTATA CCTTGTTGCA CGAATCACAT GTTACTGGTT TGCCAATCTT GAGATCCTTG TCGTGGCAAT TCCCATACGA GAAGAAGTAC AACGGTATCG ACAACCAATT GTTTGTTGGT GATGCCTTGA TTGTTACTCC AGTTTTAGAA CCAGGTGTCA ACAAGACCAA GGGTGTTTTC CCAGGTGCTG GTGTCTCAGA AGTCTACTAC GATTGGTACA CTCACGAAAA GCAAGACTTC AGAAATGGAA AGAATGAAAC CTTGGCTGCA CCATTGGGTC ACATCCCATT GCATGTCAGA GGTGGTCACA TTTTGCCACT TCAAGAACCA GGCTACACTG TTGCTGAATC CAGAGAGAAT CCATTTGCAT TGCTTGTAGC TTTGGACAAT GAAGGTAATG CTTCAGGAAA GCTCTACCTT GACGATGGGG AATCTTTGGA AATTGAAGAA TCTTTGTATG TTGACTTTGT TGCTCACAGC AAGCTGTTGA CAGCATCGAG TTTTGGAGAA TACAATGTTT CGCAGCCATT GGCCAATATT ACCATCTTGG GTGTGGAGAA GAAGCCTAAG CAGGTCGAAT TCGAGGATTC AAAGGTCAAG TTCAGTTACG AAAATTCAAC TATTTTCGTC ACAGGTTTGG AAAAGTATAC CAAAGAAGGT GCTTTCGCTA AGCAATTCAC CCTTACTTGG TAA
|
Protein sequence | MIFRDLIKSS VVALGVTNSV LEVASSTSSS GDSRATVPND LTLGVKQVPN ILNDTAVDAN QEAKGYTLVN VTSTPRGLTG ILELKEATNI YGYDFDHLNL TVTYQSDKRL NVHIEPTNLT DVYILPEDLV VKPTIEGDVN SFNFEDSDLV FQYHSDDFSF EVVRASTGEV LFSTDGNPLV FSNQFIQFNT TLPKGYAISG LGESIHGSLS LPGTVKTLFA NDVGDPIDGN IYGVHPVYYD QRYNSNTTHG VYWRTSAIQE VIFEEQSLTW RALSGVIDLY FFSGPDPKDV IQQYVSEIGL PAFQPYWALG YHQCRWGYRE IEDLEDVVTN FKNFNIPLET IWSDIDYMDS YKDFTNDPHR YPTDKYQDFL DKLHKNNQHY VPIFDAAIYV PNPNNETDND YTPFHAGNES DIFLKNPDGS LYIGAVWPGY TAFPDFLANN TQDWWNEMFK EWHDRIPFDG IWSDMNEVSS FCVGSCGTGR YFENPADPPF LVGGEVTQYP SGFNVSNSTE WKSISSSIAA TATTSKPSPS SSSASIDSMN TLLPGKGNIN YPPYAINHAQ GDHDLATHAV SPNATHADGT VEYDIHNLYG FLQEKAIHAA LLEIFPNKRP FIIARSTFSG AGHYMGHWGG DNNADYDMMY FSIPQAFSMG LSGIPFFGVD VCGFNGNSDA ELCSRWMQLG SFFPFYRNHN VLGAISQEPY VWSSVADATR TSMAIRYLLL PYYYTLLHES HVTGLPILRS LSWQFPYEKK YNGIDNQLFV GDALIVTPVL EPGVNKTKGV FPGAGVSEVY YDWYTHEKQD FRNGKNETLA APLGHIPLHV RGGHILPLQE PGYTVAESRE NPFALLVALD NEGNASGKLY LDDGESLEIE ESLYVDFVAH SKSLTASSFG EYNVSQPLAN ITILGVEKKP KQVEFEDSKV KFSYENSTIF VTGLEKYTKE GAFAKQFTLT W
|
| |