Gene PICST_62666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_62666 
SymbolAGL1 
ID4839770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp37412 
End bp39130 
Gene Length1719 bp 
Protein Length572 aa 
Translation table12 
GC content38% 
IMG OID640391085 
Productalpha-glucosidase maltase 
Protein accessionXP_001385341 
Protein GI126137636 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.97685 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.170022 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAATTA CTCGCAACTG GTGGAAAACC TCCACCGTTT ATCAGATTTG GCCAGCTTCT 
TATAAGGACT CTAATGGTGA TGGAATTGGT GATATTAAAG GTATTATCTC AACTTTGGAC
TATCTCAGAG ACTTGGGGGT TGATGTTATT TGGTGTAGTC CGATGTACGA CTCACCACAG
GATGACATGG GTTACGACGT TAGAGACTAT GAAAAGGTAT ATCCAAAATA TGGTACAAAC
GACGATATGC AGTTACTTAT TGACGAGTGT CACAGTCGTG GTATGAAATT GATTTTGGAC
TTGGTTATTA ACCACACTTC AAGTGAGCAC GTATGGTTCA AGGAATCGAG ATCCTCCAAG
ACAAATCCAA AAAGAGATTG GTATATTTGG AAACCACCCA AATACGATGA AGAAGGCAAC
AGACGCCCAC CCAACAATTG GGCATCTTAC TTCTCTGGTT CTGCTTGGGA GTACGATGAG
CGCACAGATG AGTACTACTT GAGACTTTTT GCCAGTTCCC AGCCAGACCT AAATTGGGAA
AATGAAGAGA CCAGAAATGC CATCTACGAA TCTGCAGTGA AGTTTTGGTT GGACAAAGGC
GTAGATGGGT TCAGAATCGA CACTGCTGGT TTATACTCTA AGGTTCAAAC TTTTCCAGAT
ACTCCGATCA TCTTTCCAGA GGAAGAATTT CAATCGAGTA AGTTGTACAG TCAGAATGGT
CCCCGTATTC ATGAGTTTCA CAAGGAAATG TACTCAAAGG TCACAAGTAA ATATGATGCA
ATGACAGTTG GCGAAGTTGG TCATTGTTCT CGCGAAGACG CCTTGAAGTA CGTAAGTGCT
AAAGAGCATG AAATGAACAT GATGTTTCTT TTCAATAAAG TGTGGGTCGG ATGTGACAGA
AACGATCGTT GGAAATTTGA TGGTTGGAAA TTGACCGATT TTAAGAAGGC CGTTGAAATA
GATTGTGATT TCATTGCTGG GACAGATGCT TGGTCAACCG TTTTCATTGA AAATCATGAC
CTTCCCAGAT GTGTTACTAG ATTTGGAGAC AAAAAACATC GTTCCCAAGC TGCTAAGTTA
CTCTCAATTT TAGGTACTAC ATTGACTGGT ACACTCTTTA TCTATCAGGG CCAAGAAATT
GCCATGGAAA ACTTGCCAAG AGATTGGTCA ATTGATGAAT ATAAAGACAT CAATACAATC
AACAGATACA AGGAGTTCAA GGATAAATAT GGAAATGATC CTGATTTCAA GGAAAAAGAG
GAAAAGTTGA TGGATATAAT TAATCTATTG GCTAGAGATA ATAGCAGATC CCCAGTTCAA
TGGGATGCAT CTCCAAACGC TGGATTTACC ACAGGAATTC CATGGACAAG AGTCAATGAG
AACTATACCA CAATCAACGT TGAAAGTCAA ATTAGAGATC CAAATTCTGT TTTAAACTTC
TACAAGAAGT CTATTCAAAT TAGAAAAAGT TATCAAGACT TGCTCATTTT TGGAGATATG
AAGATCTTAG ATTATGAAAA TCAAAAGACC TTTACCTACC TAAAGTTGAA CGAGAATGCT
TTATCGCCAA AAGCTTATAT TGTCTTGAAT TTCTCCAATG AAGAAGTCAA CTTTGAAAAG
TTGATTAATG GTGATTTTGA ACTAGTTCTC AGTAATGTAG ATGTTATCAA TGAACAGAAA
TTGTCTCCAT TTGAAGCACG TCTTTACATT GTTGATTAA
 
Protein sequence
MTITRNWWKT STVYQIWPAS YKDSNGDGIG DIKGIISTLD YLRDLGVDVI WCSPMYDSPQ 
DDMGYDVRDY EKVYPKYGTN DDMQLLIDEC HSRGMKLILD LVINHTSSEH VWFKESRSSK
TNPKRDWYIW KPPKYDEEGN RRPPNNWASY FSGSAWEYDE RTDEYYLRLF ASSQPDLNWE
NEETRNAIYE SAVKFWLDKG VDGFRIDTAG LYSKVQTFPD TPIIFPEEEF QSSKLYSQNG
PRIHEFHKEM YSKVTSKYDA MTVGEVGHCS REDALKYVSA KEHEMNMMFL FNKVWVGCDR
NDRWKFDGWK LTDFKKAVEI DCDFIAGTDA WSTVFIENHD LPRCVTRFGD KKHRSQAAKL
LSILGTTLTG TLFIYQGQEI AMENLPRDWS IDEYKDINTI NRYKEFKDKY GNDPDFKEKE
EKLMDIINLL ARDNSRSPVQ WDASPNAGFT TGIPWTRVNE NYTTINVESQ IRDPNSVLNF
YKKSIQIRKS YQDLLIFGDM KILDYENQKT FTYLKLNENA LSPKAYIVLN FSNEEVNFEK
LINGDFELVL SNVDVINEQK LSPFEARLYI VD