Gene PICST_82575 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_82575 
SymbolAQY1 
ID4838270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp193001 
End bp194013 
Gene Length1013 bp 
Protein Length267 aa 
Translation table12 
GC content45% 
IMG OID640389585 
Productaquaporin 
Protein accessionXP_001383665 
Protein GI126134281 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0580] Glycerol uptake facilitator and related permeases (Major Intrinsic Protein Family) 
TIGRFAM ID[TIGR00861] MIP family channel proteins 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.340504 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.546499 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACTG AAAACGAAAC CTTTGACCAA GAGGCCCAAC AGACCTACAA CCCAAAGTTG 
GACGCCACTA TCACTGCTTC TCCATTGAAG AACCATTTGA TTGCATTCCT TGGTGAATTC
TTCGGTACCT TCATCTTCTT GTGGACTGCT TTTATGATCG CCCAAATTGC CAACCAAGAC
CCTAACATTC CTGAAGTTGG ATCTGAACCT CAACAATTGA TCATGATCTC TTTCGGTTTC
GGTTTTGGTG TCATGATGGC TGTATTCATG TTCTACAGAA TTTCTGGTGG TAACTTGAAC
CCAGCTGTCA CCTTAACATT GGTATTGGCC CAAGCTGTTC CTCCTGTGAG AGGTGCCATC
ATGATGATTG CTCAAATGAT CGCTGGTATG GCCGCTGCCG GTGCTGCTTC TGCTATGACC
CCAGGCCCAA TTGCCTTTGC TAACGCTCTT GGTGGTGGAT GCTCCAGATC CAGAGGTGTA
TTCATTGAAG CCTTCGGTAC TGCTATCTTG TGTTTGACTG TCTTGCTCTT GGCCGTTGAA
AAGCACAAGG CTACATTCAT GGCTCCATTT GTCATTGGTG TTGCTCTTTT CTTGGGTCAC
TTGATCTGTG TCTTCTACAC CGGTGCTGGT TTGAACCCTG CTAGATCTTT CGGTCCAGCT
GTTGCCTCTA AGTCCTTCCC AGACTACCAC TGGATTTACT GGGTTGGCCC AATCTTGGGT
TCCGTCATTG CCTTTGCTAT CTGGAAGATC TTGAAGGTTT TGAACTACGA AACCTGTAAC
CCTGGCCAAG ACGCTGACCA CTAATCGCCC GGAATTAATT GTGGTGCATT GTCATTGCAA
TTGTCCGGAA TTTAAACCGA GCCGATTGTA TGCACCTACG ATGAAACAGC ATTGGTTTTT
ATATGTATCC AATGTCAAAA AATCGAGAGC ATGTTCATTT CACGATTACC TTTAGATAGC
TCCACACCTA ATTTTATCCA TAGTGGCTAC TAATTACTAA TAATACTTAA TTT
 
Protein sequence
MTTENETFDQ EAQQTYNPKL DATITASPLK NHLIAFLGEF FGTFIFLWTA FMIAQIANQD 
PNIPEVGSEP QQLIMISFGF GFGVMMAVFM FYRISGGNLN PAVTLTLVLA QAVPPVRGAI
MMIAQMIAGM AAAGAASAMT PGPIAFANAL GGGCSRSRGV FIEAFGTAIL CLTVLLLAVE
KHKATFMAPF VIGVALFLGH LICVFYTGAG LNPARSFGPA VASKSFPDYH WIYWVGPILG
SVIAFAIWKI LKVLNYETCN PGQDADH