Gene PICST_29072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_29072 
SymbolGLN32 
ID4851808 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp2887425 
End bp2889242 
Gene Length1818 bp 
Protein Length605 aa 
Translation table 
GC content44% 
IMG OID640393516 
Productzinc finger transcription factor 
Protein accessionXP_001387117 
Protein GI126275662 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.965221 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACAAG GAAATAACTC CAATCATAAA CAAGCTAAAC CTTCCCTCGG CTCCAAGATC 
TCCTACAAGC CCTCATTGTC AAATTCAGCA GTAAACGAGA AACAACCCCT TGTCGTCTCC
AACCTTATTG CTGACACAGA TAACTCCTCC ATCGAAATCT TCAAGATGTA CCAGAACAAG
AACTACTTAC CCCACAATCA GAGAATCTCC AACATAGCAT GGAGAATCCA GAACAAGAAG
TTGATGCTGG GTGCCTCCGG AACGGCTCCA GCATCCAACC GTTCAAGCTC TGGCTCTGTT
AATGGCATTG CCAAACCCAT CCATCGTCTG AATTCAGTGT CTAGTAATCG TTCCAACTCC
ATCTCAGCCA AGAACTCAGG TCCTGTAACT GTCAATGGAT CCAAACGGGA TCTGCTTGAT
AATCTCAACG ACCCCAACTT GGACGAATTC GACTACGTAG CCCATATCCG TAGAATCAGC
CAGGAAGAAT ATAACCAGGC TAACATAATG GCGAAGAATA ACAACCAAAG TAATAAGCAA
AGTGACAGCA TCAATAACAA CACCACGAAC AGTATCACTT CCCCGGATTC AAGCACAAAC
ACTCTTACGT CGCTGAACTC CGCCATCTTT GGAACGATGA AATCTTCAGC TACAACTGCG
ACAAGCACTT CCAGCAATAA ACCACTTGAA GTTTCATTTG CTAACAATAA TAATGCTAAG
AACATTCCTG GAAATAACAA TTTTCTCTCG TCATATATAA ATTCTTTGGA ATCGACGTTG
AAGCTGGACT ATAAGCTCAA CCAGAATTCC GAGTTCGATA CTTCTTTACA ATCAAATACG
GTGTCCAACT CCACGTCGGT ATCTCCACCA AAGTTCAAAC AGAGGCAGCC ATCGGTCGGT
ACTGGTATTG GCAAACGAGT ATTGCAATGC ACCAACTGCC AAACCAAGAC GACTCCCTTA
TGGAGAAAGG CTAACAACGG CGATTTGCTC TGTAACGCCT GTGGATTGTT TTACAAATTA
CATGGAGTAT TGAGGCCATT GAATAATAAT TCCGGTTCTG GTTCAAGCAC CAATCATATC
GCTAACTCCG ATCCTATCTC TAATTCTGGC AACAATTCTG GCAATTCCTC TGCCTCGGTG
AAAAATCCTA GTGACAAGAT CATACTGAAT AACAATACCA ATCTTTTCAA CGGCTTGCAA
CTGTTGAAGA GCAACTTCTC TCCTTCTTCT GCGCCTAAGT CTAATATCAA CACTAGCAAC
GTCAACCAGA ACGATAGGTT TAGCTCTAAC GACTTTGACT TGTCCAATTA CGACGGTAAT
AAGTTCTATG ATTCTACCAA CAAGGATATG GTCAACATGG ACAGCTTCCT TGACTTTACT
CAACCAGGTA CCAACGCTGA CAACAATCCT TCTCGCACCA ACATCGGTAT CAACTCCAAC
AATGCCAATA TCTCTATAAA CGCAAATTCA GGATTATCGA GCAGTTTGCC CGTTAACAAC
TTCCAGAACC ATCAGCATAC GCCCGTAGGT GGCAACAACG TGGACGAAAT AGACAAGCTC
TTGAACATCA ACTTGTTCCA GTCGGATTCG TTCACGATAG GCAATAAGTC TGGCTCTGGC
TTCTACGATT TGGATTCGCA GCCTGGTCAA TCTGGCTTAG CTGGAGTTAA TGAAGACATG
TATGTTGGCG ATCAGATGCA ACAATCTCAT TTGAATGCCA ATTTGGATCT CGATTTGATC
GATGGCAGTC AGACTAATGG CAATGCTAAT GGAAGTGCAG GCTGGAATTG GTTGGACTTC
AGTCCACCTC AGAACTAG
 
Protein sequence
MAQGNNSNHK QAKPSLGSKI SYKPSLSNSA VNEKQPLVVS NLIADTDNSS IEIFKMYQNK 
NYLPHNQRIS NIAWRIQNKK LMLGASGTAP ASNRSSSGSV NGIAKPIHRL NSVSSNRSNS
ISAKNSGPVT VNGSKRDLLD NLNDPNLDEF DYVAHIRRIS QEEYNQANIM AKNNNQSNKQ
SDSINNNTTN SITSPDSSTN TLTSLNSAIF GTMKSSATTA TSTSSNKPLE VSFANNNNAK
NIPGNNNFLS SYINSLESTL KLDYKLNQNS EFDTSLQSNT VSNSTSVSPP KFKQRQPSVG
TGIGKRVLQC TNCQTKTTPL WRKANNGDLL CNACGLFYKL HGVLRPLNNN SGSGSSTNHI
ANSDPISNSG NNSGNSSASV KNPSDKIILN NNTNLFNGLQ LLKSNFSPSS APKSNINTSN
VNQNDRFSSN DFDLSNYDGN KFYDSTNKDM VNMDSFLDFT QPGTNADNNP SRTNIGINSN
NANISINANS GLSSSLPVNN FQNHQHTPVG GNNVDEIDKL LNINLFQSDS FTIGNKSGSG
FYDLDSQPGQ SGLAGVNEDM YVGDQMQQSH LNANLDLDLI DGSQTNGNAN GSAGWNWLDF
SPPQN