Gene PICST_31115 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31115 
SymbolCTA4.2 
ID4837965 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp1805645 
End bp1807942 
Gene Length2298 bp 
Protein Length765 aa 
Translation table12 
GC content39% 
IMG OID640389280 
Productzinc finger transcription factor 
Protein accessionXP_001383632 
Protein GI150864697 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGCCA AAACTAGACA GAGAAAGTCG TATCTGTGTA TCAATTGCAG AGATAAGAAA 
CGCAAGTGTG ATAGAGGAAA GCCCTGTTCT TCTTGTGTAA GGCTGGGAAT AGCTTCAACA
TGTAAATACA ACACACCGAC ACTTACCGTG CTAGGACCAG CGTCCATCTT ATATGATAAC
GAAAATGCTT CCAGTCTAGA ACCAGAAGGT GAAGATGATA TAAGAGACTT CTTGAGACTC
AAGAATCCTG TCGGAATACC TGGAGGCAAG ATCAACTTCT TCAAGCTCTT GTGCAACAAC
TTGATAGAGC CAGATTACAA TGTGTTTTCG TGGATTAATG TCTACAAATC GGACCCGGGA
ATGAGGATCT ACTTGAAGAA CCTTCCTCTG AATAGCAAGC TCTTTGTAGA ACAAGTAGTA
GTAGACGACT TCCACAGAGA CGATATTTTC AAGGGTCTTC TCAATCTTGA ATTGGACGAA
TTGTACAGAC GGTCGGTGGG CCCGGACCTG GAGAACAAGA TAATACACAA CGAAGAGGAA
TTGCTACTCG CCATCAGATC AGTATTGCCA AATACCGTAA TTATAGCAAC CCATGTGGAA
AAGTTCTTCT CTGAACTATA CCCCGGTTTT CCATTTTTAG ATGAAAGCGC ATTTAGGAAA
AGGCTAATGA GAATGGGTAA TGGACCTCGA ATAACAGACG AGAATGACTG GGCCATCATT
GGTATATTAC TTATAATGGT GAGAATAAGC TATTTGTCCA GTCTCTGGAA CATATCTGGC
AGTGATAGGT TAGGAGCTGT TGCAAACAGA GCCGACTACG AGCTTGTACT GTCGGCTACT
ATAGGCCCTG AATTTATTCG ATTAGCAAGA AGTTGTTTAA AGCAATATGA TCTCCAGTCT
AGAAATTCGT TGTGGGTTCT ACAATGCAAT ATCTTCCTCC AGATCTATGA GCGGGTTTCG
CCTGAGGAAG GAGAGTGTGC TACAGGAAAA AGCAGTCAAA TCATTCATGC CTCGTTGATT
CAGCATGCAC TATCATTAAA GCTCCACATT GACCCAGACT TGCTCCCTCA GCTATACGGC
AAGAATGAAA AGTACAAGAA TCTCGTGAGA AAGATTTGGC ATTTCCTCGT TTCTTATGAT
TATTTTGATA GTATTAGCTA CGGCAATTAT CCTTCGACTA CAAAAATGGT GTTTAATACA
AGGCTACCAA CTCATATTGA AGGAAATGAG AATATTCAGG ATACAAAGTT GGAAAAGGAA
GTTCTTGGTT CGTTTGAATT TCTCGACTTC ACTTATGGAC CAGTGCATGA CCTTTTAGTT
GATATCCAGA GCTTGAAAAC CGATCATAAA ATTGAGAGAA TAGTTTTTCT TGCAACTGCA
ATAGAGTCTA CTATTGCTTC AATAATTGGA CAAGTTGATA CCTTTTTTTC TGTTGAAACG
AATCCCAGTA CAGATAAAGG AATTAGATTT GCTCTCTTTC TATACCTCAA GTTCTTTCTT
TTAACAATTT TTTCTTATCT ACAGCTGTAT TATGAGAGGA CTAAGCAGTA TGAAACGGCA
TTTTTCTACT TCAAGAAAGT ATTAATTATT GGATCTTATG AAATATTGCC AGCAATGTTT
AGATTGGTTG GTTCATTTGG AGACCACTTC AAGAAAGAGA CATTCATGTT GACTCCCCAG
ATTATCCAAT TGTCATGTCA CAGACTAATC ATTACCTTAG TTTCAGCATT TATTCGAATC
GAAATTACTT TGAAGAGCAA TGGTCAGGAG CTATCAGGTC TACATGGGGT AAAGATCAAA
GTATTGCAGA TACTTCATTT GTTTCTTGTC AACGCGTCTG CGTTGAGTAT CTATGTACAT
TATACGTGGA GAGTGAAAAA GGCACTAGAA TTTGCATTGA GGAAGATTTT TGACGGACAG
CTCTACAAAC TTCAAAATTT GACAGAATTA GAGATATCAA AGGCAGAATT GAAAATTCTG
AAGGAACAAG TTGCTGATTT AGACAAGGCA TTACAATCAG CTCTTCTAGC AATAGAGAAA
GCTTGGAAAC CTTCACTTCC AATTCCATTT GCGTCGATTG ATAACTTGGA TTACATTTAC
GGACCTAAAA ACAGAGCACC CTTCGAAGCG TATGATCGAT CCGAACCAGA TAGAATGTGG
TACAAAGTTT CTAGTTGCAT GAGTAATGGG TCCAATCTGA AGATTAGTCG AAATAATATT
CCTAATAAGC TACAACTAAA CAACGTTGAA TATGACTTCA ACATCTTCAA AGGGATGACA
TTTGTGGAAC AGGGCTAG
 
Protein sequence
MTAKTRQRKS YSCINCRDKK RKCDRGKPCS SCVRSGIAST CKYNTPTLTV LGPASILYDN 
ENASSLEPEG EDDIRDFLRL KNPVGIPGGK INFFKLLCNN LIEPDYNVFS WINVYKSDPG
MRIYLKNLPS NSKLFVEQVV VDDFHRDDIF KGLLNLELDE LYRRSVGPDS ENKIIHNEEE
LLLAIRSVLP NTVIIATHVE KFFSELYPGF PFLDESAFRK RLMRMGNGPR ITDENDWAII
GILLIMVRIS YLSSLWNISG SDRLGAVANR ADYELVSSAT IGPEFIRLAR SCLKQYDLQS
RNSLWVLQCN IFLQIYERVS PEEGECATGK SSQIIHASLI QHALSLKLHI DPDLLPQLYG
KNEKYKNLVR KIWHFLVSYD YFDSISYGNY PSTTKMVFNT RLPTHIEGNE NIQDTKLEKE
VLGSFEFLDF TYGPVHDLLV DIQSLKTDHK IERIVFLATA IESTIASIIG QVDTFFSVET
NPSTDKGIRF ALFLYLKFFL LTIFSYLQSY YERTKQYETA FFYFKKVLII GSYEILPAMF
RLVGSFGDHF KKETFMLTPQ IIQLSCHRLI ITLVSAFIRI EITLKSNGQE LSGLHGVKIK
VLQILHLFLV NASALSIYVH YTWRVKKALE FALRKIFDGQ LYKLQNLTEL EISKAELKIS
KEQVADLDKA LQSALLAIEK AWKPSLPIPF ASIDNLDYIY GPKNRAPFEA YDRSEPDRMW
YKVSSCMSNG SNSKISRNNI PNKLQLNNVE YDFNIFKGMT FVEQG