Gene PICST_32705 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_32705 
SymbolWAR1 
ID4840097 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp453852 
End bp456524 
Gene Length2673 bp 
Protein Length834 aa 
Translation table12 
GC content44% 
IMG OID640391412 
ProductWAR1; transcription factor activity 
Protein accessionXP_001385437 
Protein GI150865992 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACAAG CCAACAACAG CGATGATGCC TCTACTGGCA CCAAGAAATC CAAGTCGGCT 
CGCAGATCGG TGGCTTGCAA GAGCTGTCAC TCCCTTAAAG TCAAGTGTAC CCCAGCCGAC
CCTACCAACC CCAGTGGGAG CTGTATTCGG TGTCTAAATG CCAATCGTAA ATGCGAGATT
GATCTTAACC AAACCAGAAA AAGGCGGAAA AAGGCCGATA TCCTTTTGGC TAACCTCCGC
GATCGCAGTG AAGCACCCAC AGTTGAGTCC AGCGTTGCTC CCAGTACTCC CATCATTGCT
TCGGTACTGG AAGCCCCCTC GCCAGCCCCT ACGACCTCAG AAGAAGTTAT TGAAAATCTT
AAAAGACAGG TTAAATCCCT CGAAGCCCAG CTTCAACAAC AGCAAGCCTT CAACCAGCAT
GCAATGCACC ACAGCAGAAA CATAAACACA AGTAATGATA CTATATCGGA TGTAGATTCA
CCGCCGTTCA TCTCGAAATC TGACTTGGAA AGGGAGATCT TATTTCTCTG CGATAGCAGT
GTGACCAAGC TTACCGATTT AACCAATGAC TTGAAAACTG TAGCTGACCG TAGAACTCTG
CTATTCCGTG ACTCCCGACC TGTGGATGTG GTCTCCACTG GCTTGTTGTC CCTTGAGGAA
GCAACTGAAA GACTCGAGAC TTACAGAAAA GTATTGTTCG TGCAGCACCC ACTCATCGAG
ATTCCTAACG AGATTTCGAT ACAAGAGTTA AGAGAAAAAC TGCCCTTCTT GTTCAACGCC
GTCATGTCTG TCACCTCCGT AGTATATAAT AAACAATTGG ACATTGACAA GGCGTTGAAC
ATTGACAATG CTGCTGTGCA ATCCATAGCA GTAGAAGTCA TGGTCTCTGG TACCAAGTCT
GATGAGTTGG TCAAGAGTTT GATCTTGCTC TGTGTCTGGT ACAACAGTCC TGAACTCTTT
AGACAGCGTC GTTACCATCT TCTCAACACT CTTGGGGTCA CCATGCTCCA TGATTTGGGA
ATTGTTGCTA GACCCTCCTA CAGCTTCAAA GGAGAAGATC GCGCGGTGAC TCAGGATGAC
AATAAAAAAC AGAATTCCGA GTACCAGAGC TTGGTGTTGA TTGTATATTT CACCACCGTC
AGTATCTGTT TAATACTTAA AAGAACCATC TACGTCAAGT GGACTCCTTT TGTCGAAGAA
TGTTGTTCGA CACTTGAGAG GTCTCCAAAT AGAACATGGA GAGAACTTGC TCTTTTCTCC
AGGTTGAGCC ACCTCTTGGA TAAAATACAC CACATCATAC ACTCGCCCGA GATCTCAGAA
TCAAGACGTT CAACACCACA TTACATCATC CACGAGATGC AGAAGGCGCT CTCCATTGTC
AGGCACAAGA TCAGAGACGA TGATCATGCC TTCTTGGCTT ACTACTTTTC GGTAGAGGCT
TACTTGCATG AACCATGTCT TACTAATGTG TTCACCAATG ATGAAAATGG CGACAACATG
AAACTTACAG AGTCTTCAGC CAAGTCCATT TCAAATTGTA CCAATTCTTG TCTCAATGCA
TTGGATGAAT TCAACAAATT GTCCTTGGAG GAGGTGGCTG GTATTCCTTT GTTTTATAGT
TCTCGAATCA TTTATACTGC TGGTATGTTA TTGAGATTAA GATACTTGAT ATTATCGTTG
CCATCTCATA TTGAAAAGGA TTTAGTTCCA TATCATGCTA TTTTTGCCAT TCAGAGAGCA
AATAAGATTC TTGATCAGGC TAGTATAGCA CATTCGGCCA ACCACTTCTT GAAGAAGACG
CGTTTGGTGT TGCAATTGTT CATACAAACG TATGCAACCC AAGTTCAGGA ATTGTTGCGC
AAGAATGGTG AAACTCCACA GAACTTGAAG CCAACTCCAA AGAAAGAACT TCATGAAATG
GATAGACTTT CCAATATATT CAGAGCACAC CATCAAGCTG GACAGAAGTC TATCATTAAT
GACGATGCTA ACCAATATGA TTCAAATGTC CCATTAGACA TCTTGTCGTA TGCTGCCTCT
TACAGAAGAG ATTCTAAGGA CTCTAACATG AGAGGAAGAA GCTCGCCGTC AGCCCATGCC
ACTAACGGTC TGGAAGATTC GAAGGCTAAG TTGAATACTC CACAATCTTA TGCACCTACA
CCTAGTGCTC TTCCTTTGAC TAGTAGTGCG TCGGTAGGAG CTATTTCCAT TCCTCCTCTT
AGAGCATCTG AAACTCTTCA ACCTTCTAAT CTATCTCATT TACACCGTTC TTCCTCACTT
GGAATAATCA ACAACTCCTC ACTTCCTATT CCCAGCAATG GCCATGCCTT ACCACCGATG
TTTGGCACTA ATGGGAACCC CAATGCTGGT ATGACTCCTC CATTATTGAG CTCAGTTGTT
GCTCCATCAA CAAAGCATCA GAATCAACTG CACCAGCAAA CCTCGCTGCT GCAACTAGGT
CAGTTGCCTT CTCAATTTAG ACAACCATCC ATTGGCCTGA ACAAACAAGC TTACAACAAC
TTGGCCAATC CAGATCAACT TGAAAACTCG TACATGGCTT TGAACGATGA GTTCTGGTCT
GATTTGCTTA GCACTGAGTC CGACAGGATC AACTTCTCCA ACAACAACTA CAATTCAGCA
CAGGTCAACG ACGAGTTGTT TTTCATGAAC TAA
 
Protein sequence
MSQANNSDDA STGTKKSKSA RRSVACKSCH SLKVKCTPAD PTNPSGSCIR CLNANRKCEI 
DLNQTRKRRK KADILLANLR DRSEAPTVES SVAPSTPIIA SVSEAPSPAP TTSEEVIENL
KRQVKSLEAQ LQQQQAFNQH AMHHSRNINT SNDTISDVDS PPFISKSDLE REILFLCDSS
VTKLTDLTND LKTVADRRTS LFRDSRPVDV VSTGLLSLEE ATERLETYRK VLFVQHPLIE
IPNEISIQEL REKSPFLFNA VMSVTSVVYN KQLDIDKALN IDNAAVQSIA VEVMVSGTKS
DELVKSLILL CVWYNSPELF RQRRYHLLNT LGVTMLHDLG IVARPSYSFK GEDRAVTQDD
NKKQNSEYQS LVLIVYFTTV SICLILKRTI YVKWTPFVEE CCSTLERSPN RTWRELALFS
RLSHLLDKIH HIIHSPEISE SRRSTPHYII HEMQKALSIV RHKIRDDDHA FLAYYFSVEA
YLHEPCLTNV FTNDENGDNM KLTESSAKSI SNCTNSCLNA LDEFNKLSLE EVAGIPLFYS
SRIIYTAGML LRLRYLILSL PSHIEKDLVP YHAIFAIQRA NKILDQASIA HSANHFLKKT
RLVLQLFIQT YATQVQELLR KNGETPQNLK PTPKKELHEM DRLSNIFRAH HQAGQKSIIN
DDANQYDSNV PLDILSYAAS YRRDSKDSNM RGRSSPALPL TSSASVGAIS IPPLRASETL
QPSNLSHLHR SSSLGIINNS SLPIPSNGHA LPPMFGTNGN PNAGQLPSQF RQPSIGSNKQ
AYNNLANPDQ LENSYMALND EFWSDLLSTE SDRINFSNNN YNSAQVNDEL FFMN