Gene PICST_66589 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_66589 
SymbolCAP1 
ID4851665 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp2487570 
End bp2489833 
Gene Length2264 bp 
Protein Length492 aa 
Translation table 
GC content43% 
IMG OID640393373 
Producttranscriptional activator involved in oxidative stress response 
Protein accessionXP_001387049 
Protein GI126275202 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CCTTAATCTA TAGCTATCCG GCCAATTGTT CTTCGGACAA GTAGTGCCGT TCTTTCAATT 
GTATATTCGA CACCTCCACT GCTGTGTTCG TGTCTCCATC CTAAGCTGTC TAGACGTTAT
AACCGACGCG TTCACCCGTC TAAGCTCCGT CCCAGTCTCT CCATAAATCG TGTCCCAAAT
ATTTGGCACC TGCAAGCATC CCTCCTGCAG ATAGATCCAC AGTCTCCAGA CTTTTTTCTG
TAGCCTTCCA CAGATAAAAA GAAATTTGTA CCAGTATCGT ACGCAGGATC CAGAAGTTAC
CCGCACGCCG TCTTTATTGA TTACTAATAT CTGTGCTAGA ATAGTGAAAT ATATCACTAG
AGCATCTCCA AAAGCAGTTT TTCACACGTC TAGTTACTCG TGTTAGCTAC TTTTCGTGTT
TCTATTTCAA CCAACAATCT TTGTAAACTC ACCAGCATGA ACGACGTGAA GAGAAATTAC
GCTGAGGTTC TATCCTCAGA GTCGCCCATG GGGTCTACAC CAGACGCACA TGACGACAAG
AAGTTACATA CCAAGCCTGG CAGAAAGCCG ATAGAGACAG AGCCGAAGTC CAAGAGAACG
GCCCAAAACA GAGCTGCTCA GCGTGCTTAT AGAGAACGTA AGGAACGTAA AATGAAGGAC
TTGGAAGACA AGGTTAAGTC GCTAGAAGAC GAAAACATCA AGGCTACAAC AGAAGCAGAC
TTCTTGAAAG CTCAGGTGGA TATGTTAAAG AATGAGTTAG CCAGATACAG AGGCCACACA
GATTTCTCGG ACTTGAATCT ACCTACTAAG GTAGGAAATT TGTCGAATCC AAACACATCC
AAGTCTGGCA GCTACAATTT CAATTCGGCT TCATCCACAG CATCTTCGGC CAAATCTGCT
AATTCTGTAC AACACACATC TACATCTTCG TCTTTAAATG ATAATTCTCC ACGTCAGTTC
TCTGTGGACT TTCCATGGTC CAAGGATAAC TTGATGAGTC TCAAGAGCGG TACAAACGTA
GCCAGCCTGG AGTATAATGC CAACCAGCAG GTTCCAGATT TGGTGAGCGG CTCTTCCTCA
TCTACTTCGC CTTTAAATGA TAACCTCTTG GTTTCGCCAG ATTCGTCTGT ATCCTCAGCT
TCTAATCCAA TTAACGTCAA CACAAACTTG GACTTCACAT CGTCTTTTGA CGAGCAGTTG
GATCCATTCT GTGTCAAGTT GAATGAAGCA TGTGGAACGA AGCAGTGCCC AGTTCCTAAG
ACTAAGAGAA ACGACTCGAG GGTCTCCCAG AGCTCGATAC CCAACCAGTT CAGCTCGCCA
TTCTCTAACT TGGTAACACC AACTCCGCAG AACTTAAACG ACATTGACTA CTTGAGCGAT
CCGTTCTTCA ACCAAGTGGG AGATCCATTC TCTCTAGACT TGTCCAACAA CCAATCAGCA
TTATCAACCA ACACTTCCGT GGATTCCAGC AATAGCATAA ACAGCAACAG TACGGCTGTG
CCAGTTCGCT CAAATAACAC CAGCATAGCC ACTCCATCGC ATAACAACGA AGATCCATTG
TCGTTCCTTA ATGACAACAA TTTCGATGTG TCGTTGGCTT TTGGTGATCC AAACCCTAGG
CATGGTAAAG ATGAATTGGA CCCTATAGCA TTGTTGACTA CTGAAGAGTC GATCTATGAT
CCGTTGAAGG ATACTAGCGG AGTGAACGTG AACTTCAACT TCAACGACTT TGTCAAGAGC
TCCTTGCCCT CTGAGACAAC CCCCAAGGAA AGAAACTATA CTTTGACTGA ACCTTCTATC
AATGAAGAGG TTGCTGAAGA TGACGATGAT GATGCTGTGG TGCCGGCTCC TGAACAAACT
ATTAGATGCA GTGAAATTTG GGACAGAATC ACTGCTCATC CAAAGTATAC TGAGATTGAT
ATTGATGGTT TGTGTAACGA ATTGAAGAGT AAGGCTAAAT GCTCCGAAAA AGGCGTGGTG
ATTAATGCTG CCGACGTGAA TCAATTGTTG GAACAAAGCG CGATGAAGAG GCGTTGAACA
GTATATGTTA ATTATTAATG CCAATTCGAC ACGTTTAATG AGATTTTTTC AGTTCGGACC
TAAGTGATAT GTCATGTACC ATATTTTCAG ACTTTTTGTC GCAGCTTGAT TTTTAGTTCA
TGTTCGATAG CAGTGCTAGT AGACTTTTTT CATGGCACTT AATTTATTCA TATCCATATA
TATGGTATTT ACCTAGAGTT ATAAAAATAT AAAAACATCA ACTG
 
Protein sequence
MNDVKRNYAE VLSSESPMGS TPDAHDDKKL HTKPGRKPIE TEPKSKRTAQ NRAAQRAYRE 
RKERKMKDLE DKVKSLEDEN IKATTEADFL KAQVDMLKNE LARYRGHTDF SDLNLPTKVG
NLSNPNTSKS GSYNFNSASS TASSAKSANS VQHTSTSSSL NDNSPRQFSV DFPWSKDNLM
SLKSGTNVAS LEYNANQQVP DLVSGSSSST SPLNDNLLVS PDSSVSSASN PINVNTNLDF
TSSFDEQLDP FCVKLNEACG TKQCPVPKTK RNDSRVSQSS IPNQFSSPFS NLVTPTPQNL
NDIDYLSDPF FNQVGDPFSL DFNTTPSHNN EDPLSFLNDN NFDVSLAFGD PNPRHGKDEL
DPIALLTTEE SIYDPLKDTS GVNVNFNFND FVKSSLPSET TPKERNYTLT EPSINEEVAE
DDDDDAVVPA PEQTIRCSEI WDRITAHPKY TEIDIDGLCN ELKSKAKCSE KGVVINAADV
NQLLEQSAMK RR