Gene PICST_31032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31032 
SymbolUGA3 
ID4838303 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp1566748 
End bp1569159 
Gene Length2412 bp 
Protein Length803 aa 
Translation table12 
GC content37% 
IMG OID640389618 
ProductFungal transcriptional regulatory protein 
Protein accessionXP_001383922 
Protein GI150864914 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.282349 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.13348 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAAG GAACAAGACA GAATGAGACT GAAAATCTAC AAAGTATCTC AAGCATAATA 
TCAGTCGAAC AAGATCAAGA TCAACATTTG GAACATCTTT CACAGCTACA ACTACTCCAG
GAACTTCCAA AGCCGAAGAA AAGAGCCTGC GATAGATGTC ATGAAATAAA GCAAGTTTGT
ACGGGAAGTG TGCCATGTGA AAGGTGTGCC AGAATGAGCA TTAAGTGCGT GTTGGATAGG
CCATTGAAAA AAATTGGTAG ACCTCAAAGA GGAGATAGAG ATCTGTCTTA TTCTCCTAGA
AGCAACTCTT TGATACCTCT GAGTGATCGC AGAAATGGCA AACTTGAGAA GAGAACTGGA
ATACGCCACT CGAAATCGGC CTGTGAAGCA TGTAGAAAGA GAAAGAGAAA ATGTGATGAA
AGTTGGCCCA ATTGTGGCTA TTGTACCAAA CTGAATTTGG AATGTTCGGG GCCTACTGTG
CGTAAGAGGA AGCCAAAGGC TAAATCTTCA AGATTAGACC CATTGTTGCA AAGGAAGGCT
GAGATCAGCT TGTCAATACA ATCTGCTTCT AGTTCGACAT CACTAATAAG CAGTCTCTCA
GTTGATGATA GTTCAGCTTC GCCCCATTTT GACAATCCCA ACGATGGAAA TTGTTCCGAG
ACTACAGTTC TGCTGCAAAC AAGTTCCACG GAACAGAAAT CTTCAGAACC TATTAGGGAT
AGATGTGTAA AGGAGACTCA TAACATTCCA GTTTCACCAG CTGCTTTTAT GTCGACTGAT
ACCTTTCTTA ACAACGACAT TACATTATTA AACATGGATA AAAAAACAAA TACAGGGGCG
GAATCTTCAC CTATGAACAG TACTTGCAGC AGTAATGATG CACTATTTTC TGATTTGTTT
AGAAGAGATA ACGACGATGG CGACGATAAT ATTACCAGTT CCATTATAGA TCAGGATTTA
AGAAATTTTG GTCTAATTCT CGCGAACAAT CTGAATAATC CTGAGAGTAG CGAGGCCAAT
ATTAATAACG AGAGTATCAA CAATAGTAGT GATAATATTA CTAGAAAACG TATGGCCATT
TCTAGATCTG ATACCCCTGG TCCAATTCAA TTACACAAAT CAACATCCAT CAAATATCAC
CAATTGTTTG AGAACATGAA TCAAATCAGT AATAGTATAT GCTTGAGTCT TTTGAATAAC
ATTGTAATGG AAAACAACAT TTCTGTAAAG GAGAGATTTT TGTTAAAGTA TTTTGTTACT
GACGTATCAT TCACCATATT TGCTGACGAG ACCTCTAATG CATTCATGAG CACTATTATC
CCTATCTCAT TGAAAGATAA GAGGGTCAGA GATCCAATTT TGGCCATTGC TTCGGCTCAC
AGGGCAGGTA ATGATATAAA ATTCTTTAGA GACGCTGTCT TGTATCGTTC AAGTTCTCAT
GCTACATTGC TTGGGAGCAC TTCTCAAGTA GAGTACTACT TTTCTGATGA AATCTTACTA
TCAATCTTGT TGAGTGGAAT CATGGAAATA TTGAATGGAT CGTCTTTAGG TTGGTCTGTT
TTATTAGAAA AAGCATCTGA GATCACAAAA TTTAGAGGAG GTATTAAAAA GATGGCTTCA
GCTCGAAGTG GTTATGCACC AATGTTGGTT CAACTTTTTT GCTATATAGA TCTTATTTCA
AGCCTCAGTA CTTGTCATCC ACCTTATATT GAACAATCAG CCCAAAATGA CAATTGTCAA
ACAAGCGAAC TGACTATAAG TGAAAAGATA GAAATAGAAT CTCATGTTTA TGATCAAGAA
GAGGTCACAG AGATTTTGAA CAGCAAGTTT GGCTTCAGAT TTGGTATTGC TGGTGAAATA
TTCAAAATTT TGGGCAATAT TTCAACTTTG GCCTCGTTAC GTAAATCGAG GCATGACGGC
GAAGATCAAG AGAGACAATT TCAAATAATG GCGGATGATA TCGAGATGAA ACTACAAGAC
TGGGAATTAC CAACCACAAT GAACTTCAAT GATGTTTCAG ATGTCCAGAT GTCTCAATAT
GCCATGGCTT TACAATGGGC TGCTTTTTTA AGATTGCATC AAATCAAAGA TGGTTACAAC
CGTCAAGACA TAAGAGTTAA AGTGTGTCTT TCTACTATCC TAAGAGCTGT CAAATTGATC
CCAGAGAAGT CAAACTTGGA AAGTAGTTTA ATGTTTCCGT TGATACTAGC TGGTTCTGTA
GCGATAACAA AAACTGATCG AGATTTTATC ATTTCAAGGG TCAGATCTAT AAAGAAAAGA
TTGAAGTTCC ACTATATTGA AGAGTTTGAA CGAATGTTGT TGTACATTTG GAGTAGAGAT
AATAAGGAAG GAAATTTTGT TAATTGGGCT GCTGTCAGAT ACTATCAATT TCCTGGTTTG
GTTATGTTTT GA
 
Protein sequence
MSEGTRQNET ENLQSISSII SVEQDQDQHL EHLSQLQLLQ ELPKPKKRAC DRCHEIKQVC 
TGSVPCERCA RMSIKCVLDR PLKKIGRPQR GDRDSSYSPR SNSLIPSSDR RNGKLEKRTG
IRHSKSACEA CRKRKRKCDE SWPNCGYCTK SNLECSGPTV RKRKPKAKSS RLDPLLQRKA
EISLSIQSAS SSTSLISSLS VDDSSASPHF DNPNDGNCSE TTVSSQTSST EQKSSEPIRD
RCVKETHNIP VSPAAFMSTD TFLNNDITLL NMDKKTNTGA ESSPMNSTCS SNDALFSDLF
RRDNDDGDDN ITSSIIDQDL RNFGLILANN SNNPESSEAN INNESINNSS DNITRKRMAI
SRSDTPGPIQ LHKSTSIKYH QLFENMNQIS NSICLSLLNN IVMENNISVK ERFLLKYFVT
DVSFTIFADE TSNAFMSTII PISLKDKRVR DPILAIASAH RAGNDIKFFR DAVLYRSSSH
ATLLGSTSQV EYYFSDEILL SILLSGIMEI LNGSSLGWSV LLEKASEITK FRGGIKKMAS
ARSGYAPMLV QLFCYIDLIS SLSTCHPPYI EQSAQNDNCQ TSESTISEKI EIESHVYDQE
EVTEILNSKF GFRFGIAGEI FKILGNISTL ASLRKSRHDG EDQERQFQIM ADDIEMKLQD
WELPTTMNFN DVSDVQMSQY AMALQWAAFL RLHQIKDGYN RQDIRVKVCL STILRAVKLI
PEKSNLESSL MFPLILAGSV AITKTDRDFI ISRVRSIKKR LKFHYIEEFE RMLLYIWSRD
NKEGNFVNWA AVRYYQFPGL VMF