Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_31032 |
Symbol | UGA3 |
ID | 4838303 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | - |
Start bp | 1566748 |
End bp | 1569159 |
Gene Length | 2412 bp |
Protein Length | 803 aa |
Translation table | 12 |
GC content | 37% |
IMG OID | 640389618 |
Product | Fungal transcriptional regulatory protein |
Protein accession | XP_001383922 |
Protein GI | 150864914 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.282349 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.13348 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGAAG GAACAAGACA GAATGAGACT GAAAATCTAC AAAGTATCTC AAGCATAATA TCAGTCGAAC AAGATCAAGA TCAACATTTG GAACATCTTT CACAGCTACA ACTACTCCAG GAACTTCCAA AGCCGAAGAA AAGAGCCTGC GATAGATGTC ATGAAATAAA GCAAGTTTGT ACGGGAAGTG TGCCATGTGA AAGGTGTGCC AGAATGAGCA TTAAGTGCGT GTTGGATAGG CCATTGAAAA AAATTGGTAG ACCTCAAAGA GGAGATAGAG ATCTGTCTTA TTCTCCTAGA AGCAACTCTT TGATACCTCT GAGTGATCGC AGAAATGGCA AACTTGAGAA GAGAACTGGA ATACGCCACT CGAAATCGGC CTGTGAAGCA TGTAGAAAGA GAAAGAGAAA ATGTGATGAA AGTTGGCCCA ATTGTGGCTA TTGTACCAAA CTGAATTTGG AATGTTCGGG GCCTACTGTG CGTAAGAGGA AGCCAAAGGC TAAATCTTCA AGATTAGACC CATTGTTGCA AAGGAAGGCT GAGATCAGCT TGTCAATACA ATCTGCTTCT AGTTCGACAT CACTAATAAG CAGTCTCTCA GTTGATGATA GTTCAGCTTC GCCCCATTTT GACAATCCCA ACGATGGAAA TTGTTCCGAG ACTACAGTTC TGCTGCAAAC AAGTTCCACG GAACAGAAAT CTTCAGAACC TATTAGGGAT AGATGTGTAA AGGAGACTCA TAACATTCCA GTTTCACCAG CTGCTTTTAT GTCGACTGAT ACCTTTCTTA ACAACGACAT TACATTATTA AACATGGATA AAAAAACAAA TACAGGGGCG GAATCTTCAC CTATGAACAG TACTTGCAGC AGTAATGATG CACTATTTTC TGATTTGTTT AGAAGAGATA ACGACGATGG CGACGATAAT ATTACCAGTT CCATTATAGA TCAGGATTTA AGAAATTTTG GTCTAATTCT CGCGAACAAT CTGAATAATC CTGAGAGTAG CGAGGCCAAT ATTAATAACG AGAGTATCAA CAATAGTAGT GATAATATTA CTAGAAAACG TATGGCCATT TCTAGATCTG ATACCCCTGG TCCAATTCAA TTACACAAAT CAACATCCAT CAAATATCAC CAATTGTTTG AGAACATGAA TCAAATCAGT AATAGTATAT GCTTGAGTCT TTTGAATAAC ATTGTAATGG AAAACAACAT TTCTGTAAAG GAGAGATTTT TGTTAAAGTA TTTTGTTACT GACGTATCAT TCACCATATT TGCTGACGAG ACCTCTAATG CATTCATGAG CACTATTATC CCTATCTCAT TGAAAGATAA GAGGGTCAGA GATCCAATTT TGGCCATTGC TTCGGCTCAC AGGGCAGGTA ATGATATAAA ATTCTTTAGA GACGCTGTCT TGTATCGTTC AAGTTCTCAT GCTACATTGC TTGGGAGCAC TTCTCAAGTA GAGTACTACT TTTCTGATGA AATCTTACTA TCAATCTTGT TGAGTGGAAT CATGGAAATA TTGAATGGAT CGTCTTTAGG TTGGTCTGTT TTATTAGAAA AAGCATCTGA GATCACAAAA TTTAGAGGAG GTATTAAAAA GATGGCTTCA GCTCGAAGTG GTTATGCACC AATGTTGGTT CAACTTTTTT GCTATATAGA TCTTATTTCA AGCCTCAGTA CTTGTCATCC ACCTTATATT GAACAATCAG CCCAAAATGA CAATTGTCAA ACAAGCGAAC TGACTATAAG TGAAAAGATA GAAATAGAAT CTCATGTTTA TGATCAAGAA GAGGTCACAG AGATTTTGAA CAGCAAGTTT GGCTTCAGAT TTGGTATTGC TGGTGAAATA TTCAAAATTT TGGGCAATAT TTCAACTTTG GCCTCGTTAC GTAAATCGAG GCATGACGGC GAAGATCAAG AGAGACAATT TCAAATAATG GCGGATGATA TCGAGATGAA ACTACAAGAC TGGGAATTAC CAACCACAAT GAACTTCAAT GATGTTTCAG ATGTCCAGAT GTCTCAATAT GCCATGGCTT TACAATGGGC TGCTTTTTTA AGATTGCATC AAATCAAAGA TGGTTACAAC CGTCAAGACA TAAGAGTTAA AGTGTGTCTT TCTACTATCC TAAGAGCTGT CAAATTGATC CCAGAGAAGT CAAACTTGGA AAGTAGTTTA ATGTTTCCGT TGATACTAGC TGGTTCTGTA GCGATAACAA AAACTGATCG AGATTTTATC ATTTCAAGGG TCAGATCTAT AAAGAAAAGA TTGAAGTTCC ACTATATTGA AGAGTTTGAA CGAATGTTGT TGTACATTTG GAGTAGAGAT AATAAGGAAG GAAATTTTGT TAATTGGGCT GCTGTCAGAT ACTATCAATT TCCTGGTTTG GTTATGTTTT GA
|
Protein sequence | MSEGTRQNET ENLQSISSII SVEQDQDQHL EHLSQLQLLQ ELPKPKKRAC DRCHEIKQVC TGSVPCERCA RMSIKCVLDR PLKKIGRPQR GDRDSSYSPR SNSLIPSSDR RNGKLEKRTG IRHSKSACEA CRKRKRKCDE SWPNCGYCTK SNLECSGPTV RKRKPKAKSS RLDPLLQRKA EISLSIQSAS SSTSLISSLS VDDSSASPHF DNPNDGNCSE TTVSSQTSST EQKSSEPIRD RCVKETHNIP VSPAAFMSTD TFLNNDITLL NMDKKTNTGA ESSPMNSTCS SNDALFSDLF RRDNDDGDDN ITSSIIDQDL RNFGLILANN SNNPESSEAN INNESINNSS DNITRKRMAI SRSDTPGPIQ LHKSTSIKYH QLFENMNQIS NSICLSLLNN IVMENNISVK ERFLLKYFVT DVSFTIFADE TSNAFMSTII PISLKDKRVR DPILAIASAH RAGNDIKFFR DAVLYRSSSH ATLLGSTSQV EYYFSDEILL SILLSGIMEI LNGSSLGWSV LLEKASEITK FRGGIKKMAS ARSGYAPMLV QLFCYIDLIS SLSTCHPPYI EQSAQNDNCQ TSESTISEKI EIESHVYDQE EVTEILNSKF GFRFGIAGEI FKILGNISTL ASLRKSRHDG EDQERQFQIM ADDIEMKLQD WELPTTMNFN DVSDVQMSQY AMALQWAAFL RLHQIKDGYN RQDIRVKVCL STILRAVKLI PEKSNLESSL MFPLILAGSV AITKTDRDFI ISRVRSIKKR LKFHYIEEFE RMLLYIWSRD NKEGNFVNWA AVRYYQFPGL VMF
|
| |