Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_87984 |
Symbol | UBA2 |
ID | 4837568 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | - |
Start bp | 2503818 |
End bp | 2505710 |
Gene Length | 1893 bp |
Protein Length | 616 aa |
Translation table | 12 |
GC content | 41% |
IMG OID | 640388883 |
Product | Protein with homology to mammalian ubiquitin activating (E1) enzyme |
Protein accession | XP_001383236 |
Protein GI | 150864427 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0158101 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTAGAG ATACTTATTT GAAGAAAGTC TTAGGCGATG AGTGCTTTGG TCGTGTTCAG AGAACTAGGG TGGTTATGGT TGGAGCTGGT GGTATAGGAT GTGAACTCTT GAAAGATCTA CTTTTGACAG GATATGGAGA GATTCACATT GTCGATCTTG ATACCGTGAC TCTTTCCAAC TTGAACAGAC AGTTCTTATT TCGCAAGAAG GATATCGACA AGTCAAAGTC CTTAACTATT GCCAAAGCTG TACAATCGTT CAACTATTTT GGTGCCAAAT TAGTTCCTCA TCATGGCAAT ATTATGGATA CTAACCAGTT TCCACTCACC TGGTGGTCTC AGTTCAGCTA TGTGTACAAC GCTTTGGATA ACTTAGAAGC CCGAAGATAC GTAAACAAAA TGTGTTTGTT CCTCAAGAAA CCGTTGATGG AAAGTGGCAC TACAGGTTTT GAAGGACAGA TTCAACCCAT TTACCCATAC TATAGCGAAT GTTTTGACTG TCAAGCCAAA GTCACACCAA AGACTTTTCC AGTGTGTACT ATTAGGTCAA CACCGTCACT TCCGGTTCAT TGTATCACAT GGGCAAAAGA ATTCCTCTTC CACCAATTGT TTGACGAATC TGAGATATCG AGCATGAATA ATGAAGAACA GATTAGGAAC GAGACTGATG ATGTTCAGGA AAAAGAAAAC TTGGCCAAAG AAGCCAACGA GCTTATTGAC TTGAGAAACC AGATCAAAGG TTTGGACGGG TCAGCGTTTA TAGAGCTGCT TGTGGTCAAG ATCTTTCAGG CAGACATCGA AAGGTTATTA TTGATTGATA CATTATGGAA GTCACGTAGA AAGCCTATAC CTCTAAACTT CAATGCTCTT TCTACTGAGT TGCAGCAGCT ACTTCATGCA AAAAATAACA TCATCAGCAC CGATACAAAG GTGTGGTCGG TGTTGGAAAA CTTGTTTGTA TTATATAAGT CTGGCGTGGC TTTGCAATCC AGATTGAAAT CTGGAAAGGA ACTGTTCGTA TCGTTTGACA AGGATGACGA CGATACCCTC AACTTTGTTG TGGCAGCGGC CAACTTAAGG TCTTCAATCT TCGGGATACC TTTGATGTCT AAATTTGATA TCAAGGAAAT TGCCGGTAAT ATCATTCCAG CCATTGCTAC CACAAATGCC ATCATTTCTG GGTTTTCGAG CTTGAACGGT ACCAAATTCT TCAAACATGA CTATGAACAG ACGGGTAGTG TCGACTTCTC CCCGATTGTT AAGGAATCTT CTACGGTTTT CATTAGTATC AAGCCAAACA AGTATATCAC AGCTGCATCG TTGGTTTCTC CCAATGAAAG TTGTCCTAGT TCGTCATTAC TTTCGCGCGG TATCATGAAC TTAACGAATC AAGAGTTGCA AGAAAACACC TTACGGTGGC TTGTAGATGA GTTGGTCAAG AAGTACGGTT ACGAAGATGG CGACTTGTCT ATCATCGTAG GCAAGTCGCG ATTAGTCTAC GACGTTGATT TCGACGACAA CATCGATAGC TCTTTACTGG AGCTATCAGG ATTTGAAGAT GGCGACCTTG TATTGATCCA AGATGAAAAC GATGAGTTGG AGAATCTCGA GCTTTATATT ACAGTTGTTA ATGAGCCTAC TACCGAAAAA TTGCCTAACA TACAGTTAAG AGAGAAAAAG GCAATAAAAG AGAAGAATGA GGATGAACAA GGAGATTCCA CTATTGAAAA CAATACGAAT CAGGACGGTA CTACAATGGT TATTATTGAA GATGAAGAAG ACACAGACTT GGTGATGATA GCAGAGCCGG CACTGAAGAA AAGACGCATA GCTGGTGAAG AATACATGTA AATGAAGCCA TAGAAATAGC AAGAAGAATC AAAAAGAAGT TAT
|
Protein sequence | MARDTYLKKV LGDECFGRVQ RTRVVMVGAG GIGCELLKDL LLTGYGEIHI VDLDTVTLSN LNRQFLFRKK DIDKSKSLTI AKAVQSFNYF GAKLVPHHGN IMDTNQFPLT WWSQFSYVYN ALDNLEARRY VNKMCLFLKK PLMESGTTGF EGQIQPIYPY YSECFDCQAK VTPKTFPVCT IRSTPSLPVH CITWAKEFLF HQLFDESEIS SMNNEEQIRN ETDDVQEKEN LAKEANELID LRNQIKGLDG SAFIESLVVK IFQADIERLL LIDTLWKSRR KPIPLNFNAL STELQQLLHA KNNIISTDTK VWSVLENLFV LYKSGVALQS RLKSGKESFV SFDKDDDDTL NFVVAAANLR SSIFGIPLMS KFDIKEIAGN IIPAIATTNA IISGFSSLNG TKFFKHDYEQ TGSVDFSPIV KESSTVFISI KPNKYITAAS LVSPNESCPS SSLLSRGIMN LTNQELQENT LRWLVDELVK KYGYEDGDLS IIVGKSRLVY DVDFDDNIDS SLSELSGFED GDLVLIQDEN DELENLELYI TVVNEPTTEK LPNIQLREKK AIKEKNEDEQ GDSTIENNTN QDGTTMVIIE DEEDTDLVMI AEPASKKRRI AGEEYM
|
| |