Gene PICST_87984 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_87984 
SymbolUBA2 
ID4837568 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp2503818 
End bp2505710 
Gene Length1893 bp 
Protein Length616 aa 
Translation table12 
GC content41% 
IMG OID640388883 
ProductProtein with homology to mammalian ubiquitin activating (E1) enzyme 
Protein accessionXP_001383236 
Protein GI150864427 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0158101 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAGAG ATACTTATTT GAAGAAAGTC TTAGGCGATG AGTGCTTTGG TCGTGTTCAG 
AGAACTAGGG TGGTTATGGT TGGAGCTGGT GGTATAGGAT GTGAACTCTT GAAAGATCTA
CTTTTGACAG GATATGGAGA GATTCACATT GTCGATCTTG ATACCGTGAC TCTTTCCAAC
TTGAACAGAC AGTTCTTATT TCGCAAGAAG GATATCGACA AGTCAAAGTC CTTAACTATT
GCCAAAGCTG TACAATCGTT CAACTATTTT GGTGCCAAAT TAGTTCCTCA TCATGGCAAT
ATTATGGATA CTAACCAGTT TCCACTCACC TGGTGGTCTC AGTTCAGCTA TGTGTACAAC
GCTTTGGATA ACTTAGAAGC CCGAAGATAC GTAAACAAAA TGTGTTTGTT CCTCAAGAAA
CCGTTGATGG AAAGTGGCAC TACAGGTTTT GAAGGACAGA TTCAACCCAT TTACCCATAC
TATAGCGAAT GTTTTGACTG TCAAGCCAAA GTCACACCAA AGACTTTTCC AGTGTGTACT
ATTAGGTCAA CACCGTCACT TCCGGTTCAT TGTATCACAT GGGCAAAAGA ATTCCTCTTC
CACCAATTGT TTGACGAATC TGAGATATCG AGCATGAATA ATGAAGAACA GATTAGGAAC
GAGACTGATG ATGTTCAGGA AAAAGAAAAC TTGGCCAAAG AAGCCAACGA GCTTATTGAC
TTGAGAAACC AGATCAAAGG TTTGGACGGG TCAGCGTTTA TAGAGCTGCT TGTGGTCAAG
ATCTTTCAGG CAGACATCGA AAGGTTATTA TTGATTGATA CATTATGGAA GTCACGTAGA
AAGCCTATAC CTCTAAACTT CAATGCTCTT TCTACTGAGT TGCAGCAGCT ACTTCATGCA
AAAAATAACA TCATCAGCAC CGATACAAAG GTGTGGTCGG TGTTGGAAAA CTTGTTTGTA
TTATATAAGT CTGGCGTGGC TTTGCAATCC AGATTGAAAT CTGGAAAGGA ACTGTTCGTA
TCGTTTGACA AGGATGACGA CGATACCCTC AACTTTGTTG TGGCAGCGGC CAACTTAAGG
TCTTCAATCT TCGGGATACC TTTGATGTCT AAATTTGATA TCAAGGAAAT TGCCGGTAAT
ATCATTCCAG CCATTGCTAC CACAAATGCC ATCATTTCTG GGTTTTCGAG CTTGAACGGT
ACCAAATTCT TCAAACATGA CTATGAACAG ACGGGTAGTG TCGACTTCTC CCCGATTGTT
AAGGAATCTT CTACGGTTTT CATTAGTATC AAGCCAAACA AGTATATCAC AGCTGCATCG
TTGGTTTCTC CCAATGAAAG TTGTCCTAGT TCGTCATTAC TTTCGCGCGG TATCATGAAC
TTAACGAATC AAGAGTTGCA AGAAAACACC TTACGGTGGC TTGTAGATGA GTTGGTCAAG
AAGTACGGTT ACGAAGATGG CGACTTGTCT ATCATCGTAG GCAAGTCGCG ATTAGTCTAC
GACGTTGATT TCGACGACAA CATCGATAGC TCTTTACTGG AGCTATCAGG ATTTGAAGAT
GGCGACCTTG TATTGATCCA AGATGAAAAC GATGAGTTGG AGAATCTCGA GCTTTATATT
ACAGTTGTTA ATGAGCCTAC TACCGAAAAA TTGCCTAACA TACAGTTAAG AGAGAAAAAG
GCAATAAAAG AGAAGAATGA GGATGAACAA GGAGATTCCA CTATTGAAAA CAATACGAAT
CAGGACGGTA CTACAATGGT TATTATTGAA GATGAAGAAG ACACAGACTT GGTGATGATA
GCAGAGCCGG CACTGAAGAA AAGACGCATA GCTGGTGAAG AATACATGTA AATGAAGCCA
TAGAAATAGC AAGAAGAATC AAAAAGAAGT TAT
 
Protein sequence
MARDTYLKKV LGDECFGRVQ RTRVVMVGAG GIGCELLKDL LLTGYGEIHI VDLDTVTLSN 
LNRQFLFRKK DIDKSKSLTI AKAVQSFNYF GAKLVPHHGN IMDTNQFPLT WWSQFSYVYN
ALDNLEARRY VNKMCLFLKK PLMESGTTGF EGQIQPIYPY YSECFDCQAK VTPKTFPVCT
IRSTPSLPVH CITWAKEFLF HQLFDESEIS SMNNEEQIRN ETDDVQEKEN LAKEANELID
LRNQIKGLDG SAFIESLVVK IFQADIERLL LIDTLWKSRR KPIPLNFNAL STELQQLLHA
KNNIISTDTK VWSVLENLFV LYKSGVALQS RLKSGKESFV SFDKDDDDTL NFVVAAANLR
SSIFGIPLMS KFDIKEIAGN IIPAIATTNA IISGFSSLNG TKFFKHDYEQ TGSVDFSPIV
KESSTVFISI KPNKYITAAS LVSPNESCPS SSLLSRGIMN LTNQELQENT LRWLVDELVK
KYGYEDGDLS IIVGKSRLVY DVDFDDNIDS SLSELSGFED GDLVLIQDEN DELENLELYI
TVVNEPTTEK LPNIQLREKK AIKEKNEDEQ GDSTIENNTN QDGTTMVIIE DEEDTDLVMI
AEPASKKRRI AGEEYM