Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_54754 |
Symbol | hUba1 |
ID | 7202783 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | - |
Start bp | 3178 |
End bp | 6947 |
Gene Length | 3770 bp |
Protein Length | 1050 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | ubiquitin-activating enzyme E1, protein 2 |
Protein accession | XP_002181983 |
Protein GI | 219123337 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.13153 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTTACCCCGT GCCAACAAGC CCTTCTGGAA CCGAGACAAC CGACGGACGC TGCTAACGAA CCAGACTTTC GCTCGTAGAC ACAATCTACA CGAAACAACG CAGCCTTCCT AGAGTGACCC GCTTTGGCTT CGTGTTTACA GATAAAACAA TCCACAAACA CACGTATCCT ATCTCAATTC CTTTCGACTC CCTAGTGATT TGCCTTTGTA ACACTATGTC TGCCGAAAGG ATGGAAGCAA CAGCAACTGC CGAAGGTACG AACGATTCCT TACACGACAG CAGCTGCCGG GCATACACTC GGCGTTTAGT TTCGCCTAGC GAGCGGCAAG CAACGACGAC TTGAATTCTT TTTTCGAAGA CGACGGATGT GGACATCCTG TATTTCCTCG TTCGCTTTTT GTTGCACTGG CCTGGGGCGT GTCAATACGC TTTTGGTTGT GCCCCAATGA CAGTGAGAGC CTCCCCGAGT TTGCGTTTTG CGTGCGTTTT CTAATGGAAA GCTTGGTTTT CTTCTCACTT GTTCCTGCTT TTAACCTTTT GCCAGTGGAT GAAAAGCTTT ATTCGCGCCA ACTCTACGTT ATGGGACATG AAGCTCAGCG TCGTATGATG GCCAGCAATG TGTTGTTGGT TGGTTGCTCC GGTTTGGGCG TGGAAATCGC CAAGAATTGC ATCCTTGCCG GGATTTCTTC CATGATGTTG GTCGATCCAA CGCCGCCTAC TTCCTTTGAT TTGGGCGGAA ATTTTTACCT GCAAGAATCT GATATTGGCG GAACGAAGGG ACGAGCCGCC CTGTGCAAAG ACTCTTTGGC GCAACTCAAT CAATACGTCA GCGTGACGAC GGCGGACGTT CCGGATCTCT CCGTTGACTC AGTTCTTCCG TTGATCGATG GAAGCCTTAC CTGTGTCGTC GTTACGGTCC CGCTGCCCAA GGCATTGGTC ATTCAGCTCA ACGAAGCCTG CCGTGAACAA AAAGTGTCCT TCATTTATTC ACTCACCATG AGTGTTTTTG GTATGGCCTT CTGCGACTTT GGAGACGCCT TTGTGGTGGC CGACAAGGAT GGCGAAGCCG CCGCCACGTC TCAAATCGAA TCTGTTGTCC ACGAGAATCC CGCTGTCGTC AAAGTCTTGG AAGATCACGG TCGTCATGGT TTGGAAGACG GCGACAAGGT CAGCTTTGCC CGTTTGCACG GGGTCCCTGG GTTGGAAGAA GGCAGGGAGT ACGCCATCAA GACAACCGGA CCATTTACCT TTGAGTTGCC GGAAGTCGAT CTCAGTGGAA TTGCCGACGG GGACGGTGCC GGGCACGCAG TCAACCAACA AGGTTACATC ACACAAATCA AGCAACCGGT TACACTGAAG TTTGAATCGT ACGCTGAAAA ACTGGAAAAG CCCGGGGAGC TCATGATGTC GGACTTTGCA AAGTTTGACC GCCCACCTCT ACTGCATCTA GCATTTCAAG CCGTTGCGGC GTATTTGGAT GAAAAGGGTG AATTGCCAAT GCCAGGGGAT GTGAATACAG CTAAAGAAGT GTTGGCGCTG GCCAATACAC TCGATAAAGA AGGGATTCTC AAGTCCAACT TTCAGGTTGC CGAGCGTCTC TTGATGCATT TTGCATCTGG TGCGCGGGCG TGCCTTTCGC CCATGTGCGC AGCACTTGGA GGCATGGTGG GCCAAGAAGT TCTCAAGGCT TGCAGTGGCA AGTTTACACC TATTCCAGGT TTCTTCTACC TCGATGCTGA CGAAACCCTG CCCGATACGT TGATCGACTC CTCCTTGGTC CAACCAACGG GCACGTCACG CTATGATAGC CAAGTTGCCG TGTTTGGGAG CGACATGCAG GAAAATATTA ACAACTTGCA GTACTTCATG GTCGGCGCTG GGGCAATCGG CTGTGAAATG CTGAAAAACT GGGCACTCAT GGGGGTTGGC TGTTCGTCCA AGGGACACGT ATACGTGACC GACATGGATC GCATCGAAAA GTCGAACTTG TCGCGTCAAT TCTTGTTTCG CAACACCGAT ATTGACAAAT TCAAGTCGGC CACCGCTGCC GACGCGGCGA AAGCCATGAA TCCAAAGCTC AACGTTACGG CGTACCAGGA AAAAGTGGCG CAAGACACAG AGCACCTGTT CGGTGACGAC TTTTACGATA AGCTCAGTGG TGTTTGTACG GCCTTGGACA ATGTTGAAGC GCGTCTTTAC GTCGATCAGC GCTGTTTGTT CTATCGCTTA CCAATGCTGG AATCTGGCAC ACTTGGTACC AAGGGCAATA CACAAGTAGT TGTGCCACAT TTGACGGAGC ATTATGGCGC CACCCGTGAT CCACCAGAAA AATCCATCCC AGTTTGCACG CTTAAAAACT TTCCGAACCA GATTCAGCAT ACGCTGCAGT GGGCGCGTGA TTGGTTTGAA GGTGCGTTCA AGCAATCGGC CGATGAGGTC AATGCTTACC TTTCCATGCC GCCATCTCAG TACTTGGAAA CATTGCAACC CAATACCAAA ACCGAGTCAC TCAAGTTGTT GCGTCGCACG CTGGTGGATG AACGCCCTTT GACATTCGAG GACTGTGTCA CCTGGGCTCG TCTGACATTC GAAAATCTCT TCAACAACCA AATTCGGCAA TTGCTGTATA ATTTTCCGCC AGATCAAGTC ACATCGAGTG GCACAAAATT TTGGTCCGGG AGTAAGCGTT GTCCGAAACC GCTCGTGTTT GATATTGACG CTGTAGATGA AGACGCAGGG ATGCGCAATC ACTTTGATTT TGTTGTGGCA GCCGCCAACA TGCGAGCCCA ATTGTACGGT ATCAAGGGAC GCACCGATGA AGACTACTTT CGGCAAACGC TCAAGGATGT GATTGTGCCT GACTTTTCTC CAGCCGAGGG AGTAAAGATT GCGGCTAACG ATGAGGAAGC CAAGGCGACG GATGGGAACG GGATGGATAC CGGCGATGCG GAAGCGGACG AACTTTGGGG CAGTCTGCCG AAGCCCTCTG AACTTGCAGG ATTTCGGTTA CAGGGCATCG ATTTTGACAA GGATTTAGAT GAACAGATGC TCTTTGTCAC GGCGTGTTCT AATCTTCGTG CCATGAATTA TCAAATTCCA ACGGAAGACA CACATCGCTC GCGCGCAATT GCCGGACGTA TTATTCCTGC GATTGCGACG ACAACCGCCT TGGTCACCGG TCTAATTTGC CTAGAGCTTT ACAAAATGGT AGGTACGGCT CGCAAGAAGC TGTCAATTGA CGCCTATAAA AACGGCTTCA TCAATCTAGC CATTCCCTTC ATGACCTTGT CCGAACCAAC AGCTCCTGCC AAGACCAAGG CACTTGTCAA GGGCAAAGAA TGGGAATGGA CGCCTTGGGA TTCGTTGGAT ATGAGTCTTG GCGACATCAC TATGGGCGAA TTTATGGATT ATTTTGAAAA TGAATACAAT TTGGAGATTT CCATGCTCAG CCATGGGGTG AGCATCTTGT ACAGCTTTTT TGCCAATAAG AAAAAGGTGG AAGAGCGCAA AAGCATGAAA ATGACGGATG TGATTACATC CATTACAAAA AAAGAGTTTC CGTCCAACCA GCTCTTCATC ATTTTGGAAA TAATTGCAAA TGATAAGGAC ACGGACGAGG AAGTTGACCT GCCGTATGTG CGCTTTCGTT TCCGATGAAC ATGGTTGGAA AGCAGTGTTT TGCAGCCTTG TAAGGCTGGT CAGCTGATAG CAACGTTCCA TAACTTTTAA AAGGAAGCCT GTTGCGACAA TACTTTGTGT GACCTACATT
|
Protein sequence | MSAERMEATA TAEVDEKLYS RQLYVMGHEA QRRMMASNVL LVGCSGLGVE IAKNCILAGI SSMMLVDPTP PTSFDLGGNF YLQESDIGGT KGRAALCKDS LAQLNQYVSV TTADVPDLSV DSVLPLIDGS LTCVVVTVPL PKALVIQLNE ACREQKVSFI YSLTMSVFGM AFCDFGDAFV VADKDGEAAA TSQIESVVHE NPAVVKVLED HGRHGLEDGD KVSFARLHGV PGLEEGREYA IKTTGPFTFE LPEVDLSGIA DGDGAGHAVN QQGYITQIKQ PVTLKFESYA EKLEKPGELM MSDFAKFDRP PLLHLAFQAV AAYLDEKGEL PMPGDVNTAK EVLALANTLD KEGILKSNFQ VAERLLMHFA SGARACLSPM CAALGGMVGQ EVLKACSGKF TPIPGFFYLD ADETLPDTLI DSSLVQPTGT SRYDSQVAVF GSDMQENINN LQYFMVGAGA IGCEMLKNWA LMGVGCSSKG HVYVTDMDRI EKSNLSRQFL FRNTDIDKFK SATAADAAKA MNPKLNVTAY QEKVAQDTEH LFGDDFYDKL SGVCTALDNV EARLYVDQRC LFYRLPMLES GTLGTKGNTQ VVVPHLTEHY GATRDPPEKS IPVCTLKNFP NQIQHTLQWA RDWFEGAFKQ SADEVNAYLS MPPSQYLETL QPNTKTESLK LLRRTLVDER PLTFEDCVTW ARLTFENLFN NQIRQLLYNF PPDQVTSSGT KFWSGSKRCP KPLVFDIDAV DEDAGMRNHF DFVVAAANMR AQLYGIKGRT DEDYFRQTLK DVIVPDFSPA EGVKIAANDE EAKATDGNGM DTGDAEADEL WGSLPKPSEL AGFRLQGIDF DKDLDEQMLF VTACSNLRAM NYQIPTEDTH RSRAIAGRII PAIATTTALV TGLICLELYK MVGTARKKLS IDAYKNGFIN LAIPFMTLSE PTAPAKTKAL VKGKEWEWTP WDSLDMSLGD ITMGEFMDYF ENEYNLEISM LSHGVSILYS FFANKKKVEE RKSMKMTDVI TSITKKEFPS NQLFIILEII ANDKDTDEEV DLPYVRFRFR
|
| |