Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_33774 |
Symbol | UBA3 |
ID | 7198040 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 320260 |
End bp | 321744 |
Gene Length | 1485 bp |
Protein Length | 462 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | ubiquitin-activating enzyme E1, protein 3 |
Protein accession | XP_002178207 |
Protein GI | 219114823 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGGAA ACGACCTGGA CAATTTGAAA CCTATGGTAC GTTGCTCGTT CGACTTCAGC ATCGATATCG ATGCATCATT TAGCGAACGC GCTAAATACA TGTAAGGTTA TTTTCTAACC GCAATCCCCC AGGAAGTTGA CACTAGAAGA TCGTCTGAAA ATGCTAGGGG TATACGTGGG TCTCTACTGA CCCTTCTCAG CCGCCCGTCG CCTTTCGGAA ACGAGACTGG ACCATTGGCA TGTGGCGAGT TTGAACCGCT TCCGAAACTA AGTTCCTGTT ACGCTACTAC AGCTTCTGAC CATGAATCCC CTTTGACGAA AGCCAAAATT CTCGTCGTCG GTGCTGGAGG GTTGGGTTGT GAAATTCTCA AGAATCTTGC GATGTCCGGC GTGAGAGATG TGGACGTAAT TGATCTTGAT TCAATCGACG TGACCAATCT AAATCGTCAG TTCTTATTCC GTCAACGAGA TGTCGGCACA TCAAAGGCGA AAACCGCAGC TGCTTTCATC AACGAGCGCT GCCCTTGGAT GAGCGTTACA GCTCACCACG GTATGATTCA GGACAAGGAG CCGTCGTTCT ACTCCTCCTT TGATTGTATC ATATCGGGAC TCGACAACGT TGAAGCTCGT CGTTGGCTCA ACGCGACTGT GGTCGGACTC GTAGAGTTCG ATGACGACGG CGATATGGAT CCAGCCTCAA TCATTCCGAT TATTGATGGC GGAACGGAAG GATTTTCAGG ACAAGCTCGT TTTATCCTGC CGCGTATCAC GAGCTGCTTT GAGTGTACAA TCGATGCTTT TCCGCCACAA ATTGCTTTTC CGTTATGCAC GATTGCCGAG ACTCCACGCA AACCGGAACA TTGCATTGCA TACGCGTCAA TTCTTCAATG GCCGAGAGAA TTTCACGATA AGAAGCTCGA CAGTGATGAT CCGGATGACA TGAAGTGGGT CTACGAAAAG GCGTTGGAGC GAGCAAAGCA GTACAACATT GACGGGGTTA CATATATGCT AACCATGGGC GTAGTCAAGA ATATAATTCC TGCCGTTGCG AGTACCAACG CAATCATTGC GGCGGCGTGC GTGAATGAGG CGATAAAATA CATCACCTTT TGCTCACAGA ATCTCAACTC ATACATGATG TACATGGGGT CTGAGGGTGT TCATTGTCAC ACGTTTGCAT ACGAGCAAAA AGATGATTGC CCGGTTTGTA CCTCGACTGT GCAAAAAATG ACAATTTCTA AGACAACTAC GCTGAACGAG CTATTGCAAG AGTTTCGCGC GGGTCCCTTG CGTCTGAAAT CGCCAAGCCT CGTCAGTTCA GGCGGAAAGA CGCTTTACAT GCAAAAGCCT CCAGCCCTAG AAAAAGCGAC TCGATCAAAT TTAGACAAGC CGGTGTCGTC CCTTGTGGAA TCTGGTGAAG AGTTGACTGT AACAGATCCC CTGCTTGAGA GCATTGCAGT TGGGGTGTCA ATTACGTTTG AATAA
|
Protein sequence | MAGNDLDNLK PMEVDTRRSS ENARGIRGSL LTLLSRPSPF GNETGPLACG EFEPLPKLSS CYATTASDHE SPLTKAKILV VGAGGLGCEI LKNLAMSGVR DVDVIDLDSI DVTNLNRQFL FRQRDVGTSK AKTAAAFINE RCPWMSVTAH HGMIQDKEPS FYSSFDCIIS GLDNVEARRW LNATVVGLVE FDDDGDMDPA SIIPIIDGGT EGFSGQARFI LPRITSCFEC TIDAFPPQIA FPLCTIAETP RKPEHCIAYA SILQWPREFH DKKLDSDDPD DMKWVYEKAL ERAKQYNIDG VTYMLTMGVV KNIIPAVAST NAIIAAACVN EAIKYITFCS QNLNSYMMYM GSEGVHCHTF AYEQKDDCPV CTSTVQKMTI SKTTTLNELL QEFRAGPLRL KSPSLVSSGG KTLYMQKPPA LEKATRSNLD KPVSSLVESG EELTVTDPLL ESIAVGVSIT FE
|
| |