Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_54460 |
Symbol | UBA1 |
ID | 7200448 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | + |
Start bp | 858830 |
End bp | 862237 |
Gene Length | 3408 bp |
Protein Length | 1108 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | ubiquitin-activating enzyme E1, protein 1 |
Protein accession | XP_002179732 |
Protein GI | 219117892 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTACT GCTGGCGATT TTGTCTGGGC TTGTTATTAG AGCAGGCGCT TTTAAGAGAC TCTTTCGGTG TTCGCTTTGT CCCCATTTTC CATCCTTCTC CTCGGAGGGT CAAGGTTCAT ATCAGAAATG CACTATCATC TCGAGGCGGT GGGGGTGGGG ATGCCGTCGG TGAAACTGAG GATGATGAAG AGCGATACAG CCGTCAAGTT TTCGCTCTCG GCGCTGAAGC ACATAAGCGA ATCCGATCGT CTACCGTATA CTTGGATGGG CCAGGACGTT CCGGACTCTA TACGAATGTG CCAAGAACCT AGCCCTCTCT GGGGTGAGGA AGCTAGTTTT AGTCAAATCT AGCGAAAAAG TCGATGCAGC TTATTTCAAA GGAGAATTAG ATGATCTAGG CCGGGCATAT CACAGAGCTG CACGGTCGGA GACTGGGAAG AGTGATGATG ACTGCGATGT ATCCGATGAA GAAGTATTGA TGGAGTACTT AAAGCGGCTC AACCCATCAG TTCAGGTATC AGTTGTAAAA TATTCAGACT TTCGGCCATT AGATGACAGT TTGCGAGGAG TTCTTCTGTG TGTCGACCGT TGTCACGAGA AGCTACTAGT CATGAATGGC TTGGCAAGGC GACACAACCT TGCGTTTGTG GGGACTGAGA CAGCTGGCGT GTACGGACGC GTCTTCTGCG ATTTTGGGAC CTCTTTCGAA GTAAATGACA CTGATGGAGA GACTCCACTG GTGATTCCGC TAGATCGAGT TGAGCGAGGG ATTAGTGACG AAATACTTTT TGTAACATGC CTTGAGGGGC AACAGCACGA TGTTTCCAAG GGTGAAGAAA TCAGGTTCAT CGATCCTAAC GGCGATTCAT CAGAGCAGAA ATGCACGGTC ATCGAAGTGC ACACTCCTTT GCGACTATCG ATTGAGGTTG ACAAAAAAGG CGGATCTTGT CAAGAGTGGA TCGAAAGTGT AAATAAGAAA TATGTGGCAT TCTCCCGGAT CAAGGCTTCT AAGAAACTTT CCTTTGACGA TCTCGCAATA GCGAGTAAAA AAGCGTCCAG CGATGCTTCC ATTTTCACTC CTAGCGATTT AGGAAAGAGT TTTGATGACA ACCGAAGAGC GGCACTTTTC GCTTGTTTCC GAGCTGCATC AAGTTTTGTT GGGGATCATC TAAGATGGGC TGACGACAAC GACTTGGATG ATTTCTGTGA GCTAGTCCGG ACGTTTATGT CTAACTGCGA GTCTGAGCAC TGCTTTCTTT CTGAATCGCA GCATTTTAAT GTTGAACAGT TTCTTGAGGT TGGAAGAGCG AAGTTCAGCC CTATCCAGGC TTTCTTTGGT GCGATAGCAT CTCAAGAGGC ATTAAAAGCG TTGACCGGTC TTTACCACCC TATCCAACAA TTCCTTCTGT ACGATTGCGA CGAAATTTTG AACTCTCCTT CAGATCGCAC ATGTTCTGTA AACGAAAAGG AGGGAAGTGA CCGAAATACA TGTGGACTTC GCCATATACT GGGTGATTCT ATCGTTGAAG ATCTACAATC CATGAGAGTG TTTGTAGTGG GTGCTGGAGC AATAGGCTGC GAGATCCTTA AGAATCTGGC GGCAATGGGT ATAGGATCCA AAAGCAAAGG CCGAGTAATT ATCACGGACA TGGATACTAT TGAAAAATCC AATTTAAGCC GACAGCTACT TTTTCGCGAC AGCGACGTCG GTAAATTCAA GAGTAGCGCT GCCACTCAAG CTATCCTTCG ATTCAACAAC AAAATGAAAA TTGATTCTCA TTCCAGCAAA GTTGGAGACT CCGAGCACAA TCCCTTTGAT GATCTGTTTT GGCGCAAAGG TGTTGACATT GTGTTGAATG CACTTGACAA CATGGAAGCT CGCTTTTTTA CAGACAGACA ATGTGTTGCC AATGGCAAAC CTTTGATTGA CTCCGGAACG CTTGGTCCGA AGGGAAATGT CCAAGTCGTT ATTCCCCATA AAAGCGAATC GTATTCGTCG AGTGCTGACC CGCCCGATCC TGCGATAGCG GTGTGTACGC TTAAGAACTT CCCTTATGCC ATTTCCCACA CTATTCAATG GGGACGTGAT CTATTTGAGG ACGTGTTTTC GAGGAGACCA TCTCAAGTCA ATGACGCAAG GGACTCTTTG TCCTCAACCT GCGTCGAAGC CTTCGTTTCA AGATTGATTC AGGAACGAGG AGAGAATGGA TTTCAACAAT TTGCTGCGGA ACTGAAGGAA GATGTGAGTC CCGATCTCGA GTCGTCAGAT ATACGGGCGC ACTCGTTAGA GTGGGCTGCG TCTACTGCAG TCAAACTTTT TCGGGATTCT ATAGAGACGC TTCTTCTGAA ACATCCCCCG GGAAGTTTGG ACGATGATGG CGAACCCTTT TGGAGTGGAA CACGGCGACA GCCACGTGTT TTATCGTTCT CTGGTTCCGT ACCTCTTGAT GCGATGCAGT CAAGTGTTAA CGAGAATCTC ATCGACTTTG TGAGGTATGC CGCTCGGTTG CGGGCAGAGA TGTACGCTAG CAAGCCTATT CGTGACCCTT TTGAATTCTC ACGAAATGAT GCTGAGGCAA GTTTAAACAG TGCAGAGCAG GCTCAACCAT CTGACAAAGA AGTGATGGAC ACAGACACAG TCAATGTTCT TATTGATTCT CTCAGGCGAC TATCATCTTT TTCAAAACCC CTAAATACCG CCGAGTTTGA GAAAGATGAC GATTCCAATG GACACATTGC GTTTGTTACT GCTGCTAGCA ATCTTCGAGC CATGAGCTAT GGAATTCCGC CTGTAAATAG ATTGCAAACA AGGCGAATAG CGGGGAACAT TGTTCCTGCT GTAATCTCGA CAACTGCAGC CGTCTCAGCT CTTTCATGCA TTGAACTCGT CAAGCTTGCG CAGGGAGCGC AATTGAAATT ACACAGGAAT GCCTTCATGA ATCTGGCACT ACCGTTTTTC GCTTTCACTT CCCCACTTCC TGCGGAGGTA ATGCCGGGCC TGCAAGGTCG TCAGTACACA ATATGGGATC GTTTGAAGGT GCGGGAAAGC AAGAAGGCCC TGGCAAAGGG TGGAATATCC CTAAGGAAGC TTATTCGTCG AATAAAACAA CTAGCTTCTA CGAACCCCAA AAAAGTGTCA GTTTTGTCCA TATCTTTTGG TCCCTACCTC CTGTATGCAA GCTTCCTCCA CGATGATGAC AAAAATCATC TCAAGTCCTC CTTGTGGAAC ATTCTTGAAG AATTGACCGA AGTCGACGAC GACTTTGTAT CTACTCGAAG CAACGACAAC AGGTCAACTG AATATTCGCC GACACAGAAA TTTGTGGATT TATCGGTCAT CGTTGAAGAT CCCGACAATG GCAGTGAATG CGAGTTGCCA TTGGTGAGGG TGTTTCGGAG ATTTCTAT
|
Protein sequence | MNYCWRFCLG LLLEQALLRD SFGVRFVPIF HPSPRRVKVH IRNALSSRGG GGGDAVGETE DDEERYSRQV FALGAEAHKR IRSSTVYLDG PGLDAAYFKG ELDDLGRAYH RAARSETGKS DDDCDVSDEE VLMEYLKRLN PSVQVSVVKY SDFRPLDDSL RGVLLCVDRC HEKLLVMNGL ARRHNLAFVG TETAGVYGRV FCDFGTSFEV NDTDGETPLV IPLDRVERGI SDEILFVTCL EGQQHDVSKG EEIRFIDPNG DSSEQKCTVI EVHTPLRLSI EVDKKGGSCQ EWIESVNKKY VAFSRIKASK KLSFDDLAIA SKKASSDASI FTPSDLGKSF DDNRRAALFA CFRAASSFVG DHLRWADDND LDDFCELVRT FMSNCESEHC FLSESQHFNV EQFLEVGRAK FSPIQAFFGA IASQEALKAL TGLYHPIQQF LLYDCDEILN SPSDRTCSVN EKEGSDRNTC GLRHILGDSI VEDLQSMRVF VVGAGAIGCE ILKNLAAMGI GSKSKGRVII TDMDTIEKSN LSRQLLFRDS DVGKFKSSAA TQAILRFNNK MKIDSHSSKV GDSEHNPFDD LFWRKGVDIV LNALDNMEAR FFTDRQCVAN GKPLIDSGTL GPKGNVQVVI PHKSESYSSS ADPPDPAIAV CTLKNFPYAI SHTIQWGRDL FEDVFSRRPS QVNDARDSLS STCVEAFVSR LIQERGENGF QQFAAELKED VSPDLESSDI RAHSLEWAAS TAVKLFRDSI ETLLLKHPPG SLDDDGEPFW SGTRRQPRVL SFSGSVPLDA MQSSVNENLI DFVRYAARLR AEMYASKPIR DPFEFSRNDA EASLNSAEQA QPSDKEVMDT DTVNVLIDSL RRLSSFSKPL NTAEFEKDDD SNGHIAFVTA ASNLRAMSYG IPPVNRLQTR RIAGNIVPAV ISTTAAVSAL SCIELVKLAQ GAQLKLHRNA FMNLALPFFA FTSPLPAEVM PGLQGRQYTI WDRLKVRESK KALAKGGISL RKLIRRIKQL ASTNPKKVSV LSISFGPYLL YASFLHDDDK NHLKSSLWNI LEELTEVDDD FVSTRSNDNR STEYSPTQKF VDLSVIVEDP DNGSECELPL VRVFRRFL
|
| |