Gene PHATRDRAFT_54460 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_54460 
SymbolUBA1 
ID7200448 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011675 
Strand
Start bp858830 
End bp862237 
Gene Length3408 bp 
Protein Length1108 aa 
Translation table 
GC content47% 
IMG OID 
Productubiquitin-activating enzyme E1, protein 1 
Protein accessionXP_002179732 
Protein GI219117892 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTACT GCTGGCGATT TTGTCTGGGC TTGTTATTAG AGCAGGCGCT TTTAAGAGAC 
TCTTTCGGTG TTCGCTTTGT CCCCATTTTC CATCCTTCTC CTCGGAGGGT CAAGGTTCAT
ATCAGAAATG CACTATCATC TCGAGGCGGT GGGGGTGGGG ATGCCGTCGG TGAAACTGAG
GATGATGAAG AGCGATACAG CCGTCAAGTT TTCGCTCTCG GCGCTGAAGC ACATAAGCGA
ATCCGATCGT CTACCGTATA CTTGGATGGG CCAGGACGTT CCGGACTCTA TACGAATGTG
CCAAGAACCT AGCCCTCTCT GGGGTGAGGA AGCTAGTTTT AGTCAAATCT AGCGAAAAAG
TCGATGCAGC TTATTTCAAA GGAGAATTAG ATGATCTAGG CCGGGCATAT CACAGAGCTG
CACGGTCGGA GACTGGGAAG AGTGATGATG ACTGCGATGT ATCCGATGAA GAAGTATTGA
TGGAGTACTT AAAGCGGCTC AACCCATCAG TTCAGGTATC AGTTGTAAAA TATTCAGACT
TTCGGCCATT AGATGACAGT TTGCGAGGAG TTCTTCTGTG TGTCGACCGT TGTCACGAGA
AGCTACTAGT CATGAATGGC TTGGCAAGGC GACACAACCT TGCGTTTGTG GGGACTGAGA
CAGCTGGCGT GTACGGACGC GTCTTCTGCG ATTTTGGGAC CTCTTTCGAA GTAAATGACA
CTGATGGAGA GACTCCACTG GTGATTCCGC TAGATCGAGT TGAGCGAGGG ATTAGTGACG
AAATACTTTT TGTAACATGC CTTGAGGGGC AACAGCACGA TGTTTCCAAG GGTGAAGAAA
TCAGGTTCAT CGATCCTAAC GGCGATTCAT CAGAGCAGAA ATGCACGGTC ATCGAAGTGC
ACACTCCTTT GCGACTATCG ATTGAGGTTG ACAAAAAAGG CGGATCTTGT CAAGAGTGGA
TCGAAAGTGT AAATAAGAAA TATGTGGCAT TCTCCCGGAT CAAGGCTTCT AAGAAACTTT
CCTTTGACGA TCTCGCAATA GCGAGTAAAA AAGCGTCCAG CGATGCTTCC ATTTTCACTC
CTAGCGATTT AGGAAAGAGT TTTGATGACA ACCGAAGAGC GGCACTTTTC GCTTGTTTCC
GAGCTGCATC AAGTTTTGTT GGGGATCATC TAAGATGGGC TGACGACAAC GACTTGGATG
ATTTCTGTGA GCTAGTCCGG ACGTTTATGT CTAACTGCGA GTCTGAGCAC TGCTTTCTTT
CTGAATCGCA GCATTTTAAT GTTGAACAGT TTCTTGAGGT TGGAAGAGCG AAGTTCAGCC
CTATCCAGGC TTTCTTTGGT GCGATAGCAT CTCAAGAGGC ATTAAAAGCG TTGACCGGTC
TTTACCACCC TATCCAACAA TTCCTTCTGT ACGATTGCGA CGAAATTTTG AACTCTCCTT
CAGATCGCAC ATGTTCTGTA AACGAAAAGG AGGGAAGTGA CCGAAATACA TGTGGACTTC
GCCATATACT GGGTGATTCT ATCGTTGAAG ATCTACAATC CATGAGAGTG TTTGTAGTGG
GTGCTGGAGC AATAGGCTGC GAGATCCTTA AGAATCTGGC GGCAATGGGT ATAGGATCCA
AAAGCAAAGG CCGAGTAATT ATCACGGACA TGGATACTAT TGAAAAATCC AATTTAAGCC
GACAGCTACT TTTTCGCGAC AGCGACGTCG GTAAATTCAA GAGTAGCGCT GCCACTCAAG
CTATCCTTCG ATTCAACAAC AAAATGAAAA TTGATTCTCA TTCCAGCAAA GTTGGAGACT
CCGAGCACAA TCCCTTTGAT GATCTGTTTT GGCGCAAAGG TGTTGACATT GTGTTGAATG
CACTTGACAA CATGGAAGCT CGCTTTTTTA CAGACAGACA ATGTGTTGCC AATGGCAAAC
CTTTGATTGA CTCCGGAACG CTTGGTCCGA AGGGAAATGT CCAAGTCGTT ATTCCCCATA
AAAGCGAATC GTATTCGTCG AGTGCTGACC CGCCCGATCC TGCGATAGCG GTGTGTACGC
TTAAGAACTT CCCTTATGCC ATTTCCCACA CTATTCAATG GGGACGTGAT CTATTTGAGG
ACGTGTTTTC GAGGAGACCA TCTCAAGTCA ATGACGCAAG GGACTCTTTG TCCTCAACCT
GCGTCGAAGC CTTCGTTTCA AGATTGATTC AGGAACGAGG AGAGAATGGA TTTCAACAAT
TTGCTGCGGA ACTGAAGGAA GATGTGAGTC CCGATCTCGA GTCGTCAGAT ATACGGGCGC
ACTCGTTAGA GTGGGCTGCG TCTACTGCAG TCAAACTTTT TCGGGATTCT ATAGAGACGC
TTCTTCTGAA ACATCCCCCG GGAAGTTTGG ACGATGATGG CGAACCCTTT TGGAGTGGAA
CACGGCGACA GCCACGTGTT TTATCGTTCT CTGGTTCCGT ACCTCTTGAT GCGATGCAGT
CAAGTGTTAA CGAGAATCTC ATCGACTTTG TGAGGTATGC CGCTCGGTTG CGGGCAGAGA
TGTACGCTAG CAAGCCTATT CGTGACCCTT TTGAATTCTC ACGAAATGAT GCTGAGGCAA
GTTTAAACAG TGCAGAGCAG GCTCAACCAT CTGACAAAGA AGTGATGGAC ACAGACACAG
TCAATGTTCT TATTGATTCT CTCAGGCGAC TATCATCTTT TTCAAAACCC CTAAATACCG
CCGAGTTTGA GAAAGATGAC GATTCCAATG GACACATTGC GTTTGTTACT GCTGCTAGCA
ATCTTCGAGC CATGAGCTAT GGAATTCCGC CTGTAAATAG ATTGCAAACA AGGCGAATAG
CGGGGAACAT TGTTCCTGCT GTAATCTCGA CAACTGCAGC CGTCTCAGCT CTTTCATGCA
TTGAACTCGT CAAGCTTGCG CAGGGAGCGC AATTGAAATT ACACAGGAAT GCCTTCATGA
ATCTGGCACT ACCGTTTTTC GCTTTCACTT CCCCACTTCC TGCGGAGGTA ATGCCGGGCC
TGCAAGGTCG TCAGTACACA ATATGGGATC GTTTGAAGGT GCGGGAAAGC AAGAAGGCCC
TGGCAAAGGG TGGAATATCC CTAAGGAAGC TTATTCGTCG AATAAAACAA CTAGCTTCTA
CGAACCCCAA AAAAGTGTCA GTTTTGTCCA TATCTTTTGG TCCCTACCTC CTGTATGCAA
GCTTCCTCCA CGATGATGAC AAAAATCATC TCAAGTCCTC CTTGTGGAAC ATTCTTGAAG
AATTGACCGA AGTCGACGAC GACTTTGTAT CTACTCGAAG CAACGACAAC AGGTCAACTG
AATATTCGCC GACACAGAAA TTTGTGGATT TATCGGTCAT CGTTGAAGAT CCCGACAATG
GCAGTGAATG CGAGTTGCCA TTGGTGAGGG TGTTTCGGAG ATTTCTAT
 
Protein sequence
MNYCWRFCLG LLLEQALLRD SFGVRFVPIF HPSPRRVKVH IRNALSSRGG GGGDAVGETE 
DDEERYSRQV FALGAEAHKR IRSSTVYLDG PGLDAAYFKG ELDDLGRAYH RAARSETGKS
DDDCDVSDEE VLMEYLKRLN PSVQVSVVKY SDFRPLDDSL RGVLLCVDRC HEKLLVMNGL
ARRHNLAFVG TETAGVYGRV FCDFGTSFEV NDTDGETPLV IPLDRVERGI SDEILFVTCL
EGQQHDVSKG EEIRFIDPNG DSSEQKCTVI EVHTPLRLSI EVDKKGGSCQ EWIESVNKKY
VAFSRIKASK KLSFDDLAIA SKKASSDASI FTPSDLGKSF DDNRRAALFA CFRAASSFVG
DHLRWADDND LDDFCELVRT FMSNCESEHC FLSESQHFNV EQFLEVGRAK FSPIQAFFGA
IASQEALKAL TGLYHPIQQF LLYDCDEILN SPSDRTCSVN EKEGSDRNTC GLRHILGDSI
VEDLQSMRVF VVGAGAIGCE ILKNLAAMGI GSKSKGRVII TDMDTIEKSN LSRQLLFRDS
DVGKFKSSAA TQAILRFNNK MKIDSHSSKV GDSEHNPFDD LFWRKGVDIV LNALDNMEAR
FFTDRQCVAN GKPLIDSGTL GPKGNVQVVI PHKSESYSSS ADPPDPAIAV CTLKNFPYAI
SHTIQWGRDL FEDVFSRRPS QVNDARDSLS STCVEAFVSR LIQERGENGF QQFAAELKED
VSPDLESSDI RAHSLEWAAS TAVKLFRDSI ETLLLKHPPG SLDDDGEPFW SGTRRQPRVL
SFSGSVPLDA MQSSVNENLI DFVRYAARLR AEMYASKPIR DPFEFSRNDA EASLNSAEQA
QPSDKEVMDT DTVNVLIDSL RRLSSFSKPL NTAEFEKDDD SNGHIAFVTA ASNLRAMSYG
IPPVNRLQTR RIAGNIVPAV ISTTAAVSAL SCIELVKLAQ GAQLKLHRNA FMNLALPFFA
FTSPLPAEVM PGLQGRQYTI WDRLKVRESK KALAKGGISL RKLIRRIKQL ASTNPKKVSV
LSISFGPYLL YASFLHDDDK NHLKSSLWNI LEELTEVDDD FVSTRSNDNR STEYSPTQKF
VDLSVIVEDP DNGSECELPL VRVFRRFL