Gene PHATRDRAFT_33774 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_33774 
SymbolUBA3 
ID7198040 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp320260 
End bp321744 
Gene Length1485 bp 
Protein Length462 aa 
Translation table 
GC content48% 
IMG OID 
Productubiquitin-activating enzyme E1, protein 3 
Protein accessionXP_002178207 
Protein GI219114823 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGGAA ACGACCTGGA CAATTTGAAA CCTATGGTAC GTTGCTCGTT CGACTTCAGC 
ATCGATATCG ATGCATCATT TAGCGAACGC GCTAAATACA TGTAAGGTTA TTTTCTAACC
GCAATCCCCC AGGAAGTTGA CACTAGAAGA TCGTCTGAAA ATGCTAGGGG TATACGTGGG
TCTCTACTGA CCCTTCTCAG CCGCCCGTCG CCTTTCGGAA ACGAGACTGG ACCATTGGCA
TGTGGCGAGT TTGAACCGCT TCCGAAACTA AGTTCCTGTT ACGCTACTAC AGCTTCTGAC
CATGAATCCC CTTTGACGAA AGCCAAAATT CTCGTCGTCG GTGCTGGAGG GTTGGGTTGT
GAAATTCTCA AGAATCTTGC GATGTCCGGC GTGAGAGATG TGGACGTAAT TGATCTTGAT
TCAATCGACG TGACCAATCT AAATCGTCAG TTCTTATTCC GTCAACGAGA TGTCGGCACA
TCAAAGGCGA AAACCGCAGC TGCTTTCATC AACGAGCGCT GCCCTTGGAT GAGCGTTACA
GCTCACCACG GTATGATTCA GGACAAGGAG CCGTCGTTCT ACTCCTCCTT TGATTGTATC
ATATCGGGAC TCGACAACGT TGAAGCTCGT CGTTGGCTCA ACGCGACTGT GGTCGGACTC
GTAGAGTTCG ATGACGACGG CGATATGGAT CCAGCCTCAA TCATTCCGAT TATTGATGGC
GGAACGGAAG GATTTTCAGG ACAAGCTCGT TTTATCCTGC CGCGTATCAC GAGCTGCTTT
GAGTGTACAA TCGATGCTTT TCCGCCACAA ATTGCTTTTC CGTTATGCAC GATTGCCGAG
ACTCCACGCA AACCGGAACA TTGCATTGCA TACGCGTCAA TTCTTCAATG GCCGAGAGAA
TTTCACGATA AGAAGCTCGA CAGTGATGAT CCGGATGACA TGAAGTGGGT CTACGAAAAG
GCGTTGGAGC GAGCAAAGCA GTACAACATT GACGGGGTTA CATATATGCT AACCATGGGC
GTAGTCAAGA ATATAATTCC TGCCGTTGCG AGTACCAACG CAATCATTGC GGCGGCGTGC
GTGAATGAGG CGATAAAATA CATCACCTTT TGCTCACAGA ATCTCAACTC ATACATGATG
TACATGGGGT CTGAGGGTGT TCATTGTCAC ACGTTTGCAT ACGAGCAAAA AGATGATTGC
CCGGTTTGTA CCTCGACTGT GCAAAAAATG ACAATTTCTA AGACAACTAC GCTGAACGAG
CTATTGCAAG AGTTTCGCGC GGGTCCCTTG CGTCTGAAAT CGCCAAGCCT CGTCAGTTCA
GGCGGAAAGA CGCTTTACAT GCAAAAGCCT CCAGCCCTAG AAAAAGCGAC TCGATCAAAT
TTAGACAAGC CGGTGTCGTC CCTTGTGGAA TCTGGTGAAG AGTTGACTGT AACAGATCCC
CTGCTTGAGA GCATTGCAGT TGGGGTGTCA ATTACGTTTG AATAA
 
Protein sequence
MAGNDLDNLK PMEVDTRRSS ENARGIRGSL LTLLSRPSPF GNETGPLACG EFEPLPKLSS 
CYATTASDHE SPLTKAKILV VGAGGLGCEI LKNLAMSGVR DVDVIDLDSI DVTNLNRQFL
FRQRDVGTSK AKTAAAFINE RCPWMSVTAH HGMIQDKEPS FYSSFDCIIS GLDNVEARRW
LNATVVGLVE FDDDGDMDPA SIIPIIDGGT EGFSGQARFI LPRITSCFEC TIDAFPPQIA
FPLCTIAETP RKPEHCIAYA SILQWPREFH DKKLDSDDPD DMKWVYEKAL ERAKQYNIDG
VTYMLTMGVV KNIIPAVAST NAIIAAACVN EAIKYITFCS QNLNSYMMYM GSEGVHCHTF
AYEQKDDCPV CTSTVQKMTI SKTTTLNELL QEFRAGPLRL KSPSLVSSGG KTLYMQKPPA
LEKATRSNLD KPVSSLVESG EELTVTDPLL ESIAVGVSIT FE