Gene PHATRDRAFT_19300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_19300 
SymbolRPN2 
ID7199742 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011673 
Strand
Start bp216710 
End bp219893 
Gene Length3184 bp 
Protein Length1008 aa 
Translation table 
GC content51% 
IMG OID 
Productregulatory proteasome non-atpase subunit 2 
Protein accessionXP_002178954 
Protein GI219116318 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCTTT CGGTTTCCAA CCCCGGACAT CAGCCCGTCG GCGCAGTGGA ATCTGCTAGT 
GGCTACATGG CTCTTTTGCA AGAAGACGAC GTGACCCTAC GTTCGCACGC CTTGACGAAA
CTCTTGGGCT GCGTCGATCG CCTCTGGCAT CAAGTGGCGG AGTCCTTGCC CGATCTCGAA
GCGATGGCAG AAGACACCGA CAATCCTTTG CAAGTCCAAC AAACGGCCGC CGCAGTTGCC
TCGCGGGTTT TCTTCCATCT CGAAGAACCA ACGCAGGCGC TGCGTTTGGC GCTGGAAGCC
GGAACTCAGC ATTTTGACCC CATGGACGAT CAATCTCCAT ACGTGCAGCG GTTGGTGTCG
GCCGCCTTGG ATGCGTACAT TCAAAAGAGA CAAGCTCAAG ACGACGAGGA AGTAGATCAA
GCTAAGGAAA GTTTGGTGGA CTTGGGTCTC GATATGAATC AGTTGCAAGC CATGGTACAC
CGTCTTTTGG AAGCCTCGTG CGCGGCGGGC AAGTACGATC ATGCTCTGGG AATTGCTTTG
GAAGCACGCG AAACGTCGCA GGTACAGGAA ATCTTACGAG CCGGCGGCAA CTCCACATCC
TTATTGCAGT ACAGTATTCA GGCAGCGGCA AATACAGTGA CATCGAAGTC GTTTCGGGTT
GAAGTTCTTC AAGTAGTTGT CGGCGCCTTG ACTGTTCAAT TCGAGGAACA GAACCAAACC
AAGGTGTCGT ACGATTTGTT ACTCGTCCAC CAGCATCTCA ACCAAGCACT TCCCGTTAGT
AGGATCATGT CGAAACTATT GCAAGGTACA GAAGACGAGT TCTTGCTTGC ATTGCAGCTA
TGTTTTGATC TCATGGACAG TGGCGATCAA GCCTTTGCAC AAGCGGTGGC AGAGGGTATT
GATCAGGACG GTATCGGTGA AGCCAACCAG GGTCGATCGG ACAAGGTGCA GCGCGTCCTT
GTGGGTGGTT TTTCGGCCGA ACTCTCTTTG TCGTTTTTGC ACAAAGAAAG CAAGGCGGAT
CGGATGATTA TGGAACGACT CAAAACGGCT TTGGAAGAAC GTTCGAGTGG GAGTCGGAAT
TCGCTGTTGC ACACGGCCGC GGTCGTGACA CATTCCTATT TGTACGCGGG GACAACAAAC
GACAGTTTTC TGCGAGATTA CTTGGACTGG ATGAAAAAGG CATCTAATTG GTAAGGCTCC
AAATAAACGG GGTGCTCACG TTTTTTTGGC GTGAATACTA ACAGAAACTT CGTCTCTCCC
CCGTAGGGCC AAATTCTCTG CTACCTCTTC TTTGGGTGTT GTCCACGCAT CCCATGGTGC
CGAAGCTATG CGGCTGCTGG AGCCTTATTT GCCAATGGAA CCTTCCGAAA ACACAAGCGT
TCCTGGTGAA GGGGGTTTCG CAGAAGGTGG CTCGCTTTAC GCGCTGGGGC TGATTCACGG
TTCCCACGCC GGATCATCCG CCTCCAAACG TCAAGAAACG ACCGAATTCT TGCGCACGCA
TCTACGCACG TCGCACGCTA ACGAAGCCCA AAGTCACGGT GCAGCGCTCG GGGTTGGGCT
GACGGCTATG GGAACGGCCG ATCTTGCAGT AGTAAACGAA CTCAAGGAAC TTTTAGTGAC
GGATTCGGCA GTTGCCGGTG AAGCCGCTGG AATCGCCATC GGTATGGTGC TAGTAGGTAC
CGGGGCAGGT AACACAAACA ACTCTCTGCA GTCTCACCAA GAAGAATTGG GCGAGATAGT
TGCGGAACTT AAGAATTACG CCCGTGAGAC AACACACGAG AAAATTATCC GCGGTGTTGC
AATGGGTCTG GCCTTAATGA GCTTTGGTCA AGAGGAAAAC GCCGACGCAT TGATCGAAGA
AATGCGATCG GACCGCGATC CAGTGATGCG TTACGGTGCC CAGTATGCTG TGGCCCTTGC
TTACTGTGGT ACTGGGTCAA ACAAGGCTAT CCGTATCCTC TTGCATGCCG CTGTGAGTGA
TGTAAGCGAC GATGTGCGCA GTGCGGCAGT TGTTGGTCTA GCTTTTGTAT TGTTCAAGAC
TCCCGAACGC GTCCCGCAGC TTGTTTCGCT CTTGATTGAG TCGTTTAATC CTCATGTTCG
GTACGCGTCC TGCATGGCTG TAGGAATTGC TATGGCCGGA ACGGGCGACG CCGATAGCGT
GGCTATGCTA GAACCGATGC TGGATGACAT GACGGATTAC GTTCGACAAG GAGCTCTGAT
GGGAACAGCA ATGATCTACA TGCAGCAAAG CGACACCTGC AACGGTCGAA AGATTCGTTC
CTTCCGCGAA AAGATTTACG CGATTCCATC GGAGAAACAC CATAGTATTT TAACAAAAAT
GGGTGCAATT CTATCCCAAG GTATCATTGA TGCGGGCGGT CGCAATTGTT CGCTCATACT
AGGATCGCGA AACGGATTTA CGAAGATGTC AAGCGCTGTT GGTCTAGCAT TGTGGCTACA
GCATTGGCAT TGGTATCCGA TGCTACACAT GTTTAGTCTC GCTTTGACAC CCACAGTAAC
AATTGGTCTT AACAAGGACT TCAAGTTTCC GAAGAAATTT GAGATCCAAT GTAATTCAAA
GCCCAGTGCA TTCGCCTATC CCCGGAAACT TGAGGACAAG AAAGAAGAAA AGAAAAAGCT
CGTCGAGACG GTCACCCTCT CTACTACTGC GAAAGAGAAA GCTCGATTGG CACGGAAGCG
AGCTAAGGCC GGCGAAGTAG TCGTAGGAGA AATGGATGTT GACAAAGGCG ACGAGTCCAA
ATCGGATGAA GAAGGCGAAA AGAAGGAGAA CGCTGATTCT ATGGAGGTCG ACAATGAAGA
CGAACCGGAA AAGAAACCGA AGAAGAAACG TGTGCCAGAG CCTACTTCAT TTCGCGTGAC
CAATCCGTCG CGAATCACGA AAGCGCAATC GCAGGCCTGC TCTTTTGATT TGGATCAACG
TTATCGCCCA ATCCGCTCGG AAGAAAAACC GATGGGCGTC GTTATGCTGA CTGATAGTAC
GCCGGACGAA GACGAAGAAT TGGGAGCCGT CAAGTCACCT TCCTTAGAGC CTGATGGTGA
GCTTGCCCCT CCCGAACCCT TTGTTTGGAC GCCGCCCGCT CAACCCGAAA AAACTGAAGA
CGACAAAAAA GAAGAATAAA GCTTTTACTC TTAAAAACAT TTCAAAGTAT GTGTTGTTAG
TTTC
 
Protein sequence
MSLSVSNPGH QPVGAVESAS GYMALLQEDD VTLRSHALTK LLGCVDRLWH QVAESLPDLE 
AMAEDTDNPL QVQQTAAAVA SRVFFHLEEP TQALRLALEA GTQHFDPMDD QSPYVQRLVS
AALDAYIQKR QAQDDEEVDQ AKESLVDLGL DMNQLQAMVH RLLEASCAAG KYDHALGIAL
EARETSQVQE ILRAGGNSTS LLQYSIQAAA NTVTSKSFRV EVLQVVVGAL TVQFEEQNQT
KVSYDLLLVH QHLNQALPVS RIMSKLLQGT EDEFLLALQL CFDLMDSGDQ AFAQAVAEGI
DQDGIGEANQ GRSDKVQRVL VGGFSAELSL SFLHKESKAD RMIMERLKTA LEERSSGSRN
SLLHTAAVVT HSYLYAGTTN DSFLRDYLDW MKKASNWAKF SATSSLGVVH ASHGAEAMRL
LEPYLPMEPS ENTSVPGEGG FAEGGSLYAL GLIHGSHAGS SASKRQETTE FLRTHLRTSH
ANEAQSHGAA LGVGLTAMGT ADLAVVNELK ELLVTDSAVA GEAAGIAIGM VLVGTGAELG
EIVAELKNYA RETTHEKIIR GVAMGLALMS FGQEENADAL IEEMRSDRDP VMRYGAQYAV
ALAYCGTGSN KAIRILLHAA VSDVSDDVRS AAVVGLAFVL FKTPERVPQL VSLLIESFNP
HVRYASCMAV GIAMAGTGDA DSVAMLEPML DDMTDYVRQG ALMGTAMIYM QQSDTCNGRK
IRSFREKIYA IPSEKHHSIL TKMGAILSQG IIDAGGRNCS LILGSRNGFT KMSSAVGLAL
WLQHWHWYPM LHMFSLALTP TVTIGLNKDF KFPKKFEIQC NSKPSAFAYP RKLEDKKEEK
KKLVETVTLS TTAKEKARLA RKRAKAGEVV VGEMDVDKGD ESKSDEEGEK KENADSMEVD
NEDEPEKKPK KKRVPEPTSF RVTNPSRITK AQSQACSFDL DQRYRPIRSE EKPMGVVMLT
DSTPDEDEEL GAVKSPSLEP DGELAPPEPF VWTPPAQPEK TEDDKKEE