Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_19300 |
Symbol | RPN2 |
ID | 7199742 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | - |
Start bp | 216710 |
End bp | 219893 |
Gene Length | 3184 bp |
Protein Length | 1008 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | regulatory proteasome non-atpase subunit 2 |
Protein accession | XP_002178954 |
Protein GI | 219116318 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTCTTT CGGTTTCCAA CCCCGGACAT CAGCCCGTCG GCGCAGTGGA ATCTGCTAGT GGCTACATGG CTCTTTTGCA AGAAGACGAC GTGACCCTAC GTTCGCACGC CTTGACGAAA CTCTTGGGCT GCGTCGATCG CCTCTGGCAT CAAGTGGCGG AGTCCTTGCC CGATCTCGAA GCGATGGCAG AAGACACCGA CAATCCTTTG CAAGTCCAAC AAACGGCCGC CGCAGTTGCC TCGCGGGTTT TCTTCCATCT CGAAGAACCA ACGCAGGCGC TGCGTTTGGC GCTGGAAGCC GGAACTCAGC ATTTTGACCC CATGGACGAT CAATCTCCAT ACGTGCAGCG GTTGGTGTCG GCCGCCTTGG ATGCGTACAT TCAAAAGAGA CAAGCTCAAG ACGACGAGGA AGTAGATCAA GCTAAGGAAA GTTTGGTGGA CTTGGGTCTC GATATGAATC AGTTGCAAGC CATGGTACAC CGTCTTTTGG AAGCCTCGTG CGCGGCGGGC AAGTACGATC ATGCTCTGGG AATTGCTTTG GAAGCACGCG AAACGTCGCA GGTACAGGAA ATCTTACGAG CCGGCGGCAA CTCCACATCC TTATTGCAGT ACAGTATTCA GGCAGCGGCA AATACAGTGA CATCGAAGTC GTTTCGGGTT GAAGTTCTTC AAGTAGTTGT CGGCGCCTTG ACTGTTCAAT TCGAGGAACA GAACCAAACC AAGGTGTCGT ACGATTTGTT ACTCGTCCAC CAGCATCTCA ACCAAGCACT TCCCGTTAGT AGGATCATGT CGAAACTATT GCAAGGTACA GAAGACGAGT TCTTGCTTGC ATTGCAGCTA TGTTTTGATC TCATGGACAG TGGCGATCAA GCCTTTGCAC AAGCGGTGGC AGAGGGTATT GATCAGGACG GTATCGGTGA AGCCAACCAG GGTCGATCGG ACAAGGTGCA GCGCGTCCTT GTGGGTGGTT TTTCGGCCGA ACTCTCTTTG TCGTTTTTGC ACAAAGAAAG CAAGGCGGAT CGGATGATTA TGGAACGACT CAAAACGGCT TTGGAAGAAC GTTCGAGTGG GAGTCGGAAT TCGCTGTTGC ACACGGCCGC GGTCGTGACA CATTCCTATT TGTACGCGGG GACAACAAAC GACAGTTTTC TGCGAGATTA CTTGGACTGG ATGAAAAAGG CATCTAATTG GTAAGGCTCC AAATAAACGG GGTGCTCACG TTTTTTTGGC GTGAATACTA ACAGAAACTT CGTCTCTCCC CCGTAGGGCC AAATTCTCTG CTACCTCTTC TTTGGGTGTT GTCCACGCAT CCCATGGTGC CGAAGCTATG CGGCTGCTGG AGCCTTATTT GCCAATGGAA CCTTCCGAAA ACACAAGCGT TCCTGGTGAA GGGGGTTTCG CAGAAGGTGG CTCGCTTTAC GCGCTGGGGC TGATTCACGG TTCCCACGCC GGATCATCCG CCTCCAAACG TCAAGAAACG ACCGAATTCT TGCGCACGCA TCTACGCACG TCGCACGCTA ACGAAGCCCA AAGTCACGGT GCAGCGCTCG GGGTTGGGCT GACGGCTATG GGAACGGCCG ATCTTGCAGT AGTAAACGAA CTCAAGGAAC TTTTAGTGAC GGATTCGGCA GTTGCCGGTG AAGCCGCTGG AATCGCCATC GGTATGGTGC TAGTAGGTAC CGGGGCAGGT AACACAAACA ACTCTCTGCA GTCTCACCAA GAAGAATTGG GCGAGATAGT TGCGGAACTT AAGAATTACG CCCGTGAGAC AACACACGAG AAAATTATCC GCGGTGTTGC AATGGGTCTG GCCTTAATGA GCTTTGGTCA AGAGGAAAAC GCCGACGCAT TGATCGAAGA AATGCGATCG GACCGCGATC CAGTGATGCG TTACGGTGCC CAGTATGCTG TGGCCCTTGC TTACTGTGGT ACTGGGTCAA ACAAGGCTAT CCGTATCCTC TTGCATGCCG CTGTGAGTGA TGTAAGCGAC GATGTGCGCA GTGCGGCAGT TGTTGGTCTA GCTTTTGTAT TGTTCAAGAC TCCCGAACGC GTCCCGCAGC TTGTTTCGCT CTTGATTGAG TCGTTTAATC CTCATGTTCG GTACGCGTCC TGCATGGCTG TAGGAATTGC TATGGCCGGA ACGGGCGACG CCGATAGCGT GGCTATGCTA GAACCGATGC TGGATGACAT GACGGATTAC GTTCGACAAG GAGCTCTGAT GGGAACAGCA ATGATCTACA TGCAGCAAAG CGACACCTGC AACGGTCGAA AGATTCGTTC CTTCCGCGAA AAGATTTACG CGATTCCATC GGAGAAACAC CATAGTATTT TAACAAAAAT GGGTGCAATT CTATCCCAAG GTATCATTGA TGCGGGCGGT CGCAATTGTT CGCTCATACT AGGATCGCGA AACGGATTTA CGAAGATGTC AAGCGCTGTT GGTCTAGCAT TGTGGCTACA GCATTGGCAT TGGTATCCGA TGCTACACAT GTTTAGTCTC GCTTTGACAC CCACAGTAAC AATTGGTCTT AACAAGGACT TCAAGTTTCC GAAGAAATTT GAGATCCAAT GTAATTCAAA GCCCAGTGCA TTCGCCTATC CCCGGAAACT TGAGGACAAG AAAGAAGAAA AGAAAAAGCT CGTCGAGACG GTCACCCTCT CTACTACTGC GAAAGAGAAA GCTCGATTGG CACGGAAGCG AGCTAAGGCC GGCGAAGTAG TCGTAGGAGA AATGGATGTT GACAAAGGCG ACGAGTCCAA ATCGGATGAA GAAGGCGAAA AGAAGGAGAA CGCTGATTCT ATGGAGGTCG ACAATGAAGA CGAACCGGAA AAGAAACCGA AGAAGAAACG TGTGCCAGAG CCTACTTCAT TTCGCGTGAC CAATCCGTCG CGAATCACGA AAGCGCAATC GCAGGCCTGC TCTTTTGATT TGGATCAACG TTATCGCCCA ATCCGCTCGG AAGAAAAACC GATGGGCGTC GTTATGCTGA CTGATAGTAC GCCGGACGAA GACGAAGAAT TGGGAGCCGT CAAGTCACCT TCCTTAGAGC CTGATGGTGA GCTTGCCCCT CCCGAACCCT TTGTTTGGAC GCCGCCCGCT CAACCCGAAA AAACTGAAGA CGACAAAAAA GAAGAATAAA GCTTTTACTC TTAAAAACAT TTCAAAGTAT GTGTTGTTAG TTTC
|
Protein sequence | MSLSVSNPGH QPVGAVESAS GYMALLQEDD VTLRSHALTK LLGCVDRLWH QVAESLPDLE AMAEDTDNPL QVQQTAAAVA SRVFFHLEEP TQALRLALEA GTQHFDPMDD QSPYVQRLVS AALDAYIQKR QAQDDEEVDQ AKESLVDLGL DMNQLQAMVH RLLEASCAAG KYDHALGIAL EARETSQVQE ILRAGGNSTS LLQYSIQAAA NTVTSKSFRV EVLQVVVGAL TVQFEEQNQT KVSYDLLLVH QHLNQALPVS RIMSKLLQGT EDEFLLALQL CFDLMDSGDQ AFAQAVAEGI DQDGIGEANQ GRSDKVQRVL VGGFSAELSL SFLHKESKAD RMIMERLKTA LEERSSGSRN SLLHTAAVVT HSYLYAGTTN DSFLRDYLDW MKKASNWAKF SATSSLGVVH ASHGAEAMRL LEPYLPMEPS ENTSVPGEGG FAEGGSLYAL GLIHGSHAGS SASKRQETTE FLRTHLRTSH ANEAQSHGAA LGVGLTAMGT ADLAVVNELK ELLVTDSAVA GEAAGIAIGM VLVGTGAELG EIVAELKNYA RETTHEKIIR GVAMGLALMS FGQEENADAL IEEMRSDRDP VMRYGAQYAV ALAYCGTGSN KAIRILLHAA VSDVSDDVRS AAVVGLAFVL FKTPERVPQL VSLLIESFNP HVRYASCMAV GIAMAGTGDA DSVAMLEPML DDMTDYVRQG ALMGTAMIYM QQSDTCNGRK IRSFREKIYA IPSEKHHSIL TKMGAILSQG IIDAGGRNCS LILGSRNGFT KMSSAVGLAL WLQHWHWYPM LHMFSLALTP TVTIGLNKDF KFPKKFEIQC NSKPSAFAYP RKLEDKKEEK KKLVETVTLS TTAKEKARLA RKRAKAGEVV VGEMDVDKGD ESKSDEEGEK KENADSMEVD NEDEPEKKPK KKRVPEPTSF RVTNPSRITK AQSQACSFDL DQRYRPIRSE EKPMGVVMLT DSTPDEDEEL GAVKSPSLEP DGELAPPEPF VWTPPAQPEK TEDDKKEE
|
| |