Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_13622 |
Symbol | RPN1 |
ID | 7202013 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | - |
Start bp | 392886 |
End bp | 395722 |
Gene Length | 2837 bp |
Protein Length | 916 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | regulatory proteasome non-atpase subunit 1 |
Protein accession | XP_002181371 |
Protein GI | 219122058 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGGACG TGAGCCCGAA GGATAGCGGA AACGCGAAGG GCAAAGGTCT CAAGAAGGAA GAAATTGTCG ATACCTTGTC CGAAGACGAC AAGGAACTCA AGGAGCGTTT GGAAACCTGC GTAACCACAC TCGTAAATGC AGCAAACGAA GCTTCCGTTA CTACGGCGAT TCGCAATAAC GCCTTGGATG TGATGGTTAA TGAACTCCGA ACCGCAACAG CTTCAATGAC GTCGGTGCCC AAACCGCTCA AGTTCCTTCG TCCGCATTTT GCTTTGCTGA AATCCTGTTA CGATGCGATT GGAGACTGCG ATAACGAACT CATAGAACTG CGTGCTCGCT TGTCAGATGT TTTAGCTGTT TTGGCCATGA CCATGGGTAA GCCGGAAGAA CGCGAGAGTC TGAAGTTCAA GCTTGCAGGG GTCAAGGATT ATGCGCTATT GAGGGATCGG AAATCACCGT CGAAACACGC CGACGACAAT CTAGGATCCT GGGGGCACGA GTTCGTCCGA TCACTGGCGG GTGAAATTGG GCAAGAATAC GATCAGAGAG TGATTGATGG AGCGGATCCG AATCAAGACG ACTCGTTCGA GGATTTGCTG TCGATGATCG ACGTTATCGT GCCCTTTCAC GTTTCCCACA ATGCCGAATC GGAAGCGATT GATCTCTTGA TTGAAGTTCA GAGACTCAAA AACCTATTGA AACTCGATAC AATCGATGAG ACGAATTACC AGCGCATCTG TTTGTATCTG ATCAAGACAG CGGACTACAT GTCGGACCCG GACGATCTTT CGGTACGTTG GTGCTTTTGC ACACGCTTGC ACGTGTACGT CACGGCCATG CAGCATCCCT TTTCTAATGC ATGTTTCTTG CTCTGTAGGA AATGCTGGAA ACGGCACTGG AAATATTTAA GTCTCAGCAC CAGTACTTCG ACGCACTACG TGTGGCGTTG CGAATGAACG ACACCGAAAA AATCGCAGAT TTACTCAAGG CTTGTACCGA CCCCTTGATG CGAAAGCAAA TGTGTCTCTT ATTGGGGCGC CATCGAGTCA ACTTCGACGC CGAAGAAGCC GATATCGAGG ATGACGACGT TGAGCTGCTT AGTGAGCTAA TTGGTAACGA AAAGCTGAGC GAGCAGTTTC TCAAGTTGGC CCAGGATCTC GACGTTATGG ATCCCAAAAC GCCTGAGGAT ATCTACAAGT CCCATTTGGC TGAAACCGGA GGCTTCAGTC GTCGTCGGGA TACGAGTGCG AATGTTGACT CGGCCCGGGC CAATTTGGCA AGCACCTACG TCAACGCTTT CGTCAATGCG GGTTTTGGTC AAGACAAACT GATGACTCCC GACAACGATT GGCTATATAA AAACAAAGAC CACGGGATGA TGGCGGCAGC AGCCTCATTG GGATCGATTC TTCTCTGGAA CGTCGAAGAA GGATTGACAC AAATCGACAA ATTTTTGTAT TCTAATGAAG ATTACGTCAA AGCTGGAGCT GCGCTGGCCG TCGGCATTGT AAGTAGCGGT GTACGCAATG ATGCCGATCC TGCGCTGGCT CTACTGTCGG AACACGTAGA AGGTGAATCA CATGTTATGA AATGTGCCGC TTGCACTGGT TTGGGCATAG CTTACGCTGG TTCGGCACGT GAAGATGTTA TGGAAATCCT CACGCCGGTT GTAGAGAGTT CAGAAGGCGG CCCGACCACG ATGATGGAAG TTTCGTTGGC AGGTCTTGCT CTCGGCATGA TCTTTGTTGG CACATGTGAC GATATGGTAG GAGGTACCAT TGTACAACGG TTGATGGAAG CAACGGATGA TGAACTGGAT CATACGCATG CCCGCTTTCT TTGTCTTGGA TTGGCTCTCC TGTTTCTGGG ACAAATGGAA AAGGCTGAAG CTATGATAGA AGCTCTCCGC ACAGTAGAAC ACAAAATTGC GAAATACGCT GTTGTCATGT TGGAGACTGC GGCCTACGCT GGGTCTGGCA ACGTTCTCAA GGTCCAAGAG ATGATGCACC AGTGCGCCGA GCACTTGACA GAGGACGCGG AGCATCAAAT GGCGGCTGTT TTGGGAATTG GTTTGATCAC GATGGGCGAA GCGGTGGGTT CTGAAATGGC GCTGCGAACA TTCGATCACT TGCTGCACTA CTGCGAATTG CCGATCAAGC GCGCTGTGCC ACTTTCATTA GCTGTACTAA ACATTTCCAA TCCAGACTTT GCAGTAATCG ACCAGCTTTC TCGTCTGTCT CACGATCCTG ACATAGAAAT TTCTCAAAAC GCTATTTTTG GGCTGGGTAT TGTCAGCGCC GGTACAAACA ACTCTCGTGT CGCCGGGCTT TTGAGGCAGT TGAGCGAGTT TTACAGCAAG GAAGCTGGCC ATATTTTTTG CGTGCGAATT GCGCAAGGCC TGCTACATAT GGGTAAGGGT TTAATGACAC TTAACCCAGT CCATTCCGAT CGTATGCTCA TGAATGGACC TGCTCTTGGT GGTATGCTTG TTTTGCTGCA CTCTTGCCTC GATCTCAAGA GTACTTTGCT GGACAAGAGT CACTATCTGC TTTACTACCT GACATGCGCA ATGAATCCCC GAATGCTGAT TACAGTCGAC GAGGAATTGA ATTGGCGACC AGTAACTGCT CGCGTGGGAC AGGCTGTCGA GACCGTTGGA CAAGCAGGGA AGCCCAAGCG TATAACTGGC TTCCAGACGC ACACGACACC TGTTCTGCTT GCTGCAACGG ACCTAGCCGA GCTTGGCACC GAAGAAGTGT TCAGCATGAG CAGTGTGCTA GAGGGTATCG TGATCTTGAA AGACAATCCA GACTACGAAC CCGAAGAAAA GAAATAA
|
Protein sequence | MADVSPKDSG NAKGKGLKKE EIVDTLSEDD KELKERLETC VTTLVNAANE ASVTTAIRNN ALDVMVNELR TATASMTSVP KPLKFLRPHF ALLKSCYDAI GDCDNELIEL RARLSDVLAV LAMTMGKPEE RESLKFKLAG VKDYALLRDR KSPSKHADDN LGSWGHEFVR SLAGEIGQEY DQRVIDGADP NQDDSFEDLL SMIDVIVPFH VSHNAESEAI DLLIEVQRLK NLLKLDTIDE TNYQRICLYL IKTADYMSDP DDLSEMLETA LEIFKSQHQY FDALRVALRM NDTEKIADLL KACTDPLMRK QMCLLLGRHR VNFDAEEADI EDDDVELLSE LIGNEKLSEQ FLKLAQDLDV MDPKTPEDIY KSHLAETGGF SRRRDTSANV DSARANLAST YVNAFVNAGF GQDKLMTPDN DWLYKNKDHG MMAAAASLGS ILLWNVEEGL TQIDKFLYSN EDYVKAGAAL AVGIVSSGVR NDADPALALL SEHVEGESHV MKCAACTGLG IAYAGSARED VMEILTPVVE SSEGGPTTMM EVSLAGLALG MIFVGTCDDM VGGTIVQRLM EATDDELDHT HARFLCLGLA LLFLGQMEKA EAMIEALRTV EHKIAKYAVV MLETAAYAGS GNVLKVQEMM HQCAEHLTED AEHQMAAVLG IGLITMGEAV GSEMALRTFD HLLHYCELPI KRAVPLSLAV LNISNPDFAV IDQLSRLSHD PDIEISQNAI FGLGIVSAGT NNSRVAGLLR QLSEFYSKEA GHIFCVRIAQ GLLHMGKGLM TLNPVHSDRM LMNGPALGGM LVLLHSCLDL KSTLLDKSHY LLYYLTCAMN PRMLITVDEE LNWRPVTARV GQAVETVGQA GKPKRITGFQ THTTPVLLAA TDLAELGTEE VFSMSSVLEG IVILKDNPDY EPEEKK
|
| |