Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44571 |
Symbol | CUL1_2 |
ID | 7197810 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | - |
Start bp | 939355 |
End bp | 942290 |
Gene Length | 2936 bp |
Protein Length | 821 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | CULlin protein 4 |
Protein accession | XP_002178608 |
Protein GI | 219115625 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.457132 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGGGAGGTAT CGGCGACTCG AACAACAGGA CGGAATCGGG AGACAAGATT TCGCCACTAG TCCTAGAAAT CCCTCCGTGT TCGGTCCTAG TGGATATTTA AAGCTATAGA CTCGTTTGTA TCATTTACAA ACAAGTGGTG TGCGGTAAGG AGACTACGGT TGCCGTCGCC CTCTCGCGTT TCCGTCCGTC CGAGACTTCC AGAGGCTTGG TGCAATGCCA CGCAACAAGA GTCTGCTCAA AGGCGGTCAT ACGTCTCGCA AGATCGTTAT TCGACCATAC AGTAAGCCTC CAACGCTTCC AGAGAACTAC TACGATGATA CGGTCCGATC GTTGTTGCAG TCGATTTCGG AGGCTTCGTC GCACCGGACC TTTACTGGCA CCGCGCCTTC CCCGAACTCT ACCGGTGTGA GTCTTCAGAA TGCCTACAAA GCAGTAGTGT ACTTGGTCAG TCACCAGTAC GGCCCGCGAT TGTATCGAGA TCTTATGGAT CATATGAAGC AGGTGGCTGC TCGAATCCTT CCAGAAGAAA GAGAGGCTTC CGCGTCGCGA GCATCCCTCT TGATGTACAT CCCGAAGCAG TACCAGCTTT ACCTCGAGTA CCTCTTGCTG TGCAAGCACG TCTTTCTGCC ACTGGACCGG ACGCATGCCT GGCAACCCGA AACCAAAACA GTCGTTGTTG CATCGACACA GACTCCCGGC GGTCTCTTGA CTTTATGGCA AGTAGGCCTC GAAATGTTGC AAACCAGAAT GCAGGAGTTG ACTCTCGATC GGGAGCTTTA CCAAGAATGG CTCGCTGCCC TTTTGCAAGA TTGGAATCCA GCATCCAACA ATAATCTCGA TGCGGCCAAT CGACAAGATC TACAATCAGT CTGGTATCTG TGGCAGGACT TGGGGCAACT TGCTGTCCTC CCCCTACAGG AGGATTTGGA AGAATATTGG AAAAATCAGA GCCAGCAAAT GATGGAGGGC TACCGTGCCG GATCCTTTCT CCAGTTTGCG TACGATAAGC ATGTACACGT GACCATTTGG CAACCGTGGC TTCCCTCGCA GTGGCTCAGG TCGGTTCTAG AAAACTGCTT CTTTCAACCG CATCTCAATG ACCAATACTT GCTGAAACCA GAGAACTTGC ATCCGATTCT GCAGTCCGAA TTGTTTGCAA TCAAAACCGT TGTGGGGGTA TCGTCTACAG CGATGGAAAA GCTGTCCTCG ACACAACAGC TCTGGACTCT TGCGGGGCGT ATTGCCGGGG GACAGCGTCT GGTGGCGACG AGTATTGCCA ATTTTGCCAA AACCCAGGGC CTCGCCTGCG TTCAGCCAGC GGTGGAACTC TCTGACGGTG CAGGTAAGGC AGCGGCGGGA CAACATATTT TGGACAAATC ACCAATTCCA GCTACGAACA ATGTACAAAT TGTATCCGAC TTGCTGGATA CACAGCAGCG AATATCACGT TTGATTCAGA GTCTGCCTCA CGGTCCGGAG TTGATTATTC TGAAAAATGT TTGGGAGGAA GTTCTCAACG TGGAAACAAC CCCAGCACTA GCCGAGCTCT TGGCGAAATT CCTGGATCAA ATCTTGCGAT CAAATAAGAA AATGGATCAG TATCAGTCTG AATCAGAGCA ATGGTTGCAG CGCATCATTT CCGGACTGTT CATCCCGTTG CAGGCTAAAG ACATTTTTGA GGCGTTCTAT AAGCGAGATC TCGCGAAACG GTTACTTTGG AATCGCGTCG TCAATATGGA TGTAGAAAAA CAGGTTTGTT CGTTACTAAA GGCCGAGTGC GGCGCCGGAT TTACGTCCAA GATGGAAGGA ATGTTCCAGG ACGTTGACTG GAGTCGGGAG ACAATGATGG TATATAAGCA GTCCACGGCC GACATTTTAC CAACTGAGAA TTCAGTGGAG ATGGAAGCAC AAGTTCTGAC AACTGGTACG TAATAGTGGA CGAACGGAAT TATCAGACGT ATTGCTACAT TGTTGCTCAA ACAGGTTTCC TGCCTCGTAT TCCAGGTTAC TGGCCAGTGT ACCCTCAGTA TCCTAACTTA CATTTACCTG AATCACTCAA AGAACCTCAG GAGCGGTTTG GAAATCATTA CAAGATCAAG TACCAAGGCC GTCGTATGAC CTGGCAGTAC GCGTTGGGTC ATTGCGTTGT GCGCAGCTCG GGTTTCCCCA AAACGTACGA ATTTGTCGTT AGTTTGTGTC AAGCGCTGGT TCTAATTCAG TTTGAAGAGG CCGACACCAA ATTGTCATTG CCAACACTAA TGCAGGCTAT TGGATTAGAA GACCGTGACG AAATGGAACG AGTCTTGCAG TCGCTGGCTT TGGGGAAGGA TGGTACGCGC ATTTTGAGGA AACTAGATTA TGATTCGGAG CCAAACAAGA AAAAAAAGAT CCGGATGAAT GTGGACAACC GGGACGAATT CACAATCAAT CGCAAGTTTG AATCCAATCA GCGACGTATC CGTATCAACA ATATCATGAT GAAAGAGTCC AAAGAAGAAC GAGAAAAGAC AGTGGAAGCG GTTTCGCGGG ATCGTCTCTA CTTGATCGAC GCTGTTCTCG TACGAATCAT GAAAGCTCGC AAGACCATTT TACATCAAAC CTTGATTCCT CAAGTGGTGG AGCAAGTCAA GGTACCCGCC CAACCCGGTG ACATCAAGCA ACGCATTGAG TCTTTGATTG AGCGAGAATA CATGGAGCGT GATGCCAAAG ATCGAAACCG CTACAACTAT TTAGCTTAAC GCAAGTCTTG CTATTGCTTA AAGATGCGGC ATCCTCTGGC ATGCAAGGAA TCCGCAAGGT CTCAAGTTTA TGGCACCAAT GCTGGAATAT CATACGAGTT GAAAGTGAAC CAACAACCAT TCTTCAGCTG ATGGTTATTG ATCAAAACAC GATTTTGATG CAGTGTATAA TATATAGTGA GCTTGTCTCT TGGCTG
|
Protein sequence | MPRNKSLLKG GHTSRKIVIR PYSKPPTLPE NYYDDTVRSL LQSISEASSH RTFTGTAPSP NSTGVSLQNA YKAVVYLVSH QYGPRLYRDL MDHMKQVAAR ILPEEREASA SRASLLMYIP KQYQLYLEYL LLCKHVFLPL DRTHAWQPET KTVVVASTQT PGGLLTLWQV GLEMLQTRMQ ELTLDRELYQ EWLAALLQDW NPASNNNLDA ANRQDLQSVW YLWQDLGQLA VLPLQEDLEE YWKNQSQQMM EGYRAGSFLQ FAYDKHVHVT IWQPWLPSQW LRSVLENCFF QPHLNDQYLL KPENLHPILQ SELFAIKTVV GVSSTAMEKL SSTQQLWTLA GRIAGGQRLV ATSIANFAKT QGLACVQPAV ELSDGAGKAA AGQHILDKSP IPATNNVQIV SDLLDTQQRI SRLIQSLPHG PELIILKNVW EEVLNVETTP ALAELLAKFL DQILRSNKKM DQYQSESEQW LQRIISGLFI PLQAKDIFEA FYKRDLAKRL LWNRVVNMDV EKQVCSLLKA ECGAGFTSKM EGMFQDVDWS RETMMVYKQS TADILPTENS VEMEAQVLTT GFLPRIPGYW PVYPQYPNLH LPESLKEPQE RFGNHYKIKY QGRRMTWQYA LGHCVVRSSG FPKTYEFVVS LCQALVLIQF EEADTKLSLP TLMQAIGLED RDEMERVLQS LALGKDGTRI LRKLDYDSEP NKKKKIRMNV DNRDEFTINR KFESNQRRIR INNIMMKESK EEREKTVEAV SRDRLYLIDA VLVRIMKARK TILHQTLIPQ VVEQVKVPAQ PGDIKQRIES LIEREYMERD AKDRNRYNYL A
|
| |