Gene PHATRDRAFT_44571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44571 
SymbolCUL1_2 
ID7197810 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp939355 
End bp942290 
Gene Length2936 bp 
Protein Length821 aa 
Translation table 
GC content49% 
IMG OID 
ProductCULlin protein 4 
Protein accessionXP_002178608 
Protein GI219115625 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.457132 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGGGAGGTAT CGGCGACTCG AACAACAGGA CGGAATCGGG AGACAAGATT TCGCCACTAG 
TCCTAGAAAT CCCTCCGTGT TCGGTCCTAG TGGATATTTA AAGCTATAGA CTCGTTTGTA
TCATTTACAA ACAAGTGGTG TGCGGTAAGG AGACTACGGT TGCCGTCGCC CTCTCGCGTT
TCCGTCCGTC CGAGACTTCC AGAGGCTTGG TGCAATGCCA CGCAACAAGA GTCTGCTCAA
AGGCGGTCAT ACGTCTCGCA AGATCGTTAT TCGACCATAC AGTAAGCCTC CAACGCTTCC
AGAGAACTAC TACGATGATA CGGTCCGATC GTTGTTGCAG TCGATTTCGG AGGCTTCGTC
GCACCGGACC TTTACTGGCA CCGCGCCTTC CCCGAACTCT ACCGGTGTGA GTCTTCAGAA
TGCCTACAAA GCAGTAGTGT ACTTGGTCAG TCACCAGTAC GGCCCGCGAT TGTATCGAGA
TCTTATGGAT CATATGAAGC AGGTGGCTGC TCGAATCCTT CCAGAAGAAA GAGAGGCTTC
CGCGTCGCGA GCATCCCTCT TGATGTACAT CCCGAAGCAG TACCAGCTTT ACCTCGAGTA
CCTCTTGCTG TGCAAGCACG TCTTTCTGCC ACTGGACCGG ACGCATGCCT GGCAACCCGA
AACCAAAACA GTCGTTGTTG CATCGACACA GACTCCCGGC GGTCTCTTGA CTTTATGGCA
AGTAGGCCTC GAAATGTTGC AAACCAGAAT GCAGGAGTTG ACTCTCGATC GGGAGCTTTA
CCAAGAATGG CTCGCTGCCC TTTTGCAAGA TTGGAATCCA GCATCCAACA ATAATCTCGA
TGCGGCCAAT CGACAAGATC TACAATCAGT CTGGTATCTG TGGCAGGACT TGGGGCAACT
TGCTGTCCTC CCCCTACAGG AGGATTTGGA AGAATATTGG AAAAATCAGA GCCAGCAAAT
GATGGAGGGC TACCGTGCCG GATCCTTTCT CCAGTTTGCG TACGATAAGC ATGTACACGT
GACCATTTGG CAACCGTGGC TTCCCTCGCA GTGGCTCAGG TCGGTTCTAG AAAACTGCTT
CTTTCAACCG CATCTCAATG ACCAATACTT GCTGAAACCA GAGAACTTGC ATCCGATTCT
GCAGTCCGAA TTGTTTGCAA TCAAAACCGT TGTGGGGGTA TCGTCTACAG CGATGGAAAA
GCTGTCCTCG ACACAACAGC TCTGGACTCT TGCGGGGCGT ATTGCCGGGG GACAGCGTCT
GGTGGCGACG AGTATTGCCA ATTTTGCCAA AACCCAGGGC CTCGCCTGCG TTCAGCCAGC
GGTGGAACTC TCTGACGGTG CAGGTAAGGC AGCGGCGGGA CAACATATTT TGGACAAATC
ACCAATTCCA GCTACGAACA ATGTACAAAT TGTATCCGAC TTGCTGGATA CACAGCAGCG
AATATCACGT TTGATTCAGA GTCTGCCTCA CGGTCCGGAG TTGATTATTC TGAAAAATGT
TTGGGAGGAA GTTCTCAACG TGGAAACAAC CCCAGCACTA GCCGAGCTCT TGGCGAAATT
CCTGGATCAA ATCTTGCGAT CAAATAAGAA AATGGATCAG TATCAGTCTG AATCAGAGCA
ATGGTTGCAG CGCATCATTT CCGGACTGTT CATCCCGTTG CAGGCTAAAG ACATTTTTGA
GGCGTTCTAT AAGCGAGATC TCGCGAAACG GTTACTTTGG AATCGCGTCG TCAATATGGA
TGTAGAAAAA CAGGTTTGTT CGTTACTAAA GGCCGAGTGC GGCGCCGGAT TTACGTCCAA
GATGGAAGGA ATGTTCCAGG ACGTTGACTG GAGTCGGGAG ACAATGATGG TATATAAGCA
GTCCACGGCC GACATTTTAC CAACTGAGAA TTCAGTGGAG ATGGAAGCAC AAGTTCTGAC
AACTGGTACG TAATAGTGGA CGAACGGAAT TATCAGACGT ATTGCTACAT TGTTGCTCAA
ACAGGTTTCC TGCCTCGTAT TCCAGGTTAC TGGCCAGTGT ACCCTCAGTA TCCTAACTTA
CATTTACCTG AATCACTCAA AGAACCTCAG GAGCGGTTTG GAAATCATTA CAAGATCAAG
TACCAAGGCC GTCGTATGAC CTGGCAGTAC GCGTTGGGTC ATTGCGTTGT GCGCAGCTCG
GGTTTCCCCA AAACGTACGA ATTTGTCGTT AGTTTGTGTC AAGCGCTGGT TCTAATTCAG
TTTGAAGAGG CCGACACCAA ATTGTCATTG CCAACACTAA TGCAGGCTAT TGGATTAGAA
GACCGTGACG AAATGGAACG AGTCTTGCAG TCGCTGGCTT TGGGGAAGGA TGGTACGCGC
ATTTTGAGGA AACTAGATTA TGATTCGGAG CCAAACAAGA AAAAAAAGAT CCGGATGAAT
GTGGACAACC GGGACGAATT CACAATCAAT CGCAAGTTTG AATCCAATCA GCGACGTATC
CGTATCAACA ATATCATGAT GAAAGAGTCC AAAGAAGAAC GAGAAAAGAC AGTGGAAGCG
GTTTCGCGGG ATCGTCTCTA CTTGATCGAC GCTGTTCTCG TACGAATCAT GAAAGCTCGC
AAGACCATTT TACATCAAAC CTTGATTCCT CAAGTGGTGG AGCAAGTCAA GGTACCCGCC
CAACCCGGTG ACATCAAGCA ACGCATTGAG TCTTTGATTG AGCGAGAATA CATGGAGCGT
GATGCCAAAG ATCGAAACCG CTACAACTAT TTAGCTTAAC GCAAGTCTTG CTATTGCTTA
AAGATGCGGC ATCCTCTGGC ATGCAAGGAA TCCGCAAGGT CTCAAGTTTA TGGCACCAAT
GCTGGAATAT CATACGAGTT GAAAGTGAAC CAACAACCAT TCTTCAGCTG ATGGTTATTG
ATCAAAACAC GATTTTGATG CAGTGTATAA TATATAGTGA GCTTGTCTCT TGGCTG
 
Protein sequence
MPRNKSLLKG GHTSRKIVIR PYSKPPTLPE NYYDDTVRSL LQSISEASSH RTFTGTAPSP 
NSTGVSLQNA YKAVVYLVSH QYGPRLYRDL MDHMKQVAAR ILPEEREASA SRASLLMYIP
KQYQLYLEYL LLCKHVFLPL DRTHAWQPET KTVVVASTQT PGGLLTLWQV GLEMLQTRMQ
ELTLDRELYQ EWLAALLQDW NPASNNNLDA ANRQDLQSVW YLWQDLGQLA VLPLQEDLEE
YWKNQSQQMM EGYRAGSFLQ FAYDKHVHVT IWQPWLPSQW LRSVLENCFF QPHLNDQYLL
KPENLHPILQ SELFAIKTVV GVSSTAMEKL SSTQQLWTLA GRIAGGQRLV ATSIANFAKT
QGLACVQPAV ELSDGAGKAA AGQHILDKSP IPATNNVQIV SDLLDTQQRI SRLIQSLPHG
PELIILKNVW EEVLNVETTP ALAELLAKFL DQILRSNKKM DQYQSESEQW LQRIISGLFI
PLQAKDIFEA FYKRDLAKRL LWNRVVNMDV EKQVCSLLKA ECGAGFTSKM EGMFQDVDWS
RETMMVYKQS TADILPTENS VEMEAQVLTT GFLPRIPGYW PVYPQYPNLH LPESLKEPQE
RFGNHYKIKY QGRRMTWQYA LGHCVVRSSG FPKTYEFVVS LCQALVLIQF EEADTKLSLP
TLMQAIGLED RDEMERVLQS LALGKDGTRI LRKLDYDSEP NKKKKIRMNV DNRDEFTINR
KFESNQRRIR INNIMMKESK EEREKTVEAV SRDRLYLIDA VLVRIMKARK TILHQTLIPQ
VVEQVKVPAQ PGDIKQRIES LIEREYMERD AKDRNRYNYL A