Gene PHATRDRAFT_34592 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_34592 
SymbolCPF2 
ID7199524 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011673 
Strand
Start bp983416 
End bp985541 
Gene Length2126 bp 
Protein Length610 aa 
Translation table 
GC content53% 
IMG OID 
Productcry-dash from the cryptochrome/photolyase family 
Protein accessionXP_002178889 
Protein GI219116188 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAGTA GCAGGTCCAA GCAGTCTTTT TGGTTCCTTA TCTTTGTTCA CACCGTCCTT 
ATCTTGACCT TTGCTGCCGG TGCCGCCACT GCCATGAGAG GAGCATCCGT CAGTACCGTG
ACGAAAGCGA CCTCACGCGC CTCGTCGCAA GTTGTCTTGC ACTGGTTCCG ACACGGAGAT
TTACGCTTGC TGGACAATCC GGCCTTGATC CATTCCAGGT AAGAGTTACA CGTTATTTTG
TCTTTGCGAC AGCTCGAGTA GGAATGGAAC GCACAAAAAC TCGAGTCCGT CCCCACGAAC
TTCCAACACA AACCTTTCCT CACTCATCTT TACGTGTGAC CCTTCTTGTT GGCTTGTTCG
TTAGCAAAAC TGCCGAATCT TGTGTTCCCG TCTTTTGTTT CGACGACAGT GTCTACGGCA
ACGACAATCG GACTCCGGAC ACTCGCGCGC CGCATTCGAA CGATCGCGGT CAACTCAAGT
GCGGACCCCG TCGTGCGCAG TTCGTACTGG ATTCGGTCCA GGATCTACGA CGCAGTCTAC
AATCCCGAGG TAGCGCCTTG TACGTGGCGC ACGGGAAACC CGCACAAGTC TTCCAAAGAT
TGGTGGATGC CTGGCCGGCG GTTCCAGCCG CCGACACTGC CGCTCCGAAC GGCAGTCTAC
TCACCATTGT TTGTCAGCGG GAAGTGGTGC GGGAGGAAAA CGACGCGGTC CGAGCGGTAC
AATCCGTTCT ACGCCGACGC TTTCCGCAAG CCAAAGTTCA ACAAATTTGG GGCTCCACCA
TGTACGAACT GGACGATTTG CCCTTTGCCA CGGACCTCGC CAACATGCCC GATACCTTTA
CGCCCTTTCG GAACAAGGTG GAAAAGAATT GTCAAATTGG CACACCCCTG CCGGTACCCA
AGCAACTCTC TCTGCCGGAG AATTTTCCTT CCGCCTTGAA ACAAGGCCTG GAGTACCTAC
CCACCTTAAA AGAGCTGGGG TACACGGATG CGCAAATTCA GCAAGTCGAA ACCCATGACG
AGCGCGGAGT CCTGCACTTT ATGGTGGCGA AACAGCCGGA CTCGCACGTG TTGAGGACTA
CATTTGGACA CAGGATTGTT TGAAGGATTA CTTCGAAACG CGCAACGGGA TGCTGGGGCC
GAACTATTCG ACCAAATTCA GTCCCTGGTT GGCCCACGGC AACGTTTCAC CCCGGTACGT
GGCCGCACAG TGCCGTAAGT ACGAAGAAGA ACGCGTGGAG AACAAATCTA CCTACTGGGT
AGTCTTTGAA CTGCTGTGGC GCGATTTTTG CAAATTTTTT GCCACGAAAC ATGGTGACGC
TATTTTTTAT CCGTACGGGA CGACCGAACG CACGGATCGG CACAAGCCTT GGTCGACCTT
TGGTCGTAAT TTACAGGCGT GGCAGGAGGG TCGCACGGGC TACCCTCTCG TAGACGCCAA
CATGCGAGAA TTGGTCGCCA CCGGCTTCAT GTCAAATCGT GGCCGTCAGA ATGTGGCATC
CTTTCTCGCC ATTAACTTGA ATCACGATTG GCGATGCGGC GGGGATTTCT TCGAAAGCCA
TCTGGTACGT TAAGGAGAAC ATTGTGTGGA CGTTTCCACG TTTTGGAATT GTTACTGACA
CTTAATATTC GCGCATTTCT GAAATTCTAT TTTTGTCTTC GTTGCGGTGT TGGTGGTTGC
TGCAGTTGGA TTACGACGTG TATAGCAATT GGGTGAATTG GTGTGCGGCC GCGGGTATGA
CGGGTGGTCG TCTCAACCGA TTCAACATTT CTAAGCAAAG CAAGGACTAC GACCAACACG
GCGACTACGT GCGGCATTGG TTACCGGAAC TGGCCAAGGT GCCCAACGAA TTCGTCCATG
AGCCTTGGAA AATGACGTCG TTCCAGCAGA TGGAGTACGA GTGCAAACTC GGCGTGGATT
ATCCCAACCC GATTGTTCCG CCGTCCCGGC CCAACCCCCA CACGGACCGG AACAATCGCG
GCCGTGGTGG GCACCAGTCA AAAGGCAACA ATCGCCACGG TCCACCCGAC AAATCCCGCA
AAGCCAATGC GAGTGGAAGC AACCGACACC AAAAATACGA AATGAAGAGT CTCCAACCCG
GCAGTTTTCG AGTCAAGGAA TCGTAA
 
Protein sequence
MSSSRSKQSF WFLIFVHTVL ILTFAAGAAT AMRGASVSTV TKATSRASSQ VVLHWFRHGD 
LRLLDNPALI HSSKTAESCV PVFCFDDSVY GNDNRTPDTR APHSNDRGQL KCGPRRAQFV
LDSVQDLRRS LQSRGSALYV AHGKPAQVFQ RLVDAWPAVP AADTAAPNGS LLTIVCQREV
VREENDAVRA VQSVLRRRFP QAKVQQIWGS TMYELDDLPF ATDLANMPDT FTPFRNKVEK
NCQIGTPLPV PKQLSLPENF PSALKQGLEY LPTLKELGYT DAQIQQVETH DERGVLHFMV
AKQPDSHDCL KDYFETRNGM LGPNYSTKFS PWLAHGNVSP RYVAAQCRKY EEERVENKST
YWVVFELLWR DFCKFFATKH GDAIFYPYGT TERTDRHKPW STFGRNLQAW QEGRTGYPLV
DANMRELVAT GFMSNRGRQN VASFLAINLN HDWRCGGDFF ESHLLDYDVY SNWVNWCAAA
GMTGGRLNRF NISKQSKDYD QHGDYVRHWL PELAKVPNEF VHEPWKMTSF QQMEYECKLG
VDYPNPIVPP SRPNPHTDRN NRGRGGHQSK GNNRHGPPDK SRKANASGSN RHQKYEMKSL
QPGSFRVKES