Gene PHATRDRAFT_51952 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_51952 
SymbolCPD3 
ID7201113 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011676 
Strand
Start bp553266 
End bp554799 
Gene Length1534 bp 
Protein Length511 aa 
Translation table 
GC content50% 
IMG OID 
Productclass II CPD photolyase 
Protein accessionXP_002180071 
Protein GI219118604 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCGCAA ACCGTACACG AGTCCTTACT TCGGAAGGAA CCGAGCCAAA AGAGGGACAA 
TCGGTAGTGT ACTGGATGCA ACGTGACGTG CGATCGGTCG ATAATTGGGC TCTCTTATGG
GCTCGAGATC TAGCGATGCA GCACGATGTT CCCCTACACG TCGTGTACGC GTTGCCACCA
CCGGCTTCGT CGGACGGATC AGACAACGAT AGAGATTTGC CTCCAGCTCT CATTCAATTA
CCCATGACGA AGCGACATGG CGCCTTTTTG CTGGGTGGGC TAGAATGCGT GTACAAGGAG
TTGAAAGAGA TGAAGATTCC GTTGTACGTC TGCCTTCCCG ACTCTCACGA GAAGGTTGGC
GAGACTGTCT GCGAAGCAAT CCTGCATAAA TACAAGGCAA AAATTGTTGT CTCTGATTTT
TCTCCGATAC GCGAATACCG TCAATGGATG GAACTCCAGG CCGTACCTAT CTTGGAGGAA
GCGAAGGTCC CGTTTTATCA GGTTGATGCC CACAACATTG TGCCGGTGTG GACGGCAACC
GACAAACGAC AAGTTGGAGC TCGAACCCTA CGGCCGCGAA TTCATAAAGT GTATAATGAC
TACCTACAAG ACTATCCGGA TCTCAAAGGC AACAGCCATT CGGTTGACCA ACCCAAGTTT
GACCGGGTCG AATACGAATC GTTTTTACAG ATGGACGAAT CGGTGGAATC CGTCGACTGG
GCACAGCCTG GAACAGAAGC AGGTATGAAA CAGTTTGAAT TTTTTAGCAA GAATGGCCTA
AAGATTTTTC ATGAGCAACG TAATGATCCC GTGCAAAAGC ACGTCTGCTC TGACATGTCC
CCCTGGATCA ATCACGGCCA TATTTCGTTT CAACGGTTGG CCCTAAATGT CAAAGCATTG
AACAAACACG CCAACGGAGC TGCAGCCTTT ATTGAAGAAG GCGTTATTCG TCGCGAGCTA
TCAGACAACA TGTTGTACTA TTCTCCGAAC GACTACGACT CGCTCGAAAC GGCAGCTGGC
TGGGCACGGG AGAGCCTGCA ACTGCACGCG TCCGACGAAC GTGAATTCGT GTACTCACTC
TCCGAGCTGG AGGAAGGACG CACTCACGAT GATTTGTGGA ATGCAGCTCA GTTGCAGATG
GTACGAGATG GAAAAATGCA CGGCTTTATG CGCATGTACT GGGCCAAAAA AATCCTCGAG
TGGTCCGAAT CTCCGGTTGG GGCGTTGCGG ACGGCGCAAT ACCTGAATGA CAAGTACGAA
TTGGACGGCC GCGATCCAAA CGGCTTTGTC GGGGTCGGCT GGTCCATAAT GGGAATCCAT
GATCAAGGAT GGAAGGAACG AGAAGTGTTT GGTAAAATTC GGTACATGAA CTACAACGGG
TGTAAGCGTA AATTCAAAGT GGAGGAGTAT GTTGCTCAGT ACAAAGGTGC CGCCGAGAAT
GCTGCCAATG CAGTGGAGGA AACAAATGGG TCGTCCAACA AGCGCAAATC GTTACCAAGC
TCTTCAAATT CGAAACAAAA GACTGCCAGA AAGT
 
Protein sequence
MLANRTRVLT SEGTEPKEGQ SVVYWMQRDV RSVDNWALLW ARDLAMQHDV PLHVVYALPP 
PASSDGSDND RDLPPALIQL PMTKRHGAFL LGGLECVYKE LKEMKIPLYV CLPDSHEKVG
ETVCEAILHK YKAKIVVSDF SPIREYRQWM ELQAVPILEE AKVPFYQVDA HNIVPVWTAT
DKRQVGARTL RPRIHKVYND YLQDYPDLKG NSHSVDQPKF DRVEYESFLQ MDESVESVDW
AQPGTEAGMK QFEFFSKNGL KIFHEQRNDP VQKHVCSDMS PWINHGHISF QRLALNVKAL
NKHANGAAAF IEEGVIRREL SDNMLYYSPN DYDSLETAAG WARESLQLHA SDEREFVYSL
SELEEGRTHD DLWNAAQLQM VRDGKMHGFM RMYWAKKILE WSESPVGALR TAQYLNDKYE
LDGRDPNGFV GVGWSIMGIH DQGWKEREVF GKIRYMNYNG CKRKFKVEEY VAQYKGAAEN
AANAVEETNG SSNKRKSLPS SSNSKQKTAR K