Gene PHATRDRAFT_1947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_1947 
SymbolCPD1 
ID7201298 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011677 
Strand
Start bp693015 
End bp694559 
Gene Length1545 bp 
Protein Length515 aa 
Translation table 
GC content49% 
IMG OID 
Productcyclobutane pyrimidine dimer 1 
Protein accessionXP_002180488 
Protein GI219119456 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000588882 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGTGA TCTGGCTTAA GCGGGACTTG CGCCTTACAG ATCACGGCCC ACTGGCGGCT 
GTGGCCCAAC GCAAAGACCG AGATGTCTGT ATACTCTACT TGTACGAACC CGATCAGCTG
GCAGAACCAA GCGTTCACGG ATCCCACGTA CTATTTGCTA ATGAAGGGCT AGTAGACTTA
GATACGAAGT TGTCGAATCT TCGAAGGTTG TCGTCTGGCC AAAACGCTGC GGCCAGCAAA
AGCTTTGGTT CGTTGACAGT CTGCCACTGC GAGGTAATCC AGGCTCTACA AGCTATTCAT
GCGCAAAAGA AAATAGCGAG GCTACTGGCA CATATGGAGA CCGGACACAT GCGGTCCTAT
GCTCGCGATA AACGAGTTCG AAAATGGTGT AGGGATAGAA AAATACCTTT CGTGGAGTTG
CCTCAAACGG GTGTGTCACG ATGCCTTACG AATCGCGACG ACTTTCATCG CAATCTGCAA
ATGTTTTTGA AGAAGAAACA GTATCGCACA CCAACAGCCC TCGAATGCAA CATAGTCATC
GATTTAGAAT TACCAGGAAG AAGCATGGAG CCCTTGTTCG CCGAGTTGAT CGAGATTCCT
TTAGAGCAGA GAGTCGACCG TACAGAACGC CAGCAAGGAG GGGAAACAAC AGCATTGGAA
ATCCTCCGGT CTTTTCTTTA CCATCGAGGA GTAGGATTTT CAAAAGGAAT TTCGTCACCC
AATTCTTCGT GGACGTCATG CAGTCGGCTT TCCCCGTACC TTACATGGGG CCAAATTTCC
TTAAGACACG TAGTACAAGC ACTCCAGGAA CGTCAAGCTC AGCTGAAGAC GCAGAAATGT
CGATCTGATG ATCGCTGGTT GCGCTCCTTT ACTGCGTTCT CATCTCGCGT GCACTGGCGA
TCGCACTTTA TTCAAAAGCT CGAGTCGGAA CCGGAAATGG AACAACGCGA CGTAAATGCA
GCCTTTCAAC CACTCCGTCG ACAACCCGGC GATTGGAATG AATGCTACTA TCAGGCTTGG
TCAACTGGAA ACACAGGCTA TCCAATGATG GACGCATGTA TGCGCTGTTT GCACCGACAT
GGTTGGGTCA ACTTTCGAAT GCGGGCCATG CTGGTTTCAT TCGCAAGCTA CAATTTGTGG
CTGGATTGGC ATCGGTTCGC TCCCCACTTG GCTCGCGTTT TTCTAGACTA TGAACCGGGA
ATTCATTATC CGCAAATTCA AATGCAGTCG GGTACAACAG GTATTAACGC CTTGCGCTGT
TATTCTGTAA CAAAACAAGG AAAAGAGCAC GATCCTCGAG GAATTTTCGT TCGCAAGTAC
ATTCCTGAAC TCCAGTCCGT ACCAAATGAC TACATTCACG AGCCTTGGAA GATGTCTAAA
TCTATGCAGG CCAAGTGCGG CGTTCACATT GGCGAACACT ATCCCGCACC TATTGTGAAT
GAACAGGAAA CAGCGAAAAG CGCCAAAGAA CGCATCGCTG CCGTCCGTCG AAGAAACGAA
ACTCAGGAGG CCTCACGAAA GGTTTACGAA AAGCATGGGA GCCGT
 
Protein sequence
MDVIWLKRDL RLTDHGPLAA VAQRKDRDVC ILYLYEPDQL AEPSVHGSHV LFANEGLVDL 
DTKLSNLRRL SSGQNAAASK SFGSLTVCHC EVIQALQAIH AQKKIARLLA HMETGHMRSY
ARDKRVRKWC RDRKIPFVEL PQTGVSRCLT NRDDFHRNLQ MFLKKKQYRT PTALECNIVI
DLELPGRSME PLFAELIEIP LEQRVDRTER QQGGETTALE ILRSFLYHRG VGFSKGISSP
NSSWTSCSRL SPYLTWGQIS LRHVVQALQE RQAQLKTQKC RSDDRWLRSF TAFSSRVHWR
SHFIQKLESE PEMEQRDVNA AFQPLRRQPG DWNECYYQAW STGNTGYPMM DACMRCLHRH
GWVNFRMRAM LVSFASYNLW LDWHRFAPHL ARVFLDYEPG IHYPQIQMQS GTTGINALRC
YSVTKQGKEH DPRGIFVRKY IPELQSVPND YIHEPWKMSK SMQAKCGVHI GEHYPAPIVN
EQETAKSAKE RIAAVRRRNE TQEASRKVYE KHGSR