Gene PHATRDRAFT_54072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_54072 
SymbolERCC2 
ID7197063 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp24858 
End bp28911 
Gene Length4054 bp 
Protein Length782 aa 
Translation table 
GC content49% 
IMG OID 
Productxeroderma pigmentosum group D complementing protein 
Protein accessionXP_002177848 
Protein GI219112193 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AGACTTCATT ACCGTAAACA GAAACAAGTC TTGTCAGTCA TCAATGAGGA ATTGACCGTT 
CTCTTCATAG GCAATTCCTG AAAGCTTTGC ATAAGCGGAA GGTGTGAAAA AGGCGTCAAA
ATGAGATTCG ATTTGGACGG CCTGGATGTG TTCTTTCCGT ACGATCGCAT CTACCTTGAG
CAGCATCAGT ACATGCGAGC ACTGAAACAG TCGCTGGACG CAGGTGGGCA CTGTCTCCTG
GAGATGCCTA CGGGAACGGG AAAAACCGTA TGTTTGCTAT CGCTGATTAC TTCCTACCAG
TTTGCGAATC CGTCGGCGGG TAAGCTCGTG TATTGCACTC GAACCGTTCC GGAAATGAAT
CACGTTATGG AAGAACTCGC GACAGTTTTA GCGTACCGGT CGCAAGAACT ACAACGGCAC
CAGGAAGAAA ACATCTTGCC AATGGACACT GACGGGGATA TTGAAAACGT TCCGGTTGCG
GGCAGTAGCG GGGATGTAAA CGTCACTGCC ATCACCCACA TTACCAGCAA TGGCTCGCCC
AATCGCAAGA AGCCTCGAAA GGTCTATACC GCCAATTTAC AAAAACCTCC CATGGGACCG
ATCTCGGACG GTCGCGGAGC TGGTGGAAGC GGGGCTCTAG CATTGTGTTT AAGCAGTCGA
CGGAATATGT GTGTTCACGA GCGAGTCATG GCCGAAAGCG ATCGGGAAGC CGTCGATGCG
GCTTGCCGAA GTCTAACGGC GAGCTGGGTT TTGGAAGAAG CACAGAAGCG TCCGGGCTCG
ATCGAAACCT GCAGCTATTA TGATAATTTT CATGCTGCAG GCGAGTCCAC TTCAATGCCC
AGTGGCGTGT ACGATTTGGA AGAGCTGCAG AAGTGGGGCA AGCGGCGTGG ATGGTGTCCA
TATTATCTGA CTCGGCAAGC CATTAATCAC GCCAATATAC TCGTATTCAA CTATCAGTAC
ATGCTTGATC CAAAAGTTGG TACGTCGCTT GAGGCTGCGT TTTGTACAAT CCTTGAGGTT
CACGCGCTTA CACTCTGCTG TTCCTTTTCT CGTTATGTAG CCAAAATGGT TTCGAAAGAA
CTCGAAGCCG AAAGTATCGT GGTGTTTGAC GAAGCCCACA ACATTGATAG TGTATGCATT
GAAGCGTTAT CAGTGACCAT CAACGACCGA GGATTGGAAC AAGCTACACG TTCTCTTGGA
CGCTTGTCAT CAGAAGTGTC TCGCCTGAAA GCATCTGACA ATCAAAGGTT GCAGACCGAG
TACCAAAATC TCGTCAACGG CTTGATTGAC CAAGGATTAC TGGATGCGCC GGCGAATGAT
GCTGGACTCA CAAGCGTACG TTTTTTTCTT GGTCCGACAC TGGTTATCCT TCCCATAGTA
TAAATATTTT TGTCTCACAT TCACCTGTGC TCTGGAATAC AGAATGTGCT CAATCCTGAT
GTTTTGAATG AAGCCGTACC CGGCAACATT CGTCGAGCCG AACACTTTAT AGGATTTATG
AAAAAAGTCG TGGAGCATTT GAAAGCTAGG CTGCGTTCCG TTGCAGGGCC GAATGGAGGT
GTGCGGAGTG AAACGCCGCT AGCATTCTTA CATCGCATGA CAAATGGTAC CAGTCTGGAA
GCCAAGCCTT TGCGGTTTGC CTATTCCCGG CTATCGTCCT TGCTACGCAC CCTGCAAGTT
TCGAATCTGG ACGATTTTAA TTCACTGACA GACGTTGCGG ACTTTTCAAG TCTACTTGCT
GCTTACAGTG AAGGAGTCGC CAAGTTTGCT ATTATCATGG AGCCGAACGG GTCCTCCATT
CCAGGGGCTA CGGACCCTGT AATTCAACTT GTTTGTTTGG ATTCGTCACT GGCTATTGCA
CCTCTTTTCA AGCGATTCGG GAGCGTGATC ATCACATCGG GAACACTGTC ACCCATTGAT
CTCTATCCCA AACTGTTGCA GTTTGAACCT TGTGTCTCCG AGTCGCTGTC CATGTCTACG
TTCCGTCCAT GTATTCGCCC ACTGGTGATC ACGAAAGGAT CGGATCAGCT GGCGGTATCT
ACAAAGTACG AAGACCGAGG TGACATGGGT GTCATTCGAA ATTATGGAAC CATGTTGGTC
GAACTGTGCA GTACCATCCC TGATGGCGTG GTTGCTTTCT TCACATCGTA CAGTTATATG
GAATCCCTGA TCAGTGAGTG GGATGCGATC GGAATTCTAC GAGAGTTGAC CAAATCGAAG
CTTGTTTTTA TAGAAACCAA AGATGTGGTC GAAACCACCT TAGCTTTGGA CAATTACCGC
AGAGCCTGTG ATTGTGGGCG GGGCGCCGTT TTTCTATCAG TAGCACGGGG AAAAGTATCG
GAAGGTATCA ACTTTGATCG ACATTACGGA CGAGCTGTGA TTATGTTTGG TGTTCCCTTT
CAGTACACTC TTTCGCATAC CCTCAGAGCT AGACTGGAGT ATCTTCAGAC ACACTACCAG
ATTCGTGAAC AAGACTTTCT AAATTTTGAC GCCATTCGCC AAGCTTCACA GTGTGTTGGT
CGTGTCATTC GAAGTAAGAC CGACTATGGG CTGATGATCT TTGCAGACAG TCGATATAAT
CGCCATGACA AACGATCGAA ATTGCCTAAA TGGATCCAAC AATTCATGTC CGATCATTTT
TTGAATTTGA GCACAGACAT GGCGCTTTCG CATGTCAAGC AATTTCTGCG CCTCATGGGA
CAACCTATCG ACCAGGAGGC GCTGCAAAGC GTTCTACTTA CGCTAGACGA AGTCAATCGA
ATGAATCCGC CCACAATTCG GACAAACATA GCTGCAGGGG AAGGCGGTGC AATGATTACC
GAGGTATCGT CCGCTCAGCC AATTCCAGCA TTTTCGGTAA TGTAGATTTG GAAGGTGTGT
CTTGTTGTTC TCTATCTATC AAGATGGAAA TTTAATACAG AGCAATAGGT GCAGTTTTGG
CATAGGCTGT TTACCATTCG CTAGTTCATC TTTCGCCCCC ACTCTCGGAG AGCAGTCGAA
CAACCCATAA TGTTCTGAGC TATCGCTCCT TCCAATTCGC GATACACGTC AACCGTTAGC
TGCAAAGGAA CAGCTTGCAA AAGAGACGGC TCCAAATCCA AGACGGAACG ATCATGTTGG
TACTGCAGGG GTGCCCCACC GCTATGATCT GAAGCGGGAC GTAGCAGGCC AACACTAAGC
AAGTCAACAA AAGCCTTGGT CAGAACTTGT TTGGAATACC GATTTGAACT ACCTTTGTAC
GAGCCAAGGT ACTCTTCCAA CATGCGTTCA AGCGTCAAAA CGGCGTTGCT TTGCTCCCGG
CGATTGTCCC GGGATAGAAT TCGCCTCGTA GCCAATACCA GCGCTGCTTG CGGGCCCGAC
AAATCTCGCA AGGCTTGCAT ACGGGGATCG ACAGCCACAT CCTCTACTAT CACCAGGTCG
CGTTCTCCCG ATTCAGATGC CAAGGATCCT CCCATATCGG CCAATGCTTC CAAGAGGTAT
TCGGCATCAA ATGTTGGAAT CGGAGATGCC GATGAGCTAG CATCACCTTC CATGGCGGCC
AAGCAATCCA TCCGATAGTT TGTAATCGCG AATGTCAAGA CGCGGCTGAA CCAGCGAACA
TCTTGACCCA ACATGCAATC CCGCACGAAA GCATTATAAA CTCGTCTCCT TGTGGGATGA
TTGTCGTCTT CTGCAGGTTT GGTCAGGAGT GCGCGAATTT CGTGGTAAAG CGATGGCTGA
TAAGCCGCAT CATAGTGGTC TTTTCGTGTC GATTCCAACT TGCACGACAC AATATCGCGA
AGTTTGCCGA TGGACGTGCA CGGTCCAAAT TGAAGAAATT GAGATGTACC CTCGGCTCGA
CTACGAATTC GTTTTTCCAG TAAACCGATG AGGCCTAGAT GGCAGGTCAT ACCGACAAAG
CACACAGATG AACCGTCGGA AGCTACGCGA TCGAGAAGGT GATAAAGTAA CAGTTGTCGA
TCTTTATGCT CCGTCAAACC ACTATCCAAA GAATTTGCCG TCGCTTTCTG TTGTCCCAGA
AAAAGATCGA GCTCATCCAG AACAATTACG ATAG
 
Protein sequence
MRFDLDGLDV FFPYDRIYLE QHQYMRALKQ SLDAGGHCLL EMPTGTGKTV CLLSLITSYQ 
FANPSAGKLV YCTRTVPEMN HVMEELATVL AYRSQELQRH QEENILPMDT DGDIENVPVA
GTLCLSSRRN MCVHERVMAE SDREAVDAAC RSLTASWVLE EAQKRPGSIE TCSYYDNFHA
AGESTSMPSG VYDLEELQKW GKRRGWCPYY LTRQAINHAN ILVFNYQYML DPKVAKMVSK
ELEAESIVVF DEAHNIDSVC IEALSVTIND RGLEQATRSL GRLSSEVSRL KASDNQRLQT
EYQNLVNGLI DQGLLDAPAN DAGLTSNVLN PDVLNEAVPG NIRRAEHFIG FMKKVVEHLK
ARLRSVAGPN GGVRSETPLA FLHRMTNGTS LEAKPLRFAY SRLSSLLRTL QVSNLDDFNS
LTDVADFSSL LAAYSEGVAK FAIIMEPNGS SIPGATDPVI QLVCLDSSLA IAPLFKRFGS
VIITSGTLSP IDLYPKLLQF EPCVSESLSM STFRPCIRPL VITKGSDQLA VSTKYEDRGD
MGVIRNYGTM LVELCSTIPD GVVAFFTSYS YMESLISEWD AIGILRELTK SKLVFIETKD
VVETTLALDN YRRACDCGRG AVFLSVARGK VSEGINFDRH YGRAVIMFGV PFQYTLSHTL
RARLEYLQTH YQIREQDFLN FDAIRQASQC VGRVIRSKTD YGLMIFADSR YNRHDKRSKL
PKWIQQFMSD HFLNLSTDMA LSHVKQFLRL MGQPIDQEAL QSNLPSLSVV PEKDRAHPEQ
LR