Gene PHATRDRAFT_45710 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45710 
Symbol 
ID7200482 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011675 
Strand
Start bp1012498 
End bp1015462 
Gene Length2965 bp 
Protein Length890 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179766 
Protein GI219117963 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.262092 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACTTCG CGAATGGCTT TGGAGACAGT AGTGATGATC TGATGGAAAA CGCAAGGATC 
GTTAGCCCTT ACGGCGATGG AGAAACCCAA GCGTTTCCTG CTGATCCTTG GGCAAATTTT
CAGTACGCAC TGCCTGAAGG GCCCTCGCAG TATAAACATC GTGCTTCTAG ACAAAGACCA
ACGTTTGGAG GATCACGATC TCCCTCGGAG CCAATTCCCA CCCCACGAGA AAGGATGGTG
ACATCTTCTA AGATCTCAAC TCCTCACCGA GACGGTATGG TAAATCGAGG CCATGCTAGT
CGATACGAAT ATTCTAGTGA CCAGCGATGC GATGACGTAC ACGTAATATT TACAGATCCC
TCCGACGGTG TTGGGATCAG CGTTGAAAAA AGGAGCCAAC AGGACTTGTT GCGACAAAAG
GTTCGGGAGG ACACCAGTCG TAGGAGCTTA CAACATTCAA CCAGCGCACA AAGTATGTAC
CGGAAACGTA GCGAAAACAA AGGTGCGTTT CCTTCATTTC CACAAGTAGC GCCGAGCAAA
GAGAAAGCAC TGGTACGTAC TCCGAACACC TAAACTTGAA TGCATTGACC AATTTTGAAA
AGTGAACTAC ACTCCTCTAA ATTACGCCTT CAATTCGTAT TCTTTTGTCT GAAGCGAGGC
CAGCGTCCAT CTAAACATAC TCGGGACCAT ATCTCACGGC ATACTTCGCC AAAATCATCA
GTGGAAATAA CTTCAGAAAG CACAAACGCA GAGACCGATG GTCTAAATTT GAATTGGGCA
TTCTCGCGTG TTCAAGTTCG AGCGAATAGT GAAACGGAGT CAACGACGGT TTGGCTAAAT
ACTGATCACA GGCAGGATTT AGAAGGAGAA GACATTTGGT TACAAGAAGA ACATGCATTC
CCTTTGTTGT CAACGGGGGC TTCAAGCAGA TTTTCGGGCC CATCTGGTAA AGATGGCCTC
CGAAAACGTA CTGATAAGAA GTCAAATCGT GTCCGTTTTG CCGAACCGAT ACAGAAGCCG
CGATCTGTCC CTTTTCTGAA AACTGTTTCA CGGGAAACTA TACGGGAAGA ATCCTCAGAT
CCTCGTTCCA AAAAGATCAC ACATAGTGAC AATCGCGCGT GGTTCGTGGC CCAACCGAAG
TCAATTCTCC GTCGGCGGCG TTTCGCTGGC GAAACCGTGT CCCATGACCC ACAGTACCCC
CAGAAGAATC GCCCATCCTC TCATCGCGCA GCTCCACAAC GGAAGTCTGC TACATCGTTT
TTGGATACAC AAGGATCCCT ACTCTCTCCC ATTCATTCAG ATAGGCGGCC TTGGGACCGC
ATCTCTGAAA CAGGCTCGGA GTCACTCAGT CCTTCGTACA GTGATGTTGA GCGAGAGAAA
CGCGTTAGTC TTGGTCCCTA CCACCTGCAG GAGCTAAATG AGATGTATCC CGATCCTCCT
CTTGAGTTGC AGGTAAGACA TTTTTGACTT TACAAAGCCC TATCTTTTGT AGCCGACCTA
CTCACCGACA TACATTTTCT TTTAGTTCGA CGACGAGTCA ACTGTAGTAC CAGCACGCGC
TTCTTTCATC GACACTGTCG CTGCTGTTGT CGTTCAAGCC GCTGTTCGTA GATTTCTTGC
CCAAAAAGTG ATGCATGAGA TGGTCGGCAA AGCATATTCC TTTCCGCATT TAGAATCTGA
CGATAAGAAA TATCGACCAT TGTCGTCGCG AAAGGTAACT CCTGAAAAAA GGTCGTCCCG
AAAAAGCTAT GCAGAGAGCC CGTATGGTAG GAATTTGGGA GGAGCAGTGT TCATAGAAGT
CATGGCTGCG ATAAAAATCC AATCTGCCTT CCGAGGCTTT TGGGTTCGAG ATTCGTTGAA
TGTGGATCAT TTTTGCGCGA CTATGATCCA GAAATGGTAT CGACGACATC ATCAGAGGCA
CCACTATTTT GCAGATCTTT CTCGGATCAT ACTGGTCCAG TCCATTTGGA GGCGCAGTAT
AGCCAGGGAG CACGCTGCCT TTTTCCTTGG GAGCGTAATT ACAGTTCAGT CGCTGTTTCG
CTCGTACAGC GCTCGCAAAA AGCTCTACTC AGGACTCACT TGCCTACGAA AGGATACTAT
GGCAGCTGTA GTGATCCAAT CGCAATGGCG TACATATGCT TGCGAATGCA ACTTTATTCG
CGATCTTGTC GATATTTTGA TCGTTCAAAG TGTTGTGAGA ACTTGGTTAG CAAGACGACA
CCTGTCATCA CTACGCTCCA GGGCCCAAAG TATTTCCGGC AAAAAGTCAC CAACAGTATC
AAAAAAATAC GCGAATCAAG TGGCGGCGCA ACCTACTGGA AGTCCTCGAC CTGGAGAGGC
CAATCGTAAA TTGGCGACAG GGCAATGCTA CTCCTCGTAT AGGTCTGTCG AAGAGAGTTC
GTTCAGCGCT ATTCTTGGCA ATATAAAGAG CAAGGAGAAC AATCACCTCA TTGTGTTGAT
TACATCTCAG TCTCTCTCGC GCAATCAAGC TTCCACAAGA AGTAATATTG GTACAATCTT
ACGCGTCCAT AATGTCTCAT TCGAGGAAGT GGATGGAGCA AATCCGCTAA CCCGAGGACG
ACGCGACGAA CTCTTTGCTA TATCACAAAT GCGCGGCGTG TACCCGCAGT TCTTTGTGGT
AGACTATGAA ACAGGGCTCA CGTTATTTTT CTGCAACAGT GATTCTTTTT TCGGTGCCAA
CGAAGAAGGC TCTCTACCCA GGATACTCAA TATTGCTGGT GTTGTGCAGA GCGCGATCGG
AGGACATCAA GAAAGAAATA GTACCATAGA CGAAGCTCCT AAAGCAAACA AGCACCTGTT
TGAGCCAAAG AGGCAAAGCT CACATACTAC GGTTTCAATT GACAGTGAAA CTTCGGAGCC
CTCTGTAGGA CGGAACAGTT TGCTTTCGAT GTGGAAAAAT CTTGACAAGA AGAACACATT
AGTATTAAAT GGACACAGGA ATTGA
 
Protein sequence
MDFANGFGDS SDDLMENARI VSPYGDGETQ AFPADPWANF QYALPEGPSQ YKHRASRQRP 
TFGGSRSPSE PIPTPRERMV TSSKISTPHR DGMVNRGHAS RYEYSSDQRC DDVHVIFTDP
SDGVGISVEK RSQQDLLRQK VREDTSRRSL QHSTSAQSMY RKRSENKVEI TSESTNAETD
GLNLNWAFSR VQVRANSETE STTVWLNTDH RQDLEGEDIW LQEEHAFPLL STGASSRFSG
PSGKDGLRKR TDKKSNRVRF AEPIQKPRSV PFLKTVSRET IREESSDPRS KKITHSDNRA
WFVAQPKSIL RRRRFAGETV SHDPQYPQKN RPSSHRAAPQ RKSATSFLDT QGSLLSPIHS
DRRPWDRISE TGSESLSPSY SDVEREKRVS LGPYHLQELN EMYPDPPLEL QFDDESTVVP
ARASFIDTVA AVVVQAAVRR FLAQKVMHEM VGKAYSFPHL ESDDKKYRPL SSRKVTPEKR
SSRKSYAESP YGRNLGGAVF IEVMAAIKIQ SAFRGFWVRD SLNVDHFCAT MIQKWYRRHH
QRHHYFADLS RIILVQSIWR RSIAREHAAF FLGSVITVQS LFRSYSARKK LYSGLTCLRK
DTMAAVVIQS QWRTYACECN FIRDLVDILI VQSVVRTWLA RRHLSSLRSR AQSISGKKSP
TVSKKYANQV AAQPTGSPRP GEANRKLATG QCYSSYRSVE ESSFSAILGN IKSKENNHLI
VLITSQSLSR NQASTRSNIG TILRVHNVSF EEVDGANPLT RGRRDELFAI SQMRGVYPQF
FVVDYETGLT LFFCNSDSFF GANEEGSLPR ILNIAGVVQS AIGGHQERNS TIDEAPKANK
HLFEPKRQSS HTTVSIDSET SEPSVGRNSL LSMWKNLDKK NTLVLNGHRN