Gene PHATRDRAFT_43224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_43224 
Symbol 
ID7196585 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp2376385 
End bp2379590 
Gene Length3206 bp 
Protein Length835 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177501 
Protein GI219111499 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTTGCCACAA CAGCAACACG ACTCTCTATC GTGGGCAACA ACGATTTCAT TCGGTAGACT 
CGGCGTACTT CGTTGGTGAA GGGTGCATAC TTCTTCGACT ACCGAGAGAC TTGCGCCTTT
TGCCATCTTC TCTCGTCTCT TTGTTGGTTT CCTACATCCA GTGAAGATTG CTTGTCGGTG
ATAGTCTTTA ACAACAACAA CACCATCCAA CAACATTCAT CAGCACTATG GGGGAAAAAT
ACCGATCGTG GTACACGAAC ACTTGGATCG CGTGGAGCTG TGGGGTCGCT CTATGGCTGA
GTAGAACGAC GATTGCTGCC GGATCTACCG AAGAATTCTA CCTTTATTAC GGCGACACGG
ATGCCTCGAC GTACGTGCAA GAGTACCGGG CGACCAAAGA CGGGATTCCT CTCGACGGAT
TGTTCGAACA ATCCGAACAT CCCCGCATTG TCGAGTTCTA CTCACCGTAT TGTGTAAGTG
AAGCGAAAGC GCTTAGCTAG CCCTTGTGCA TTGCGGAGAC GACTCTTGCT GTGTACCGTT
GTACATTGGT AGGAGGTCTT TAGTCGACTA CCGTGCGATG GACTGCTTTT AAACTCACAA
TGTCACTCAC ACCTGGCCAC TCCTACACGC ATTGTTTCGT GCCATGCTGT AATTGACTTA
CTTTTTTTGT GCGTTAGCCG CACTGTCGTC ACTTCAAACC CAAATACGTC CGATTGGCGC
GAGACGTGGG ACAGAAGTAT CCCGACGTGG AGTTTTACGC GGTTTCCTGT GTCGCTCACA
GTGACATGTG TCAGAAATAC AACGGTACGT CTCTGTGTGT GTGTGTGTGT GTGTGTGCGA
AGTTGACGTG GACGTGGGGG TATATGGGAA TGCGAGTGGA ATCACTCTGA ACGTTTCGCT
TTCGTCTTGC TCCATTGACT ACCTTGCGTT ACTTCGTTCT GCCACTCACG TCTTTTCCGT
CTACTTTCAA TTGGTTTTCG ACTGCAATGT GGTGTTAAAC AGTTCGAGGG TTTCCGCAGA
TCTTGGCCCT GTCGGCCTCT ACTGCCGAAC CGACAGTGTT GTCCAAGAAC TCTTACACGG
TGCGAGGCAT TGCCTCCGCC TTGAAGCTCG GACCGGTCCT TGGCGGTACG TCCACCGGGA
CCGTCGGTCG GACGGCCCGA CTTCTGGAAG ACGGTACCGA AGCAGGAGAC TCGGACGAGC
CACCCAACGA TGACTCGGAG CAAACGGTGG ACCAACCCGA CGAAAACTCC AACGACCCGA
GTCTGGACGG CAAAGTATCT TTCGAACTTC ACGTTCCTAT TCCGGATCTG CGACCCCACC
CCACCGGTGA CGACGTGCCC AATTCCAACG ATCCGAGTGC CGACGAAAAA ACAGAAGACG
AACCCGCCGA CTCGCAAGAT ACACGGAAGC ACTACTCGGG CGAGGACCCC ACCGTCGGGA
ACGGTCTTTT TAACGTGGAT ATTGATGATT CCCCCGATGA AGGCTTTGAT CAATCCGCCG
ACGATCGAGA CGACCCTGAC GCGGAAGAAT ACTACAGTGA CAACCGCGAC GACGAAGATC
TCGACGAAAG CGAAGACCAG GAGAGTGAAC AAGACCCCTA CGGAATGGAC CTATTCTTCG
AGGCTCAGAA ACGTGGCGGT GACGAGTCGG AGTCCTCCGA CCTTCCCGGT CCCGTCCAGC
GCAAACCCGG AGCCGTCACC GCGCAGTTTG CCCCTCGAGA TATGGACAAG TACCGCGACG
TCCTACGCCA GAAACGGGCC GCCGAACAGA AGAAACGTAA GTTTGTCGGA CTGCGAAAGG
ATCGTCAAGT AAAGCCACTG GGTCCTGTAG TCCGGGACGG TGCCTCGAAG GCTATGAGGG
CCAATACGCC GGGAACACTG GAATACAAAC AACGCCAGAA AGAGTCCTCG GAACGTTTGG
CAAAGTTGCT GGAGAAAAAG TTTGGCAAGA AAAGGGCCGC CAAGTACGTT TCGTCTAAAG
GTATCGGAAA CCGTATCGGG AATTACACGT CGCTGCCCTT CAAAAAGGAA GTGGCCAAGC
CGCGCTTGGT GGAGCAGCTT CCAATACTCA AAAGAGTAGT ACGCATGGAT GACGAGGAGA
TGCTCATCCT CGACTCCTCC CTGTCGTTTC TCCGAGGGCT CCGCTATGGA GTCTTTCAAG
ATCCAAAACC ATTGACAGGC AAAAAGAAGC GTGCGCTCAA AGACTGGTTG GATTTACTGA
GTGGCTCGCT GCCACAGGAA TGGGGACTCC ACGAGATCAT TGACGATCTT TTGGACAACC
TGGATTATAT TGCCCAAGGG AGCAAGAACT TGCACGATAT TCTCGACAAA CACCCGATTC
AACGCAGGAA CTGGAGTCGC TCTTGCACCA AAGGTGGTCG AAACGTCAAC GGTTTTACTT
GCGGATTCTG GAAGTTGCTG CACGTCATGA CTGTTGGTGT GGCAGAACAC CGAGGTGGCA
AGAACCTTGT GGCTACGGGA CTGCGTCGGG ACATCCGTGT CTTCGCGCCC ATGGAAGCTG
CGGATACTCT GCGCGAGTAC ATGGCGCATT TCTTTAGCTG TACCGAATGC TCGAAGCACT
TTCTGGTCCA GTACGATCAG TGTGACATGA ATCGCCGCTG CGGTCGTCTC GCTACGGATG
CGCACGATGC TACCGATTCG GACTGGAAGG AATTGGCGAA ATGGCTCTGG GAATTCCACA
ACGACGTGAG TGTCCATGTT TTGAACGAAC GCACGGACAA CAAACGCAAA CAAATGCAGC
AACGCACATG GCGTCGTGCG GAGTCCGGTC CCGGAGCAGC AGGACTGTTT GAACAGGTCA
GCGTGGTTTG GCCTTCGACC TTGTCATGTA CAGAATGCAT CAAGGCGGAC GGTACGTTCG
ACGAGGATGC CGTCTTTACG TACTTGGAAC AAACCTACTG GCCTGGTTTG GAAGATTCCA
TTGATCGTGT AATACAGTTT TACGACGAAC ACGAGTCCGG ATCCAAGGTC TTGACACTCA
TCCTGTTGTG CATTGGTGCG TACTTGGCTT TCGTGATGCG TAAGAGTCTC GGTCCGAAAA
GTCTCCAACA ATCCCTGATT ATGGCACGGA AAATGAGGCC GAAAGGCTCC GTCGGCGTGG
ACAAGCGTTC GGTTTGAGAT CTCCGTTTAG CCTTTGTTTT AGATAACAAA CCGTCAATCT
ACTTAGTTTC ACAGTCAAGC AGCTCA
 
Protein sequence
MGEKYRSWYT NTWIAWSCGV ALWLSRTTIA AGSTEEFYLY YGDTDASTYV QEYRATKDGI 
PLDGLFEQSE HPRIVEFYSP YCPHCRHFKP KYVRLARDVG QKYPDVEFYA VSCVAHSDMC
QKYNVRGFPQ ILALSASTAE PTVLSKNSYT VRGIASALKL GPVLGGTSTG TVGRTARLLE
DGTEAGDSDE PPNDDSEQTV DQPDENSNDP SLDGKVSFEL HVPIPDLRPH PTGDDVPNSN
DPSADEKTED EPADSQDTRK HYSGEDPTVG NGLFNVDIDD SPDEGFDQSA DDRDDPDAEE
YYSDNRDDED LDESEDQESE QDPYGMDLFF EAQKRGGDES ESSDLPGPVQ RKPGAVTAQF
APRDMDKYRD VLRQKRAAEQ KKRKFVGLRK DRQVKPLGPV VRDGASKAMR ANTPGTLEYK
QRQKESSERL AKLLEKKFGK KRAAKYVSSK GIGNRIGNYT SLPFKKEVAK PRLVEQLPIL
KRVVRMDDEE MLILDSSLSF LRGLRYGVFQ DPKPLTGKKK RALKDWLDLL SGSLPQEWGL
HEIIDDLLDN LDYIAQGSKN LHDILDKHPI QRRNWSRSCT KGGRNVNGFT CGFWKLLHVM
TVGVAEHRGG KNLVATGLRR DIRVFAPMEA ADTLREYMAH FFSCTECSKH FLVQYDQCDM
NRRCGRLATD AHDATDSDWK ELAKWLWEFH NDVSVHVLNE RTDNKRKQMQ QRTWRRAESG
PGAAGLFEQV SVVWPSTLSC TECIKADGTF DEDAVFTYLE QTYWPGLEDS IDRVIQFYDE
HESGSKVLTL ILLCIGAYLA FVMRKSLGPK SLQQSLIMAR KMRPKGSVGV DKRSV