Gene PHATRDRAFT_50571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50571 
Symbol 
ID7199359 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011699 
Strand
Start bp156295 
End bp157646 
Gene Length1352 bp 
Protein Length346 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185532 
Protein GI219130774 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGCATTTACT GCCGGTCCAT GAGGGTATGT ACTCGGACAA AATGCGGAAT TGTCTTGTAC 
ATGATCCAGA GCGGATGCAG TTTGGTGAGG CCCGCTCCCG CTCACAGTCA ATGCAGATTC
GTACGGCACG ACGGCGGACG TGTTGAAAGT CGAAGAGCAT TTCTTCTAAC CATGGCTGGG
ATGGCGTCAG AGCATATACA GCAAACATGG CTACCGAAGC GCATCCGTAT TACAAAAAGT
TGCACCATCT GACGGTGTGC ATGGTACCAC ACGATGAGGA TCGGCGAGTT TGGGAGCAAC
TCACCAAAGT ACGCACACAA TTACGGGATC CCGGACTGTA CCGCTGGCCT CCCCATGCCA
ACCTGTTGTA TCCGTTCTTG GACATTAGGC CCACACCCGA GTCGGGTGAC ACCACACCTA
GAAATGCTAT CAATATGGAT ATAGTCGACG GGTTGAAAAG AGTATGTCAT CTGTATGATC
CGTTCACTGT TCGGCTGGAA AAATTCGGTA CCTTTGGTGG TTCCAAACGC GGCGTCCTTT
GGATATACCC TGATTCAAGG CCTAGAGGTA ATGGAGGTGA CGAGGCTCAC GATGCAGAGC
CGTTGGTAGC TTTACAAGCT TCTTTGGAAG CCCAATTTCC CATGTGTATG GATCAGCGCA
AAACCGGAGC TTTCAGTCCT CATATTACGG TCAGTCACTT TGCTGACTTG AAGTGCGCCA
GGGAGGCGCA AGCTCTCGTT GAAACTGAAT GGCCGACCGA CCTATCGTTC CAAGTGCGGG
AAGTCTACTT GATGCAGCGA TTAGGCGACG ACGGTCAGTT TGAGCGGGTG GCAACTATTG
GACTAGCGGG GCACGAAACA ATTGTTCATC GTCAGCCGCT ACGATTTGCC AACATGCCGG
AAAGCGAAGA GGAATGGGTA CGTCGCGAGC GCATGGCAAT GAAGGACCGC CGAAATAAAG
GATCTCGTGG ACACCGAAGA CAAAGAGAAA GCATCCGAAC CGGTCGACCC TCGAGGGTAA
AGGATTCTCC TGTCGTTATC GAAGCCAAAC GCGCTATGCG CGAAGCCAAG CGGGAGATGT
TGCTAGCTGA AAACAACGGC GAACCTCCAG AAACGGCTAA AATCCACGAG GCAATTTTCA
GGGCGAAGGA AGAGGAGCTC AGAGCGTTTC AGAATTTGCA GGGACAAGCA CAAAGCGAGA
AACAGGTCAC AAGTACACCG GCAACAAACG AGAGCACATC TTCCTAAAGA TGGGTTTGTG
TTGACATTTG CTCGTGTACA ATTTTTAATT ATCCCTCTTG GGGAAATCTC TGTATGTAAT
AACAAGACAT AAACGCTCCG TGAATATCGT GT
 
Protein sequence
MATEAHPYYK KLHHLTVCMV PHDEDRRVWE QLTKVRTQLR DPGLYRWPPH ANLLYPFLDI 
RPTPESGDTT PRNAINMDIV DGLKRVCHLY DPFTVRLEKF GTFGGSKRGV LWIYPDSRPR
GNGGDEAHDA EPLVALQASL EAQFPMCMDQ RKTGAFSPHI TVSHFADLKC AREAQALVET
EWPTDLSFQV REVYLMQRLG DDGQFERVAT IGLAGHETIV HRQPLRFANM PESEEEWVRR
ERMAMKDRRN KGSRGHRRQR ESIRTGRPSR VKDSPVVIEA KRAMREAKRE MLLAENNGEP
PETAKIHEAI FRAKEEELRA FQNLQGQAQS EKQVTSTPAT NESTSS