Gene PHATRDRAFT_46375 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_46375 
Symbol 
ID7201637 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011678 
Strand
Start bp149603 
End bp150824 
Gene Length1222 bp 
Protein Length403 aa 
Translation table 
GC content55% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180953 
Protein GI219120429 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.389874 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTTGATCGCC ATGAAGCTGG CGTTCAATCT ATCTACAACG GCACTATCGC TTTGTGCTGT 
CTGCTGTTGC AATGTGCTTC TTACGCAGGC CCAGACGCCG CTGATTCTAC TCGCCGGCCA
ATCCAACATG GTTGGCAGTC CCAACGAGGC CGGAGAAAGG TACGAGCAAG ACGTGCTCGC
CCGCACTAAT ACCATCCAGC CGGGATTGAC TTTCGACAAC CTACTGCCCT TGCTCGTCTC
ACCAGCTATG CAGCAGACCA ACATAACAAC GGAGTTTCGT CGAGAACTCG TCGACCTGCT
CACACGAATC TTAGGATACG TCCACAACGG TACCGATGTC GTTCCCGAAC ACCAAGTCGA
CCTCTTGATC GAGCTGGGAC AACGGTTTCC TACTCTGTAC ACGTCTCTCG CAGACCCACT
CGGGAATGTT TATTGCTCGG ATGTGACTCC GCCCATCGCG GTACAAGAAT TCATGTCCGC
GGTGCCTTTG TCTCCCAACA GTGGATGCGG TAATCCGTAC GGCCCCGAAC TAGTCTTGGG
CCACACGCTC GGATTCCTTC CCGACGGGAA AGAGTCGGGC AGTGACCTAT CTTTCATCAT
GCCCAAGGTT TCTCGTGGAG GCACACAAAT CCGCGGAAAC TGGTCCAAAG CGGAAGGTGA
CTTGTGGAGT ACACTCCAAA GTCGTATCGC GCACATCGAC TCCGTCAGTA CCCAGTGCCA
AACCGGCAGC GGATGCTCGT GGGATGCCTT TGTTTGGTTT CAGGGAGAGA ACGATTCCAT
GGACCAGCTC AACGCCGAGA ACTACGAAGG CGATTTAATC ACCTTTTTGG CGGACGTTCG
CGCGGAATTG TTCGCGGCGG GCACGCGCTA TGCCGCACCG GAGGAAATCC CTGTCGTCAT
TGTGCAGATT GGTAGCTTTT TCCGCGCCCG TGAGTTCGGA ACCGTCGTGG CACGCGCCCA
AGCCAGTGTG GCTGCTAGCG ACGCGTTTGC CAGCATTGTA TGGACGGACG ATTTGGGTAC
GTTCTATCAC TACGACGCCT CGTCACAGTT GATCATTGGC GATCGCGTGG CTCGAGCCCT
GGAAGGCCTG TGGAAAGATA CCATCGTCAT GCCCCCTTTG GATCGGTCCT GGTTTGGCAG
CTGTCAGCAA AATGGTCAAA AGTGCCACAC GAATGCCGCT TGCTGTGGTG GTAAATGCGC
CTATTCAAGA TGTGGTACAT GA
 
Protein sequence
MKLAFNLSTT ALSLCAVCCC NVLLTQAQTP LILLAGQSNM VGSPNEAGER YEQDVLARTN 
TIQPGLTFDN LLPLLVSPAM QQTNITTEFR RELVDLLTRI LGYVHNGTDV VPEHQVDLLI
ELGQRFPTLY TSLADPLGNV YCSDVTPPIA VQEFMSAVPL SPNSGCGNPY GPELVLGHTL
GFLPDGKESG SDLSFIMPKV SRGGTQIRGN WSKAEGDLWS TLQSRIAHID SVSTQCQTGS
GCSWDAFVWF QGENDSMDQL NAENYEGDLI TFLADVRAEL FAAGTRYAAP EEIPVVIVQI
GSFFRAREFG TVVARAQASV AASDAFASIV WTDDLGTFYH YDASSQLIIG DRVARALEGL
WKDTIVMPPL DRSWFGSCQQ NGQKCHTNAA CCGGKCAYSR CGT