Gene PHATRDRAFT_48300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_48300 
Symbol 
ID7203729 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011685 
Strand
Start bp56489 
End bp59446 
Gene Length2958 bp 
Protein Length746 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182889 
Protein GI219125231 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0439212 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGG CGGCTGTACT TTTCATCATC ACGTACGTAT TGTGAGAGAG ACGGAGAGAC 
GATTTCCACC AAATCCTATT CTAGCGACGA TCAATCATCT TGTCGACACC AAAAAGTAGC
CTCTGTATCA AAGGTAGCAC GATATCGGTT TCCATCATCT TCTGGCCTCA AATCTCTTCG
TAGTGAGTAG TTGCTGAACT AGCAAGCATT AAGAACGCGC TGAAACTTCT CATATCCTTG
TACTAGCTGG CTTTCACGAT TCAGTTCTCG TTGGCCAAAT CGTGGAACTC ACAGTCAAAC
GAAAAGCCAA CCAGCAGTAT ACTACTGAGT CTCTGTCCTT TGTTATATTC GAGACTTAGC
GGGTCGCACT TTATCGACTC CTTGTCGAAT TCTTCAGTCA TGGACGCAAA AAGGCGGTTG
TCGTTCTTTA TGGGGATAGG AATGGTGTTC TTTCCAGGAG CAAACGGCAA CAACTCTTGG
ATCGATATTG AAACACCATT GAAGAAAAGG ACCACCAAGT CTTTGGTTGA TGGCTCAACT
TACCATTTGG TGCGTCTTTT TCTATCGCTA TGTACAGAAA TGCAATTGCA AGCTAAAATC
TCACACCAGA CTTCGTAAAC TACTTTGTGG CTGCGTGCAC AACAGGTCAT GTCAGACGAG
TTTAACGTAG AAAATCGAAC ATTTAAAGAT GGACACGATC CGATGTGGAC GGCCCTAGAT
AGAAGCGACG ATGACGCTTC AAGCGCTGGA GGTGGATCCC TGCAGTTCTA CAATAGCTCC
GCCGTTAGTA CAGAAAATGG CTTTTTAAAA ATTGCCACCT ATCTGGAAAC CACTTCTTGG
ACTCGGTACG ACCACGTCAA CAAGCACTGG AAAACGGAAA GGACGAACTT TACTTCAGGT
ATGGTCCAAT CGTGGAACAA ATTTTGCTTC ACCGGTGGAA TAGTGGAGGT GGACGTTGTT
TTTCCTGGCG AACCATTCAT TGGGGGATTG TGGCCAGCCG TTTGGATGCT GGGTAATCTT
GGACGAGCTA CGTACGAGGC CTCCACCAAC AATATTTGGC CGTGGAGTTT CGATACATGT
GATCGTGAAA TGCAGGATGC CCAAGCCATT TCGGCTTGCA ATCGCGAAAA TCACTACGGA
ATGCATCCGT TTCAAGGGCG AGGTGCCACC GAAATTGATA TCATTGAAGT TATGACCGGT
GATTCCAACG GGCCGTTGCC GTCCACCGAA CCACCCATTA CGTTGCCCTA TGGAGATATG
ACGTTACAGG TACGTTGACT GATAAACTGA TGTTGTTTCT TTATGGCTTG TCCCGGTCCT
TTGGCTTTGA TGAGTGGTAG TCGGGCTCAC ATGTGGTAAT GTTCGCTTTT TTAGGTTGCG
CCCGGTGTAC CAAAAAATCG ACCTCAGAGT GGATCGCTTC CTCTTCGGAA AAATACTTTC
TCCGACAATG GGCATACAGA GTTTTTGGCA AATGTTTGGT ACAAAGATTT GGAAATGCAC
GGCAATACTT CAATCAATCC CTTTTTTTAT GGCACGTATC TTGGAGAAAC CAAACCTGGC
GAACCCGTGA CTCGCGGAAA GCATGAAGCG TTTCAAGCCG ATGCGGTTGG AGCGGCCCAT
CAGTTAACAC CGGCGCACTT TAAGAGACCC CATACCTTTC GGATTGAGTG GCAACCAGGT
AAAGGAGGAC GACTGGATTG GTACACTAAA GGCTATCGCA TGAATGAGAC GACGTACATG
GAAGGTGATG GTGAAGGACA GGAGTGGACA CACGTATTTT CACTCAAGGA TAAATCCTTG
AGCGATTTGA TGGGGTCGCA GATTCCGAAT GAACCAACAT ATCTTATTTT CAATACAGCC
ATATCCAGTA CCTGGGGATT TCCTTACGAT CCGCCCGACT GGTGTCCGAA ATGTTTTGAC
TGCAACGATC CGACTTGCTC TTGTAACTTC TATCCTGGCT TTTGCCAAAT GCTTGACTCG
CGAACGGTTG CCATGTTGAT TGACTCTGTC CGTGTATACC AATCCTTCAA CACTTCGGCA
CACGTTGGCG GCAAGCATAC CTTGGGTTGC GATCCACCGG ATTATCCTAC AAGAGAATGG
ATCACAGGCC ACGAGTATCG ATACATGAGG AACGAGCCTT TTTCATATAA GGACAAAGCG
CATTCTTTAC AACCATTGCA ACGAGGCGGT GGCGTTTGCC GAGACGATTC GGATTGTGGC
GGAAACGTGT CTTTAACGAA TTTAACGGCC GTATACGACA TGCTGGGAAC TGACAGTGAG
CGGAAACTTT TTTCGACGGA ATCGCGAGAG ACGGTTGATC TTGTCATAAG TCAAGGCCAG
TGTGCCTCGC AGACGAACAC ATTCTTCTCA AGCAAGTCAT GGACGGGGAA AGTTTGTCGG
TGTAGAGTGG GCTTTACGGG GCCAATGTGC TTATCGTTGG ATCGCATAGA TACTTTCCCA
AGTGCGCACA AAATAAGGAC AGACGTATCG CCTTTCAATC GGATTGCCAA TTTTGAAGCC
CCAACGTTCA TGCTGACGGC AATCGCCAGC ATGATTGTTA TGCTGCTTTC TATTTTGGTC
TCCAAAGTTG TAGACGAAAA GAAAGCAAGG AAACGAAAGT CGGTGTCGAG ACAGTTCAAA
CGTCCCACTT TTGTCACAAC AAGCAATGAC TCCAACGTCA CCATCATTAC CGGTACCAGC
ATATGAGCAT GATAGCCAAA TACAATGCTA ACAATACCAT TACTAATTGA ATATGCGTTC
CACATCAGAC CACGGTCTCA GGATGTGAAC TTTAAATGTG ATGATTGCTT GATTTAAGCA
ACTTTGAGAC AGTTGTTGTA AACAGACCGG TGGAAGAACA CTAATGTAAT TCCAACAAGG
ACTGGCTGTG AGTTCTAAAT TTGTGGAAAG ACGACTGGTT ATTTTTTCAA AAACATGTAA
TCAATATACG TTAGCCTT
 
Protein sequence
MKKAAVLFII TLCIKGSTIS VSIIFWPQIS SYGSHFIDSL SNSSVMDAKR RLSFFMGIGM 
VFFPGANGNN SWIDIETPLK KRTTKSLVDG STYHLVMSDE FNVENRTFKD GHDPMWTALD
RSDDDASSAG GGSLQFYNSS AVSTENGFLK IATYLETTSW TRYDHVNKHW KTERTNFTSG
MVQSWNKFCF TGGIVEVDVV FPGEPFIGGL WPAVWMLGNL GRATYEASTN NIWPWSFDTC
DREMQDAQAI SACNRENHYG MHPFQGRGAT EIDIIEVMTG DSNGPLPSTE PPITLPYGDM
TLQVAPGVPK NRPQSGSLPL RKNTFSDNGH TEFLANVWYK DLEMHGNTSI NPFFYGTYLG
ETKPGEPVTR GKHEAFQADA VGAAHQLTPA HFKRPHTFRI EWQPGKGGRL DWYTKGYRMN
ETTYMEGDGE GQEWTHVFSL KDKSLSDLMG SQIPNEPTYL IFNTAISSTW GFPYDPPDWC
PKCFDCNDPT CSCNFYPGFC QMLDSRTVAM LIDSVRVYQS FNTSAHVGGK HTLGCDPPDY
PTREWITGHE YRYMRNEPFS YKDKAHSLQP LQRGGGVCRD DSDCGGNVSL TNLTAVYDML
GTDSERKLFS TESRETVDLV ISQGQCASQT NTFFSSKSWT GKVCRCRVGF TGPMCLSLDR
IDTFPSAHKI RTDVSPFNRI ANFEAPTFML TAIASMIVML LSILVSKVVD EKKARKRKSV
SRQFKRPTFV TTSNDSNVTI ITGTSI