Gene PHATRDRAFT_50643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50643 
Symbol 
ID7199480 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011701 
Strand
Start bp73698 
End bp76075 
Gene Length2378 bp 
Protein Length724 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185613 
Protein GI219130947 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGATTCAGTT ACCGATTAAT TACAATACAA TCGGGGCCTT ACGAGGCTTT CCCCTTTTAT 
TCATCTTACT CCCTCATTGT ATCAACAATA CATTGCGTGA GTATAGAAGA ATCCGTTAGT
TTATCAGACC CAGCAATCAT CGCTGGCGTT GCTCTCACCA ACATACTTAC TTTACACGAT
TGTCTCTCTT TTTGAACTGA GCCATGGCCC GAGTTCGCAA GGCAACCGGT CCTACCCGGA
AGGGAGCGAC CGAAACGGTG CCGGAGGAGC GAGTGGAAGA AGAAACGCCC TTTGAGGCCG
TTGAGTCGCC GTCCAAGGAC AGTGACAATG AGACGCAACC ATCGTCCATG GGCGATGACA
ATGACTCACA GTCTGAGATC GAGTCGTACA AGATTGATAC CGACATTGAT TTCAAGTACA
ACCCAAACTT TTTTGAGGAC AAGAAAGCCC TTGAAAGTGT TCTAAGGAAT ACTATGGGAT
TTGGAGATAT CCATGTGAAG TCACTCCAAA ACGAAGGTTT GAAGACCGCA AATGATTTCT
TGCTTATTTC TATGAGTGAC ATCAATGATC TTTGCGACAA GCTTTTGTTT GCAACAGTTT
ACAGGGCTCG CCTACGGGCA TTTGCTACAT GGTTACGTAG TCAACCCGAC AACATAAATA
TTACCCAAGA ATGGACAATT CCAGTTATGC AATTGGAAAT GCAGATGAAG GCGCAAGCGT
CTCCATTTGG AACCTCCGAG ACCAACAAAA CAGACAAGTC AGTCTCCAGT CTGGTGCCTG
ATCCCTTTGA TGGTACACAG AAGAAGTGGC TCGCCTTTCG ATACAGTTTT GAGGCATGGG
CCGGAGCAAG TGGGCAATCT TTTGATGCCT GTATCTCACA TGACTCGGAG CGATATTCCC
GTTCAGAACC AACAGCGACC TACAATGACA TCAATGACGA ACCTGATTCA TTTAAATATG
ACTGGAACGT TAAGTCAGTT CGCAATTCAA ACATCTTTTT TATGCTCAAG TCGCTCACAA
GCGGCGGAGA TGCATGGGGC CTTATCGAAC CTTACGAGGT TTCAAAAAAT GGCCGTCATG
CCTGGATCGC CTTGTGTGCG TTCTATGAAG GGGCCAGTCA GGTGGGCTTA ACCACAGAAG
AAGCTCGCAC TACAATTCTG ACATCGAAGT ATACCGGACA ATCCCGGAAC TTCACTTTTA
CCAAGTATGT TCAAAAGCAT CTTACTGGTA ACAACATATT GGCTCGCAAC AAAGAGGCCT
ACACGGACTC ACAGAAAACA AACTTTTTCC TACGGGGAAT TGTTGATCCT GAACTTATGG
CATTCAAGGC AGCTGCTGAA GCTAACCTAA ATGAATGGAA GTTCGAACGC GTTGTCACGT
ACATGCGTAC TCAAGCCGCC AAGCTCACGA GCAAGGACGG TAAGGATTCC CGAAACATTC
GTCAGGCTAC GGGCTTGTCG AAAAACAGGA ACAACAAAAA CAACCGGCGC AAGCGCTCGG
AATACCAAAG CCAAGGCAAA GGTAATAAAG AGTCGGGCAA AGGAAACAAT GCTCCTAGTA
CTCAACTCCG CAAGGACATC TGGGATGAAT TGTCTCCCGA GATAAAGGAT GCCATCAAAG
CGGCAAAGCG TAGAGCGTCT ACGGACCCGC GCACGGCTAA AAGAGCCAAG ACTAGTAGTA
CGGATAACTC TAACGCAAGC GTTGAGTCCT ACTCGCCTGA TTTAAGGTCA ATGTCTACTG
AAATATTTAA AGCAGATGAT GACAAGGACT TGGCTTCAGG TCAGCCTGAG GCGAAAGATA
CACCACTTCA TTTGGAACTT GAAGATACGC TTAAGAAACC TACATATGGA GCAGGTACCC
TATTTGGGCG ATCTGCTGAC AGGGTCTCCT TTAATCGTAT GGTATGCAGT TCAGAAGAAA
ACAAAGTCAC TCCTTGGCGC ATGTCAGAAC TACGGCTTGC GGATGCAACA ATAAGACGCA
TTTGTAAGAA TCGCACACGA AATCCTACCG GCCGTTCAAC ATGGGGCGAA GCTGCCATTG
ATACTGGTGC CGACACAATT TGCATTGGTT CAGGCTATAC TGTACTTGCC CATACAGGTC
GATATGTGAG TCTGCGAGGT TTTCATGACA GTGGTGATAC TCTTGATCGA ATTCCAGTTG
TGACGGCTGC TACAGCATAT GACTACGATG ACGGAACCAC CGTTATTCTG GTTTTCCATG
AAGCTTTGAA TCTTGGGCCT ACACAGTCCA CATCTCTCAT CAACTTGAAT CAGATTCGGC
ACGCCGGACA TCAGACTGAT GACATTCCGA AGTTTTTATC CCAAGGGAAA TCTCTTCACG
GAATTGAAAC AATTGATGGC GACTACATTC CTTTTTAA
 
Protein sequence
MARVRKATGP TRKGATETVP EERVEEETPF EAVESPSKDS DNETQPSSMG DDNDSQSEIE 
SYKIDTDIDF KYNPNFFEDK KALESVLRNT MGFGDIHVKS LQNEGLKTAN DFLLISMSDI
NDLCDKLLFA TVYRARLRAF ATWLRSQPDN INITQEWTIP VMQLEMQMKA QASPFGTSET
NKTDKSVSSL VPDPFDGTQK KWLAFRYSFE AWAGASGQSF DACISHDSER YSRSEPTATY
NDINDEPDSF KYDWNVKSVR NSNIFFMLKS LTSGGDAWGL IEPYEVSKNG RHAWIALCAF
YEGASQVGLT TEEARTTILT SKYTGQSRNF TFTKYVQKHL TGNNILARNK EAYTDSQKTN
FFLRGIVDPE LMAFKAAAEA NLNEWKFERV VTYMRTQAAK LTSKDGKDSR NIRQATGLSK
NRNNKNNRRK RSEYQSQGKG NKESGKGNNA PSTQLRKDIW DELSPEIKDA IKAAKRRAST
DPRTAKRAKT SSTDNSNASV ESYSPDLRSM STEIFKADDD KDLASGQPEA KDTPLHLELE
DTLKKPTYGA GTLFGRSADR VSFNRMVCSS EENKVTPWRM SELRLADATI RRICKNRTRN
PTGRSTWGEA AIDTGADTIC IGSGYTVLAH TGRYVSLRGF HDSGDTLDRI PVVTAATAYD
YDDGTTVILV FHEALNLGPT QSTSLINLNQ IRHAGHQTDD IPKFLSQGKS LHGIETIDGD
YIPF