Gene PHATRDRAFT_21789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_21789 
Symbol 
ID7202678 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp352157 
End bp354042 
Gene Length1886 bp 
Protein Length559 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181891 
Protein GI219123145 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.207672 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GGCAAGGAAC GACGGTAGAT GCATTCCTAG TCTTTTCATT GTCCTTACAA AACCATTCAT 
CGCACACCTA TCTCATAATC ATGATGCAAG GAGGAGGACC CATGGGACAG GTGATGGTCT
TGAGTAAGTG TAAAGCACGG GGGGCGGTGA ACACGAATTA GTTTAGGTAC CCACCGCAAA
CACACAGACA CAAACTGGCC TCACACGTTT CTGGCCTTTT TCATCCTTCG TCTTTTTCGA
TTCCACCAGA CACCAAAACC CAACGCCAAA CGGGTCGCCA GGCCCAACTC GCCAACATCC
AAGCCGCCAA AGCCGTCGCC AACATTATCC GCACGACCCT CGGTCCCCGC AGTATGCTCA
AGATGCTACT CGATCCCATG GGTTCCATCG TATTGACCAA CGACGGGCAC TGCATTCTGC
GCGAAGTCGA CGTCTCCCAT CCCACCGCCA AGTCCATGAT TGAACTCAGT CGCTCGCAGG
ATGAAGAAGT CGGGGATGGC ACCACGTCCG TCATTGTCCT CGCCGCCGAA GTACTCGCAC
AGGCCGAACC CTACCTGCGA CAAGACGCCA TGCACCCGAC CGTCCTCGTG TCGGCCTACA
CCAAGGCACT CGCACAGGCC ATGATCATTC TGGAGGAACA AAGTGTCACC ATTGATGTCG
AAAAAGACCA CGAACTCATG AAACTGCTCG TCCAAAGCTC CTTGGGCACC AAGTTCTCCT
CCCGCTGGAA CGATCAAATG GTCGAAATGG CACTGCAGGC CGTCCTCACC GTTTCCCAAA
AACGAGCCAC CGCAGCCGAT GGTGTACTCA AACAAGTCGA GATTGATATC AAACGCTACG
CCAAGGTGGA AAAAATACCC GGTGGCGAAA TACAGGAGTG CGCCGTACTG GAGGGTGTCA
TGTTCCAGAA AGACGTGGTG CACGCCAATA TGCGCCGCCG CATTGAAAAT CCCAAAATAT
TGTTACTGGA TACGCCGCTC GAGTACAAAA AGGGCGAGTC ACAGACGAAC ATGGAAATCA
CGGACGAAAA TGATTGGAAT ACGCTGCTCA AATTGGAAGA AGAGTACGTT GCCAACATGT
GCGCGCAAAT TATTGCGGCG CAACCGGATA TTGTGGTGAC GGAAAAGGGC GTCAGCGACT
TGGCCCAGCA CTATCTGCAC AAGGCCAACA TCGTCGCCTT TCGCCGGGTA CGGAAAACGG
ATAACAATCG GATTGCACGA GCCGTGGGTG CCACCATTGT GTCCCGCACG GACGAAATCG
ATGATTCCGA CATTGGAACC GGCTGTGGCT TGTTCGAAAT GCGACAAATT GGATCGGATT
GGTTCTGCTA CCTCACCAAG TGCAAAGAAC CCAAGGCCTG TACTATTGTA CTCCGCGGTG
GCTCGAAGGA TGTTTTGAAC GAATTGGAAC GCAATTTGCA GGATGCCATG CAGGTGGTGC
GTAACGTTGT CTTTTCGCCC AAGCTTGTAC CAGGCGGCGG GGCCATCGAA ATGGCTCTCG
CCGTGGGCCT CAAACGCACC GGCCAAAAGG TCCAAGGCAT TCAGCAAGGC CCCTACATGG
CGGTCGGCGA AGCTTTGGAA GTCATTCCCC GCACACTGGC ACAAAATTGT GGCGTTTCCG
TCATACGCGT GCTGACTGCC CTGCGGGCCA AGCACGCGGC CGCCTACGAC GAAGCCCAGG
ATAAGACCGG AAGCGACGAC AGCAAGCAGG CCGCCTTTTG TTCGTGGGGA ATTAACGGTA
CGACGGGTGA ATTGGTAGAT ATGAAGGAGT TGGGTATTTG GGAGCCGTTT GCGGTGAAAG
CACAAACCAT CAAGACGGCC ATTGAAAGTG CCTGTATGAT TCTACGCATT GACGACATTG
TATCGGGGTC CAAAAAGCGC GGGTGA
 
Protein sequence
MMQGGGPMGQ VMVLNTKTQR QTGRQAQLAN IQAAKAVANI IRTTLGPRSM LKMLLDPMGS 
IVLTNDGHCI LREVDVSHPT AKSMIELSRS QDEEVGDGTT SVIVLAAEVL AQAEPYLRQD
AMHPTVLVSA YTKALAQAMI ILEEQSVTID VEKDHELMKL LVQSSLGTKF SSRWNDQMVE
MALQAVLTVS QKRATAADGV LKQVEIDIKR YAKVEKIPGG EIQECAVLEG VMFQKDVVHA
NMRRRIENPK ILLLDTPLEY KKGESQTNME ITDENDWNTL LKLEEEYVAN MCAQIIAAQP
DIVVTEKGVS DLAQHYLHKA NIVAFRRVRK TDNNRIARAV GATIVSRTDE IDDSDIGTGC
GLFEMRQIGS DWFCYLTKCK EPKACTIVLR GGSKDVLNEL ERNLQDAMQV VRNVVFSPKL
VPGGGAIEMA LAVGLKRTGQ KVQGIQQGPY MAVGEALEVI PRTLAQNCGV SVIRVLTALR
AKHAAAYDEA QDKTGSDDSK QAAFCSWGIN GTTGELVDMK ELGIWEPFAV KAQTIKTAIE
SACMILRIDD IVSGSKKRG