Gene PHATRDRAFT_48855 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_48855 
Symbol 
ID7194941 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011687 
Strand
Start bp486371 
End bp489395 
Gene Length3025 bp 
Protein Length903 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183493 
Protein GI219126498 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.105124 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCCGAC TTGTTTGGTC CATAGAAGTC TTGACGACGA CTTTTCTCGC AGGCAACTGC 
GTTTGTTTCC TGTTATCAAT AATACCTTCC GTGGCCGCTT TTGCCGAAAC ACTCCGCGTA
CGACGCGTTC TACCATCGGT CTCGCCACAA GACGCGTATG CGCGCTGCTT GCGGGGATGG
AAGCGGGATA ACCTTGGTCT CGTGGGGGTA CCTCCGCCCA TTCTACTAAG AGACGGCCAT
CCTGATACCG GCGTCGGTAT GCTTCTTTTG CGAATACCGC CCTTCGGCTT GAAAGAGGGC
ATCGTCGGCT GCCAGCGGGA CGAGTGTTCT ACCACCATGA ATTACCAAGT TTTGAATCCG
GGATGGTTTA CTTGGCCGGT TGCTGAGCAT GAAGGATGGA TTCGGTTCGC CTCTGATGGG
GAACAGGGAT GTATTTTGGA ATGGACGGTC CAGTGGACAC CGCTACCATT GCCGCTATCC
TTTTGGAACG GTTTTCTCAA AGCACTAACA ACAGTGATAG TGGAAGCCGC GGCAAGCTAT
GTCGCCAGAG GGGGCTCAGG CGCCTACCAA TCAGTTGCTA TCCGCAAAAA AGAGTAGCCA
AAAGATACTG CCCCTATCTG GTACACGTGT AGATGAATGT AATTAATGCA GCAACGCAGT
ATAGATTAAG AAAGATCGTA AACATCGAAG TCATCCGGAA GAGCTGCGAA TTTAACAAGA
ATATCGTGAT GAGATACTCC TGTAGCTCTA GTGCATGGGA CAACAAAAAA TCCTTCGGTT
TGAACAGATC AGTCCGCGTA CCTACGATGA TGTTCTCGTG TGCTACGGGA GGAGCTTTCG
GCGCGGCATT CCCCTTGGCG GAGGTTGATT CTTCTTGCAC ACGCCGAGCA GGTCGAGCCT
GCACAGGGAC GCTTCCGGGA TTCTGTCTCG TTTGGGCGGA TCCTTCGCCA CCCGGCGTCC
GGAGACTGAT GGGTCGCACT GGCGGCCTGG AAAACTCAGG CTCCATCACT TTGGGGGCGA
TCGAGAAAAT TGATCCTATT GTCTGCAAAG GATGCAAATA ATGTTACATG GTCCTCGAAG
AGCTCACAGT CAGGCATTCG CCAACTAACG TTGGATCTGT TCTGCGTTTT CCAGTGCGAT
AGTATCTTCG GCTCTCGATT ACAGAACTCC GGGTTGTATT GTCACATCAT ACTCACAGTC
GATGGGTAAA GGGGTCTGTC CGGCTGCCCC CGATCACGGA TGTACGGGAC ATCTTGTCCA
GGGACCGTCA AACGAATTTG GGGCACGCTT CCGACATGCA TCACACGACT TCGCGATTGC
AATGATCTGT GCACCCTCCG TTGGTTCATT TCACAGCTCG AGCTCGGAAA TACTGAGGTC
ACTCGAACAA GACAAGATGA GGGAATCTAG TCGAAAACGA CCACTTAGAG CAGAAGATGG
CCTACAATCA TCGTCGTCGG CGTCTGACTT GGACCAACAG CAACAACATC ACCAAGACGA
CGTGGATACC CTGGGCGACG CGACGATTGC TTGGTACGGT TCAAACGAGG CGACGGTAGT
ACTAGGGGCA ACACCTCTGT GCCTTTTAGG ACGTGGAAGA ATACGTCTTA TCTCGGGAAC
TGTCAGCTTG CACGGCTTTT GCCTTACCGA TGAATTTCGA GATTTCGAAA GTCCTATTTA
TGCGAGCTGG CTTACTATTG TCGCCGAAGA GCACGGTAAG AAGGGAGGAG AGGTAGCCAA
GGTAGCAGTG GAATCAACAC GGCGTGTTGC TGGTGTTCTC GTACCAACTT TTGAAATTAA
GCTGGCGACC GAGCCCCATT CACGACCGAC CGTTATTCCC CGTCGATGGA CAGAGTCACT
AGACTCAATT CTACAGGAAC AATTGTGTCG ACAACTGGAA AACGAGAAGG CAAGCCAGCG
GAAAAGACCC ACCGCCTTCA CACGTTTACT CGAAAAGGAG AGCAGTAGTG AGAAACTTGC
TGCCAAGGAA ATGGAACAGT CAGCTCCTGG GTTCAAAGTC GTCGTGGTGG GTGCCAAGAA
CGTCGGCAAG TCCACCTGTT TACGCTACGC CATCAATCGA CATCTGTCGA TGTGCAGCGA
AGTAGCGGTA TTGGACGGAG ACTTGGGACA GCCGGAACTA TCGCCTCCAG GGATGGTGAC
TTTAACACGT TTGCGACAGC CAATCTTTAG CCAGCCGCAT TTACACCTAG TAACGAACGA
AGATAATGCG TCAGCTGCAG CGCCTCGACA CGAAATGGCG TATTGGTTTG GGGCATCTAC
TTCACAAGGG GATCCCGAAA AATACGTGAG CAGTCTCACA AAGCTAGTGC GCTACTATCA
CGAAAAGCTT CTTCCTCAAA AGCCTACGTT GCCTTTGCTG ATCAATCTCG ATGGGTGGGT
CAAAGGACTT GGGATGCAGA TCTTGGAAGC TATCCTGCTA CAAATTCAAC CAACGCACGT
GATTCAGATA CTAGGAGATC TCAATTCCAA AGTCTTCGAA TTGTCGTTAC CTGACGAAAT
TCATCTCCAT ATATGTCACG CCTACCACGT GATACCACCA GAGGAACAAC AAAGAAAAAC
AAAAATGTCT ACGCCCACTT CCGATTCCGA GTCGAAGGAA ACATGTGCGC TGGTCGACAG
GCAACACGAC ACTGGTCGCG TTACTTCTCT GTCCGACATT GCGAGTCTCC CAACAATCCC
CTCGTCCATA CCGGCATCAG CGCTCCGGAC ACTCCGCTTT GTCTCCTACT TTTTGGACGA
TGTTTCCATT TGGGATCGCA TACGCTTCGG TCAAAAGGAA CTGATCGTGG ATATTAATTG
TGTGATTGCC AAACGTTTTG CCTCGCAGAA GCCGTACATA GTACCGTTTG AAGCAATTGC
GGTGGATTTT AGTTCCGACG AGTTTCGACG TGACATATGG ACACCAGAAC GGATTCTGGA
CAGTTTGAAT GGATCCATAG TAGGCTTATG CTGTCGAACG GGCAAATCAG ACGACGAGTT
TGACTGTTGT GTTGGTCTCG GTATT
 
Protein sequence
MVRLVWSIEV LTTTFLAGNC VCFLLSIIPS VAAFAETLRV RRVLPSVSPQ DAYARCLRGW 
KRDNLGLVGV PPPILLRDGH PDTGVGMLLL RIPPFGLKEG IVGCQRDECS TTMNYQVLNP
GWFTWPVAEH EGWIRFASDG EQGCILEWTV QWTPLPLPLS FWNGFLKALT TVIVEAAASY
VARGGSGAYQ SVAIRKKDAW DNKKSFGLNR SVRVPTMMFS CATGGAFGAA FPLAEVDSSC
TRRAGRACTG TLPGFCLVWA DPSPPGVRRL MGRTGGLENS GSITLGAIEK IDPISMGKGV
CPAAPDHGCT GHLVQGPSNE FGARFRHASH DFAIAMICAP SVGSFHSSSS EILRSLEQDK
MRESSRKRPL RAEDGLQSSS SASDLDQQQQ HHQDDVDTLG DATIAWYGSN EATVVLGATP
LCLLGRGRIR LISGTVSLHG FCLTDEFRDF ESPIYASWLT IVAEEHGKKG GEVAKVAVES
TRRVAGVLVP TFEIKLATEP HSRPTVIPRR WTESLDSILQ EQLCRQLENE KASQRKRPTA
FTRLLEKESS SEKLAAKEME QSAPGFKVVV VGAKNVGKST CLRYAINRHL SMCSEVAVLD
GDLGQPELSP PGMVTLTRLR QPIFSQPHLH LVTNEDNASA AAPRHEMAYW FGASTSQGDP
EKYVSSLTKL VRYYHEKLLP QKPTLPLLIN LDGWVKGLGM QILEAILLQI QPTHVIQILG
DLNSKVFELS LPDEIHLHIC HAYHVIPPEE QQRKTKMSTP TSDSESKETC ALVDRQHDTG
RVTSLSDIAS LPTIPSSIPA SALRTLRFVS YFLDDVSIWD RIRFGQKELI VDINCVIAKR
FASQKPYIVP FEAIAVDFSS DEFRRDIWTP ERILDSLNGS IVGLCCRTGK SDDEFDCCVG
LGI