Gene PHATRDRAFT_16704 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_16704 
Symbol 
ID7199008 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011696 
Strand
Start bp46189 
End bp49368 
Gene Length3180 bp 
Protein Length1045 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185112 
Protein GI219129893 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTTCCG ATACCGAATC GGTTGCCAGC GCAAAAAATA AGTGGGACGA TGTCATATCC 
ACGGGAGCTT CGTCACGTCG AAGGAAAAGA TGGGACGAAA CTCCTGTGAT GGCGGCGTCA
TCATCAGTAG CAGCCACGCC TCTTGTCACG ACCGGGTGTC GATCAAAATG GGACGAGACT
CCTGTATTGG CTTCAGGTGG CGTAGGAGTG ATAAAGACCC CTACGCTTGC TGGCGCTCGG
AACCGTTGGG ATGCAACTCC GTTGTCTACA CAGCCTTTGG CAGGTGCGTC GCAAACTCCC
ATGGGAACGC CGCTAGATAA AGCAATGTTG TTGGAGCGAG AAATGGAATC AAGAAACAGA
CCATGGACTG AAGGTGCTTT GGACGCAATT CTTCCTTCTG AAGCATACAA CATTGTTCGT
CCACCGTCAA CCTACATTCC TCTCCGGACA CCCGGTCGAA AGCTCCTGGC TACACCCACG
CCGATGTCGA TGACCCCTGC GGGCTTCCAG ATGGAAGTGC CCGCTGAGCA GCGCATAGAC
GCGTCAGTTC AAGATATTCG AGAGGCGTAC GGTATCCCTT TGGCCCCGAC TGCCGACGAG
ACGGGTGCTG TAGGATCATT ACCATATATT AAACCCGAGG ACATGCAGTA CTTTGGGCGG
CTTGCCGAGG AAGTCAACGA AGACGATATA TCAAAGGATG AGTTGAAGGA ACGTCAAATT
ATGACAATGC TGTTGAAAAT CAAGAGTGGC ACGCCCCCTC AACGAAAGAC AGCTATGCGA
CAGATTACGG ACAAAGCGCG CTCCTTCGGC GCAGGTCCTC TGTTTAATCA GATACTTCCA
CTTTTGATGA GTCCAGCTTT AGAAGATCAG GAGAGGCATT TGCTTGTCAA AGTTATCGAC
CGGGTCTTGT ATAAGCTTGA CGACTTGGTA CGTCCTTATG TTCACCGGAT TCTCGCTGTG
ATTGAACCGT TGCTAATCGA CGAAGACTAC TATGCCCGTG TCGAAGGTCG GGAGATTATC
AGCAATTTAG CGAAGGCTGC AGGACTTGCT ACGATGATTG CTACGATGAG ACCGGATATC
GACAGTCCCG ATGAGTACGT CCGAAACACG ACTTCACGTG CATTCGCGGT AGTGGCAAGT
GCCCTTGGTG TTCCGGCCCT GCTGCCTTTC CTAAAGGCTG TCTGTCAATC ACGAAAATCC
TGGCAGGCGC GCCATACGGG CATAAAAATT GTGCAGCAAA TAGCTTTGCT CATGGGCGTT
GCTGTGCTCC CCTACCTGAG AGAGTTGGTC GAGATAGTCA GCCATGGTCT TGTTGATGAT
ATGCAGAAGG TTCGGATTGT CACAGCATTA ACCTGCGCAG CTTTAGCGGA AGCCGCTCAT
CCGTACGGAA TCGAGAGTTT TGATCCTGTA ATTCGCCCGC TCTGGAAAGG TACTATGGAG
CAACACGGAA AGGCCCTCGC TGCTTTTCTC AAAGCTGTTG GCTTCGTGAT TCCTCTCATG
GAAGAGAACT ACGCTAGTCA CTACACCCGG CTCGTCATGC CCATTCTTAT TCGTGAATTT
CACTCCCCCG ATGAAGAAAT GAAAAGGATT GTGCTCAAGG TTATCGAACA GTGCGTTGCC
ACGGCTGGGG TCGAACCTGA CTATATTCGC ACAGAGATCC TGCCCGAGTT TTTCCGAAAC
TTTTGGATCC GCCGTATGGC ACTAGATCGT CGCAACTACA ACCAAGTTAT TGAAACCACC
GAAGAATTGG CAAACAAGGT TGGTTGCTCT GATATAATCA TTCGTATCGT GGATGATCTC
AAGGATGACT CAGAACCTTA CCGACGGATG GTGATGGAAA CTTTAAAAAG AGTCTTGAAC
AATTTAGGCG CTAGTGATAT TGACGAACGC CTGGAGGAAC GGCTCATCGA CGGTATTCTA
TATGCCTTCC AGGAGCAAGC TGTGGACGCC AGTAGTACGG GTTCTAATTC TTTTGGCAGG
GAAAGCCAAG TGATGCTGGA GGGCTTCGGA ACTGTTGTAA ACGCTCTCGG CGAGCGATGC
AAGCCATATC TTAAGCAAAT TGCCGGTACC ATCAAATGGC GACTCAACAA TAAGGCGGCA
TCTGTTCGTA TGCAAGCTGC TGATCTTATC GGACGAATCG CAGTTGTGAT GAAGGCCTGT
GGCGAAGACC AATTGATGGG GCATCTGGGC GTTGTCCTTT ACGAATATCT TGGTGAAGAA
TACCCGGAAG TGCTGGGATC TATCCTGGGT GCTCTTCGCG CCATTGTGAA TGTCATTGGG
ATGACAAAGA TGACCCCACC TATCCGCGAC TTGCTTCCGC GTTTGACACC GATTCTGCGG
AATCGCCATG AAAAAGTGCA AGAGAATGTC ATTGATCTGG TCGGCCGTAT TGGGGACCGT
GGTGCAGAGT TTGTGTCGGC TAAGGAGTGG ATGCGTATCT GTTTTGAGCT ACTTGAAATG
TTGAAGGCCC ATAAGAAGGC CATTCGACGT GCTGCAGTTA GTACGTTTGG TTTCATTGCG
AAGGCAATCG GTCCGCAGGA TGTGCTGCAT ACTCTCTTGA ACAACTTGAA AGTACAGGAT
CGGCAGATGC GCGTCTGCAC GACTGTCGCC ATTGCTATCG TTGCAGAAAC GTGTGGGCCA
TTTACAGTCT TACCCGCTTT GATGAACGAA TACCGTGTTC CGGAACTCAA TATACAAAAT
GGTGTATTGA AATCTTTGAG CTTCGTCTTT GAATACATCG GGGATATGGG TAAGGACTAC
GTCTACGCTG TTACCCCGCT CTTAGAAGAT GCCTTGATGG AGCGCGACCC TGTGCATCGT
CAAACGGCAT GCTCCATCGT GAAACATCTT TCACTCGGCG TTGTTGGCTT GGGATGCGAA
GATGCACTAC TTCATTTGTT CAATTACGTC TGGCCAAACA TTTTCGAAGA AAGCCCTCAC
GTTATCCAAG CAGTGTTTGA TGCAGTACAA GCACTCATGG TGGCATTGGG ACCCAACGTA
ATCTTGGCCT ACACAATCCA GGGGCTCTAT CACCCCGCAC GGCGCGTACG GGATACATAT
TGGCGTGTAT TTAACATGCT CTACATTTAC AATGCCGATG CTCTCGTAGC GGGGTACCCT
TCCATGAGGG ACGAAGGTGG AAACACTTAC AAGCGCACTT CTCTTGAACT CTTTATTTAA
 
Protein sequence
MVSDTESVAS AKNKWDDVIS TGASSRRRKR WDETPVMAAS SSVAATPLVT TGCRSKWDET 
PVLASGGVGV IKTPTLAGAR NRWDATPLST QPLAGASQTP MGTPLDKAML LEREMESRNR
PWTEGALDAI LPSEAYNIVR PPSTYIPLRT PGRKLLATPT PMSMTPAGFQ MEVPAEQRID
ASVQDIREAY GSLPYIKPED MQYFGRLAEE VNEDDISKDE LKERQIMTML LKIKSGTPPQ
RKTAMRQITD KARSFGAGPL FNQILPLLMS PALEDQERHL LVKVIDRVLY KLDDLVRPYV
HRILAVIEPL LIDEDYYARV EGREIISNLA KAAGLATMIA TMRPDIDSPD EYVRNTTSRA
FAVVASALGV PALLPFLKAV CQSRKSWQAR HTGIKIVQQI ALLMGVAVLP YLRELVEIVS
HGLVDDMQKV RIVTALTCAA LAEAAHPYGI ESFDPVIRPL WKGTMEQHGK ALAAFLKAVG
FVIPLMEENY ASHYTRLVMP ILIREFHSPD EEMKRIVLKV IEQCVATAGV EPDYIRTEIL
PEFFRNFWIR RMALDRRNYN QVIETTEELA NKVGCSDIII RIVDDLKDDS EPYRRMVMET
LKRVLNNLGA SDIDERLEER LIDGILYAFQ EQAVDASSTG SNSFGRESQV MLEGFGTVVN
ALGERCKPYL KQIAGTIKWR LNNKAASVRM QAADLIGRIA VVMKACGEDQ LMGHLGVVLY
EYLGEEYPEV LGSILGALRA IVNVIGMTKM TPPIRDLLPR LTPILRNRHE KVQENVIDLV
GRIGDRGAEF VSAKEWMRIC FELLEMLKAH KKAIRRAAVS TFGFIAKAIG PQDVLHTLLN
NLKVQDRQMR VCTTVAIAIV AETCGPFTVL PALMNEYRVP ELNIQNGVLK SLSFVFEYIG
DMGKDYVYAV TPLLEDALME RDPVHRQTAC SIVKHLSLGV VGLGCEDALL HLFNYVWPNI
FEESPHVIQA VFDAVQALMV ALGPNVILAY TIQGLYHPAR RVRDTYWRVF NMLYIYNADA
LVAGYPSMRD EGGNTYKRTS LELFI