Gene PHATRDRAFT_42735 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42735 
Symbol 
ID7196125 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp931544 
End bp934006 
Gene Length2463 bp 
Protein Length796 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177192 
Protein GI219110881 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATTCG TTCTAGAAAC ACTGCTAAAA GACGCTTCCA GAAACGAAGT ACCGCCGGCA 
AAACGGCGAG CAGCACTGAC GAAGGCGTAT GACTTACTGA GTCGGATAGA TAAAGAACTC
ATTGCTCTTA ATGCGCGAAA CACTCCATCT GAAAACGATA GTGAGCAATC GGATACAGAT
GTCGAAACGA AATCAACAAA CGGAATCGCC GTGGCGGTTG CGGGCGCGTC GTCCAAGGCG
TCCATCGATA CCGGACTACA CTCACCTTCC GGGGGGCGAT CCAAGAGTAC TTTGAACGGT
GGGATCACAA CTTCGAGAGC TGCGAGCCCT TCTGATTTGA GAAGTAGTGG GGAAAAATCA
GCGCCGGCGA TACTTGATGG CCCCGTTGCT GCTACAAATG TTCCATCTAC GACGAAAGCC
ACCACCTCCA GTTCTGGGTC GGGACTGTTG GCTACAACGT CGTCGGAAAA GGCACCTTCT
GACGTAAACA TCGCCGTTGT AGCCAAGAAA TCATCAACGA GATGGGGCGA GCCCCTTTCA
AAGAAGATAT CTGCAGGCCG GAATGGTGCA CATCATGAAG TGGTCGAGGC AAATTATCCC
TCTGCTGTAC CAAATAATGA CGCACCGAAA CCTCTACATG AATCTTCTGC AAGAATACAA
ACTGATGGTA AACCGAGATC TGAACATCTG GAAAATGAAT CTCTTCATCT GCCGGGGTCA
GCTTTAAGGG TAGATCCGAT AGCAACAGGG GACGAAGCGA CGAGACCAGC AAGCCCCGTT
CAGACTTCCA AACATTCATG TCGTACCAGC CTACCTCCGT TCATTCCAGA AACAACAAGT
AAGCGCGCTA CCAGCGGTGG TCCTTCTTCA GTGCTCTTGG ACAATACATG TAGGTTGGCA
TCACGTGATT CGGACAAAGA ACAACGTCCT ACGTTGGCAG AAAAGGGAGG CCAAAACATT
GGCGTGCTTG TCTCTAGCAG TACTGTCACA GCGAAGAACG TACAGAACCC CAAACACAAG
GTGGATTTGG ATGAATCCGG CTCATCTAGT AAACTTTCTA GTAAAGGAAT TCAACGACCT
CAAAGAGATC TGTCTCTACG ATGGGAACCG CCGAAACGGA GATCTTTTGA AGTCGAAAGG
AGCGCATCGA ACGATACATT TGATCGACAA ACCAGGAATC GCGAATCGCA TTTGAATGAA
AGAATGGATG ACGCCATATC CCAGAAGGAT GTGACAGCTA ATTTTTACAA TGAACAACGC
TCTAAGCAAG AAAGAGTGAG ACAAGACGCG AACTCGATTT TTCAATCCAA TAGAAAGCCG
CTGGTGAACG ATGTTTGGAT CAATTCCGGC GTTCCAAGTC ATGCAGTAAC AGTCAAAATG
GCCGCTGACA GAAGTCCAAC GTTTTCAGGG TCTTTGTCTG AAAGAGGGGA AATAAACTTG
AAACTCAAAC TGGATCCCTT ACCAACAAGC ATATCCAAAA GCCTACCGGT CCGACTGAAA
AAATGGGATC CTTTTTTTAC TTTCGCGGGG CCTTGCAGAT GCACGTTGAC GGTGCCTGCA
GGCAGCACAG ATGCAACTAA AACAGGATCA TCCTTGAGAA TCAATCTCAA CACACCTATG
TACCAAGAAT TCATCAGAAA AATTCAGCCC GAAATGTGGG GAAAACCTCG CCAGGGTGCT
TTGAAATGGA AGAAGGGAGA TTGGGGATTA ATGTTACGCG CTCTTCCCTT ATCAAGGACC
TCAAAAAATC GAGCTGATTG CCACTTATGG CCTAAGGGAA CTTTTCTGCA ACTTAACGGA
AAGCCACTGC GATTGGCTCA AAGGCAGCAG CAGAGTCATG ATAAGTCTCT ATGGAAGAAT
CAATGCACGC AGTTAGACTT GACTGAACAT GTATCTATGT CTGACCCGAA TGTTTCGATT
GAGATATGTT GCTATGATGA AGAACCGTTT ATCTTAATGG TGGGGTTTTG CAGATATGAA
TCTGCCGATT CGATATTCTC CACGATACGA AACCCAAATA ACGGGCTACT GAACCGAGTC
ACTGTAAAAG AAGGGATGCA GCGAGCCATT CAAAGAGCAT CTGGACAGAT GCACATTATT
GATGGGAGCG ATGGTGAGAA GGTAGAAGAA GTAGGCAAGT TTGTTTTCAG CTTGACTTGC
CCTATTTCCA AGGCTTTAAT GAACTCTCCT GTTAGAGGAA GGTCATGCAA ACACTGGCAG
GTATGTGACT AGATGGTCTG CCTTCTTCGT AGTATTCTAC AATTCTTATA ACTCGATTTG
TATTCACCAC AGTGTTTTGA TTTGAAGACA TACTTGGATG CAAATCAAAG AGTCACGGGC
AGCCGTTGGC GTTGTGCTTC ATGCGAGTTG TTTGTGCCAT ATGATGAGCT CGAAGTGTGT
GAGTTTACGC TAGCCGCGTT GCAGCGATAC AGAAATGAAG CATCCACGGC CGAGATCGCA
TAG
 
Protein sequence
MEFVLETLLK DASRNEVPPA KRRAALTKAY DLLSRIDKEL IALNARNTPS ENDSEQSDTD 
VETKSTNGIA VAVAGASSKA SIDTGLHSPS GGRSKSTLNG GITTSRAASP SDLRSSGEKS
APAILDGPVA ATNVPSTTKA TTSSSGSGLL ATTSSEKAPS DVNIAVVAKK SSTRWGEPLS
KKISAGRNGA HHEVVEANYP SAVPNNDAPK PLHESSARIQ TDGKPRSEHL ENESLHLPGS
ALRVDPIATG DEATRPASPV QTSKHSCRTS LPPFIPETTS KRATSGGPSS VLLDNTCRLA
SRDSDKEQRP TLAEKGGQNI GVLVSSSTVT AKNVQNPKHK VDLDESGSSS KLSSKGIQRP
QRDLSLRWEP PKRRSFEVER SASNDTFDRQ TRNRESHLNE RMDDAISQKD VTANFYNEQR
SKQERVRQDA NSIFQSNRKP LVNDVWINSG VPSHAVTVKM AADRSPTFSG SLSERGEINL
KLKLDPLPTS ISKSLPVRLK KWDPFFTFAG PCRCTLTVPA GSTDATKTGS SLRINLNTPM
YQEFIRKIQP EMWGKPRQGA LKWKKGDWGL MLRALPLSRT SKNRADCHLW PKGTFLQLNG
KPLRLAQRQQ QSHDKSLWKN QCTQLDLTEH VSMSDPNVSI EICCYDEEPF ILMVGFCRYE
SADSIFSTIR NPNNGLLNRV TVKEGMQRAI QRASGQMHII DGSDGEKVEE VGKFVFSLTC
PISKALMNSP VRGRSCKHWQ CFDLKTYLDA NQRVTGSRWR CASCELFVPY DELEVCEFTL
AALQRYRNEA STAEIA