Gene PHATR_46641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_46641 
Symbol 
ID7204570 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011679 
Strand
Start bp57925 
End bp61245 
Gene Length3321 bp 
Protein Length1036 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185815 
Protein GI219121172 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.335573 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AATTCCAGGA ATCATCCTCA CTGTCAGCAC AACATCTTGG TTTTCATAGT AAACCTTTGT 
ATCCGACTAA TGTAGCAGCG GTCGTGCACT ATACGTGTCT CCAGAAGCTG CAACATGGAA
CAAGGGAGTA CGTGCAACAG CTCAGCACCA GCAGAGACAG CGGCGGCGCC GTGGCTGACC
TGCTTTCACG ATACTATGAT TATCTTTCCA AGAAACGTTG TCCGATCGAC GGTATCACGA
TGGAAGCTGC CGTATTCGAC TACGCTTTTT GGTGCTTGTA GCAATATTGG GTACATCGCG
TTTCGTTGCG AGTCTTCTAA AGATGTGATA CCGCTGGTAC TGCTGATACT GTTTATCGCA
ATAGCCTCGG TTTTACCAAA GTGCTATCGA AAGCGCTGCA AAGAAACAGT ACCGACAACA
TTGCACGATC AATGGACACT TTCTTCAACC ACACAGCGCA CAACGTGCGT TCCCGACGTC
GTTGACCAAC CCCAACACGA GCAACACTTT TACCTGAATC AACTCCGGCG CAAACGAGAC
TTTCTGGCCC GTGCCGGAGA CGATTACGGC TATCGCAACT CACCCGCAGG GTTTATTGAC
GATTGGAGAG ACTTCGAATT TCCTCTCCTG ATATCTCCTA TCCGGCTGGA TAGCAAAGTT
CCCTTCACAG AGCCAGGTCC TGCCGTGTCC AAAAATACTA AATGCGGTCC GAATCCCAGT
CCTGACGATA ATACTTGTGA ACAGCAGGTC TACGCAGATT ACGCGGGTGC CGCCTTGCCA
ACACGATCCC AATGGAAAGC TACCACCAAC GAGACAGATT CGCCTCTACT GTTGGCTAAT
CCTCATTCCA CGGGGCCCTC TGCTGGACAC ACATCCCTTC TGATCGAGCA AGCAAAAAAA
AGAATTCTCG AGTTCTTTTC TGCCACTCCT GGACAGTTCG GTGGCCCACT ATCGAGCCGA
GCTCTCCCGG ACACCACCGG CATTAGTAGA AAAGAAAACC AGCAGTATCC GCAACGACAA
GAGACATTTC ATCCCGGCTA CGAGATTGTT TTCACATCCA ATGCGACCGA CGCGCTTCGC
ATTGTTGCCG AAAGGTTTCC CTGGAAGACA GCGAAATCAA CGTCTTGTCA ATGTCAATGC
CAGCAAGCGT CAACCCTAGT TTACCCACAA GATTCACACA CGTCTGTGGT AGGAATGCGC
GGACCAGTCC TACAGAAAGG TGGCCGCTTG ATGTGCAGGC CAGCGACCGA CTTGCTGTTG
GATATGGACA AGCCACAAGG TGTACGCACA TGGAGTAGTC TCCCCAATGA AGGGCTCCGG
GAAAGTCACC CGTGCAATTG CTGCAAGAAT GAGGTGACAA ATCATTTGTT AGTATTGGCT
GCCGAATGCA ACTTTAGTGG CGACCGGAAG AATGTGAAAC GCGCATTTCG ACGAGTACGC
GAAAGCTCGA ATGCTATAGA GACGAGTGAT CGCTGGTTTA CCATGCTCGA CATGGCCAAG
GCCGCCAGTT CAGCTCCAAT CAACTTACGT TCCCTTGATC CAGATTTTGC CTGCGTCTCC
TTTTACAAAT TGTTTGGAAT GCCGACGGGG TTAGGGTGCT TGTTGGTCAA GCGAGGAGCG
GCTGTGGAGC TTTTGAAGGA GAATCAGAAT ATATATTTTG GCGGGGGTTC TGTGGATGTT
TTGCTACCAA GCACCGACTT TGTGGTGCAT CGATCTGGGC CAACATCTTT GGCGTCTCTT
ACAAATGGTA CTGTTCACTT TCGCGGCATA GCCTCCTTGG TTCACGGGTT TGATGCCCTG
GCCCATGTTG GAGGAATGCA TTCCATTGAA GGTCACACTG TCACTCTAGC TAGAGAATTT
GCTAGTCGAA TCAGCGCAAT GCAACACGCG AACTGCCGAC CATTGGTGGA GATACACAGT
TCATGGGCTA AGGCAGGAAA AGCTCTTCGT CATGGACCCA CGGTAACCTT CAACGTGCTG
CGTAGCGATG GAGCGTACGT GGGCTTCAAC GAAGTCTCTA AACTGGCAGC ATTAAATCGA
CCACCCATAC AGATGAGAAC AGGATGTTTC TGTAATCCTG GTGCCTGTCA GCTTGCTCTA
GGACTCAGTG ACAATGATGT CCGGCATAAT TACGAAGCTT CCGGTCACGT TTGTGGTGAC
CAAATGGATG TAATCAACGG TCGACCTACA GGAGCGATCC GTGTCAGCTT TGGCAAAGAT
AGTATCTGGG AGGATGCGGA CGCAATCGTC ACTTTTCTGG AGCGGATCTT CGTATCGGTT
CAAAGTTTGG ACAATAAGTC CAATGTTGGC TGGGATGCCT CGCCTCGTCG GGTCATGCTG
TCCGAAATGT ACATTTTTCC AATTAAAAGT TGCGCCGCCT TTCGTGTGAA ACGGTGGAAA
TTTGACGCCA TAAGTACGAA ACCCGATTTT GATCGAGAAT TTGCCTTGGT CGATTCGTCT
GGAACAGCCA TGCGCCTCCA ATCCTACCCC AAAATGGCAT ACATACAGCC GCATATTGAT
GTATCGCGAC GTGTAATGAC CGTTCATGCC CCCGGGCAGT CACCGTTGGA GCTTCACTTG
GACACGGATT CTTCGAACAC GATCGAGATT GACAGCGTCG TGAAGATTTG CGGCAACCGT
TGCGGAGCTC GCGTTTGGGG GGATTGTAGT GTCTCAGAGT GGTTCAGCTC CTTCCTTGGT
GTCCAGTGTT GGTTGGCCCG TCATTCTGTC TATGGAAACC AACAAGTTTC GAGCAAATAT
GCTGTTCCGT CAACAGCAGC AAGACGTCAA AGTGTAGCTT TTGCAAACGA GCAACCCATT
TTGCTGATAT CTGAGCACGC AGTTGATACT CTAAATGAGT CTCTGAGGGC ACAGCACCAA
AAGCAAGTCA GCTCCAGGCA TTTCCGGCCC AATATGGTGG TCAGGCTAGT CGGGCAACAA
TTTCAGAACG ACGCTCTTCA TGCCGAAGAC GCCTGGAGCA CAATACAGAA TAATTCCAAG
GAGATTGTCT TTGACGTTGT CGGACCATGC GCTCGTTGCT CGATGGTAGA TGTGGACCCA
TCCTCAGGAA TGAAAGGAAA CACGCTGCGT GCGCTTGCCG AGTATCGACG CCAAAACGGA
CAGATCATTT TTGGAATCTT CCTCAAGGGT AGAACTGCTC GCTCAGAGTC TGAGAAGCGA
GCCGACATGT GGCTGGAGGA AGGGGATTTT TTCCTTTCCG AATGAAGTTT GGAAAAGGGC
TATTGTGTAG CTGTTTGTGC ACCGTAACAA CGAAATCGGT TCCTACTTTA ATTAATGTAA
CATGTTATTG CTTTACATTC A
 
Protein sequence
MEQGSTCNSS APAETAAAPW LTCFHDTMII FPRNVVRSTV SRWKLPYSTT LFGACSNIGY 
IAFRCESSKD VIPLVLLILF IAIASVLPKC YRKRCKETVP TTLHDQWTLS STTQRTTCVP
DVVDQPQHEQ HFYLNQLRRK RDFLARAGDD YGYRNSPAGF IDDWRDFEFP LLISPIRLDS
KVPFTEPGPA VSKNTKCGPN PSPDDNTCEQ QVYADYAGAA LPTRSQWKAT TNETDSPLLL
ANPHSTGPSA GHTSLLIEQA KKRILEFFSA TPGQFGGPLS SRALPDTTGI SRKENQQYPQ
RQETFHPGYE IVFTSNATDA LRIVAERFPW KTAKSTSCQC QCQQASTLVY PQDSHTSVVG
MRGPVLQKGG RLMCRPATDL LLDMDKPQGV RTWSSLPNEG LRESHPCNCC KNEVTNHLLV
LAAECNFSGD RKNVKRAFRR VRESSNAIET SDRWFTMLDM AKAASSAPIN LRSLDPDFAC
VSFYKLFGMP TGLGCLLVKR GAAVELLKEN QNIYFGGGSV DVLLPSTDFV VHRSGPTSLA
SLTNGTVHFR GIASLVHGFD ALAHVGGMHS IEGHTVTLAR EFASRISAMQ HANCRPLVEI
HSSWAKAGKA LRHGPTVTFN VLRSDGAYVG FNEVSKLAAL NRPPIQMRTG CFCNPGACQL
ALGLSDNDVR HNYEASGHVC GDQMDVINGR PTGAIRVSFG KDSIWEDADA IVTFLERIFV
SVQSLDNKSN VGWDASPRRV MLSEMYIFPI KSCAAFRVKR WKFDAISTKP DFDREFALVD
SSGTAMRLQS YPKMAYIQPH IDVSRRVMTV HAPGQSPLEL HLDTDSSNTI EIDSVVKICG
NRCGARVWGD CSVSEWFSSF LGVQCWLARH SVYGNQQVSS KYAVPSTAAR RQSVAFANEQ
PILLISEHAV DTLNESLRAQ HQKQVSSRHF RPNMVVRLVG QQFQNDALHA EDAWSTIQNN
SKEIVFDVVG PCARCSMVDV DPSSGMKGNT LRALAEYRRQ NGQIIFGIFL KGRTARSESE
KRADMWLEEG DFFLSE