Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_46641 |
Symbol | |
ID | 7204570 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011679 |
Strand | - |
Start bp | 57925 |
End bp | 61245 |
Gene Length | 3321 bp |
Protein Length | 1036 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185815 |
Protein GI | 219121172 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.335573 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AATTCCAGGA ATCATCCTCA CTGTCAGCAC AACATCTTGG TTTTCATAGT AAACCTTTGT ATCCGACTAA TGTAGCAGCG GTCGTGCACT ATACGTGTCT CCAGAAGCTG CAACATGGAA CAAGGGAGTA CGTGCAACAG CTCAGCACCA GCAGAGACAG CGGCGGCGCC GTGGCTGACC TGCTTTCACG ATACTATGAT TATCTTTCCA AGAAACGTTG TCCGATCGAC GGTATCACGA TGGAAGCTGC CGTATTCGAC TACGCTTTTT GGTGCTTGTA GCAATATTGG GTACATCGCG TTTCGTTGCG AGTCTTCTAA AGATGTGATA CCGCTGGTAC TGCTGATACT GTTTATCGCA ATAGCCTCGG TTTTACCAAA GTGCTATCGA AAGCGCTGCA AAGAAACAGT ACCGACAACA TTGCACGATC AATGGACACT TTCTTCAACC ACACAGCGCA CAACGTGCGT TCCCGACGTC GTTGACCAAC CCCAACACGA GCAACACTTT TACCTGAATC AACTCCGGCG CAAACGAGAC TTTCTGGCCC GTGCCGGAGA CGATTACGGC TATCGCAACT CACCCGCAGG GTTTATTGAC GATTGGAGAG ACTTCGAATT TCCTCTCCTG ATATCTCCTA TCCGGCTGGA TAGCAAAGTT CCCTTCACAG AGCCAGGTCC TGCCGTGTCC AAAAATACTA AATGCGGTCC GAATCCCAGT CCTGACGATA ATACTTGTGA ACAGCAGGTC TACGCAGATT ACGCGGGTGC CGCCTTGCCA ACACGATCCC AATGGAAAGC TACCACCAAC GAGACAGATT CGCCTCTACT GTTGGCTAAT CCTCATTCCA CGGGGCCCTC TGCTGGACAC ACATCCCTTC TGATCGAGCA AGCAAAAAAA AGAATTCTCG AGTTCTTTTC TGCCACTCCT GGACAGTTCG GTGGCCCACT ATCGAGCCGA GCTCTCCCGG ACACCACCGG CATTAGTAGA AAAGAAAACC AGCAGTATCC GCAACGACAA GAGACATTTC ATCCCGGCTA CGAGATTGTT TTCACATCCA ATGCGACCGA CGCGCTTCGC ATTGTTGCCG AAAGGTTTCC CTGGAAGACA GCGAAATCAA CGTCTTGTCA ATGTCAATGC CAGCAAGCGT CAACCCTAGT TTACCCACAA GATTCACACA CGTCTGTGGT AGGAATGCGC GGACCAGTCC TACAGAAAGG TGGCCGCTTG ATGTGCAGGC CAGCGACCGA CTTGCTGTTG GATATGGACA AGCCACAAGG TGTACGCACA TGGAGTAGTC TCCCCAATGA AGGGCTCCGG GAAAGTCACC CGTGCAATTG CTGCAAGAAT GAGGTGACAA ATCATTTGTT AGTATTGGCT GCCGAATGCA ACTTTAGTGG CGACCGGAAG AATGTGAAAC GCGCATTTCG ACGAGTACGC GAAAGCTCGA ATGCTATAGA GACGAGTGAT CGCTGGTTTA CCATGCTCGA CATGGCCAAG GCCGCCAGTT CAGCTCCAAT CAACTTACGT TCCCTTGATC CAGATTTTGC CTGCGTCTCC TTTTACAAAT TGTTTGGAAT GCCGACGGGG TTAGGGTGCT TGTTGGTCAA GCGAGGAGCG GCTGTGGAGC TTTTGAAGGA GAATCAGAAT ATATATTTTG GCGGGGGTTC TGTGGATGTT TTGCTACCAA GCACCGACTT TGTGGTGCAT CGATCTGGGC CAACATCTTT GGCGTCTCTT ACAAATGGTA CTGTTCACTT TCGCGGCATA GCCTCCTTGG TTCACGGGTT TGATGCCCTG GCCCATGTTG GAGGAATGCA TTCCATTGAA GGTCACACTG TCACTCTAGC TAGAGAATTT GCTAGTCGAA TCAGCGCAAT GCAACACGCG AACTGCCGAC CATTGGTGGA GATACACAGT TCATGGGCTA AGGCAGGAAA AGCTCTTCGT CATGGACCCA CGGTAACCTT CAACGTGCTG CGTAGCGATG GAGCGTACGT GGGCTTCAAC GAAGTCTCTA AACTGGCAGC ATTAAATCGA CCACCCATAC AGATGAGAAC AGGATGTTTC TGTAATCCTG GTGCCTGTCA GCTTGCTCTA GGACTCAGTG ACAATGATGT CCGGCATAAT TACGAAGCTT CCGGTCACGT TTGTGGTGAC CAAATGGATG TAATCAACGG TCGACCTACA GGAGCGATCC GTGTCAGCTT TGGCAAAGAT AGTATCTGGG AGGATGCGGA CGCAATCGTC ACTTTTCTGG AGCGGATCTT CGTATCGGTT CAAAGTTTGG ACAATAAGTC CAATGTTGGC TGGGATGCCT CGCCTCGTCG GGTCATGCTG TCCGAAATGT ACATTTTTCC AATTAAAAGT TGCGCCGCCT TTCGTGTGAA ACGGTGGAAA TTTGACGCCA TAAGTACGAA ACCCGATTTT GATCGAGAAT TTGCCTTGGT CGATTCGTCT GGAACAGCCA TGCGCCTCCA ATCCTACCCC AAAATGGCAT ACATACAGCC GCATATTGAT GTATCGCGAC GTGTAATGAC CGTTCATGCC CCCGGGCAGT CACCGTTGGA GCTTCACTTG GACACGGATT CTTCGAACAC GATCGAGATT GACAGCGTCG TGAAGATTTG CGGCAACCGT TGCGGAGCTC GCGTTTGGGG GGATTGTAGT GTCTCAGAGT GGTTCAGCTC CTTCCTTGGT GTCCAGTGTT GGTTGGCCCG TCATTCTGTC TATGGAAACC AACAAGTTTC GAGCAAATAT GCTGTTCCGT CAACAGCAGC AAGACGTCAA AGTGTAGCTT TTGCAAACGA GCAACCCATT TTGCTGATAT CTGAGCACGC AGTTGATACT CTAAATGAGT CTCTGAGGGC ACAGCACCAA AAGCAAGTCA GCTCCAGGCA TTTCCGGCCC AATATGGTGG TCAGGCTAGT CGGGCAACAA TTTCAGAACG ACGCTCTTCA TGCCGAAGAC GCCTGGAGCA CAATACAGAA TAATTCCAAG GAGATTGTCT TTGACGTTGT CGGACCATGC GCTCGTTGCT CGATGGTAGA TGTGGACCCA TCCTCAGGAA TGAAAGGAAA CACGCTGCGT GCGCTTGCCG AGTATCGACG CCAAAACGGA CAGATCATTT TTGGAATCTT CCTCAAGGGT AGAACTGCTC GCTCAGAGTC TGAGAAGCGA GCCGACATGT GGCTGGAGGA AGGGGATTTT TTCCTTTCCG AATGAAGTTT GGAAAAGGGC TATTGTGTAG CTGTTTGTGC ACCGTAACAA CGAAATCGGT TCCTACTTTA ATTAATGTAA CATGTTATTG CTTTACATTC A
|
Protein sequence | MEQGSTCNSS APAETAAAPW LTCFHDTMII FPRNVVRSTV SRWKLPYSTT LFGACSNIGY IAFRCESSKD VIPLVLLILF IAIASVLPKC YRKRCKETVP TTLHDQWTLS STTQRTTCVP DVVDQPQHEQ HFYLNQLRRK RDFLARAGDD YGYRNSPAGF IDDWRDFEFP LLISPIRLDS KVPFTEPGPA VSKNTKCGPN PSPDDNTCEQ QVYADYAGAA LPTRSQWKAT TNETDSPLLL ANPHSTGPSA GHTSLLIEQA KKRILEFFSA TPGQFGGPLS SRALPDTTGI SRKENQQYPQ RQETFHPGYE IVFTSNATDA LRIVAERFPW KTAKSTSCQC QCQQASTLVY PQDSHTSVVG MRGPVLQKGG RLMCRPATDL LLDMDKPQGV RTWSSLPNEG LRESHPCNCC KNEVTNHLLV LAAECNFSGD RKNVKRAFRR VRESSNAIET SDRWFTMLDM AKAASSAPIN LRSLDPDFAC VSFYKLFGMP TGLGCLLVKR GAAVELLKEN QNIYFGGGSV DVLLPSTDFV VHRSGPTSLA SLTNGTVHFR GIASLVHGFD ALAHVGGMHS IEGHTVTLAR EFASRISAMQ HANCRPLVEI HSSWAKAGKA LRHGPTVTFN VLRSDGAYVG FNEVSKLAAL NRPPIQMRTG CFCNPGACQL ALGLSDNDVR HNYEASGHVC GDQMDVINGR PTGAIRVSFG KDSIWEDADA IVTFLERIFV SVQSLDNKSN VGWDASPRRV MLSEMYIFPI KSCAAFRVKR WKFDAISTKP DFDREFALVD SSGTAMRLQS YPKMAYIQPH IDVSRRVMTV HAPGQSPLEL HLDTDSSNTI EIDSVVKICG NRCGARVWGD CSVSEWFSSF LGVQCWLARH SVYGNQQVSS KYAVPSTAAR RQSVAFANEQ PILLISEHAV DTLNESLRAQ HQKQVSSRHF RPNMVVRLVG QQFQNDALHA EDAWSTIQNN SKEIVFDVVG PCARCSMVDV DPSSGMKGNT LRALAEYRRQ NGQIIFGIFL KGRTARSESE KRADMWLEEG DFFLSE
|
| |