Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_27838 |
Symbol | |
ID | 7201503 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011677 |
Strand | - |
Start bp | 910054 |
End bp | 913669 |
Gene Length | 3616 bp |
Protein Length | 1040 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180730 |
Protein GI | 219119960 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TCGGTTTCGG TGCGTGTAGG TAAGAATGGA TCGAGAGGAA GACCTATGGA TTGGACTAGG ATACACACTA TCGCTGCTAC TGATGGACAG TGTAGGAATA CAGGAGACAA TGTGCTGCGA CGGTCGAAAG TCGGTCGTTT CGGTAGTGGG AATGCGTTTT CGTTCCCCAG TCATTTGCAC TGTTCACGGT ACTCTTCCTG TCCCACTTTA TACTCACATA CACGCTGACG CTCGTCGTGC TTGCGCCACA GATTTTTTGT TGGAATACGT ACGGGAACGA ATACATCATT ATCATGTCTT TGGAAGCCAA ATTTGCCGCT CTCAAGCTCG ACGACGTGGA CAGTATTGTA CAAGCCGTCC AGAAGGACGG GGTCGAAAAG TCGGGTCTGG CCGCCAACAT TGGTGTGTTG GCCGCTCGTT GCGCCTCGAG CGACGACGAC GAAGCCTTGG TGGCACTCAA AACTACCAAG ACTTTGGTGG AACAGTGCCC TACCGCACAA GCCTTCACCA AGGACTGTTT GACAGCTTGT AAGTAACGAC TCATCTTTGG CTGGTTCACA CCGGAAGCGA ATCTGTGTGG AATCTCTGCG CGACTCGCAT CGAACGTGCA TTCTCACGTT CCATTTCATG ACTTTTGTTG TTGGCAGGTC TGGAACAAGC CTTGTCCAAG AATACAGATG TCCGCAATAC CGCCGAAGAA ACGGCCTTTG CCATTTGCGA AAACATCAAT CCGTTCGCCA TGAAGTCTTT GCTGCCCCAG ATCTTTGCCC AGCTTCCCGT AGAGAAGAAG TGGCAAATCC GGGAGTTGGG ACTCAAGTGC ATCGCCAAGT TCAACAAGAC CGCCCCGCGC CAGTTGGGTG ATGCCCTCCC GGAGGTCATT CCCGAAGTCA CCGCCTGTAT GTGGGACACC AAGAAGCAGG TCAAGATTGC CGCCACCGCA GCCATGGAAG CCGCTCTCCA AGTTATTGGG AACCGCGATA TCGAACACAT GACGGACAAA ATTTTGGTCG CCATCACCAA GCCCAAGGAA GTGCCCGAAA TTATGCACAA GATGGCGGGA GTTACCTTTG TCCAATCGGT CGAATCACCC GCCTTGGCCA TGGTCGTGCC CCTTTTGTTG CGTGGACTAC GGGAAAAGCA GATTGCTACC AAGCGTCAGT CAGCCGTTAT TATCAACAAC ATGAGTAAGC TGGTCGACAA CCCGCTTGAC GCCGCTCCTT TCCTGCCCCT CTTGCTGCCG GCTCTACAGA CCAACGCGGA AAGTATTGCC GACCCCGAGG CCCGCGAAGT CACCGAACTG GCCGTGGCGC AGTTGAATCG ACTGAAAGGA CTCGCGGACA AGCAATTCTC CGTCCGAGGA GATATCTCCA AGCTCGAAGA CGAATTTAAG AAGGCTTTGG GCGCCGAAAA TGCGGAAGGA GGCCTCTTGG TCGTTATCAA GCAAGCCTCG ACCATTGCCA CCACAATGAT GGACTTGCAC TTTATGGAAG ATGTGCAGTG GACCAAGCAT GTTTCGTCGC AGTTTGTGGA CTATTTGGAT AAGGCCAAGG TGGAAGCCGG TATCGAAAAG GTCCGTGAAG AAGCCGAAAA GATGTTGGTC GTCCCCGAGG AAGATGAAGA CGAGGACGAC TCCGAGGAAC TTTGCAACTG TACCTTCACA CTCGCCTACG GTACCAAGAT TTTGTTGCAC AACACCAAAA TGCGCCTCAA GCGTGGTAAG CGCTACGGTC TGTTGGGACC CAACGATTGC GGAAAGACCA CGCTCATGCG AGCCATTGCC AATAACCAGG TCGAGGGATT CCCCGATACT GGTCAGGTTC GTACCGTATT CGTCGAAGCC GATATTCAGG GTGAACAGTC CCACTTGTCG TGTGTCGACT ACGTTCTACA CGATCCCAAG ATTGAGGCTT TGGGCATTAC CACGGAAGAA GTGCGCAACG TGCTCGCGAC GGTTGGTTTC ACCGAAGACG GCAAGGCCAA GCCTAACCAC GCCGTGTCGA CTTTGAGTGG AGGATGGCGC ATGAAATTGG CCCTGGCGCG TGCCATGCTT CAAAAGGCAG ATATTCTTCT GCTCGACGAA CCTACCAACC ATTTGGATGT CATTAACGTT GCCTGGGTGA AGACCTACCT GAACTCATTG ACCAACGTGA CGTCGATTAT TGTCAGTCAC GATTCCGGTC TCTTGAACGA TTGCTGCACC CACATTCTTT CTTTCGACAA CCTGAAGCTC AGCACTTTCA AAGGAAATCT CGACGAGTAC GTCAAGGCCC ACCCGGCTGC CCGTGCCTAC TTTACCTTGA GCGACTCCAA GATGAAGTTC AAGTTTCCCC AGCCGGGACC GATCGAGGGT GTCAAGTCCA AGGGTAAGGC CTTGATGAAG ATGGCCAACT GCACCTTCAC CTATCCCGTC AACGACAAGC CGACGCTTTT CGACATTACT GTGCAAGTAT CACTCAGTTC CCGTATCGCT TGTATTGGGG AGAACGGTGC TGGCAAGTCA ACAATGATCA AGCTATTGGT CGGTGAGATT GAGCCGCAAG TTGGTGACGT CTGGAAGCAC CCCAACGCTC GTGTCGCCTA CGTCGCACAG CACGCCTTTC ACCACATTGA ATCCCACTTG GATAAGACCC CGAACGAGTA CATTCGTTGG CGTTTTGCCA ACAATGGTGA AGACAAGGAG TCACTGGTCA AGGTATCTCT ACAATTTTCC GATGAAGAAA TTAAGCTGCA AAAGGCCCCC TTTGAGATTC AGGTGGTAGA CGAAGCCAGT GGAAAAATTT CGAAGATCAA GAAGGTTGTG GGTGAGCTCA TGGGAGGACG TAAACAGAAC AAGAGCAAAG AATACGAGTA CGAAGTCCGT TACGCCGGCT CGACCGTTGA CTCGGGTGAA TATTTGTCGT CGAAGATCCT CAAGAAGATG GGTTGGGAAA AGGCCATGAA GGCTGTCGAT CTCAAGATTG CCCAGACTGC CGGTATGTTC ATCCGTCCTT TGTCGACCAA GAATGTCGAA GAGCATTTGG AAGGCTGCGG GTTGGGACGT GAGTTCGGTA CCCATTACCG TATGTCGGCG TTGTCCGGAG GACAGAAGGT GAAGGTTGTG TTGGCTGCTG CCATGTGGAT GCAGCCACAC ATTGTCATTC TAGACGAGCC CACGAACTAC CTGGATCGCG AATCGCTAGG TGCCCTGGCT GGTGCGATTG AGGAGTTTGA TGGTGGAGTG ATCATCATCT CGCACAACAA CGAATTCGTT TCCAAGCTGT GTCCCGAAAC CTGGGTCATG GACGCCGGTC ACCTCGAGAC CAAGGGTGAC GCTGACTGGA TGTTGAAGCA GGATTCTAAG ATTTCTGATC AAATGCAGAT CAATACGGAC GTTACCGATG CGGCCGGTAA CAAGATCGAG ATCAAGCAGG ACAAAAAGAA GCTGTCCAAG AAGGAAGAGA AGGCACTCAT CAAGAAAATT AAGGCCAAGA TGAAGGCTGG TGAGGCCCTT GACAGTGAAG AAGAAGAACT TGCCTTTGAG AAGGAACTGG TGTAAAGCTC ATTTGAATTC TCTATGGGGC AGTCTGCTTT TGTAGTCCAG TGACGACTTT ATATATAGAA CGAACGAACG AACAAA
|
Protein sequence | MSLEAKFAAL KLDDVDSIVQ AVQKDGVEKS GLAANIGVLA ARCASSDDDE ALVALKTTKT LVEQCPTAQA FTKDCLTACL EQALSKNTDV RNTAEETAFA ICENINPFAM KSLLPQIFAQ LPVEKKWQIR ELGLKCIAKF NKTAPRQLGD ALPEVIPEVT ACMWDTKKQV KIAATAAMEA ALQVIGNRDI EHMTDKILVA ITKPKEVPEI MHKMAGVTFV QSVESPALAM VVPLLLRGLR EKQIATKRQS AVIINNMSKL VDNPLDAAPF LPLLLPALQT NAESIADPEA REVTELAVAQ LNRLKGLADK QFSVRGDISK LEDEFKKALG AENAEGGLLV VIKQASTIAT TMMDLHFMED VQWTKHVSSQ FVDYLDKAKV EAGIEKVREE AEKMLVVPEE DEDEDDSEEL CNCTFTLAYG TKILLHNTKM RLKRGKRYGL LGPNDCGKTT LMRAIANNQV EGFPDTGQVR TVFVEADIQG EQSHLSCVDY VLHDPKIEAL GITTEEVRNV LATVGFTEDG KAKPNHAVST LSGGWRMKLA LARAMLQKAD ILLLDEPTNH LDVINVAWVK TYLNSLTNVT SIIVSHDSGL LNDCCTHILS FDNLKLSTFK GNLDEYVKAH PAARAYFTLS DSKMKFKFPQ PGPIEGVKSK GKALMKMANC TFTYPVNDKP TLFDITVQVS LSSRIACIGE NGAGKSTMIK LLVGEIEPQV GDVWKHPNAR VAYVAQHAFH HIESHLDKTP NEYIRWRFAN NGEDKESLVK VSLQFSDEEI KLQKAPFEIQ VVDEASGKIS KIKKVVGELM GGRKQNKSKE YEYEVRYAGS TVDSGEYLSS KILKKMGWEK AMKAVDLKIA QTAGMFIRPL STKNVEEHLE GCGLGREFGT HYRMSALSGG QKVKVVLAAA MWMQPHIVIL DEPTNYLDRE SLGALAGAIE EFDGGVIIIS HNNEFVSKLC PETWVMDAGH LETKGDADWM LKQDSKISDQ MQINTDVTDA AGNKIEIKQD KKKLSKKEEK ALIKKIKAKM KAGEALDSEE EELAFEKELV
|
| |