Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_40070 |
Symbol | |
ID | 7195888 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011690 |
Strand | - |
Start bp | 71755 |
End bp | 74814 |
Gene Length | 3060 bp |
Protein Length | 996 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184179 |
Protein GI | 219127932 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.625757 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATGTAT ACGTGCCCGC TGGTTTCAGC TCTGGACTTC CGGATATTGG AGCCCTACTT GTACGTTCTC ACCCGGGCTC GTCTGTCGTC GTACCACTCT CGTTAACAGA CACGTACATC TGTCAACAGT CACCACGAAA ACTAGAAATC GGCCGGAAAC TGTATCACAC TGTTGGGAAA GACGATTCCC GGCTCTCGAT CCAAAGCGAG GACCCGACCA ACCCATTCCC CACCATCACT GTAAGGATAA GCATGGTAGA AGAAACAAAG AATATCAGTC GAAAGGACCG TCGGCGAGAG GAGCGAGTCC GCAAGAAACA GAAGCGGCGT CCCCCACCGG TCCCGTCCGA GCTCCTGGAA GAGGCGCACG TCGAACCTGT CGTCGTGGCA AGTAAGGACA AGAAGAAACG CAAGGGCAAA GTTCAAGATT TACCCGAACG AAAATTGGCG AAACAAAATG ATGCTTACGG ACACCTGGAG TCGGGTGTGG CGGCGGCCTT GCGTCGTGAC GACGAAGAAA TCGCTGCACT CGAAGCCAAA TTGAAATTGT CCAGTAAGGC GGACAAATCT CGGCTTAATA AAGAATACGC CAAACTCGAA GGATACGGTG ACGATTTTGG GGACTTTTTG GACGATCTGG ATGGCATCAT GCAACGAGTG ACTCACGGTG AAGATTTTGC GGACGAGGGC GACCTAGAAA GCTATGTACA GATAAGGAAT GACGTAAAAA CCATCGAGAC CAACAAGTCA CAGAACAGAG CAAAGGCAAA GACAGCGGTT TATTCCAACT TGGACTGGCA TGTGGCAGCT GCACTCCGTC GCGATGACGA AGAAATAGCT GACCTCGAAG CTAAGTTAGG CCTCGGAAAC AAAAAGGAAA AGAGCCGACT GAAAAAGGAA TACGCCAAGC TCGAGGGCTA TGGCGACGAC TTTATCGACT TTTTGGACGA TTTGGATCAT TTGACGGATC GAGTAGTTAA ACAATCGAAA AAGGAAGATT CAGACGACGA TAACGATACT AGCCCTGACG ATGACGGCCA ATCAGAAAGC GAAGGTGAAG AAATAATCCC GATGAAAGAA CCGGCTTTTA ACGATCTCGA CGAGGACGAC AGCGTAATGG ATAGTCTCGA GTCGACTCAA AGCAGTGACT CCTCGGATCA TGAGGCTTTA GAAGATGATA TGACTAGGGA CAGATCTGAA TCTCAAGAGA ATTTGCAATC TGATGATATG GACATAGAAC AGGACCATGA ACCCGAACAT ACGTACCGAC CATCAGCTGG CGAAAACATT TACGGAAAAG AGATTGGCGC AGCAGAGAAT ATGGAAAAAC CCCGCAAGTA CGTTCCGCCT CACTTAAGAA ATACGCAAGA AGTGGAGGGG AAAGAGGACA GCGCTGCAAG GCAAGACGCC CTGCGTGAAA TTCAAAGATC TTTGAACAAT GCATTGAATC GACTTTCCGA CGACGCATTG ATTTCAGTAG CTCAATCGAT TTGTCAGCTG TATCCGATGC ATCCGACTTC AGATGTGAAT ACAATGATTT GGAATAACTT GCAAAACGCA TGTATCGCCA GAAGCCACCT GATGACAGGA CTGATTCCTG CTTATGTGGC TGCCATAACC GGTGTACATA TTGAAAAGGG CGACACCGCT CAACTTGGGG AGTTTTTGAT AGAAAAGACG GTTTTGGAAA TCTGGAAAAA ATTGGAGGTT ATCCGGTCGG TCAACAGTCA GAATGATAGC CCCCTAGGAG AAGAGTCTTT GACTGTAAAT AAGGAAACCA GCAATCTTAT ACTGGTTCTC TGCTACCTCT ACAACTTTGG CGTCGTTCAT TGCTCCCTTA TCTACGATGC TATACGAAAC TTGATTGAAA GCTTTACCGA AATTGATGTG GAATTACTTC TTCTTATTTT GAGTCACTCT GGCCGCGCAC TGAGGAGTGA CGACCCGTTG GCCTTAAAAG AAATTGTTTT CCTTGTCCAA AAACAATATA CTATTGCAAA AAAAAGTAAC ACGAATGCTT CGCGCTTGGA GTACATGGTT TCGGCAGTCA TTGATTTGAA GAACAATCGA AAGCGGAAAC AAGACGCGTT ACTCGAAGAA AAGACGACGA AGCTACGGAA ACTCCTCGGT CAAATAAAAT CTAAGGTGGC TCAGAACAAC GTTGGCTATA AAGCTTCCGA TTCATCGCTC CGGATTGGTC TTAGAGACAT ATTCAATGCA GAAACCAAGG GGCGTTGGTG GAAAGTTGGA GCATCTTGGG TCGGCCATCT GGTAGGAGAG AAAAGTAGCG AATCACCAAA CCAAGGGACG ACAAACGAGG TCAAACCTTC GATAGAAGAC GAAAAATTAT TGAGATTAGC GTCAAAACAT CGCATGAATA GTGATACGCG GCGATCGATC TTTTGCATAA TCATGAGTTC TGCTGACTGT GAAGATTGTT TTGAAAAACT CGTCAGGGCG GGAATGCTAA AAAACCGCGT CGAACGAGAT ACTGTACGAG TCCTCATTGA ATGCTGTGGC AACGAGAAAG CGTACAACAA ATTCTACTCT CATTTAGGAG CAAGGATTTG CGAGTACCAA TCATCTTGCA AATTTACGAT ACAGCTTGCG TTCTGGGATG TCTTTAAACA GTTTGATGAC ATGAGTGTGC GCAAAGCTGC TAATCTTGCT AAACTTTTGT TCAGTCTGAT TGTCGACCAT CACATTTTGA AGCTCAACGT TTTGAAAGCG ATTGATATTT CTTCCCCAGA CGAGCTCTCC GAGACGGCGC TAGTCTTTAC AACAGTTCTT TTGTCTAGTA TTATGGAAAA ATTTGACGAT CCATCTCAGG TCCAGCAACT TTTTGAGACG GGAATTTCTC ATAAAAAAGC GATTGCATCC GACAGTGTGG ACGACATCGA TGGATTTGGA GAAGCCGACG AGAGCGAGGC ATTAAGGGCT AGCCTCACCA TTTTTTTCAT GCAAGTCCTT AAGGGAAGCC CAAGGTACAA GAAAGGAAGC AGATATCGTG CGAATCTGAA GGCTGCCATT AAATCATGTG ATGTGGACGA GTTCTTTTAA
|
Protein sequence | MYVYVPAGFS SGLPDIGALL SPRKLEIGRK LYHTVGKDDS RLSIQSEDPT NPFPTITVRI SMVEETKNIS RKDRRREERV RKKQKRRPPP VPSELLEEAH VEPVVVASKD KKKRKGKVQD LPERKLAKQN DAYGHLESGV AAALRRDDEE IAALEAKLKL SSKADKSRLN KEYAKLEGYG DDFGDFLDDL DGIMQRVTHG EDFADEGDLE SYVQIRNDVK TIETNKSQNR AKAKTAVYSN LDWHVAAALR RDDEEIADLE AKLGLGNKKE KSRLKKEYAK LEGYGDDFID FLDDLDHLTD RVVKQSKKED SDDDNDTSPD DDGQSESEGE EIIPMKEPAF NDLDEDDSVM DSLESTQSSD SSDHEALEDD MTRDRSESQE NLQSDDMDIE QDHEPEHTYR PSAGENIYGK EIGAAENMEK PRKYVPPHLR NTQEVEGKED SAARQDALRE IQRSLNNALN RLSDDALISV AQSICQLYPM HPTSDVNTMI WNNLQNACIA RSHLMTGLIP AYVAAITGVH IEKGDTAQLG EFLIEKTVLE IWKKLEVIRS VNSQNDSPLG EESLTVNKET SNLILVLCYL YNFGVVHCSL IYDAIRNLIE SFTEIDVELL LLILSHSGRA LRSDDPLALK EIVFLVQKQY TIAKKSNTNA SRLEYMVSAV IDLKNNRKRK QDALLEEKTT KLRKLLGQIK SKVAQNNVGY KASDSSLRIG LRDIFNAETK GRWWKVGASW VGHLVGEKSS ESPNQGTTNE VKPSIEDEKL LRLASKHRMN SDTRRSIFCI IMSSADCEDC FEKLVRAGML KNRVERDTVR VLIECCGNEK AYNKFYSHLG ARICEYQSSC KFTIQLAFWD VFKQFDDMSV RKAANLAKLL FSLIVDHHIL KLNVLKAIDI SSPDELSETA LVFTTVLLSS IMEKFDDPSQ VQQLFETGIS HKKAIASDSV DDIDGFGEAD ESEALRASLT IFFMQVLKGS PRYKKGSRYR ANLKAAIKSC DVDEFF
|
| |