Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46292 |
Symbol | |
ID | 7201223 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011677 |
Strand | - |
Start bp | 835308 |
End bp | 837856 |
Gene Length | 2549 bp |
Protein Length | 772 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180717 |
Protein GI | 219119933 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGATA AGAGTACCGC GAATTACCCC ATAGTGGTCG TGGATGTGGA CGACGATACG ACCGACGGCG AGAACGACGA AGTATCGACC ATTGTCCGTA CGCGAATCGA CACGGCGAGT CCGACAACGT CACCGATTCG GCCCCACACT CCACTCGAGG AAGCAGTGCA CGGGGGCTTG CGGTTGACCC GGACGCGCCG CACTTATCAC GAAACAGCCG CACCCCCGAG CTCCCCCGTG ACCGCGTCGT CCCCCCCGTC GACACCACTC TCCTTCACGG CCCAAGCGCC GAAAAAACGA AAGGCACCCG TCGTACACGA ACCGTATTCT GCCGCAGCTA ATATGCGAGC CGCCATGGAA CTGGCCAAGT ACGAATTGTT GGTGGACCCG TATCGTGGAA TGACAAAGTT ACGAGACGCA CACCACGACG AGGAACGGGA CGAGACGGAT ACGAACGGAC ACGTGGCTAC GTCGCCGTCG CTAGCAACGG TGGAAATAAC TCCCAGGCCC AGTCCCTGGC AAACATCCTG CGCCGTCGCA ATCCCGGAGA GTGGTGATTG TAGGAGCCCC ACGGATGATG GAGACGACAT TGCGAATTGG GAGGCGGCCG ACATTGTGAA ACGTGCTTGT AACTGTTCTT GTCCTTCATT GACGTCCTGG ACCAGCGCAT TGGCTGTCGC TCGCGACAAG AAGAGGAAAA AGTCCAAACA CCAACGCAAA GCCAATTCGA CTGTCAATAC ACATGTATTG GACGGTGATA CGGTGCGGAT TGGGCAACGA CCCGATGGAC ACGCCGTACG CTGCCCGTGC GATTATAATC CTTTCTGTCT CGTGTCCTTG GGCGGTGTTG TGAACGAAAT TCTCGTCGAC CGTTACAAAG AGTTGGAGAC CAAAAACGAT CGAGCGGGCA ATGGCGAGGT CGAGGAAGTT GTCGACGACT CCTCTGTCAC GAACGATGCC GAAGGTCCCG TTCAACCAAA CAGAATCATA TCGGACGCAA CCCAAGAAGA CATGAATGCC GTCCGCCGAA GTATATCCGT GAGAGTTGAG CCTATTCGTA GCTATCTGGA ACATACCTTG CAAGATTTGA CGCCGGCCCT TACCCTGGAG GACTGCATCA GCCGTATTCG AAAACGCCAT GCTGCTCTAA TATTTGTCAA TCCACTTTTG AAAGAGAAGG CCGATTCACC CAAGGATAAT GATGCACTGG TCATGTCCAT TCCACCCGGG ATGCAAAACT TGGGCGCCAC TTGCTATCTG AATACACAAC TGCAGTGTTT GGCACAGAAT CTGGTCTTTT TGGAAGGGGT CCTGTCGTGG CGTCCGCCAA CGTCCACAGT GGACGGGAAT GCAAACCCCG ATCCCATTCC CCAAATGATT GAAACGTTTC AATCGCTCCT GGCTTCCATG CGTATCGGAC CACACTATGT CTTGAATACC AAAGACTTTT CCAACGCACT ACGCTTGGAC CACTATGAAC AACAGGACCC GAACGAGTTC AGTCGGCTCC TGCTTGACTG TATACAGCAA AGTTTTCAAA GCGCCACGCA ACAACGCGAT TTGGCAACCT TGCTGCCCCA TCTTTTTCAC GGCAAAACAA CCTACACGAC CACCTGTCAA GTATGTCACA AAATGTCCAC CACGACAGAG AACTTCATGG ACGTGACTCT GCCGATCGTA AAGCCGCTGC GAGAAAAAAG CATGCCCGGC CAACAATCGC TTGCCGATGC CTTTGGTAAA AGCAAAGCAA AAAAGAACCT GCAAAGCTAT GATACGGATG TCCAGTATTG CTGGGATCGG TACGTTTATG CAGAAACGTT AGAGGGTGAC AATCAGTATT TCTGCACCGA ATGCGAAGCG AAAGTAGACG CTCAGCGGGC CTTGACTTTC TCTGCACTTC CACCCGTTCT AAACATTCAA TTGTGTAGAT ACGTTTACGA TAGAAATCGC GGTACAAAAA AGAAGGTGAC GGACAAGGTC CTCTTGCCAA CGGAACTAGA AATTGAGCAA GCGACGACTT CTGCACCTTC AACATCACCG CAAGTCGAGA CGGAGCCCAC CAAACATCGG TACGTGTTGT GCGCCGTCAT GCTACACAAA GGAAATTCGG CGTACAGTGG TCATTACGTG GCTGAAGCGA TGGATTGGCA AACAGGACAA TGGTTTGAGT TTAATGATGC TCATGTAACT CTGCTGGAGG CGCCCTCATG CAGTTGGGAC CCTTTGTTTG ATAGTGACAG AAGTGACGAT CGCAAGAAAG ATACAAAAGA GACCAAGGCA AAGAAGGGCA GCGAGGATGC CTACAATATG TATTACGTTG AAGAGTCGTT TTTGGCGCAG AGTGTTTTGG ACAGCATTCG AGAAATTGAT CCACAGACTG GTGCTTCGAA GTCTGAATCA GAGGACGGAG CGTCCGCTTT AAAGACAGCT GCCCTGACAA GATCTGAATA CTTTGCTGAT CTGAGACGGT AAGTTTTGGG ACACAAAGCA AGAGTGCATT TCGTTTCTGT TCTTTGTTTA GTACGTACGG AACGTTCGGT AACTTATAA
|
Protein sequence | MTDKSTANYP IVVVDVDDDT TDGENDEVST IVRTRIDTAS PTTSPIRPHT PLEEAVHGGL RLTRTRRTYH ETAAPPSSPV TASSPPSTPL SFTAQAPKKR KAPVVHEPYS AAANMRAAME LAKYELLVDP YRGMTKLRDA HHDEERDETD TNGHVATSPS LATVEITPRP SPWQTSCAVA IPESGDCRSP TDDGDDIANW EAADIVKRAC NCSCPSLTSW TSALAVARDK KRKKSKHQRK ANSTVNTHVL DGDTVRIGQR PDGHAVRCPC DYNPFCLVSL GGVVNEILVD RYKELETKND RAGNGEVEEV VDDSSVTNDA EGPVQPNRII SDATQEDMNA VRRSISVRVE PIRSYLEHTL QDLTPALTLE DCISRIRKRH AALIFVNPLL KEKADSPKDN DALVMSIPPG MQNLGATCYL NTQLQCLAQN LVFLEGVLSW RPPTSTVDGN ANPDPIPQMI ETFQSLLASM RIGPHYVLNT KDFSNALRLD HYEQQDPNEF SRLLLDCIQQ SFQSATQQRD LATLLPHLFH GKTTYTTTCQ VCHKMSTTTE NFMDVTLPIV KPLREKSMPG QQSLADAFGK SKAKKNLQSY DTDVQYCWDR YVYDRNRGTK KKVTDKVLLP TELEIEQATT SAPSTSPQVE TEPTKHRGHY VAEAMDWQTG QWFEFNDAHV TLLEAPSCSW DPLFDSDRSD DRKKDTKETK AKKGSEDAYN MYYVEESFLA QSVLDSIREI DPQTGASKSE SEDGASALKT AALTRSEYFA DLRRTYGTFG NL
|
| |