Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50067 |
Symbol | |
ID | 7198754 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011694 |
Strand | + |
Start bp | 292775 |
End bp | 294253 |
Gene Length | 1479 bp |
Protein Length | 409 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184853 |
Protein GI | 219129349 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0352666 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTACAGGTCT GTTGGGGGAA GCCGATGAAC AACAACCGCA TGCATAACAC ATACGGGATA CGCTACGACT CCAGAGCATG ATTGTACACA CGTTGGTCAC CCGTGGATCA CTAGTAACGA AAAAAAAACG CTATAGTGTC GTACTTGCCT TGTTGTATAC AAGTTGAACC AATTGTAGTT GAGTTAGCCT TTGTTGGATT GACTGTGTGA TACGTGGAGG AGTATGAAGG ACCGCGACCG ATACGACGAC GGCGAAGCCC TCAGCATCAA GAAGGAGGAG GGGTACACGG CTCTGTACGC TGTCCTGAAT GTGTCACCGG ACGCTTCCCG AGCCGATATA CAAAAGGCCT TTAAGCGACT GAGTCGAGTC TTTCATCCGG ACAAGCGGGT GCGTCTCGGG GTGTCGACCC AAAACAATGG TACCAACAGT ATGGCGGAAG AAGCCTTCCA GACGATTCGT CAAGCGCACG ATGTTCTTTC GGATCCGGTT TTGCGCTTGA CTTACGATTA CGCCGGCATG TTGGCGGTGG AACTCCTGCT GCGCTCGCAT TTGGCACGGG GCGATCGTAA CGAACCAAGG GGCAGCCACG ACGAGTCGTC GCGATCGACC ACGACCAACA AAAATGAGGA GGATTCCGAC GCGGAAGATC CCTGGGACCA GGACGACGAC GACGACGACG ATGACGAGGG TAACTCTTTG GACTTGTACG TTCAAGTACG AGACGCCCCA TCCTACCAGT ATGCTACTCA AATACTAGAC GATGCCTTGT ATCGAGTGCA ATCGCACCAA GCATCCTCAC GCACACACTC GCTTAACGGT TCTCTCGCAT TCCCTCACGT GCTCGGTGGT GGCGGTACAC AGGACGGCTT TTGGGAGCAA GATCGCGGTA GTTTGCAATG GCAGACCAAA CGACAAGTAT CGGCGCAATG GACGGCAACG CTCGGGGCCG GTTCGGAAGT ATCCCGGACG GCGCAAACGG AAATGTCCAC GCAACTTTCG CTCGCCTATA CCCGACCCGG TTACGGACCG GTGGGATCCG TGGATGTCAT ATCGTCCTCC CGAATGCCCG CAGCACCGGT CGTCAAAATT CAAAGTGGAC GTACACTTGC CAACCAGACC AACGTGTTGT TCAGTTTAGC CGGATCCGTG GACAATCCGG AGACCTGGAC GTATTCATTT ATGTCCAGTC GGAATATTTT GTGGAATTCG CCTAGTAGTG AAAGACGCAA GCAAAGTCAT TCGGACCCGT CGTCACCTAA AACGATTCAC GCCTCGTGGC AGTTGGGAAT TTCGTTGCTG GGAAAATTGC AGTATTTTCG GGTGGAATTG CGTCAACCTA CGTTGCCGCA TAAATGGAGT GCCCGCATCG GCCTCGATGC GCTCGCTGGT ACCTACGAAA CCGCCACGTA CACTGTATCG TACGCGCGTC ACTGGATGTG GACGCGTTGG AAGGCCCTTT GGCACCACAA GTGGGCTGA
|
Protein sequence | MKDRDRYDDG EALSIKKEEG YTALYAVLNV SPDASRADIQ KAFKRLSRVF HPDKRVRLGV STQNNGTNSM AEEAFQTIRQ AHDVLSDPVL RLTYDYAGML AVELLLRSHL ARGDRNEPRG SHDESSRSTT TNKNEEDSDA EDPWDQDDDD DDDDEGNSLD LYVQVRDAPS YQYATQILDD ALYRVQSHQA SSRTHSLNGS LAFPHVLGGG GTQDGFWEQD RGSLQWQTKR QVSAQWTATL GAGSEVSRTA QTEMSTQLSL AYTRPGYGPV GSVDVISSSR MPAAPVVKIQ SGRTLANQTN VLFSLAGSVD NPETWTYSFM SSRNILWNSP SSERRKQSHS DPSSPKTIHA SWQLGISLLG KLQYFRVELR QPTLPHKWSA RIGLDALAGT YETATYTVSY ARPLAPQVG
|
| |