Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_35702 |
Symbol | |
ID | 7201118 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | + |
Start bp | 565065 |
End bp | 566678 |
Gene Length | 1614 bp |
Protein Length | 511 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180076 |
Protein GI | 219118614 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGTCGT CGAGCGCGGC GAGCGGTAGC TACCGACCCG AGCCGACGAG CACTGGTCGA TGTCAATACT GTGCACACTT GGCCTTCAAG GCCAGGAAAG CTGCTGTTGA TACGTTTATG GCAGGGAACG ATATGCCTTG GGATTTTGAG GTCGGAATGT ATGACCTTGT TTTACGGCTG AATCAGATGG AAGAAGAGAC TAGGTTGCAT CCCGAAACGG CACGGCACTT AGCCGACAAA GCAACCCGCT ATACGGCAAG ACTCCGCACT CAACTATCGG ACGGAGAACT GACGCCGGTA CAGTCGCTTA ACAATTGCCG AATACGAACC ATGGCGCCTT TGGAGTACGA AGAGGTAACG TTGGGACCAT TATTGGGGGC GGGGGGATTT TCTACTGTCT TTGCGGTGGA CGGGTTTCAT CTAAAGGACC ACTCGACACC TGATTACGAC TCGGACGAAC GCGCTGCACG GGAATTTTTG AATAAGCACG CACGACGATA CGCTGGGAAC GGTGGACCGA CTATACACCG CTCGGTCGAA GGCACGAAAT CCAATCAAAC TACCGCGCGC TATGCGGTAA AGCACCTCCG CCGCGGGCTC GCCGAGGAAG CGGATAGGTT CGAACGCGCT GCGGTGGATC TAGTGATGGA AGCCCAGCTG CTGCTGGCGC TCGATCATCC CAACATGTAA GTGGCGGTCG GAGACAAAAA GTACCTTGCG AGTCGAGTAC GATATCTCAA CACTGTGCTC TTTGGCCAAT TTAGAATATC CATACGTGGG TGGTCTCGTC AAGGACCAGG TGGTTACGTG AACGGAAAAC ATACGGACTT CTTCGTCATT CTGGACCGAT TGCCCGAAAC ATTAGAAGAC AAGATTTACG CCTGGAGAAA AAAACTGAAA CGGTACAAAG CCATGGGGAA GCTACCGTGG GGACGGAAGA AGTATATCGC CAAGACTAGT GCTCTGTTGT TGAAACGAAT GCAGGCGGCG CTTGACATTG CTGCCTCCTT GGAATATATG CACGATCGGC GCATCATCAA CCGCGACGTA AAAGCCAGCA ACATTGGCTT TGATATCCAC GGCGAGCTGA AAATGTTCGA TTTTGGCCTT TCTCGTCTAC TGCCGGCCGA AGAAGAGCGC GTCGAAGGTG GCTTTGTCAT GTCGCGGGTT GGCACAAAGT ATTACATGGC TCCCGAAGTA TGCGACAAGC AGCCCTTTGA CCTGTCGGCG GACGTGTACA GCTTTGGAGT GGTCCTGTGG GAATTGCTGA CCTTGTCCAC GCCTCGCGAA GTGATTCGGA AGCTCCGACG TCTCACGAAC AGCTCGTATA TTCTACCGAT CTGTCCGTGC TGGCCTAAAG CACTGCAAAG GCTGGTGGGA CTGTGTCTTG CGGACGACCC TAGTGTACGC CCGACCATGG CGGTCGTACG AGCGTCAATC GAGATAATGT TGGAACGGCT GGGGGTACCA CGTGCTGTAG GGGGAAAAGC ACGTCGCCGT TCCACCTTTC GATTGGAGAC GACCGCAACT GATAGTATTG CCAACGCCGC TCTCACCGTG CAATCGTTTT CCGATTACGA TCCGTCCATT CCAACTTCTG TCCAATGCAA TTGA
|
Protein sequence | MESSSAASGS YRPEPTSTGR CQYCAHLAFK ARKAAVDTFM AGNDMPWDFE VGMYDLVLRL NQMEEETRLH PETARHLADK ATRYTARLRT QLSDGELTPV QSLNNCRIRT MAPLEYEEVT LGPLLGAGGF STVFAVDGFH LKDHSTPDYD SDERAAREFL NKHARRYAGN GGPTIHRSVE GTKSNQTTAR YAVKHLRRGL AEEADRFERA AVDLVMEAQL LLALDHPNII SIRGWSRQGP GGYVNGKHTD FFVILDRLPE TLEDKIYAWR KKLKRYKAMG KLPWGRKKYI AKTSALLLKR MQAALDIAAS LEYMHDRRII NRDVKASNIG FDIHGELKMF DFGLSRLLPA EEERVEGGFV MSRVGTKYYM APEVCDKQPF DLSADVYSFG VVLWELLTLS TPREVIRKLR RLTNSSYILP ICPCWPKALQ RLVGLCLADD PSVRPTMAVV RASIEIMLER LGVPRAVGGK ARRRSTFRLE TTATDSIANA ALTVQSFSDY DPSIPTSVQC N
|
| |