Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_18735 |
Symbol | |
ID | 7203903 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 1148774 |
End bp | 1150129 |
Gene Length | 1356 bp |
Protein Length | 442 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186192 |
Protein GI | 219113217 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.609957 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGGCCGGGAC TACTACTAGT AGCGGCAAGA TGTTCAACTT TGGTATGTTC GCGCTCGAAC AATTTAAAGG CCGGCTTCAT GAATGGCCAC AGTATTGCTC CCATATTGTA CAGATACCAC ACTTGAAAGA TGGCTACGCT GCGTTAGTTT CTGAGATTGA ACGAGAAATG AGCAAGAATC AGGGCGCTGC TTCCGTGGCA GGCCAACAGC CTCCTGTACC AAAAGTTCTT GAATCAGGAC TCGCTGGCGA CACTTTACTA CCTCATCCAC CGGGCGAGGC AATAACGTCC CATCGATCCC AAAGTGCTCC CGTACCAAGT GCGCTTGAGC TATCGTCGGA GGTGCTGAAC TTATCCTCGC CGCCGCGGGT TGCCGAGTTT GGGCCTAAGC TTGGGCGTGC CGTGACGGAT AGTCCTGATT CCGAAAACGA CTTTGATGCT CCGACGGATA CGGTTTTAGA CCGCGTCCAG TTCTTGGTGA ATAATCTTGC ACAGTCAAAC GTAGAGCAGA AGGCTCAGGA CCTCAAGGAG ATGTTGGACC CAAAGTACTT TGGTTGGCTT GGTCATTTTT TGGTTGTAAA GCGCATTAGC ACACAAGCAA ATTTTCATTC GCTCTATTTG TCATTCCTCG ACAATCTAGG GGACTATGGA AAAGGTTTGA TGGAAGCGAT CATCAATAGC GTTTACCGCA ACATCGGAAA GCTTCTTCGA TCGTCCAAAA TTACGACGTC GTCATCGGAG CGAGGATATC TCAAGAATCT AGGAATTTGG CTTGGTCAGA TTACTCTGGC TCGAAATCGG CCAATTCTGC AGATTATGTT GGACGCAAAG GAATTGCTAC TCCAGGGATA CGAAACGGGC AAGCTGATCG CGGTTGCCCC GTTTTTGGCA AAGACCCTTG AGGGTGCGAA AAATTCCAGA ATTTTTCGTC CACCAAATCC TTGGCTCATG GGCATTCTTG GTGTATTTCG GTCGGTTTAC ATGGTCGACG GTCTCAAGAT GAATATTAAA TTCGAAGTTG AAGTTCTCTG CAAAAATCTT GGCATAAAGC TGGAAGAGAT TCCGCTCCGC AACGGCGTCC TGGCGAAGCG CATCGCACCG GTGAAAGAAC GTAATCCTGA TTTTAACATC AAGAGCGCAT CATCGGGGTC AAAGTCGTCC GCAACGGGAG TCGTGCCTTC CCGCTCTCTT GTCGGTAGCT CTGAGGCTCA GTCTCTCAAT TTGCCGATTC CGTCGACAGT ACCAAGTAGC GAGGATAAGT CGGGGCAAGA CCCACAGGAC ACGGTAATAC CTAATCTTGC CTCTTACGTC ACAGTCAACG CAAGCTTGCC GCAGCTTCTC CAAACCCAAG GGAGCC
|
Protein sequence | MFNFGMFALE QFKGRLHEWP QYCSHIVQIP HLKDGYAALV SEIEREMSKN QGAASVAGQQ PPVPKVLESG LAGDTLLPHP PGEAITSHRS QSAPVPSALE LSSEVLNLSS PPRVAEFGPK LGRAVTDSPD SENDFDAPTD TVLDRVQFLV NNLAQSNVEQ KAQDLKEMLD PKYFGWLGHF LVVKRISTQA NFHSLYLSFL DNLGDYGKGL MEAIINSVYR NIGKLLRSSK ITTSSSERGY LKNLGIWLGQ ITLARNRPIL QIMLDAKELL LQGYETGKLI AVAPFLAKTL EGAKNSRIFR PPNPWLMGIL GVFRSVYMVD GLKMNIKFEV EVLCKNLGIK LEEIPLRNGV LAKRIAPVKE RNPDFNIKSA SSGSKSSATG VVPSRSLVGS SEAQSLNLPI PSTVPSSEDK SGQDPQDTVI PNLASYVTVN ASLPQLLQTQ GS
|
| |