Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_54658 |
Symbol | |
ID | 7204707 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011679 |
Strand | - |
Start bp | 616820 |
End bp | 618215 |
Gene Length | 1396 bp |
Protein Length | 422 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185917 |
Protein GI | 219121384 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0000768502 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTTCTTCTTC CGAGAAACCA ACACACATGG ACTAATCCTG GAAGTTTTCG GTGATGGATC GAGATTACTC ACTATTTTTT GTTTTCTAAC ATCAATATTC TCAAGTAGCA TGCATACCGC GAAAGTAGTT CAACGTTTCT CTTTTTGTCG ATTACTGCAA CAATGGTTGT TGCTCGGTAC ACTCCTCGCG GGGCGTCCAG GCGTTTCAGC TTTCTGTGAT GAACTTTCAG ACCCTTCTTG GGATTCCTTG GTTGATTGGA TGACACTGGT CGAGCCTTTG GGGTTCGGTA TTGTCTGCCC TTTTCAAATC TCTGGATCTG CTTGTCCGAA GATCGAAACA TCCTTCGAAG TTACATCCCC AAACTTCTAC ATTATCTGCG CGAATTTTCT GGGCGGATCG GGGGATTCCT GCTTGCTTGA CTGTCCTTCT ACGCATATAT CAATAGCTCC TTATTCTCAA TTGATTCTGG AAGGCTGGAC TTTAAGAAGC ACAAAAGGAA CGCCCGCAGT AGTTGTGGAA TCAAACGCTG AGTTTGTTGC CTACGGCTCA ACTTTTGAAA ACAACCAAAA TACCGTTGGA AACGGTGGTG CCATTTTCGC CGATACATAT TCGACGGTAG TTTTGGAAGA TTCTTCTTTT GTGGAAAACT TTGCTCTTAA TGGTGGTGCA CTATACAGTT TGGGCACTGT AGTCATTGTG AATGGCAATT TTCAGAGAAA CAAAGCAATA GAAAGCGGAG GTGCGGTCTA CTTGGATGGA CCCCTCGGAA GTTCGGCTAG CATCCGTACC AGTCGCTGGG AAGGAAACCG TGCCCGTGTT ATTGGGCCTT CCGTATTTCA GAACTCAACT GACGTTGCTA TACAAACACA AGGAAATACC GCATGTAACA ATTTGAATTT CGAAGCACTT TCGTATTGTG ACGGAATTGA AGATGGCAGC CGAAAATGCT TGCCGTTTGG AGAGAAATGC ACCTTTTCAA CTAGTGCCCC TACATCAATC CCTACCTTGA CTCAATCTCT GGTTCCAACA TCAAGCTCAA TTGAGTCTCC CACTCAAATT CCAGCGTCGC GTCCGTCACT GCGCCCAACA CTGGCTGGAG CAACACCTAC AGTAACACCA ATCAATACAC CGTCCGATGT CCCATCACTT GTTGAAGCCG CCACTTTGTC AACTGCAGTG CCTGCCGGTC TTGATTTTAC ATCTGACGCA CCATCACCTG TTCCATTAAC ACCAGTACCT ACACGGATGC TAAGCAATAT ACCGTCTGAT GTCCCATCCT TGGTCCCAAC ATCAAAAGCA TCAACTGACG TGCTGACTGC GAAAGTATTT ACTTCCGGCG TCCCTTCACT TATTCCAACT ATTATAAACA GAGAATAGCC CAATTGTTAA ATTGAC
|
Protein sequence | MHTAKVVQRF SFCRLLQQWL LLGTLLAGRP GVSAFCDELS DPSWDSLVDW MTLVEPLGFG IVCPFQISGS ACPKIETSFE VTSPNFYIIC ANFLGGSGDS CLLDCPSTHI SIAPYSQLIL EGWTLRSTKG TPAVVVESNA EFVAYGSTFE NNQNTVGNGG AIFADTYSTV VLEDSSFVEN FALNGGALYS LGTVVIVNGN FQRNKAIESG GAVYLDGPLG SSASIRTSRW EGNRARVIGP SVFQNSTDVA IQTQGNTACN NLNFEALSYC DGIEDGSRKC LPFGEKCTFS TSAPTSIPTL TQSLVPTSSS IESPTQIPAS RPSLRPTLAG ATPTVTPINT PSDVPSLVEA ATLSTAVPAG LDFTSDAPSP VPLTPVPTRM LSNIPSDVPS LVPTSKASTD VLTAKVFTSG VPSLIPTIIN RE
|
| |