Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_31948 |
Symbol | |
ID | 7196444 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 1315748 |
End bp | 1316848 |
Gene Length | 1101 bp |
Protein Length | 317 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177266 |
Protein GI | 219111029 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.115339 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACAAA TCAATGAAGG TATGTTTCAG GCAAGCTTAG CACAGCACTG GTGGATCCGC TGTGTCGGTG CGTTGTGTCT AACGGCTACA CTTGTTGATG GACTGTCCCG TGCCCTTGTC TCATCCAGGA GCGTGGATGT AAATGCCCTG CTCTCATCGT GTGTAGATGC GTGTGTACGA GGGTGCCATG AGATCCGCGA CGTCCAAGAA GAAAGGGACC AAGGGTCGTC CTTCCATGTG GAATTCAAAG ATTCATCTGA TCCAAGGTCA GCGCTTACGG AGGCTGACAG TAGAGCTCAA AAAGCTATTG TCGCTGCTTT GAAACAAGAT TGGGGGCCCG AACTAAGAAT CATCGGAGAA GAGGACCCAA AGGATGGATT TCTTTCGGTT GCAACAAACA AAACAGCACT ACGGATCAAT CTGTGCTCAT CCATTTTTAA GGAAGACTGT GACATGCAAA CTCTTTCGGA CATCATTGTA TTTGTGGACC CTCTGGATGG AACAAGGGAA TTCGTCGAAG GACGGCTTCA AAACTGTCAG GTTCTGATTG GTATTTCTGT CGGCGGAATT GCGTCAGCAG GGGCAATCGG AATTCCTTTT CCAACAGGAA ATTTAGACGA ATCGCCCACA GTTGTATATG GGAAAGTCGG ACTCGGGCAC GGGATTATTG GAAGCCCACT TCCCAATGCT CATGGTCGGA TTCCGCTATC ACAACCCTTA CTGGCGACAG GGGATACAAT TCTTCCAATT ATGGCTGAGG CAAGACGTAT AGTGCAAGAG CATTTTGGAG GTATAAACTG CCTCTACGGA GGTGCAGGGA ACAAGATCCT CGCTGCTGCT CTTGGACTTG TGGATTGCAC AATACAACAC AAATTTGGTG GCCCGTGGGA CACATGCGCC CCCGCAGCAG TGCTAAAATC AATGGGTGGT GAAATCACTG ACCTGCTAGG AGAGGATCTT TCTGTCTATC AAGAAGGAGA GCAAGAACAG TCAACCAGGA GAGGATTTGT AGCCACAGGG AGAAACTCCC TCATCCCACA CAAAGTTCTT GTTGCGGCCC TGCAGGAATC TAAAAGAATC ACCACATATA TAAACGGCTA G
|
Protein sequence | MQQINEDACV RGCHEIRDVQ EERDQGSSFH VEFKDSSDPR SALTEADSRA QKAIVAALKQ DWGPELRIIG EEDPKDGFLS VATNKTALRI NLCSSIFKED CDMQTLSDII VFVDPLDGTR EFVEGRLQNC QVLIGISVGG IASAGAIGIP FPTGNLDESP TVVYGKVGLG HGIIGSPLPN AHGRIPLSQP LLATGDTILP IMAEARRIVQ EHFGGINCLY GGAGNKILAA ALGLVDCTIQ HKFGGPWDTC APAAVLKSMG GEITDLLGED LSVYQEGEQE QSTRRGFVAT GRNSLIPHKV LVAALQESKR ITTYING
|
| |