Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44522 |
Symbol | |
ID | 7197782 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 798784 |
End bp | 800012 |
Gene Length | 1229 bp |
Protein Length | 352 aa |
Translation table | |
GC content | 45% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178307 |
Protein GI | 219115023 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.431246 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAACAATTCG TGTAGCAGAC TTATTCTTGG AAGCCCGGAG AACAAAATCC ATGCTCGATT CTGCGATACC TACTTTTCTT TGTGAAATAT TCCGCTCATG AAGTTCTGCA TGTACGAGTT CATGTAAGGC AATTGACTGA CCGTGAACAA CCAAAAGATA TCAAGTCGCC ATGCTCCCTT TACAACGCAA GAAGGTTGCC ATGGCATTTC CAAAATGGCC AACTGTCACG CTAAATCTGT GGACCCTTCT TGTTTCGTCA AGCCTTCTTT GCTTTGTTGT CTACCAACTG GCCGTACAAA AATTCAAAGT TGAACTGTAC GAGATGGGCC GCCAGCAGCG TATTCATGAG CTTATTCAGC AAAGCGCACC AATAGATGCG TCTGTGATCG AAGAGTTGTA TCAAGCTATT TCATCAGAGA GGTTGGCTCG GATTGCAAAC AAGTTCAAGG AAGTATATTC ACTCAATGAT CCGTTTCCAC ATATAACAAT CGACGGAATT TTTCCCAAGC GATTTTTGGA GGAAGTTTTA AAAGAAAACC ACGAATCTGT TGGTGCGAAT GGATGCCTCG CAGGTGCTGG GAAGTGTTTT CAAGAGAGAG TCCAGAAAAA GAAATCAGCT ATTGAAGAAG ACGATCTGAT GGGACTACAC ACCAAGATTC TATTTTCTGC GATGAAAAGC TCAAACTTCA TTCATTTTCT GGAAATGCTA ACGGGCATTC AAGATATCAT ACCCGATCCG CATTACCGTG GTAGTGGCAT ACATTTGACA GCTACCGGCG GGAACCTCAA CGTCCACGCT GACTTCAACA AGTATGAGGC CTACCAGCTG GACCGGCGAG TTAATTCTTT TGTATTTCTA AATCAAGACT GGCCTGAGTC CTACGGTGGG CATTTGGAGT TCTGGACAAA AGATATGGGT TCCTGTGTCC AGCGCATTCT TCCAGTCTTT GGGCGCTTCG TTGTCTTTTC GTCCACCGAT TTTTCGTACC ATGGTCATCC CCAGCCTTTA TCTACACCGG AAGGTCGGGC TCGCAGATCC TTAGCTTTGT ACTACTATAC AAATGGAAGA CCTGCGAATG AATGTCTGAA AGGCGACTGT AGTGGTAGAG GGCATTCTAC GTTGTTCCAA AGACCGGTTG GCTGCAAGAT ATGTGAGGAA CAGACTTGCA AGGCGTACAG AGACAATGAA AAACTTCCGT ACTGGGAAGA TTCGGAGGCG AACACATGA
|
Protein sequence | MLPLQRKKVA MAFPKWPTVT LNLWTLLVSS SLLCFVVYQL AVQKFKVELY EMGRQQRIHE LIQQSAPIDA SVIEELYQAI SSERLARIAN KFKEVYSLND PFPHITIDGI FPKRFLEEVL KENHESVGAN GCLAGAGKCF QERVQKKKSA IEEDDLMGLH TKILFSAMKS SNFIHFLEML TGIQDIIPDP HYRGSGIHLT ATGGNLNVHA DFNKYEAYQL DRRVNSFVFL NQDWPESYGG HLEFWTKDMG SCVQRILPVF GRFVVFSSTD FSYHGHPQPL STPEGRARRS LALYYYTNGR PANECLKGDC SGRGHSTLFQ RPVGCKICEE QTCKAYRDNE KLPYWEDSEA NT
|
| |