Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49451 |
Symbol | |
ID | 7195810 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011690 |
Strand | + |
Start bp | 279210 |
End bp | 280534 |
Gene Length | 1325 bp |
Protein Length | 389 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184103 |
Protein GI | 219127773 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACCCAT ATCGATCCAG TGATGACATG AAAGATGGCA CTGCGAGAAG AATGAACGGA AAGCGTTCGA GGGAAGCGAA GATACAAAAA TGGCAAAACA AACAAACGAA GTTAGAAAAG AGAAGAGAAG AAATTAGTAA TCGGGACGAG TGGATCGAAA AGCATTCAGA CTTTGTTTCG GTGATAAACC TAGAGGACGT GAGAGATGCA AAGAACATCG CGTCCGGAAA CGCTCTTCGC GAGCGTTTGT CTCCATACTT TAATCTACAG GACAACTCAT TGGTTTCCCG AGCGAAAGAT GGCCGACGGC GCTTTATTGC TGAAGGAACA GAAACTGTCC GACTCCTGAT GCAGCAATTA ACTGTAAGCA ACAATTCTTC TTCCGGACTT TTTCCGGTTG AGGTTGAGTC CATCTTTGTA AAGCCGAGTG TCTTCTTCGA CCCTCCTGTT TCGCTCATTT TCGACTTTCA GAAGATGATT GACTTGACAA AGCATACTAC TGTTTGTGTA AGCGAGGCTG CTAAAAGGGC AAAAGTCCAT GTCATGATCG GAGCTGAAAA TGTATTGAGC GAAGCCGCTG GATTTACAAT ATCGAGAGGG GCCTTGGCAT GCGGCTTCGT CCCCGAAAAT CGTAACTTTG CCTGGTTGAT GGAATATTTT AGAAAAACAA GAATGTCTGG TGAAGGGGAG CTTCGCCTGT TGGCGCTAGA TGGAATTTGC GACACCGCAA ATCTAGGATC CGTTGTACGG TGCGCCTCGG CGTTTGGGGT TCACGCCGTT CTTTTAAGTA AGGATTGCTG CGACCCATGG TACCGTCGAG CGGTGCGTGT GTCCATGGGT CACATTTTCC GAATACCATG TGTGCGAGTC GACAATTTAG TTCAAGCCCT AACTGCACTA TCGCAAGAAC CATTCGCAGT CACTTCCTAC GCAGCTGTGA TCGACCCGAG AGCGGATCTT CTGTTGGAGA ACATCGCACA AGGTATGTCT TTCATTTTAA ACCTAAACGA AGTCAAGAAT ACATATAGTT GACCGTTCGC TCCTCTTTTC ACTCGACAGG CGCTATTCCA AAATCGTGGT GTTGTATTGT GGGTAGTGAG GTATGTCGCT TTCGCACTAT TGAAAACTTG AAATAAAATT GCCTCTCTCT GACAATGAAT GTTACATTTT GACAATAGGG GAAAGGAATT TCTTGTGACG TTATCCAAGC TGCGACTACA ACCCTCAGGA TCGGGATGTA CGACCACGTT GATTCACTGA GTGTCCCTGT GGCTACAGGC ATTTTGCTGC ATGGTTTGAG CGAGCGCTCA AAACCGCTTC TATAG
|
Protein sequence | MDPYRSSDDM KDGTARRMNG KRSREAKIQK WQNKQTKLEK RREEISNRDE WIEKHSDFVS VINLEDVRDA KNIASGNALR ERLSPYFNLQ DNSLVSRAKD GRRRFIAEGT ETVRLLMQQL TVSNNSSSGL FPVEVESIFV KPSVFFDPPV SLIFDFQKMI DLTKHTTVCV SEAAKRAKVH VMIGAENVLS EAAGFTISRG ALACGFVPEN RNFAWLMEYF RKTRMSGEGE LRLLALDGIC DTANLGSVVR CASAFGVHAV LLSKDCCDPW YRRAVRVSMG HIFRIPCVRV DNLVQALTAL SQEPFAVTSY AAVIDPRADL LLENIAQGAI PKSWCCIVGS EGKGISCDVI QAATTTLRIG MYDHVDSLSV PVATGILLHG LSERSKPLL
|
| |