Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_31981 |
Symbol | |
ID | 7196522 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 1395648 |
End bp | 1396856 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176777 |
Protein GI | 219110050 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGCGC TTGGCATCGT GTCGTCCTCC CTGCTGACAT TGCGGAGCAG CCACGGACTC GTGCTGGTTC CATTGCCACT GCGACCGTCA TATAGCTCAA GCCACCGCTA TCAAGATAGA CATGCAACAA CGTCCATGTC ATCCTATCGA GGGGCGCCAG AAGACATTTA TGGAGCTGTT CATCGAAAAG AATACGAAAT GAAGAAGGTC AAGGCCCAGC ACATGTCTAC GACGGATCCT GTTCGAATGG CGATGGGGTA CGCACAGGAA TCGGTCTCGC CGATGAAGCT CGCCAAAGCC TTACGAAGAG TTTACGAGGA TCCATCCAAT CCAGCCAATC CCGACCATGT TCCTCTATCT GATGAAGAAA AGAAGCGCGC TCAGACACTG CAACAAGTTG GAATGGCCGA TTTGGGAATG CGACGAGGTA GTTTCATTGT CGACATTAAG CGCAAGTCGT TGAGTCGACC AGGCGAAGTT TTTTGTAATT ATGATGATGC TGGTATGGTA GCAGAGGCTA TGGTACGCTT GGGAGCCGAC GCCGTCTTTG TAAACACCGA CTATCAGGCC TACGGCGGTG ACATGACGGA ATTGAAATCG GCTGTCAAGG CAGTTCGCGC CGTTTCAAAA TCGGCGGCGG TCGTGATGAA AGATATTGTA GTGGATGAGA TTCAATTGGG ACTCGCGAAA GAGGCCGGTG CTGACGGAAT CGTTCTTATG TCATCAGTGT TGGGGCCTAC GCTCGAGAAT TTCTTGAACC TGGCAACCAT GATTGGTTTG GAGACGATCG TTGAGTGTCA TACACACGAT GAAGTACAGA GAGCCATCGA CATCCTGGCA CCCAATATTT TGGTCAACAA CTACGATCGA GTTGCACAGG AACTTCACCC AGAGCAGGCA ATTAAGCTTG CAGGTATGTT CCCTGGCTCT GGTGGACCCA TTATTTGTTT GGCTGCGGGA GAGATCGAAA CCACCGATCA AATGAAGCGC CATTTGGCGG TTGGGTACGA CGGAGTTGTG GTCGGTAAAG CAGTCATGGG AAGTCCGGCA GCTCCCGAGT TCATTCGAGC GGTTCGGGAT CGAACACTGC TACCAGCCGA ATTCTCGGCT TGGGGTTTAG AAGACGTGGA GTTTGACATG GACGGAAATG TCATGTCTGG ACCCAAGCGC GGCACTCCTC AAGATGGTGA TGCCGACGTC TACCAATAA
|
Protein sequence | MTALGIVSSS LLTLRSSHGL VLVPLPLRPS YSSSHRYQDR HATTSMSSYR GAPEDIYGAV HRKEYEMKKV KAQHMSTTDP VRMAMGYAQE SVSPMKLAKA LRRVYEDPSN PANPDHVPLS DEEKKRAQTL QQVGMADLGM RRGSFIVDIK RKSLSRPGEV FCNYDDAGMV AEAMVRLGAD AVFVNTDYQA YGGDMTELKS AVKAVRAVSK SAAVVMKDIV VDEIQLGLAK EAGADGIVLM SSVLGPTLEN FLNLATMIGL ETIVECHTHD EVQRAIDILA PNILVNNYDR VAQELHPEQA IKLAGMFPGS GGPIICLAAG EIETTDQMKR HLAVGYDGVV VGKAVMGSPA APEFIRAVRD RTLLPAEFSA WGLEDVEFDM DGNVMSGPKR GTPQDGDADV YQ
|
| |