Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43429 |
Symbol | |
ID | 7197437 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | - |
Start bp | 437214 |
End bp | 438799 |
Gene Length | 1586 bp |
Protein Length | 442 aa |
Translation table | |
GC content | 56% |
IMG OID | |
Product | chorismate synthase |
Protein accession | XP_002177933 |
Protein GI | 219112363 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0435185 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCGTGATCTC TTCCGAGGTC ATCCATCTGC CTAAATTGAG CAAACGTTCG CTTCTCCGAT CATTACATTC GTGACTACAC TTGCCTACAC ATGAGGATAC TAGCCTCCGT CTGGGGCGTC GTTACGCTCT TCCCTTCGGC GCTCGCCTTT TCTCTGACCA TGGGTACAGG AAATTCCTTC GGTCGCGTCT TCCGGATCAG CACGTGGGGT GAATCGCACG GCGGCGGCGT CGGTGTCGTC TTGGACGGTT GTCCTCCCCG CATTCCTGTC TCGCGCGAAG AAATTCAAAT GGATCTGGAC CGCCGTCGTC CGGGACAATC CCGCATCACC ACTCCCCGGA ACGAAGCCGA CGCAGTGGAG ATCATCTCGG GTCTTTCCCC GGATGGTTTG ACACTCGGTA CGCCTATTGC CATGCTGGTG CGGAACAAGG ATCAGCGCTC GCAGGATTAT TTGGACAACG ACATGAAGGT TGCCTACCGT CCATCGCACG CGGACGCCAC GTACGACGCC AAGTACGGAG TCCGGGCCGT GGCCGGTGGG GGACGCTCCA GTGCACGTGA AACAATTGGA CGGGTGGCGG CGGGAGCGAT TGCTCGTAAG CTCTTGCACA AGTACAACGG CATTGAGATT CTCGCGTACG TTTCCAAAGT GCAGGACATT GGCTGTTCCG TGGACGACGA TACCTTCACG ATGGAGGACG TGGACGCCAA CATGGTGCGG TGCCCGGATC AGGCAGCAGC GGAAACCATG TTGGCACGGA TTGACGATAT ACGCAAGTCA GGGAACTCGA TTGGTGGGGT GGTGACCTGC GTGGCGCGCA ATGTCCCGGC TGGACTCGGT GCTCCCGTCT TTGACAAATT GGAAGCGGAC TTGGCCAAGG CTTGCTTGTC CATTCCCGCG GCCAAGGGTT TTGAGAGTGG AGACGGCTTT GCCGGAACAC TCTTGACTGG TAAGGATCAC AACGACGAGT TCTACGTGGA TCCCGAGACC GGGGCGACCC GCACCAAGAC GAACCGATCC GGTGGCATCC AAGGTGGAAT CAGCAACGGA GAAAACATTG TCATTCACGT TGCCTTTAAA CCAACCTCCA CTATTGGACA AGCACAGGTA AGACTTTGAG AGATTCAATC AACATGTCGG AGTGGACTGC CCCGAGGTTG ATTTCGCTGA CGCTCCAATC ACTTAATTGT TCTTTCGCCC TTTGTTGCAG AGCACGGTGA CGCGTGATGG ACAAGAAGTT GAATTACGCG GCAAGGGCCG ACACGATCCA TGCGTCCTTC CTCGAGCCGT ACCCATGGTT GAAGCCATGG TCGCGCTGAC CTTGGTGGAT CACCTTATGA TGCAGCAGGC TCAGTGTGAA CTCTTTCCGA ACGAGGCGGC GGCAGAAGAC TTGCCCAATC CGATGGGTAA GACCGCACGA CGACAAGGTG GACCCGTGCA CCAGGAGCCT GTTGCTAGCG GTCCGGTAAG TCAAAGAGTA GACGAAGAGT AACTCATAAA CACCGCAAAC GCAATTCGTC TTTTTCTAGA CGTGGGCACT AAATAATGGT ACCCTATTGC GACATTCGTT GTTCGCTGTG TACTGA
|
Protein sequence | MRILASVWGV VTLFPSALAF SLTMGTGNSF GRVFRISTWG ESHGGGVGVV LDGCPPRIPV SREEIQMDLD RRRPGQSRIT TPRNEADAVE IISGLSPDGL TLGTPIAMLV RNKDQRSQDY LDNDMKVAYR PSHADATYDA KYGVRAVAGG GRSSARETIG RVAAGAIARK LLHKYNGIEI LAYVSKVQDI GCSVDDDTFT MEDVDANMVR CPDQAAAETM LARIDDIRKS GNSIGGVVTC VARNVPAGLG APVFDKLEAD LAKACLSIPA AKGFESGDGF AGTLLTGKDH NDEFYVDPET GATRTKTNRS GGIQGGISNG ENIVIHVAFK PTSTIGQAQS TVTRDGQEVE LRGKGRHDPC VLPRAVPMVE AMVALTLVDH LMMQQAQCEL FPNEAAAEDL PNPMGKTARR QGGPVHQEPV ASGPTWALNN GTLLRHSLFA VY
|
| |