Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_40988 |
Symbol | |
ID | 7198826 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011695 |
Strand | - |
Start bp | 70547 |
End bp | 71617 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | |
GC content | 57% |
IMG OID | |
Product | enyol-coa hydratase |
Protein accession | XP_002185042 |
Protein GI | 219129745 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.569479 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCGAG CGTTGTTCGG TTGGTCTCGG CATTGCGGGA GTTCCGTTGC GGACGCCGGA GTCCCACGCC ACCGACTCGA GCGTGAGCTT GTCCTTCCCC GCTCGTTCCG TACAAGGTTC TACGCGTCTA CCGGTGAAAC ATCAACAACT CCCAATGCTA GCAAATCCTC GTCGACGGAA ACGTCACCGT CCGATGAATC CCGCGTCTTG GTGGACGTGG ATCACCAAGG CGTCGCTCGT GTGTGCCTCA ATAGACCCAC CAAACTCAAC GCACTCGACA TGCCCATGTT CGACGCCGTA GCCGATACGG CCCTTTCACT ACAAAAGGAT CGAGCCATTC GCGCCGTCAT TCTATCCGGC TCTGGCCGTG CTTTTTCCGC CGGTCTCGAC GTCGCTTCCG TTCTCTCCAC CAATCCCTTG AAAAACTCGG AACGTCTCCT CACGCGAGAC GACGTTGACG ACAACAACAA CGACAACAAA AACAACTACT CGAACGAAAG ACGGTCCATC GCGAATTTGG CACAACGCGT CTCCATGGCC TGGCGGGATA TCCCGGCACC CGTCGTAGCC TGTCTGCACG GTGAATGTTT CGGGGGCGGT TTGCAGATTG CCCTAGGGGC CGACGTGCGC TTGGCCACCC CGGATTGCCG CTTGGCGATT ATGGAAGCCA AGTGGGGATT GATTCCCGAC ATGGGTGCGT CCGTGCTCTT GCGGGAACTC GTACGCATCG ATGTCGCCAA GGAGTTGACC ATGACGGGAC GGATTGTGGA CGGGAACGAG GCTGCCGCAC TCGGACTCGT CACGAGGGTC GTCGACGATC CCCTCGAACA AGCCGAAACA CTCGTACAAG CCTTCCTGCA GAGGTCGCCG GATTCTCTGG CCGCCACCAA ACAACTTTAC CACCAAACGT GGGTTGCACC GGAAGAGTAT AGTTTAAAGG TGGAAACAGC ACTGCAACGA AAACTGTTGG TTTCGTGGAA CCAAATGGCC GCCGCGGGTA GGAGCTTTGG CTGGAAGGTT CCTTACTTTC AACGCAAAGA CGGGACCCTT GATACGGAGA AGAAGTTATA A
|
Protein sequence | MKRALFGWSR HCGSSVADAG VPRHRLEREL VLPRSFRTRF YASTGETSTT PNASKSSSTE TSPSDESRVL VDVDHQGVAR VCLNRPTKLN ALDMPMFDAV ADTALSLQKD RAIRAVILSG SGRAFSAGLD VASVLSTNPL KNSERLLTRD DVDDNNNDNK NNYSNERRSI ANLAQRVSMA WRDIPAPVVA CLHGECFGGG LQIALGADVR LATPDCRLAI MEAKWGLIPD MGASVLLREL VRIDVAKELT MTGRIVDGNE AAALGLVTRV VDDPLEQAET LVQAFLQRSP DSLAATKQLY HQTWVAPEEY SLKVETALQR KLLVSWNQMA AAGRSFGWKV PYFQRKDGTL DTEKKL
|
| |