Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_4059 |
Symbol | |
ID | 7201598 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | + |
Start bp | 591539 |
End bp | 592600 |
Gene Length | 1062 bp |
Protein Length | 312 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180863 |
Protein GI | 219120240 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTGCTGGCGG GTCCTTCTTC CGAAGACCTC GGTTGCGACA TCGCGCACTT ACTGGGCGTA CCAGCGAATC GCATGGATGT TGGCCAGTTT GCCGACGGCG AAACTCGAGT CCAAGTTCAA GACTCAGTCC GCGGCAAGAA CGTTTATATC GTAAACTCTA CTACATCGTC GGATTCTATA ATGGAGTTAT TGCTGCTGAT TTCTACGTTA AGAAGGGCGA GTGCCAAACG GATAATAGCA GTGGTCCCGT ACTATGGATA TTGTCGACAA GACGAGCGTA GGCAAGCGCG ACAACCGATT GCCGCCAAAG ACATGGCACT CATGATGGAA GAAATGGGAG TTGATCGGGT CATTTGTATG GATCTACATA ATGACTCTCT TCGTGGTTTT TTCTCTCCTT CCGTCCCCGT TGAAGTACGT CAACGACACG AGCAATTCTA TTTCGCCGGA TTCGTTTCCG CTTTAAATTA TCTCAATCCG CCACTTCTTT TGTTTTAGCA CTTAATGCCT GTTCCCGTTG CGGCCGCTTA CTTTCACGAG GAATTGAGTG CTGGAGTGGA ACAGAAGAAT GAAGCATTTC CTAAGGTGAC AATTGTTGCT TCTCATGAAG GGCAAGTCGG CCGAGCGACA CAATTTCGAT CCGTCTTACA ACGCCTTTCA GGAGAGAATA TTGAACTGGC CGTTCTGTCG AAATCTCGCA TGAGACCGGG CGAAAAGCAG TACGAACCGA AATTGGTAGG CAATGTGAAG GGACGGAAAT GTATCCTGGT GGACGATATC GTCAACACGG GTACCACGTT GGTGAGTAAC GTGCAAATGT TGAAGCAGGA AGGGGCGGAT AGTATCTACG CATGGGCTAC CCACGGCGTT TTTGGTGCCA GTGAGCTCAA CGATGCACCG GAAAGATTGC AAGAACTACA AGATCTCGAT TACTTGTTAG TGAGCAACTC AGTCAGCAAC CCGAGATCGT TGCCGTCCAA GATTCGCCTT TTAAACGTAG CGCCCTTGCT AGCTGAAGCC ATTGCCCGGG CGCTTCATGA CCAATCTATT AGTGGTATTT TG
|
Protein sequence | LLAGPSSEDL GCDIAHLLGV PANRMDVGQF ADGETRVQVQ DSVRGKNVYI VNSTTSSDSI MELLLLISTL RRASAKRIIA VVPYYGYCRQ DERRQARQPI AAKDMALMME EMGVDRVICM DLHNDSLRGF FSPSVPVEEL SAGVEQKNEA FPKVTIVASH EGQVGRATQF RSVLQRLSGE NIELAVLSKS RMRPGEKQYE PKLVGNVKGR KCILVDDIVN TGTTLVSNVQ MLKQEGADSI YAWATHGVFG ASELNDAPER LQELQDLDYL LVSNSVSNPR SLPSKIRLLN VAPLLAEAIA RALHDQSISG IL
|
| |