Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42255 |
Symbol | |
ID | 7194985 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | - |
Start bp | 668045 |
End bp | 669209 |
Gene Length | 1165 bp |
Protein Length | 349 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183528 |
Protein GI | 219126571 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACTACG TTCAGCTTGG GGATTCGGAT CTCGTCGTTT CTAAGGTTTG CATGGGAACT ATGACTTTCG GTGAGCAAAA TACTCTAGAA GAAGGTGTGG AGCAGCTTAC AAGGGCTTTT GATGAATTCG GTGTTAATTT TTTGGACACT GCCGAAATGT ACCCGGTACC AACGAAAGCC ACAACGCAAG GCGCCACCGA CAAGGCGATA AAAATGTTTT TGCAATCACG GAAGCGCGAA GACGTAATTT TGGCCACAAA AGTATGTGGT AGATCCGAGC GCATCAAGTG GTTGCCCCGA CGAGATTCGG AAACGCCGGC CGCTTTGACC AGGGATCAGA TTCTCGATTC GGTAGATGCA TCTTTGGAAC GACTTGGAAC CGACTACATT GATCTTCTGC AGCTTCATTG GCCAGGTAGG AAATTAGTCA CGGATTTCTC GAGTACGTGT ATTTTTCTTC GAATTCTGAA TTATCTCATC GTGTTTATTG CTATTTTTAG ATCGTTATGT CGGCTCGATG TTTGGATCAG GGGATTTTAG ACCGTCGCAG TACCAAGATA ATCCCAAGCC AACAAGTTTT GAAGAGCAGC TTTCTGCCTT GCAGGAGCTT GTGACGACGG GAAAAGTTCG TTACGTAGGC GTTTCCAATG AATCTGCATA TGGCGTGTGT TCCATGGCTG CACTCACCAG GCAGTTTCCC GAGCTCTATC CCAAAATTGT ATCGATTCAG AACAGTTTTT CGCTTGTCGT ACGCAAAGAC TTTGAGGCCG GTCTTGGTGA AGCCTGCTTC CATCACAATG TAGGACTCTT AGCATATTCA CCTCTGGCAG CAGGTACGCT GAGTGGAAAG TACCGCAAAA ATGTTCCAAA AGGTGCTCGC TTGACACTCT TTCCTGGATT TATGGAACGA TATTTGGGTT CTTCTAATGA GGAAGCCGTG AACGCCTATT GTGATCTTGC AAAGAAGGCA GATTTGACGC CGACACAACT TGCTCTAGGC TGGTGCTACC ACAATGAACT TGTAGCGAGC TCCATTATTG GTGCTACAAC CATGGACCAA CTGGAAGAAA ATATCCAAGC CTACGACGTC CGATTAAGCG ACGATGTCAG TAAAGAGATT GAAGCGATTT ACGCAAAGTA CACGGACCCG ACCAAGGCTC GCTAA
|
Protein sequence | MDYVQLGDSD LVVSKVCMGT MTFGEQNTLE EGVEQLTRAF DEFGVNFLDT AEMYPVPTKA TTQGATDKAI KMFLQSRKRE DVILATKVCG RSERIKWLPR RDSETPAALT RDQILDSVDA SLERLGTDYI DLLQLHWPGD FRPSQYQDNP KPTSFEEQLS ALQELVTTGK VRYVGVSNES AYGVCSMAAL TRQFPELYPK IVSIQNSFSL VVRKDFEAGL GEACFHHNVG LLAYSPLAAG TLSGKYRKNV PKGARLTLFP GFMERYLGSS NEEAVNAYCD LAKKADLTPT QLALGWCYHN ELVASSIIGA TTMDQLEENI QAYDVRLSDD VSKEIEAIYA KYTDPTKAR
|
| |