Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_32202 |
Symbol | |
ID | 7196856 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 1939057 |
End bp | 1940579 |
Gene Length | 1523 bp |
Protein Length | 485 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176874 |
Protein GI | 219110245 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.334711 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCGAGG TTGGGGTGGA TCCGTTCGAG TACACGTTTG CGACAACGCA TACTTCCGCG GATCTCAACA AACTGTACGA ACACAAGTTG GAGAACGGCG AAGAAGATGA AAGTGCCAAC GTTTCAGTCG CTGGTCGTAT CATGACAAGG CGTGTGTTTG GAAAACTGGC ATTTTTCACA ATGCAAGACG AAATGGGGAT GATTCAGCTT CAGTTTGATA AGGGTCGTCT TGGAGATACG TTCAAGGTAA GAAGAAGTAC GAAAAAGTGT TGTGAATTGC TCCAGTTCCT CATGGCTATT GCGGATCATA GAATCTTAAA GATTGGACAG ATGGCAGTGA CATAATTGGC GTGAGGGGAA CTATTCGTCG TACAGACAAG GGCGAACTTA CAGTTTTTGC AACCGAGTGG AAAATGTTGA CCAAGTCCAT TCTACCTTTG CCCGATAAAT ATCATGGCTT ACAGGATGTG ACAAAAAGAT ACCGCTCTCG TCATCTGGAT ATGATCTCAA ACCCCGGCGT TCGGGAGACC TTCCGCAAGA GAGCTTTGAT CACGTCCAAG CTGCGACGGA TGTTGGATGA CAAAGGATTT CTTGAAATTG AAACACCAAC CTTGCATACT CAGAGTGGAG GCGCAGAAGC AAAGCCTTTT GAGACTTATC ACAACTCGAT GGATATGCAA TTGACTTTGA GAATAGCGAC AGAGCTCCAC TTGAAGCGCT TGATTGTTGG AGGCTTCGAC CGCGTCTATG AGGTCGGTCG TATTTTCCGC AACGAAGGTA TATCGACACG ACACAATCCT GAATTCACTT CTGTAGAATT GTATCAAGCA TACGCCGATT ACGATGACAT GATGAACTTA ACTGAAGAAT TAGTATGCAC GATTGCGGAA GAGGTCTGCG GGAGTCTATC GATTCCTTAT GGCGAGCACA TGGTTAGCCT AGAGCGTCCT TGGCGTCGGG TCACTATGCA TGATATTGTC AAAGAAGAAA TGCCTGATTT CGACTTTTCT GCACTAGATT CCCAACTCCC TGAGTCTCTG AATACGGCCA AAGCAGCGGC TATAGCTGCT GGTGTTCCCA ATGTCCAGGG ACTCAATACC ATTGGCTATG TCCTCAACGC TTGCTTCGAG GAGCTTTGTG AGCCAAAACT AATCCAGCCA ACTTTTGTAA TAGACTACCC TGTGGATGTT AGCCCCTTGG CCAAACGCCA TCGCAACAAA CCGGGGCTGA CTGAACGTTT TGAGCTTTTC GCTGTTGGTC GAGAACATGC AAATGCGTTT AGTGAACTGA CAGACCCGAT AGACCAACGT GAGCGATTTG AAGCACAGGC TGCCAAAAAG GCAGCTGGGG ATGAAGAAGC ATGCGATGTA GACGAGGACT TTCTTCAAGC TCTTGAACAA GGAATGCCAC CTACAGGAGG ACTGGGTATT GGAATAGATC GGCTTGTAAT GCTGTTGACA AACTCACCAT CTATTCGAGA CGTTATTGCG TTCCCCTTAC TCAGACCCGA CTCGTCCACT TAA
|
Protein sequence | MREVGVDPFE YTFATTHTSA DLNKLYEHKL ENGEEDESAN VSVAGRIMTR RVFGKLAFFT MQDEMGMIQL QFDKGRLGDT FKNLKDWTDG SDIIGVRGTI RRTDKGELTV FATEWKMLTK SILPLPDKYH GLQDVTKRYR SRHLDMISNP GVRETFRKRA LITSKLRRML DDKGFLEIET PTLHTQSGGA EAKPFETYHN SMDMQLTLRI ATELHLKRLI VGGFDRVYEV GRIFRNEGIS TRHNPEFTSV ELYQAYADYD DMMNLTEELV CTIAEEVCGS LSIPYGEHMV SLERPWRRVT MHDIVKEEMP DFDFSALDSQ LPESLNTAKA AAIAAGVPNV QGLNTIGYVL NACFEELCEP KLIQPTFVID YPVDVSPLAK RHRNKPGLTE RFELFAVGRE HANAFSELTD PIDQRERFEA QAAKKAAGDE EACDVDEDFL QALEQGMPPT GGLGIGIDRL VMLLTNSPSI RDVIAFPLLR PDSST
|
| |