Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_46843 |
Symbol | |
ID | 7204694 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011679 |
Strand | + |
Start bp | 603703 |
End bp | 605070 |
Gene Length | 1368 bp |
Protein Length | 455 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185742 |
Protein GI | 219121021 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.190085 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGACATG TCACTTTCTT CCAGGCATGT CACACTCGAG TGCCAATCTT GACCGATGTC ATCAAACCTC TGTTCCGCCG TTACGACGCA GCTTTTTCGG CTTCCAACTC TTTTGTGCTT TTGCCGAGAA CTGTTGCGCT CGGATCTGTA GCAAATGGCG ACGAATGCTC GCAGCCTTCG CAAAAAATGG ATGAGAAGCC AGCAAACACG GTGCTTGCAG TTAGCGAAAA CCCAGAGGGC GTTCAATGGA AACGTCCCGA TGATTCGATG CACGCACATT TCGACTGTTT CTCTGGTGCT GCGGGTGACA TGATGCTTGC TTCGTGTATC GATGCCGCAG GGGATATCGG TGACCAGCTG CTGGGATATG TGGGAGAGTC TATACGCCGC GGCTTCCCGG AGCTTGATGG CGAATTCGAG TTGTATAGGG AACGCGTCTG GAGAGGAATG GGTTCTATCG CTGCCACCAA GATAAACGTG AGCAGTCTCT ACGGCCACAG AGCTGCACCG ACTCCCAAGC TAACAACAAA AACCATAGAT GAGCGCGATT ACAAGGCTAC TGAAATGGCG CCGCATACCG ATCATGATAA TTCCGACCCG TCTTTTAATC AAACTCACCG TCACGTACAC TCTCACGCTC ACGAGCAAGA AGGACACACT CATCATCTTG AGCACAATAC ATCTGCCAAT CAGCATTCGC ATGATCACGA ACATGTCGGC GAACATTCGC ATGTACATAG CCATGAAATA CGAAACAGCT ACTCAAAGGA CCATAATCAC GCGTCAGGAC CACTTCGAAA CCTGCCGCAA ATCCGAGAAA TGCTCCAAAG CTCGTCGGTC GAATTTATTC CTTTGTGGGT AAGGGACGCC GCGATCGAAG CATTTACCGA GCTCGCCCGT GCCGAAGCCA CGGTACACGG TGCATCGGGA AAGGACACGG TGTACTTCCA TGAAGTGGGA GCGGTTGACT CAATTGTCGA CACTGTCGGC ACTCTTCTGG CGTTGCATGC GCTAGGTGTT GAAAGTGTAT CGTGTAGTCG GTTACCCTTG GGCGAAGGAA CCGTTTGGAC AGACCATGGC TTGCTTCCTG TACCAGCTCC GGCAACACTT CTTTTGATGG TTGGTATGCC AACAAGTCCT GGTCCACCAG GTGTTACCGG AGAGCTTGTC ACACCGACAG CCGCAGCTCT TTTGCGTGTG CTGACGAAAA AAGATGGCAT TTCTTCTATC GCGGGTAGAC CTCCCCGGTT CGTTGTCAAT TGTGTGGGAA TAGGTGCCGG AACAAAAAAC TTTCGAAAGC ACCCAAACAT TTTGCGTCTT TTGATTGGTG ACTCCGTAGT GATTGACGAG AGCACAGAAA GGACATAG
|
Protein sequence | MRHVTFFQAC HTRVPILTDV IKPLFRRYDA AFSASNSFVL LPRTVALGSV ANGDECSQPS QKMDEKPANT VLAVSENPEG VQWKRPDDSM HAHFDCFSGA AGDMMLASCI DAAGDIGDQL LGYVGESIRR GFPELDGEFE LYRERVWRGM GSIAATKINV SSLYGHRAAP TPKLTTKTID ERDYKATEMA PHTDHDNSDP SFNQTHRHVH SHAHEQEGHT HHLEHNTSAN QHSHDHEHVG EHSHVHSHEI RNSYSKDHNH ASGPLRNLPQ IREMLQSSSV EFIPLWVRDA AIEAFTELAR AEATVHGASG KDTVYFHEVG AVDSIVDTVG TLLALHALGV ESVSCSRLPL GEGTVWTDHG LLPVPAPATL LLMVGMPTSP GPPGVTGELV TPTAAALLRV LTKKDGISSI AGRPPRFVVN CVGIGAGTKN FRKHPNILRL LIGDSVVIDE STERT
|
| |