Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47436 |
Symbol | |
ID | 7202562 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | + |
Start bp | 616571 |
End bp | 618371 |
Gene Length | 1801 bp |
Protein Length | 573 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181598 |
Protein GI | 219122534 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTTCCC AAATCCTTCC GAGGCTTAGG CTAGTCCACT TTAGCGACGT TTACGAACTC AAAAACCTAT CCAAATTGCA AACATTCCTT AGCCAGCTTT CTCCACCGCC GGTCGCGCTT GTGTTGGGTG GAGATTTTCT GTCGCCGTCG ACCCTTTCCG CTCTTGATGG TGGGAAAGGT GCGTTTATAC CGTACGGAAA GCTCACGTGA CATCAAAACA CGCATTTGTG TTAAAGGTAT ATTGACTCTC ATGGTTTTGG GCTCTGCAGG AATGGTCTCT ACATTACGAG CGTTAGGTGT CACACACGCC ACTGTCGGGA ATCATGAAGC CGACTTGAAG GTTTCCACAC TAGAGAAACG GCTTCGAAAG CTAGCCAAAG GAGCCAAACT GATCAATAGC AATATTAAAG ATGTTCCAGA TGCTCCCTGG TTTAAAGAAG TGTTCCATCC TTGGAGTGTT TTGCAAACAC CGTGCCGTAA GGTTACCGTT GCCATGGGTG GGTTTCTATC TGACGAGGAG GGAGTTTTTC GGGATAATAC TTTCAAAGGT GCACATATTG GTGATGTCAC AAAAGCGTTC GACCGCGTGT ACCGTGAAAG TGTCCTGAAT GGGCCTGCCG ATGTGTTTCT ACCCGTCACG CACCAAACAA TATATCGCGA CACGTTGTTC GCCAAACACG TCTTGCAGAC GCAGCCAGGT CTTACGCTCG TATTGGGAGG CCACGAACAT TCTCACTACG ATTTGATTGT AAAATCAGAC AACGAGCTGT TCCCCGATCG AAGCGCTCGA ATTCTCAAGG GTGATTCCGA TGCTACCAGT GTGAATTTGG TTGATTTGAC GTTTGATATC GTAAACGATC AGTTCCAGAC CGTCGAGATC GATGCCTCTT TAATTGACAT AACAGCATAT GAGCCATCAA AAGTTGTGCA AAGCATTGTC GATTCCCACA TGTCTTTATT GGATAGTTTG GATCACGAAA CCATCGTGAA CGCGGACAAT TTATTGCCAC CCGGTACACC GTTGAGTTCG GAACGTACCA GATACAGCCA GACCACCGTT GGAAGTGTAC TGTGTCAGGC AATAAAAGAG GAGCTAGAAG TGGACGTGGC TGTAATAAAT GGTGCCACCA TTAAAGGAAA CAAAAGGTAC GAGGGGTCTG TAATGAGCTA TGCGCAGCTT AAAGAGGAGC TACCTTTTCC TACCAAAATG GTAGTAGTTC CCATGAAACG ATGGGAGCTC CAGCAAGCCA TTTACTATTC TCGCACAAAC CCTCCTTCAT CAAACGATAT CGAAGAGCCG GAACGGAAGG GATTTTTGCA GGTGGATGTG GATTTCGACC GGCTTGGGTC ACATACAGGA GGACAGGAAG ACGAACTTCT TGTCGCTCTC CCGCGCAATC TACTAAAAGG CTTTTGCAAG ATTTTACCGC TCATGGACAT TGGTGATCGC CTAGAAAAGG TGGGAAGCTT AGATTTGGAA AACCACAACT TTGTACCAGC TATTGATCTT GTGGTACGAC ACTTCTGCAA GGAACAATGG TACGAGCTGG TACACGACAA TACCTTTGAG AGCTTGGATA GAGGAGACAA AGGATTCTTG TCACGAGAAG ATGTGAAAAC TATGATGCAC AAAGCCTTGG GTCATGAACC TGCGGCCTTT CTGGTAGACG ATATGATATC GTCTATCGAC GCAGACAACA ACGGTGTGAT TGATCCCGGC GAGTTCAGTC ACCTGCTGGC ACAGATGGAA CGGGATCATG GTATGATAAA GTTTGACTAG CGCAGACAGT TTTGATTGAT T
|
Protein sequence | MSSQILPRLR LVHFSDVYEL KNLSKLQTFL SQLSPPPVAL VLGGDFLSPS TLSALDGGKG ILTLMVLGSA GMVSTLRALG VTHATVGNHE ADLKVSTLEK RLRKLAKGAK LINSNIKDVP DAPWFKEVFH PWSVLQTPCR KVTVAMGGFL SDEEGVFRDN TFKGAHIGDV TKAFDRVYRE SVLNGPADVF LPVTHQTIYR DTLFAKHVLQ TQPGLTLVLG GHEHSHYDLI VKSDNELFPD RSARILKGDS DATSVNLVDL TFDIVNDQFQ TVEIDASLID ITAYEPSKVV QSIVDSHMSL LDSLDHETIV NADNLLPPGT PLSSERTRYS QTTVGSVLCQ AIKEELEVDV AVINGATIKG NKRYEGSVMS YAQLKEELPF PTKMVVVPMK RWELQQAIYY SRTNPPSSND IEEPERKGFL QVDVDFDRLG SHTGGQEDEL LVALPRNLLK GFCKILPLMD IGDRLEKVGS LDLENHNFVP AIDLVVRHFC KEQWYELVHD NTFESLDRGD KGFLSREDVK TMMHKALGHE PAAFLVDDMI SSIDADNNGV IDPGEFSHLL AQMERDHGMI KFD
|
| |