Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_20608 |
Symbol | |
ID | 7201190 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011677 |
Strand | - |
Start bp | 664512 |
End bp | 667792 |
Gene Length | 3281 bp |
Protein Length | 830 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180684 |
Protein GI | 219119866 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTGATG TTCCATCTCG TCGGACTAGT CGAGAAACAA GACTACACGG CGGAGGCTAC AAGTGCCGAA ACCCCGCGAC AAGCTGGTAG TATTTGAGAT ACTCATCTCG CACAAGGCGA AGCGAGTCCT CCTTCAGTGG TCGATTGTGT CATCCGTACA AGTTTTCGTC GGCTCGTCAA CTCCATCTAA AGTTGCCTTT CCGTCGGCGT ACGCAACGAA ACCCATTATC CAATGGCATT CAATATGAAG TTTCTCGCTT TGTTTCTCAC GTTCTTTCAA GGCGAAGAAT GTTGGGCTTT CCATCCGATT GCTTCCTTTC GTTCAGGTCG TAGCGGTAGA AAAGGGGTAG ACGGCTCAGC CTCGACATCG TCACGTTGGG CGTCCGACGG TTCCGGGACA GCCTTTTCGA TTGAATCGGA CCACAGCGGA CCCGCCGGTT TCCCCGTACA CCGCGTCATA TTTCACTCGC TGCCCGGTCA AGAGCAGGAT GTTCCTCCTA TTTGCATTGA AACGGGAAAG ATCGGACGAC AGGCGGCGGG TGCTGTCACA TTGACTCGCG GTGATTCCGT TCTCTACGCG ACAACGTCGC ACGACAAGGA CCCCAAGGAA GAAATCGACT TTTGTCCCCT CAGTGTGGAC TACCAGGAAC GATTCAGTTC CGCCGGGCTC ACATCCGGTG GGTACAACAA ACGGGACGGA CGTCCCGCTG AACACGAAAT TCTTACCTGT CGTCTCATTG ACCGACCGCT CCGGCCATTG ATTCAGTCGG GATGGCGGCA CGAAACGCAG CTTCTCTCCT GGGTCTTGTC CTACGATGGG GAACGGTCCT GCGATCCTCT TGCGATCATT GCTTCGGCCT CGTCGCTCTT CATTTCCGAT GTACCCCTGC ACAAACCCGT GGCGGCGGTG CAAGTGGGCA TGAAGGATGA TGGCACGTTG GTGGTGAATC CTACAAATCT ACAGATGGAG ACGAGCAAGC TCAACCTGAT GGTGGCCGGT ACCGAAGAAG CAGTTTTGAT GATTGAAGGA GCCGCCGACT TTTTACCAGA ATCCACGATG ATTCAGGCCG TGAAAACGGG CCATGAGGCG ATTCAGGTCC TATGTCAAGG GCTGACCGCG CTAGGAAAAG TCGCCGGAAA GGAAAAGAAG CTGGATACTA TTAAAGCTAC TCCAGAGAAC TTGCAAATTC GAGTAGATGA ACTTTTCAGC GATCGTATCG ATGATATGTG GTCGTCTGGT CTGGGCAAAG AAGCCCAAGG GAAAATCATG ACTGATTTAT ACACGGCCGT AGTCGGCGAG CTTATCGAAG ACTATCCTGG CGAAACGGTT GCCATCAAAG GCGCGTTTAA AGATCTGTTG TGTCGACGCA TGTTTTTCCG TGCCCGAGAA GAAGGGCTTC GTTGTGACGG TCGTGGACCG ACCGACATTC GTCAGCTTAC AATGGAGACT GGGTTGCTGC CGCGTGTGCA CGGGTCTGCC CTCTTCACAC GAGGTGAAAC ACAGTGTGTC GCCACGACAA CACTGGGTGG TTCCGGCATG CGACAAAAGA TTGAAAAACT GGACGGAACC GATGAGAAGC GCTTTTATCT GCAGTATACG TTTCCGCCTA GTTGCGTTGG TGAGACGGGC CGAGTGGGAG CTCCGGGACG TCGAGAAGTC GGACACGGTA ACCTGGCGGA AAGGGCCCTG ATTCCAACCA TACCAGCACT AGCTGATTTC CCTTATACAA TTCGGGTAGA GTCTCTCATC ACTGAGTCGC ACGGTTCCAG TTCCATGGCC AGCGTCTGTG GCGGATCACT GGCTTTAATG GATGCCGGTG TACCCATCAA AGCACCCGTG GCCGGTATTG CAATGGGAAT GCTGCTAGGC GATAAAGGCG GCGTATCCGA CGAGAACGCT GTAATACTTT CGGATATTCT TGGAACCGAG GATGCACTTG GTACTATGGA TTTTAAGGTT GCAGGTGATC GAGTTGGAAT ATCCACTTTT CAACTGGATA TCAAGTGTGA AGGTTTGACT TTGGAAACCA TGGAAAGCGC ACTGGAACAA GCACGCACGG GCCGCTTACA TTTGTTGTCG GAAATGGAAA AGGTAATTGC CTCACCGCGA GAGGAGCTCC CCGCAACTGT CCCAAAAATG ATGTCGTTTT CAATTCCTGT TGAGGCCATT GGTAAGATTA TTGGACCAGG TGGTAAACAG ATTCGTGCTA TCATCGAAGA TTTTGAGCTT GTAAACATGG ACGTCGGTGA AGAGGGAGGG GTACAGTTAT CCTCGTTTGA TACAGCTAAA ATGGGAGAAG CCCAGACCTT TATTACTACT TTGGTTAGTA GTGCCGGACG AAATGGACGT GGGCCAAGAG AGGAACGTCC AAAGTATGAA GGACCGGAAC CCGTTGAAGG TGAAACCTAC ACCGGAAAAA TTACTGGTAT TCATCCGTTT GGAGTTTTTC TTGAGATTTT GCCCGGTGCC GAGGACGGTT CTTACCCGGG TCTTGAAGGA TTGGTTCATG TCTCGGAGCT GGCCCACGAG CGTGTTCGAA ACTGCGAAGG TTTCATGAAG AGCATGAATG TTGAAGAACT GACGGTCAAG TATCTCGGTA AAGACAAGGG TAAACTGCAG CTTAGTCGAA AGGCTCTACT CGAAGAACAA GGTGGAGACG GAGGCCGAAG AAATGGTTCT CGAGGACCGA GTCGAGAAGC AGCCGCGCCA ACTCCCGAAA TGACAAAAGA TGAGATCGAC GTGATTGCGC AAGCTATTGA GGGTGTAACA GAGCTATAGA TTTCTTAGGC TTCGACTCAA GCTATTCCTC TTCGAAGTGA CCGTCCAGTA CATCCAGTCC CATTTCGTTC AAGATGTTTC GAACTGGACT CGGCTTTCCG TGTCCAGCTT CTAGCGATTT CAACAAGTTT GTCAGAATCT CAGCGTTGTG ACGGACACTC TCATCTTGTT GGACATCTTT ACTTCCATTT TCGGAGATCG GTCGGATTCG CAACTCGTTG TCCATAGCGC TCATGACATC GGTCATGGCT TTATCAACGC CATCGCCACC TTCCTCATCA TCTAACTGGT AGTCGTCTTT CGAGAAAAAC AGATCCTTCT CGATTCTCGT AAAATCGAGG TCGTCTGCAC ATTCTGCTTG CAAAGTTGAG TGCAAAATGT TCAAGAACAC AGTAGGATCA ATGCGCAGTG GACGAGTCAA TCCAATTTGA TGCGAAACAC CTTCAATCGT GCTTTTGCCG TGCATGAAAG ACTGGACTCC GCTCAGCATA TCGCTTAAAG C
|
Protein sequence | MAFNMKFLAL FLTFFQGEEC WAFHPIASFR SGRSGRKGVD GSASTSSRWA SDGSGTAFSI ESDHSGPAGF PVHRVIFHSL PGQEQDVPPI CIETGKIGRQ AAGAVTLTRG DSVLYATTSH DKDPKEEIDF CPLSVDYQER FSSAGLTSGG YNKRDGRPAE HEILTCRLID RPLRPLIQSG WRHETQLLSW VLSYDGERSC DPLAIIASAS SLFISDVPLH KPVAAVQVGM KDDGTLVVNP TNLQMETSKL NLMVAGTEEA VLMIEGAADF LPESTMIQAV KTGHEAIQVL CQGLTALGKV AGKEKKLDTI KATPENLQIR VDELFSDRID DMWSSGLGKE AQGKIMTDLY TAVVGELIED YPGETVAIKG AFKDLLCRRM FFRAREEGLR CDGRGPTDIR QLTMETGLLP RVHGSALFTR GETQCVATTT LGGSGMRQKI EKLDGTDEKR FYLQYTFPPS CVGETGRVGA PGRREVGHGN LAERALIPTI PALADFPYTI RVESLITESH GSSSMASVCG GSLALMDAGV PIKAPVAGIA MGMLLGDKGG VSDENAVILS DILGTEDALG TMDFKVAGDR VGISTFQLDI KCEGLTLETM ESALEQARTG RLHLLSEMEK VIASPREELP ATVPKMMSFS IPVEAIGKII GPGGKQIRAI IEDFELVNMD VGEEGGVQLS SFDTAKMGEA QTFITTLVRP EPVEGETYTG KITGIHPFGV FLEILPGAED GSYPGLEGLV HVSELAHERV RNCEGFMKSM NVEELTVKYL GKDKGKLQLS RKALLEEQGG DGGRRNGSRG PSREAAAPTP EMTKDEIDVI AQAIEGVTEL
|
| |