Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_43865 |
Symbol | |
ID | 7204283 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 254402 |
End bp | 258492 |
Gene Length | 4091 bp |
Protein Length | 1224 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186028 |
Protein GI | 219112889 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0745753 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGCTTTTGTA CCAATCTGCA TGTTCCATTT TTTTGTACTG GTAGACTTGG TTCTGTGGAC CCCATCCCCG GTCACCGGCT ACATCCTACC AGCCGCACGC CCGCACACTC GCTCCTCCGC CGGTCGACAT CGAGTGGAGC GAACGAATAC ACGCTACTCG TGCCTTCCGG CGTCTTCCAG GGCCTTGTCG AAGTCGCGTG TCACTGCCGA CATGATGGTA CTGGAAGAAA AGAATATACC AACGTCGGAA CGCAACGAGG CGTCCAACGA GGTTGCGGGA CGATTGTGGA CGTACGCCTC TTATTTACAA CGGGTACGCC GGCAAGAACA ACAGCGGGAA ATCTCGAGCG ACGTTCGACC CCAGAGTAAG GCGGCGACAT CGCTGGCCTA CTTTTTGGAA ACCAATGTTT GTCGAGACGC GGACGACGAA CGGGAACCAT TGTCGCGTGC ATTTACGCAA GCAATTCACA TCGCGGCGGA TGCGCAGAAT TATAGACTCA TTCTAAGACT CGTCGACGCT GTGTTGTATT ACGCAAAGGC GGATAGTGAC CCACAGTGTG TGGTAGACCC ACGTATCGTT GGTGAAGCAA TTCAGGGCAT TGCGAGGACT CAGGCCAGTG TTGGGAAAAT TAAGACGCTT TGGAAACGTA TGTACGCACA TCCGGCCTTG GCGGCACCAG TCGGTTCGCG ACAAGTGAAT GCCATGTTGC AGGCACTGGA ATCGCGGGGC AAGAGTAGAG CCGTGGTCGA TTTGTATCAC GAAGCATCGT CTCAATCCAA CGCCACGGAC GCGTACAGTT TGAGTATCGC TCTCAATGCC TTGACCGCGT CCGTCACGGA CGATCAAGCC CCTGTCAAAC CGGAACCTCT TCCGTCTTTG TACCGGAGTC GCTTTGCGGT TTGGCAAAGT ACTTGTTGGC AGTGGAACGA AGCTCTAATA CTACTCGGGC ATGGACTAGA GACAAATAAG GTCAACAATC CCGTCGTTTC GGCGCTTCTC CAGTTGAACG ACCGGGCTTC ACGAGTAGTC ACGGCACATC AAGGGCCCCA GTTGGCGCTT GCCGTCTTGG ATATTATGAG AGAGCACAAT ATTGTACCCG ACACGGTCAC TGTAACGTTG GTGCTGTCAA GTTTGGGCCG CGAATGGGAA ATTGCACTGA CCTTGTTGGA GTCAATGAAA CCCTGCAACT CAACCTCAAC GCCGTCATCT TCCTGGAGTT TACCGGAACC CAATGAGTTT TGCTACTCTG CAACAATGGC AATTTGTGCT CGGTCTCGCC AATATCAACT CTCCTTGGAG TTGTTGAATG ATATGCGCAG AAATTCCGCC CTGCAAATTA ATACGGTTGT TTACAATAGC GTCCTACAGG CACTGGTCGG GAACAACAGG AGACGCGAAG GCAAAGTAAA AAAGGCGCAA CGCAAAAAGG CGCAAGACCG AGTACGCGCA ATGTTTCAGA TTCTCGGCCA TATGGAGGAG GACATACAAA GGCAATTGGA CACGCGACCG AATACAAACA CGTACAATAC CATCCTCTTG ATTCTAGCTA CTTCCGGGAA ACTCATGGGT GAGCGTGAAT GGGTTGACAT ACGCGAGACC TTTCCAGCTT ACTTTCATGA TGATGACGAC TTGGTAGCTA CCTCCAATCC AGGAAGCTTG GCTCGGTACT TTTTGAACGA CATGGCTGCC AAAAGTGTAG AACGGAGTGC ATTGACCTAC CGCAATGCCA TTCTTGCTTC CTCAGCTAGT CACGTCGAAC TGGTGCTAAA TATGTTAAAA TTGGCCGAAG CGGCTATTCC CGCTTTTGCC TCGGATCGTG GGAGTATTTA TAACGCGGCC TTGACAGTTA TGGCTAATGC GGGATACGCG AAAGAAGTTC TAGCCTTGTT TTCTAAGATG ACAAATATCG GTGTGATACC AAATCGCGAA ACCACAGGTG CTATCATCAC AGCGCTTGCA AGAGGCCAGC AAACAAAATC GATTCCTTCT TTTCTGTCGC ATTTGGCGGG GATGCGGAAA GAAGGGAAAG TCACGCTGTT TGCTGGGGTT ACATTGGACC TGACTTCTCT GCCCCATAAT TCACAATCGG ACTTTTCACT AGCTCTATCG CTGTGCCTGA CCCGAAATGA CTACAAAAGC GCGAAAGATA TTCTCGAAGT CATGCGAACA GTCGGTTGTA TTCCAACCGC AGAATCGCTG GAAAGGATTG CAATTGCGTA TGCTCGGGGT GCCATGACAA AACCTTTTGG TGCTCGGAAC TTTCCTGATG GACAGGCACA CCGACTGAAG GAATCGAGGG CGAAGGCACG CAATGCATAT GATCTGACGG TTGGCATGGA AGGCGCATCG CTTGAGCTAC TGGCAATTGT TTCTAAAGCA TGTGCCCATG TGGGATACTT CGACGAATCG ATGTTCCTTC TTCGGGAGAT ACACAGGATG ATTTTAGCTT CACAATCGTT GTCAAACGCG CTTCCGAACT GTGGACTCCA TTCTTCTAAC TTTAGTCTAG AGGCCGTACA CCGAAAGATC CTTCATTCCT GTGCCAACGC CGGGAACGTC ACTGCTGCAC TAGATTTTGT CGGAAACATC CAACAAACAA GTCGCAAGCT GAGGAGAATC AATGGCAGGT CCCAGGAGCT ACCTTTAGCG AAAGGCATCA AAATGGACCG TTTTACATCG AAATCATATT CATGCAAAGG AGAAACGATG TCAAAAGTAC AGCTTGGAAT GCAAGCAGAA GATTGGAAGT CATTGATAAT TGCCGGCTCA AAGTCTGGGC ATTGGCGTGT CTGCTTGAGC ACTCTACAAT TTTTGCAACC TTACTTGGAC AATATCCGTC CGTCGAAAGT CTCGGACGAA TACGGTTTGA ACAGAATGGA AAAAGAGTAC GAGTCGATAT CTAAAGCGTT GACCTGGGCG GTGAAGTGCA TGTCTGTCCG ATCACAGTAT GGATGGGCAA TAAGGGCCAT TCAAGACTGG CTGGAATGGA CTGATCGGCG TCCTCCGAAG GAGGCTGTTT CTGCCGCAGT TAGAGTTCTC AGTACACGGG GACGAGGCGA TGAAATTATT TCACTACTGG AAAAGTGTCT ATCTATACCA TCTACAGGAC TTTCAGCTAC GAATTCCTAC GAAGTGGGAA TATATGTTGA GGCAATCACC ACACTCTACA GAGACGGGTT ATATGAGTCT GCCGACGATG CGTTTATCAG AGCTGTTGCA AACGGAATCC TCCCGCTTCA GCTAGAAAAC TGCGATACGC GTGGCAATCG TCGCATAGTT TTGGACTTAC ACGGAATGAA CGTGGCTGTC GCCCATTCTG CCGTTCGAAT TGCTTTACAT CAAGAAATTC TGACAGCGAG CTGGAATCAT TCGACTGTCG CGAGCAACGA ATTTGTCATC GTTACCGGTA GAGGGCAAAA GTCGGCATTC AAAATGAGGC CAGTACTTCG TCCGGAAGTC CAGCGTATGC TTGTAGAAGA ATTCTATCCT CCTTTGAGCA CTTTGTCAGT ACCTGGAAAC TTAGGAGCGC TTACTATACC GCTCGAGGAC ATATGCAGCT GGTTGAACCA TCAACGTGTA CAAAAAGGAG CCAGAATGAT GAGTATTGCC GCTGTTCTCA GAAATCTCTC TTCGGGCAAA AGGCTTCATG CTGCACTGTC GAAGGCCAAT GAACTAGACC CTGGTGGGCC ATAAGCGATT TATGACGTGG AGTGATTTGC TCTTTCACAT GCCCTTGCCC TTAATCTTGT GTGCAGGCAA AGGCAACACT CAAATGCCTC CTTTGTTGGT TTTCGTGAGC TTCTTCTGCT CGGTATTTAA ACAGCAGTCA GTAATGTTGC CGACCCCTCG GTACCACATG CATACAAGGG CCCACTCGCT AAAGAAGGGA AGATCCAGTT CTACGGCTCG CCATGACCGA CGCAACATTC AGTAGTCACC AATGCAGCAT CTCGATGATA TGGGGTTTTA GGATCAATCC AAATCTGTCT TGTAATAGTG CAGAATGGTA TTGGATTCAA AAAGAATCGA AGCCGACTGC ACTGTCAGAC TTTATAGTCA AGATATCATA GCGGACTTAC ATTAGTTGTT C
|
Protein sequence | MFHFFVLVDL VLWTPSPVTG YILPAARPHT RSSAGRHRVE RTNTRYSCLP ASSRALSKSR VTADMMVLEE KNIPTSERNE ASNEVAGRLW TYASYLQRVR RQEQQREISS DVRPQSKAAT SLAYFLETNV CRDADDEREP LSRAFTQAIH IAADAQNYRL ILRLVDAVLY YAKADSDPQC VVDPRIVGEA IQGIARTQAS VGKIKTLWKR MYAHPALAAP VGSRQVNAML QALESRGKSR AVVDLYHEAS SQSNATDAYS LSIALNALTA SVTDDQAPVK PEPLPSLYRS RFAVWQSTCW QWNEALILLG HGLETNKVNN PVVSALLQLN DRASRVVTAH QGPQLALAVL DIMREHNIVP DTVTVTLVLS SLGREWEIAL TLLESMKPCN STSTPSSSWS LPEPNEFCYS ATMAICARSR QYQLSLELLN DMRRNSALQI NTVVYNSVLQ ALVGNNRRRE GKVKKAQRKK AQDRVRAMFQ ILGHMEEDIQ RQLDTRPNTN TYNTILLILA TSGKLMGERE WVDIRETFPA YFHDDDDLVA TSNPGSLARY FLNDMAAKSV ERSALTYRNA ILASSASHVE LVLNMLKLAE AAIPAFASDR GSIYNAALTV MANAGYAKEV LALFSKMTNI GVIPNRETTG AIITALARGQ QTKSIPSFLS HLAGMRKEGK VTLFAGVTLD LTSLPHNSQS DFSLALSLCL TRNDYKSAKD ILEVMRTVGC IPTAESLERI AIAYARGAMT KPFGARNFPD GQAHRLKESR AKARNAYDLT VGMEGASLEL LAIVSKACAH VGYFDESMFL LREIHRMILA SQSLSNALPN CGLHSSNFSL EAVHRKILHS CANAGNVTAA LDFVGNIQQT SRKLRRINGR SQELPLAKGI KMDRFTSKSY SCKGETMSKV QLGMQAEDWK SLIIAGSKSG HWRVCLSTLQ FLQPYLDNIR PSKVSDEYGL NRMEKEYESI SKALTWAVKC MSVRSQYGWA IRAIQDWLEW TDRRPPKEAV SAAVRVLSTR GRGDEIISLL EKCLSIPSTG LSATNSYEVG IYVEAITTLY RDGLYESADD AFIRAVANGI LPLQLENCDT RGNRRIVLDL HGMNVAVAHS AVRIALHQEI LTASWNHSTV ASNEFVIVTG RGQKSAFKMR PVLRPEVQRM LVEEFYPPLS TLSVPGNLGA LTIPLEDICS WLNHQRVQKG ARMMSIAAVL RNLSSGKRLH AALSKANELD PGGP
|
| |