Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_46734 |
Symbol | |
ID | 7204511 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011679 |
Strand | + |
Start bp | 318848 |
End bp | 322253 |
Gene Length | 3406 bp |
Protein Length | 894 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185681 |
Protein GI | 219120899 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGACCGACAA TGGTTATCCA AATATTTCAC ACTTCCGACG GCGGCCTCGA CCATCTCCAG ATGATACGGT GAGTGTTTGA AGGAACCGCA CGGGAGCACT GGGAACAGAG ACGAATCTGC GAAAGTTCCG ATAAGGACAA ACCGCGGTAT ATCTCAAGTC ACCATGCCAC CGAGCATTGA TCGATCTCTC CATTGGATCG GCACGTGGAC TCTTAGCGCG CATATTCTCC CTCTAAAGAG ACGCCGCGGT CTCTCCACAA TTTTCATTTT CATCGTTTTA AGCTGCTGCA AAATTCAGTT GCAATCCTTG GCAAAGGCCT ACACGGGAAA CGGCAAGCCT AGTCGGAAAC GTTTGATTCC TCCCACTAAG CGTGAACCTC GCTATTTCGC GCAGGAGGCG TTGCTTCCGG GCTTAGCAAA TCTCGATGAG CTACAACCTG AATTTGTGGT ACCAGGGCTA CCCGTTCTGT ACACGAATGA TCCCAAGAGA GTCTCGGACT GGCTTGGCGA CCACGTCGGG CCAAATGGAG GCACGTTAGG TTTTGATGTT GAGGTAAGCG AACCAAGTCA GCCCACAAAA TATCCTCGAC GGCGGTTTCT TTGATTCTTG TAGAGGAGCC AAGCTGCTGA GTTTGTATGC ATGGAGAGAC GAGGAGAAGC AATGATTTCT TACCAGCTAC ACTGCAGGGG TTTAGTCGCC CATTCCCCGT GCCAGAAAAT CGTATACTTG CCTTAAAAGG CAGGACTGAG CTTCTCTTTC CTTCACGCAT GCTATTGCCC GCTGACGTGT CGTGGTAATA CGATACGAAT ATGTGTCTTA CAGGTTGTCT TTTTTTTGGT TTAGTCGGTA CCTGAGATTC CGCATATCCT ACGAAAAGCA ACATTCCGAG GACCTGCCCT TGTGCAACTT GCTACACCAA ACGCAAGCCT CGTCATTCAA CTCGCGCGAA ACAATGGTCG CCACAGTCGA GCATGCATCC CAATACTCGA AGCCGTGCTG GCCGATGAAC ATATCATCAA AGCTGGTGTT CAAGTCGATT TGGATATGCT TGAGCTTCAT CAGAAATGGC ATACTATTGA AGCTCGCAGC CGCCTTGATC TTGGGGGACT TCTCATTTGT GAAGACGACG CAAATCGCCG TCCTGGCCTG AAGCGCCTCG CCGAAAGCGT TCTTGGCGTA AATCTACCAA AGAGCAAGAG TTTGGCTAAG AGCAACTGGA GCCAGGTACC ATTGAGCCCC GCTCAAATAG CCTATAGTGC ACGGGATGCC TGGGCGGGGG CTGCAATAGT GGAAGAGTTG GTCCGCCTGG ATCCCATTAT TTTCGAAAGA GTGTCTTTAG TCGAGCGCCT GCGCTCGCAA CGGCCTATAG CCGAGCTATC GGTGCGTTTG AATAAGCGGA GAGAGGCAAA GAGTCTCCTT TCGTCGTTAT TGGCACCATA CTCGTTCCAG AAAGGATCAC AAGTATCAGT TGACGAGCTT CCGCCTTGGA AGCAAAAGAC GGTCAAGAAT CTGAGGCAAC AGGTACGCGA GAACAAGTGG GACGCGATGG AGGTATTCGA CTTTGAGCCA CTACGATTCA TAAACGGATT TAATAGCACC CTATAATTTC TTTTTGTAAT GTCCAGCACG TTTTTTGCTA CAGAGGTTGT GTTACAGGGC ATTGCAAGTT GAACACCGCT CACAGTAAAT AGCTATGTTG CATCAACGGC AGCGCTTCGA CTGGGAAAAA TTACAGTAGA GTTGTTCCAC TATGTAAATC TACATCATTG CTTTTACGAC AGGCACACGA CATCATTTTG TTGTTGCCAG CAATTATGTC GATTTTCACC AGGGAAAGAT CCAAATACAC CGTCTTATTT TTGCCAGAAG GAAAGAATTG CCATGGGGAT CCCTCAGTTT CTTTCCTACA TTCTAGAGAC GGCTGGTAGA AATGTCGATC TCCAGTATTA TCAAGGTGGC ATTGTTCCTC GTCAAGATGA AAGACAAACA CGAGACGGAG CCGACAGTCA GAAGAGGCCT CTACGGATTG GCATAGATGT AAGCTCCTGG ATCTACAAGG CATGTCAAGG ACACGGGAGC ATGCTCGGCG ACGAACGACA CTTAACGAAT TATGGCCGGG CAGCCCTACT GCAGAATGAA GAGCAACAGA AGACTGGCGA TTTAAATGAC ACAAAAATTC GAGACGCACA GAAAAGGGAA ATGGTTCTGA GATACGTCAA TGCTTGTAGC TTGTACGTCA TCGAGCGATT GCAACGTTTG CAGTCTATGA CAGATGCGGA AATTCTTGTT GTTCTAGATG GTGCAACACC TCCGATAAAG CGTGTTGAAG TTCGCGATCG ATCAAACCGA CGGAAACAAG CTGCCCAGGA TCGTGATCGA CCTGCGGATA CTGATGAAGA TGCTCTAGAT CGACGCTTTA AAGCTTTTCG ACGAGCTGGC GCTGGAGAGT ATTATACCGA CGTTGTCGAA AGTATTTTGC AAGGACTCCG TGCAAAATCG ATTCCTTTTC TTGTCTCACC CTACGAGTCG GATGGGCAGT TGGCGTTTCT AGGAGACAAA GGGTACATCG ATCTAATCGC AACGGAGGAT TCCGATTTAG TGGCATATGG AGTGAAAAGT CCTATTCTTT ACAAGTTGGT CAATTCCCTC GGCGACGAAG CAGTACCTCG AGGAGTTCTT GTGCGACGGG AAGATCTCGG AGCAACAACC GAGATCAACC TCTGCGACTT TACCGCTACC ATGCTGGCGG TATTATTCGT GGCTGCGGGA TCGGATTACT GTAAAAAATT GAAGGGTATC GGCGTAAAGG CGGCTTCATA TATCGTCCGA GCGGCCTTCT ATAGTAAGCG AGAAAAAGGA TGCAGCCCTC TAGAGGTTGT TTTTCGAAAG CTGTACTCCG AGACATGGGA CAAACAGACA CTGACAGATG ATTTCAAACG AGACTACGAG AAAGGCTTTT TGGCAGCTCT CCTTATGTTC CGACATCCTG TTGTTTTCGA CTCGGTCCAC GGAGTTTGCG CTACTATGGG AGATCCGTTA GAAGGAGATC CCCAACTCAT CTCCTACCCT CCCTACGCCG AGCTTTGCCG CGACTCTGAA CGCAGAGCCG CTGTTTGCGG TAATCTGCTA CCATCTCCTC AGGCGCTGTT TGTCGCGGAA GGGTGGCTTT CGGCACGAAC GTTTCGTCCG TATCCAAAGA CTGAAATCCC CTCAAATGTA AAAAAGTGGC TTTGGCAAAA TGAAAAGCAA GCGCGTGTGC CTTCTACGAA TCAAAAGCAC AGCAAGAGGA GAAAGGAGAC AGCCACGGAC GAAGGGGATT TAGAAGGAGA CTTCGAGACC GAGGAGCAAG CGATGACGTG GGTGGACGAA GTCAGTGATG GCGACAGCCC AGTAGCCTGT CTCTAA
|
Protein sequence | MPPSIDRSLH WIGTWTLSAH ILPLKRRRGL STIFIFIVLS CCKIQLQSLA KAYTGNGKPS RKRLIPPTKR EPRYFAQEAL LPGLANLDEL QPEFVVPGLP VLYTNDPKRV SDWLGDHVGP NGGTLGFDVE SVPEIPHILR KATFRGPALV QLATPNASLV IQLARNNGRH SRACIPILEA VLADEHIIKA GVQVDLDMLE LHQKWHTIEA RSRLDLGGLL ICEDDANRRP GLKRLAESVL GVNLPKSKSL AKSNWSQVPL SPAQIAYSAR DAWAGAAIVE ELVRLDPIIF ERVSLVERLR SQRPIAELSV RLNKRREAKS LLSSLLAPYS FQKGSQVSVD ELPPWKQKTV KNLRQQLCCI NGSASTGKNY RKDPNTPSYF CQKERIAMGI PQFLSYILET AGRNVDLQYY QGGIVPRQDE RQTRDGADSQ KRPLRIGIDV SSWIYKACQG HGSMLGDERH LTNYGRAALL QNEEQQKTGD LNDTKIRDAQ KREMVLRYVN ACSLYVIERL QRLQSMTDAE ILVVLDGATP PIKRVEVRDR SNRRKQAAQD RDRPADTDED ALDRRFKAFR RAGAGEYYTD VVESILQGLR AKSIPFLVSP YESDGQLAFL GDKGYIDLIA TEDSDLVAYG VKSPILYKLV NSLGDEAVPR GVLVRREDLG ATTEINLCDF TATMLAVLFV AAGSDYCKKL KGIGVKAASY IVRAAFYSKR EKGCSPLEVV FRKLYSETWD KQTLTDDFKR DYEKGFLAAL LMFRHPVVFD SVHGVCATMG DPLEGDPQLI SYPPYAELCR DSERRAAVCG NLLPSPQALF VAEGWLSART FRPYPKTEIP SNVKKWLWQN EKQARVPSTN QKHSKRRKET ATDEGDLEGD FETEEQAMTW VDEVSDGDSP VACL
|
| |