Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_43975 |
Symbol | |
ID | 7204389 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | - |
Start bp | 591009 |
End bp | 594757 |
Gene Length | 3749 bp |
Protein Length | 1210 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186378 |
Protein GI | 219113589 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAACTG CTCCACGTAG AAAGCGGAAA TTCCCGTTGC CCATTATCCC AATCGACCAC ATCCGTCCTC TGCTGTGTCC CGATCAGCTG AATCTACGGT ATCCTGTCCA TGGCGCTGAA CTTGGCACAC TACTCGAGGC AGAGGCGCTG TTTAGTGTAG AAGGAAACGA TATCCTGTTG ATGCCTAATG AACCAGCAGC GAAAGCTCGA AAAGCTTCGC ACAATCCTCA ACAACAAAGT CCTGAGCTCG TAGACTCAGA CAGCTTACGA TCTACGGTAA TGCCCGACCA AAGTCAACGC TCGACGGTCG ATTCCCTATC ACTTAGATCG AACAAAGATA CTGAATGTTT GCGGGAAAAG AGTTCCGTAG TCGACTCTGT TGGAGAAAGC ACGATTGCTC GACAATCGCC CAACATAAAT GTTGGAATCC ATGCTGCAGA ACGAAACGCA TCCAGCAATA CGAACACTAA TTCTCTTCCT GCACATACAA GTGATGCCGA AGAGTACGGC AACTCGCAAT TGCCTCCAAA TTCTCCCGAC GATGCTGCCG TCTCTGAGCT GGAGCAAGCC TGGATGGGAA AGCTCGATCG CGATATTTCG GTGGCCGAGC AAAACACGAG TACGAGTAAG CTTGTCGATC GAATAGCTCA AGCTGTAAAA AGTGGCGTAC AGAAAAAAGT ATTTATATCC AGAAAAGAAA GCTGCGGATC CTCAAACTTG GACTCTTCTT TGGCAGCTCC TGGCAATAAC CGACGCCATG ACGAGCTGTC GAACGACAAG TCTTCCTCTG CCTTAGAAGT TATCGATCTA ACTGGGGAGC TCTCAGATTG CGGGGACGAC TGTGAAGTCT TGGATTTTCC GGAACGTTTG TCTTTGACAG TACCAATTGA TGCAGCCGAA CAAAGTAGAA CAAAGAAACG ACCTTTGGAC CTCAAAGTGG TGCGAGCTAC GAAATTGGTG CCGAAGCGGC TGGAAAGTGG TGGTGACTTA GTGTTTTCTG TTGAAGATTT TCAGCGCTAT CTTGTCGGTC CGTTTAACCA ACGTGAGGGT CGCAAGCTTG AATCTGCGAA AAAGGCGGCC CCGTTTGCTG CAAAGGTAAT CAGTTTGAAA AAGTCTCCTG ATGTTTGGAA GGTCCTGGTT GAGGGCCACC CCGATACGGT ATCTTCTTTT GAGGTTGCTA TGGTACAGTG GGTCGGAGAG AGGATACATG GCTTAAATTT TCAAAAGCTC AAGCTTGAAG GCCTTCCAAA GGATGTTTTG TTGGAGTTCT CTCACCCTAT TGGGGCCAAG CTTAGACCAG CTCGGTACGT CGTAAATAAT CGAGAGTCCC GAGCCTTACT TATTTCAATC AATGTGGATG GTCAGCTTGG TAAGGCACTT GCTCCACTCG TCACTGTATT TGGCTGTGCA GTTGCCCTTT TGGATGGATT GGAATGTCGA ACAATTTCGG AGTTCAAGTC AGTATTAAAA AATGCAGCTA AAAAGGGACA ACCTTTGTAC AAGCTCTCTC TGTTGCTTGC AAAGGAAAGC AATGCAGCTG GGAGAGGGCT TCACCAGCAA CAAAAAAGTT CTTACTTGAG TGGCGGCTTG AGTAGTTTCC CGTCTTCCGC AAACCTACTT GTGGGTGGCC CAAACACAAC CGATGATAGT GCAAACATCT TATCTGGCGC CAACGTGAGC ATTGATTCCA GCGTAAAGAC AAGTTTACAT ATACCTCGCA AGGCACTAGA GAGCAAAGCA GGGAGCAAAC AGGAAAGGAC GTATGAAGTT CTGTTTGATG CACAACAACC CCTAGGATTT TATTGTATTG CCCTACCCTC TGGAATTTCG CAAGCTGAAT ACTGTTTGAT TGTTTCGATA TGTCCCGGAG GTCAAGCGTC GAGAGATGGT CGGATACGTC CCGGTTCAGT CGTGCGATCC GTATCAGGGG AGGACAGGTT ATTAGGTATT GAACAATTAT TTGAGATTTA CGAGATGGCC AAGCGAAAGA ATCACATAAT CAGCCTTTCA TTTCTGGACC GCCTTAGTCC GCTGAACGGT TCGATGTCCC AAGCCTTTGG GGAATGGACT GCTAAAGGTC ATTGGAAAGG GCGTGTGAGC CATGGTTGGG CGGGTGGTGC TCTACAGACT CTCGATACTT CAAACAGAGT TAGTCGCGAA AGAGATCCTC ACGGAATTCA GAAGCAGCTC TCCATTGGGA GTGGAGAGAC TGGTCGTAAA TCGATGGATG ATGGAAGGCC GTGTATGGCT GAGCACCCTC CGAATAGTTG CCGAACTAGT ACCGGGAGCT CAGGAGACCG AAGGGTCCGA TTCATGGACG CAATACATGA GCGACACTAT TCTATCGATA GCAAGCCCTG CGAATTTTAC GAGAAGCATA GCGGGTTGGC CTTGTTGCCA AAAGACAAAG GAGCTGGACT GGGACATACA ATACCTTGCG ACAATCAGTC TTTACTCCTT CGATCAATAA AGTCGGGGTC GTTTCGGGAC GTTATTGTGA TTCTTGAGGA AGGGTTGCTG AGTTCAGCAA AATCAACCGC ATCACTTATA GTTGCAAAAT CTTATGTGAA AGAGCAACTT GCCTCACTAC AGCAGAATGG AATAACCGAC GCTGCTTCCG AAAGAGATTG GATGTTGAAA GATGTTTTGA CCAAGATTTT CTTGAAGGCC GCTCATGTGT ATGAGAATGC CAAGTCTCTC AAAGAGTGGT CGCGCTACGA AGTAATCTTT CTCGGACTTG AAGAGGTTCA ATTGTCTAGC AGCGGCGGGC TTCAATTTAA TCAAGATTGC ATTTCCGTTA GGCTATCAGC TCGCTACCCC GATTCCAACC AGCAGTCCGA ATTGGCAAAG TCTCTACCAG TACCGCTCTC GAAGGATATT CTTTTTGGCA AGGAGCTTTC TGTCCCGAGG CACATGCACT ACAACAAATG CGTAGCTTCT AAAAGAAGTG TTGTTGTCGA CATTTGTAAA AACGGGGAGG CATCCGGCTC GGTCAAATGT ATCGGGTCTA CTGTGTTAAA AATCCAAGAT CTTCAGCGCA AGTGTCCTCG GAATGGAACT TGGTTGGAGA GTTCGAAAGC GTTCTCGAAC AGAAATCTGT TTGGAAGCGC ATCTATACGA TTTCGTGCGA GGCGTTTACC AGTTGAGGCC ACTTACCTCG AACGGAAACG GAAAACGGAG TGCATAGGGC TCAAGGATGT CATCAATTGG ATCAAGCGCT TCAACGATGG GCTTTGCCCA GAAGAAAGGG ACGCACAACT GACATTTACT GTTCCTGTTT TCGACAACGC TAGCCTGTTG CATTCCGCCA TTTTAATACA AGAATCACCT CTGGTTGAAG AGTTGCTCTA TCTCGGAGCC GATCCAAAGA GAAGGAGTGT AATTGGATCA CCTGTCTTTT TGGCCCACAA TCTTCGACAC AAATTGATAG AAAGTCTCGC CGAGACGTCG AATAGTGAGA TAGCCGATAC AGACGCATAT CAAGAAAGGG AAAGAGGTCC CGATAGCGCA TCCGTCGTAT CCGAAGGAAG GTGTGCTAAC CCGCGAAAAA AGAGAATCGA GCACATTGCC GCGCTGATAG CGGCAGCGAC TGGAAATAAA CTTCCAAGCG AACCACGACG GAAGGTATGC TAACCCGCGG AAAAAGGAAT CAAGCACATT GCCACGTTGA TAGCGGCAGC GACTAAAACG GCACTGTCCA GCGAATTGTG ACAAGTTGCT TACATCATAC AACGCGAGCG GCTCTAAAA
|
Protein sequence | METAPRRKRK FPLPIIPIDH IRPLLCPDQL NLRYPVHGAE LGTLLEAEAL FSVEGNDILL MPNEPAAKAR KASHNPQQQS PELVDSDSLR STVMPDQSQR STVDSLSLRS NKDTECLREK SSVVDSVGES TIARQSPNIN VGIHAAERNA SSNTNTNSLP AHTSDAEEYG NSQLPPNSPD DAAVSELEQA WMGKLDRDIS VAEQNTSTSK LVDRIAQAVK SGVQKKVFIS RKESCGSSNL DSSLAAPGNN RRHDELSNDK SSSALEVIDL TGELSDCGDD CEVLDFPERL SLTVPIDAAE QSRTKKRPLD LKVVRATKLV PKRLESGGDL VFSVEDFQRY LVGPFNQREG RKLESAKKAA PFAAKVISLK KSPDVWKVLV EGHPDTVSSF EVAMVQWVGE RIHGLNFQKL KLEGLPKDVL LEFSHPIGAK LRPARYVVNN RESRALLISI NVDGQLGKAL APLVTVFGCA VALLDGLECR TISEFKSVLK NAAKKGQPLY KLSLLLAKES NAAGRGLHQQ QKSSYLSGGL SSFPSSANLL VGGPNTTDDS ANILSGANVS IDSSVKTSLH IPRKALESKA GSKQERTYEV LFDAQQPLGF YCIALPSGIS QAEYCLIVSI CPGGQASRDG RIRPGSVVRS VSGEDRLLGI EQLFEIYEMA KRKNHIISLS FLDRLSPLNG SMSQAFGEWT AKGHWKGRVS HGWAGGALQT LDTSNRVSRE RDPHGIQKQL SIGSGETGRK SMDDGRPCMA EHPPNSCRTS TGSSGDRRVR FMDAIHERHY SIDSKPCEFY EKHSGLALLP KDKGAGLGHT IPCDNQSLLL RSIKSGSFRD VIVILEEGLL SSAKSTASLI VAKSYVKEQL ASLQQNGITD AASERDWMLK DVLTKIFLKA AHVYENAKSL KEWSRYEVIF LGLEEVQLSS SGGLQFNQDC ISVRLSARYP DSNQQSELAK SLPVPLSKDI LFGKELSVPR HMHYNKCVAS KRSVVVDICK NGEASGSVKC IGSTVLKIQD LQRKCPRNGT WLESSKAFSN RNLFGSASIR FRARRLPVEA TYLERKRKTE CIGLKDVINW IKRFNDGLCP EERDAQLTFT VPVFDNASLL HSAILIQESP LVEELLYLGA DPKRRSVIGS PVFLAHNLRH KLIESLAETS NSEIADTDAY QERERGPDSA SVVSEGRCAN PRKKRIEHIA ALIAAATGNK LPSEPRRKVC
|
| |