Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_11441 |
Symbol | |
ID | 7199864 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | - |
Start bp | 83638 |
End bp | 87443 |
Gene Length | 3806 bp |
Protein Length | 1212 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178926 |
Protein GI | 219116262 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.41683 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGTACA ACGACGGAGG CGAAGCCTTA CCGGAAGAGG AAGGAGATGA AGTTATGGAC ATTGACGAAA TTCCCGTCAC GCAAGAAGAC GCCTGGGCAG TGATTAGGTA CGTGCTGGGA CCGAGACGAT GATGGGAGGT CGACACATTC CATACGGAAA GCACGGTAGG ATCCGGAGCA GCCTGCGTCC AAACGCCTTT CCTCGCGTGT CCAGAAAACG GATGATTCGT CGTTGCAACA TGACACTAGT TTCTCACCCA AGCACTGCCC ACAGTGCCTA CTTTGAAGAA AAAGGCTTAG TCCGCCAGCA ACTCGATTCT TTTGACGAGT TCATCCAAAA TACCATGCAG GAACTCGTGG ACGATTCCGG AGATATTCGC GTCTCGCCGG AAATTCAGCA CATGGTGGGC TACGACGACG AAGCCTACGA CAGAGAACAG GAAAAGGAAA CAAAGTTTGT CTTTGAGATT CACTTTGGCC AAGTTTACCT TTCCAAACCC ACCACGGTCG AGAAGGACGG AACCATCACC AACATGTTCC CACACGAAGC CCGACTCCGA AATATGACCT ATTCCGCACC GCTTTACGTT GATGTGGACT TGAATCAGTA TCAAGTGGGA CGAGAGGTCA ATGTTAACGA CCCCTCGGAG GATTTGGGAG AACCTGTCGC CACCGAACAC GCCAAGAAGG AATTTCTCGG TTACGTGCCT ATCATGTTGC GTTCCCTCTT TTGTGTACTT TCGGACAAGG ACGACGCCGA TTTGAGTGAC CTGGGGGAGT GTGTTTACGA TCAGGGTGGT TATTTCGTCA TTAATGGATC GGAAAAAGTC ATTATTGCGC AAGAGCGATT GTCCAACAAT CACGTTTACG CCTTTAAAAA GAAGCAGCCC AGTAAGTTTT CCTGGGTCAT TGAAACCCGC AGTCAAGTGG AAAACTCCAC TCGGCCTGTG TCAACCCTCT ACATCCAAAT GTACCACAAA GGTGGGCGCG GGGCCATTGA AGGCAATCAG ATTCGTTCCA CGTTGCCCTA CATTCGCACC GATATCCCCG TCGTTATCAT ATTTCGAGCA CTGGGATACG TAGCGGACCG GGACATTATC GAGCACGTCG TGTACGATCT GACGGATGGG GAAATGATGG ACTTGTTCCG ACCTTCGCTT GAAGAGGCTT TCGTCATTCA GCGACAGGAT GTGGCGTTGG ATTTTATCGG TCGGCGTGGA TCGGCACGGG ACGTGACCAA GGAAGACCGG ATGCGGTACG CACAAGCCAT TCTACAAAAG GAAGTTTTGC CGCACGTCGG AACAGAAGAG CACTGCGAAA CCAAAAAAGG TTTCTTTATC GGTTACGCCG TACACAAGTT ACTCATGTGT CGCTTGGGAC GGGCCGAAGA AGACGACCGT GATCACTTTG GTAAAAAGCG CTTGGATTTG GCCGGACCCT TGTTGGGCGG GCTATTTCGC GTCTTGTTCC GGAAACTGAC TAAGGATGTG CGCAAACATT TACAACGATG CTTGGATGAA GGCAAACACT TCAATATTGG AGCCGCTATC AAGAGCAACC ATATCACCGA CGGACTCAAA TACTCGTTAG CAACGGGAAA CTGGGGTGAC AAGGGAATGG CTACCAAAGC GGGTGTCTCA CAAGTGTTGA ATCGTCTGAC CTACGCTTCG TCGCTCTCCC ACCTCCGTCG ATGTAACACG CCCCTGGCCC GCACCGGAAA ACAGGCCAAG CCGCGTCAGT TGCATAATAC GCACTGGGGT ATGGTATGCC CAGCCGAGAC TCCCGAAGGG CAAGCCGTTG GTCTCGTCAA GAATCTGGCC CTCATGGCGT ACATTACGAC CGGTACCGCA CAAGTTCCCG TCTTGGAGTT CCTGGAAGAG TTTAGCACCG AAAACTTGAC GGACATTTTG CCCTCGGTAA TTGCCGAACC GAATACTTGC AAAATTTTTG TTAACGGAAA TTGGGTAGGT ATTCATCGTG ATCCGAAAGC ACTCGAAGAG ACATTCCGAT CATTGCGTCG AATGGTGGAC ATTGATGCCG AGGTTTCGAT TGTTCGAGAC ATTGCCGATA AAGAAGTTCG CATTTACACA GATGCCGGAC GCATCTGCCG CCCATTGTTT GTCGTTCAGG AACAAAAACT GGCAATCAAG AAGCACCACA TCATGCAGTT ACAGGGCATG GATCCGAATG AAAAGAAGTT GACCTGGACG GATCTTCTCA TGGAGGGTTT GGTCGAATAT ATTGATACAG AAGAGGAGGA GACCACCATG GTTGCCATGG AACCGAACGA TTTGGATTCC GACGATTCGT ATTCCTCAAC GTATACACAC TGCGAGATTC ACCCGTCTAT GATTCTTGGT GTATGTGCCT CGATCATCCC CTTTCCTGAT CACAACCAAT CTCCTCGTAA CACTTATCAG AGTGCCATGG GAAAGCAAGC TATGGGTATT TATGCGTCAA ACTACCAAGT ACGCATGGAC ACGATGGCGC ACGTATTGCA TTACCCGCAG AAACCATTAT GTACTACACG AGCGATGGAG TTTTTACATT TCCGAGAACT CCCAAGCGGT GTAAATTGCA TTGTCGGCAT TCTCGTTTAT ACTGGGTACA ATCAGGAAGA TTCCTTGATT TTGAATCAGT CAGCGATTGA CCGTGGGCTC TTCCGGTCCT CCTATTATCG CTGTTACATT GACCAGGAGA AAGCCAGCAC TGTCGGCACG ATTGGATCCT TGACGTCGGA ACTGTTCGAA AAACCTAGTA TGGACAGTAC GCGCGGAATG AAACACGGGG AGTACGGAAA ACTCGACGAT GATGGTCTGG TAGCTCCCGG TACTCGCGTG TCGGGAGACG ATGTTTTGAT TGGCAAGACG GCACCAATTG ACGCTACGGC CGGTATGCCT TCACGTTACT CCAAACGCGA TTGCAGCACG TCCATGAAGG CCAACGAGCA TGGCATTGTT GACAACGTGC TGATCAGTAC GACAAAAGAA GGATATCGGT TTACGAAGGT TCGCATTCGG AACGTCCGGA CGCCGCAGAT TGGTGACAAG TTTGCCTCAC GTCACGGACA AAAGGGAACG ATCGGAATGA CTTACCGTCA AGAAGACATG CCTTTCACCG TAGAGGGCAT CGTGCCGGAC ATTATTGTGA ATCCCCACGC TATTCCTTCT CGAATGACTA TCGCACAGTT AATTGAGTGT CTTTTAGGAA AGGTTGTTGT CTTTCAGGGA TGCGAAGGAG ATGCTACTCC TTTTACGGAC GTGACCGTCG AAGATATCAG TACCCGCTTA CACGCCATGG GATACCAAAG ACACGGAAAC GAAGCTTTAT ATCAGGGACA TACAGGACGC CCATTGAATG CTCGTGTTTT TATCGGTCCA ACCTTTTACC AGCGTTTGAA GCATTTGGTC GACGATAAGG TCCACTCTCG TGCTCGCGGT CCGGTTGCCA TGCTTACGCG GCAACCGTTG GAAGGTCGAT CTCGGGATGG TGGTTTGCGC ATGGGAGAGA TGGAACGCGA TTGCCTGATT ACGCATGGCT GTGCCAACTT TATGCGCGAT CGCTTCTTTG TCAATTCGGA CCAGTACCGT ATCCACATTT GCGAACGTTG CGGTTTGACG GCACAGGCCA ATCTCAAGAA GATGACGTAC GAATGCCGCA GCCCGATGTG TGTGGGTCGT CCGACGCAAA TTTGCCAGGT CGAGATCCCG TACGCGGCCA AGCTACTGCT ACAGGAACTC AACTCCATGT GCATTCAAAC CAGAATTTAT ACCAAGAACG TGACGACAAG GGATAATTCG TATTAA
|
Protein sequence | MAYNDGGEAL PEEEGDEVMD IDEIPVTQED AWAVISAYFE EKGLVRQQLD SFDEFIQNTM QELVDDSGDI RVSPEIQHMV GYDDEAYDRE QEKETKFVFE IHFGQVYLSK PTTVEKDGTI TNMFPHEARL RNMTYSAPLY VDVDLNQYQV GREVNVNDPS EDLGEPVATE HAKKEFLGYV PIMLRSLFCV LSDKDDADLS DLGECVYDQG GYFVINGSEK VIIAQERLSN NHVYAFKKKQ PSKFSWVIET RSQVENSTRP VSTLYIQMYH KGGRGAIEGN QIRSTLPYIR TDIPVVIIFR ALGYVADRDI IEHVVYDLTD GEMMDLFRPS LEEAFVIQRQ DVALDFIGRR GSARDVTKED RMRYAQAILQ KEVLPHVGTE EHCETKKGFF IGYAVHKLLM CRLGRAEEDD RDHFGKKRLD LAGPLLGGLF RVLFRKLTKD VRKHLQRCLD EGKHFNIGAA IKSNHITDGL KYSLATGNWG DKGMATKAGV SQVLNRLTYA SSLSHLRRCN TPLARTGKQA KPRQLHNTHW GMVCPAETPE GQAVGLVKNL ALMAYITTGT AQVPVLEFLE EFSTENLTDI LPSVIAEPNT CKIFVNGNWV GIHRDPKALE ETFRSLRRMV DIDAEVSIVR DIADKEVRIY TDAGRICRPL FVVQEQKLAI KKHHIMQLQG MDPNEKKLTW TDLLMEGLVE YIDTEEEETT MVAMEPNDLD SDDSYSSTYT HCEIHPSMIL GVCASIIPFP DHNQSPRNTY QSAMGKQAMG IYASNYQVRM DTMAHVLHYP QKPLCTTRAM EFLHFRELPS GVNCIVGILV YTGYNQEDSL ILNQSAIDRG LFRSSYYRCY IDQEKASTVG TIGSLTSELF EKPSMDSTRG MKHGEYGKLD DDGLVAPGTR VSGDDVLIGK TAPIDATAGM PSRYSKRDCS TSMKANEHGI VDNVLISTTK EGYRFTKVRI RNVRTPQIGD KFASRHGQKG TIGMTYRQED MPFTVEGIVP DIIVNPHAIP SRMTIAQLIE CLLGKVVVFQ GCEGDATPFT DVTVEDISTR LHAMGYQRHG NEALYQGHTG RPLNARVFIG PTFYQRLKHL VDDKVHSRAR GPVAMLTRQP LEGRSRDGGL RMGEMERDCL ITHGCANFMR DRFFVNSDQY RIHICERCGL TAQANLKKMT YECRSPMCVG RPTQICQVEI PYAAKLLLQE LNSMCIQTRI YTKNVTTRDN SY
|
| |