Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45073 |
Symbol | |
ID | 7200161 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | - |
Start bp | 121314 |
End bp | 124769 |
Gene Length | 3456 bp |
Protein Length | 1085 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179365 |
Protein GI | 219117141 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TGGTGCCGCG CTCCGTTTCT TTGTAGTAAA ATACCGGCAA GAGCCCCTCG TGAGCATCTT TATTCACAGC ATGCAATCTT CCTTGCGCGC ACGACGTGGT GTGGCTCCCC GGAGAAAAGC TGCCGTCGGT CGGTATCAAC TGGTCCTCGC TATTACGTTG TCCGTCGCCT TGACCGGTCT CGTGACAACT CAATTTGTGC TGTCGACTTC GTTGTATTCC ACTCAAGAAA AGCAGCAACG ACAGCAGCAC AAGAGCCACG CAAAGGGACC TAATCCATCC AATCTAAGGA CCGATTCTTT CCCGAGAGAG CAGTTGCACG TAGTTGTGCC GGAAGAACAG AAACAATCAG AGCGGCGATT GAACGAAAGC AGTCGGGAAC ACGAATCTCG AGAGAATGAC CATGAAGACG AGCAAAAGAC CCGGCCCGAT TCAGAGAAGA AAGATGATCT TCTTACGCAG CAGGACCAAA AGACGACGAA ACGAAACGAA CATGGTCCTG CTCAGGAAGG CATCCGCAAG CACAAGACTG ACCACGAGGC TCCCGAAAAG GTGGAATCGC CAGTGGATGA GGACCATGAA GTCCAAAAGG CACACAGAGA AACAGTGCAA AAATTCGTCG ACGACCGAAG AGTTCGCCTC AAGCACAGAA TCTCTCGACC CAAAGTGGCC AGGGCTGCAT CACCGCCTGA GGTGGAACCG CAAATTGAGG TTGCTCCGCC ACCCGAAAAG CGACCTTATG ACATTCTGGA TGATCCCTTG CAAAATCCAG ATTTCAACAA ACCCTCGAAG CCGTTGAACT TCACGGCCGC TGTACCATAT TTGGGTGTAC TGATTGACGG GGGGCGTCAT TTCTTTCCAA TGGACTGGAT GAAACGAGCC GTTGATCGCC TTTCTGATTT GCGCTATAAT TTGATTCACT TGCGTCTTAC GGACGACCAA GCCTTCAACG TTCTATTGGA TTCTCATCCC GAGCTCGCTT ATCCCGCCGC CGTCAACAAC CCACACCAGC AAGTTTGGAC GGCCAGCGAA TTGCGTGACT TGACCGCTTA CGCGAAATCC AAAGGAGTAA GCATCATGCC CGAAGTCAAC GTTCCCGGAC ACGCCGGTGC CTGGGCCGGT ATTCCTCACC TAGTCGTGCA CTGTCCCGAA TTCATCTGCC AAAGAGGCTA CGGATTGCCA CTAAATGTAA CCCATCACGA TCTCAAACCT ATCTTGACGA GTATCTTGAA GGAAGTCGTC GACATTTTTG ATGATCCGCC CTTTCTACAT TTGGGTGGGG ATGAAGTCAA CGTACGTTTG TTTTCCGTGT GGAACTTTTC TGTGCGTTGT CGGAAGGTAT AAGGCGTTTC TAAACTTTTT TTTCTCTACC AGATGGCCGG CCCCTGTTTT AACGAAGTTC GCAGTCCCGT CTTTAATTAC ACGGCTTTCG AAGTTGTTCT TAAAGAAATC ATTGCCGATG TAGGCTACCC CGAAAAGCAA GTGGTTCGTT GGGAAATGAC CGGGCAGGCT AATTTGGAAC GCGCTGGCGG TGTGGAACAA TTTTGGGAGT CGTATCCGGG AGAACGGCAC AAGGCTGCGG GACCTTTTTT CATTTCGAAC CGTTTGTATT TTGATACAAA CCAGGATCAA AATGCGTACG AAGTTTGGCA GAATACCCGA CGGTTTTATG TAAATGATTA CCAGCCCGAG GCAGTTCCAA CCGCCATTAT TGCCGGCACC TTTGAGTTGT CGACGACTTG GTGGTACGAT CGCAATATTT TGGGACGTTT GTTGGCTGTT GCGTTGGGTG CCCGAAACGA AACTCTACCA AAGACGATGA AGGACCAAGA CCATGAGAAA ATGGTGCTTG ATCAATACCA AGTTTTCTGC GACCAGTTAG GATATAGCCA GGCAATTTGC GAAACCAACG GTGGCCCGAT CATCCCTACC CCGGAGTACA AAAAGAAATG GGGTGACGGT TGGGTAGTTT GGAAGGCGCA CATCTGTGAA CGCATGACGA CGACTGAGGT AACCAAGGCA ATGCGACCCC GCTCTAGCGA CCGGGTCGCC ACGCAGGCGA ACAGCTACTT TTGGAACGTA TTTGGATTTC CTGCGCACAC ACACACGCGA GTGGGCCAGC ATCCAACGTT GCCAGACGAT CTCCAGGCCC TCCAGCGACA TTTAATTCCT CATTGTGGCG TTATGTTGGA TACGACCAGA TCCCTGGTTC CAGCGGATCG GTTGGGAACG ATTTTGACCG ACACCGTTGC AAAATTGGGT TTCAACATAG CCCAGCTGCG TTTGGTCAGC AACAAGGGCT TCACGTTTGC TCCGAGTAGT CTACCGCATA CTGTAGGCCA TTCGTTACTA GCAACGAAGG AGATCAAGGT ATACACTAGG AGTGACTTGA TGGGTACCGT TGCTAAAGCG AGTGCGGTGG GAATCCAAAT GATCCCCGAA ATCAGCATGA CGACAGGAAG TGCTGGTTGG TACGAGTCGG GCTATCTAGC GAATTGTCCA AACCGTCTGT GTGAAATTGG TGACGCGTCG ATTGACGTGA CGAACCCGTT CTTACCACCC ACCGTGTACT CGTTGATCTA CGAGTTGCGT TCCATTTTCA GCAGCAGTCC CTATATTCAT CTCGGTTCGG ACGAGCGTCA AGACGCGGCA GCTTGCTACC AAGAAGCCAA TCCCACGTTC CACGCGGACG TGGGAGCGTT CGAGCGCAAA ATGGTCAAAG TCTTGGAGGC GAGCGGGATT GCGAACGATT CTGTACTGCG GTACGCCAAT TCGCAAGGCG AGGTGTACAG CGACCGGACG GGTGGCGTCA CACACTACGG TCCGGACCAT GCGACGGAGA TTCCTGCGGA CGCACCAATA TTTGTGAGTG TGGATTTGTT GCGGGACGAT GGGTGGACAT TGTACCAGCG GGTGAAGGAA CTCGTATCGA AAAAGCCGTT GGGCATCTTG GCGGAAATCC GTACGTTGAC GGCTCCCCGT TGGGAAGGCC TGGAGATTCC GGAACGTTTG CTGGTGTACG CCATGGCCGT ATCGGAATTG CCCACGTACG CGAACGCGGC GGCACTGGGC GAGCGGTACG GGGAGCTTTG CCGGGCGTTA TCGGATCGAT TGCCGGGATT GGGTCGTCGG CACGATTGCG CGTTGCCGGG TGTCGTCTCG GGGAAGGTGA CGTTTTTGGC CGACACGAGT ACCTTTGTGC AGCAGCAGTG TCAAATGGCG ACGTATCCCG TGACGCAACA CCACGCCAAG TTGGTAGCAC CCCGGTACAA CGCGACCGAG TGGGAGCAAC TGCGCGGGGC GCCCCGCGTG TTTCCGGCGG CGGGTCGGGA TCCCCAGCGG CATCACCCCG TCGTCATCGG GCATGGAAAA TCGGGGGACG AGTCCCCGGT CGAGTCCGTA GCTAGCTAGC TAGCTAACTA ACTAGGAGTA AACGATGAGT GCATGGATGA GAGAGA
|
Protein sequence | MQSSLRARRG VAPRRKAAVG RYQLVLAITL SVALTGLVTT QFVLSTSLYS TQEKQQRQQH KSHAKGPNPS NLRTDSFPRE QLHVVVPEEQ KQSERRLNES SREHESREND HEDEQKTRPD SEKKDDLLTQ QDQKTTKRNE HGPAQEGIRK HKTDHEAPEK VESPVDEDHE VQKAHRETVQ KFVDDRRVRL KHRISRPKVA RAASPPEVEP QIEVAPPPEK RPYDILDDPL QNPDFNKPSK PLNFTAAVPY LGVLIDGGRH FFPMDWMKRA VDRLSDLRYN LIHLRLTDDQ AFNVLLDSHP ELAYPAAVNN PHQQVWTASE LRDLTAYAKS KGVSIMPEVN VPGHAGAWAG IPHLVVHCPE FICQRGYGLP LNVTHHDLKP ILTSILKEVV DIFDDPPFLH LGGDEVNMAG PCFNEVRSPV FNYTAFEVVL KEIIADVGYP EKQVVRWEMT GQANLERAGG VEQFWESYPG ERHKAAGPFF ISNRLYFDTN QDQNAYEVWQ NTRRFYVNDY QPEAVPTAII AGTFELSTTW WYDRNILGRL LAVALGARNE TLPKTMKDQD HEKMVLDQYQ VFCDQLGYSQ AICETNGGPI IPTPEYKKKW GDGWVVWKAH ICERMTTTEV TKAMRPRSSD RVATQANSYF WNVFGFPAHT HTRVGQHPTL PDDLQALQRH LIPHCGVMLD TTRSLVPADR LGTILTDTVA KLGFNIAQLR LVSNKGFTFA PSSLPHTVGH SLLATKEIKV YTRSDLMGTV AKASAVGIQM IPEISMTTGS AGWYESGYLA NCPNRLCEIG DASIDVTNPF LPPTVYSLIY ELRSIFSSSP YIHLGSDERQ DAAACYQEAN PTFHADVGAF ERKMVKVLEA SGIANDSVLR YANSQGEVYS DRTGGVTHYG PDHATEIPAD APIFVSVDLL RDDGWTLYQR VKELVSKKPL GILAEIRTLT APRWEGLEIP ERLLVYAMAV SELPTYANAA ALGERYGELC RALSDRLPGL GRRHDCALPG VVSGKVTFLA DTSTFVQQQC QMATYPVTQH HAKLVAPRYN ATEWEQLRGA PRVFPAAGRD PQRHHPVVIG HGKSGDESPV ESVAS
|
| |