Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_36310 |
Symbol | |
ID | 7201634 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | - |
Start bp | 131341 |
End bp | 134474 |
Gene Length | 3134 bp |
Protein Length | 882 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180948 |
Protein GI | 219120419 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.278303 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGAAA ACTTCATTAT CCACCCGTTA AGGCCTGAGT ATCATCGCTT GCAGAGTTCC GGGGGAACTG GCAGCTCGGC TGGTGGAGAA ACGGAATTCG ATGACAGCCA GGTGCTACGT CAGACTTTCA TGGTGTATTT GCCTATACTG GTAGTTGTAG TATTGTTGTT CTCACTGATA CGCCGAAAAT ATCGCCGCGC ATTCAACGTT CGTAGCTGGC TGGATCCAAT TAAGACACCG CTTGCATATA ACCAGTACGG GTTCTTTTCG TAAGTGCCTC ACGCAGAGTG GAGAAGTGCA TCGCAGTGTA TAGATCTTGT CTTACTTTTT TTCAGCTGGT TTTGGAAGCT GAAAAGCATT TCTGATGACA AGCTCATGGA TGAGTGCGGA ATGGATGCTT TGTGCTTTGT CAGAGTACTG CGAATGGGTT TCAAAATAAG CCTTTTGGGC GTGCTTTGCT CGGCCGTGCT CATGCCGTTA TATGCCACCG CTGACGACTC GCAAAACACA CGCTCTATCA CCGATAATAT TGCACAGCTA ACCATAAGCC ATGTTCCTGA AGGATCTCCA CGGCTTTTGG GGGCGGTTAT AGCTGCTTGG ATTATTTTCG GATACACAAT GCGATTGATA TTGAAAGAAT TCGTTTGGTT CATTGAAAAA CGACACAAGT TTCTTGCCAC CATCCGGCCT CGCAATTACG CGGTTTACGT GCGAAATATA CCAAACGAAC TTCGATCTGA TGCCGAGTTA GAAAACTTCT TTCGTCAGTG CTTTCAAAGC GAATCCATAT TAGAAGGGAA CGTGGCGCTT AAAGTTCCTG AATTGTCGAA ACTTGTGGCC CAGCGCGAAG CTGCAATTAC CAAATTCGAG CACGCTGTGG CAGTTGAAGA CCGCACGGGC GAAAAGCCAC AGCACGCTCC TTCTCTCGCG TCGGCAATCA GAGGTTCACT AAAGGGGGGA GGAGAAAAGG TGGACTCTAT AAATTATTTT GCATCAGAAA TCAAGGAATT AAATCAAGTC ATTTCGAAAC ACATTGATGA TCTAAACGAA AACAAGACTT GCTTCTCGCA TGATGTGGAG CAGCCACATG CTTCGGGCTC AAACAGACAG ATAAGTAATG GGGTACGAAA AACCAGAAGC GATTCAGAAA ACGAAAGGTA TGGTAATATG GAAAGAACGG GTCTCCTTTC TGTATCGGAA GGCCGTAGCG ACCACAGCGT CAAGGAGGAT TGCTCTACTT CCCCACGAAA TGACATTGAC ACCCCGAATG GAGACCCGGG CAAAATAGAA ACGGCCGAGA ATGCAGGATA CAATGCCTAT CGCGCAACCA ACGACGCGAC TCCTGGAGAC TCAGAAGGAA AAACAAATCC TGGTATCGTC TTGAAAAACG CCACCAAGAA ATGTAGAGAA TCTGCTTCAT CCATTGGAGG TGCTGTCAAA GACTCTGCCA AGGCAGTGGC GGAAAACTCT ACAAGTTTGT TAAAGAGCGC GGAAGATGGC AAACCCGAAA GCGCCGGATT TTTGTCCTTT CGCAGTCTCC GTTCAACTCA CGCTGCCCTG CAGCTGATAC ATCATGGCAC TCCTTTTACC ATGGAGGTCC AGGAAGCGCC AGCACCAGAT GATGTTTTTT GGTTCAACGT TGGTCGCGGA CATAAGGAGC TGCAAGTAGG TCGACTTATG TCGTTCGCAG CAACTGCTGT TCTTTGCCTC TTCTGGACAA TCCCAGTTAG CTTTGTTGCG TCGCTGTCTA CCATCGAATC TCTGCGAGCA GAAGTTGGTT TTGTCGACGA TCTACTTGAT ACGCTTCCAT TCCTTGCTCC ATTCTTTGAG ATTGCAGCGC CGCTACTTCT TGTTGTCGTA AATGCGCTGC TCCCGATGAT TTTGAGAGTG TTTTCCATGA TGGAAGGTCC AGTCTCTGGA GCTGTTGTGG AGGCCTCCCT ATTCACCAAG CTCGCCGCCT TCATGATTAT TCAGACGTTT TTCGTAAGCG CAATATCAGG TGGATTATTG CAGGTAAGAC ATCCTTCGTC GAGATTACAA ACCATTGCAG AATTCTTGAC TGGTCTCGCT TACACGCACT TGAATAACAT CAATGGAACA GGAACTCTCA TCACTGGTCC AAAGTCCCAC ATCAATAGTT GATTTGCTCT CCACGTCACT GCCTGCGCAG GCGACGTACT TCATTCAAAT TATCTTTGTA ACAACGGTAT TTTCTTGTGG AATGGAAATT CTCCGTGTTG TGCCACTACT CAAAGCAATG CTGCGTAGAT TCCTTGGACC TCGACTCACA GAAAGAGAGC GACAACAACC CTTTCTCACG CTTCGACCGT TATCAAACCC TCTTGACTTC GAACATGCTG GATTCTCTTC AAATATAGTA AGCTTGCCTG TCCCTCTTTT CGGTGCACAA TTTCCATCAT CTCACAAGCT TCTCTCCCAC ATCCCAGGTG TTATACTATA TCGTCTTTTT GGTATATTCC GTGATCTCGC CGCTGACAAG CATCGTTGTT GCATTTTGCT TCGCGTTCAT GGATTCAATT TTTTGCCATC AATTTGTATA CATTTACCCC AACCGTTCTG ATTCAGGAGG AAAGCTGTGG CTGAACTTTA TGCGGGTTCT AATTGCCTGT ATGTTCGTAG CTGAGTTTAC AAGTGAGTCA AGGATGAAAC TTTGTACATG TGACCAGTTC GCCATTTCTC TTATGAACTT GCTTCTTTAC TAAAGTTGTC GGCCTTTTGG CGTTGAAGAG AGCGCCCATA GCCACTCCGC TCATGGTCCC ATTGATTGTA GTTACAGCGC TGTTTTCAGT ATACATCAAC GAACAGCATT TCAAGGTGAC AAAAAATCGT AAGTTCCTGT GCTACGGTGT CCCCATAAGT TTAGTTGTCG GAATCAAACG ATCCTTGTTC ATACTAACTT TTGCTGGCTT CCTTCCTTCT GGCCAGTTCC ATGTCGAGAG TGTACTTTCA AAGACATAGA ACACAGTTCA ACTTTCGATT CCGCTTTTCT TAAAGACGCG TATCTGCAAC CCGAATTACA AACCAAAGAA GGTATGTTCT CACCAGTCAG TAGCTTTATG CTGTGTTTGG TGTCAACGAG TCTCACAGCA AGATTTGGTT TCGCCCGACT GTAA
|
Protein sequence | MNENFIIHPL RPEYHRLQSS GGTGSSAGGE TEFDDSQVLR QTFMVYLPIL VVVVLLFSLI RRKYRRAFNV RSWLDPIKTP LAYNQYGFFS WFWKLKSISD DKLMDECGMD ALCFVRVLRM GFKISLLGVL CSAVLMPLYA TADDSQNTRS ITDNIAQLTI SHVPEGSPRL LGAVIAAWII FGYTMRLILK EFVWFIEKRH KFLATIRPRN YAVYVRNIPN ELRSDAELEN FFRQCFQSES ILEGNVALKV PELSKLVAQR EAAITKFEHA VAVEDRTGEK PQHAPSLASA IRGSLKGGGE KVDSINYFAS EIKELNQVIS KHIDDLNENK TCFSHDVEQP HASGSNRQIS NGVRKTRSDS ENERYGNMER TGLLSVSEGR SDHSVKEDCS TSPRNDIDTP NGDPGKIETA ENAGYNAYRA TNDATPGDSE GKTNPGIVLK NATKKCRESA SSIGGAVKDS AKAVAENSTS LLKSAEDGKP ESAGFLSFRS LRSTHAALQL IHHGTPFTME VQEAPAPDDV FWFNVGRGHK ELQVGRLMSF AATAVLCLFW TIPVSFVASL STIESLRAEV GFVDDLLDTL PFLAPFFEIA APLLLVVVNA LLPMILRVFS MMEGPVSGAV VEASLFTKLA AFMIIQTFFV SAISGGLLQE LSSLVQSPTS IVDLLSTSLP AQATYFIQII FVTTVFSCGM EILRVVPLLK AMLRRFLGPR LTERERQQPF LTLRPLSNPL DFEHAGFSSN IVLYYIVFLV YSVISPLTSI VVAFCFAFMD SIFCHQFVYI YPNRSDSGGK LWLNFMRVLI ACMFVAEFTI VGLLALKRAP IATPLMVPLI VVTALFSVYI NEQHFKVTKN QHSSTFDSAF LKDAYLQPEL QTKEARFGFA RL
|
| |