Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43401 |
Symbol | |
ID | 7197135 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | - |
Start bp | 351284 |
End bp | 354146 |
Gene Length | 2863 bp |
Protein Length | 564 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177918 |
Protein GI | 219112333 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0388666 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAGTG TCTACGAGTT AGTGATTGAA CGCAACAATC GCGAGACAGC TCCATTCTTG TCGGTTCACG AAGCAAACGC AACACTTTTA ATTCAGGTAG ATGCTCTCCT GACCAAGTAT GAGCTCTCGG AACGAGAAGT TTCTAGTTTA CGACAGCAAC TGCAAGATGC AGCGATCACT TCACCAGGAA CCAAAGTCAC TTCGGCGGCA GCTAAGGCCG CTCGAAAGTA TGAAGTCCAA CTACGCGACA AGCTTGAAAA GTTGCAGGAC GAATACAACG CCAAGCTCCA GTCAGAAGCC AACGATCAGG CTGCCGCACT GAAAACCGCC AAGGACTTGA GCGATATGAA GGATTTGAAC ACAGCGCAGG AAATGACAAT CGCTAATTTG CGAGAAGAAA ATAAAGGTGC TGAAAGAGCC ATTGAACACT TGACTAACGA ATTGTCCGAG GCCAAATCAC GAACCGACTT GGCAGAAAAG CAGTATGATG GACTGAAAAA GACCATTCGC AGTCTCCAAA TAGAAAACGA CCATCTGAAG AAGGAAAGCC GAATAATTGA ACAACGGGTA GTGACGGAAA AGGGGAAAAC AGTTGATGAG GTGAATATTT TGACTGAAAT GGTGAATTCA CTCAAACGGG AAGTTGATAT GCTTCGTGCC TACAATAAAG GTTCTTGGTT TAGCAAGAAA GTCTCCAAGG AAAAAGCTGA TAACGAAGAC AAACAAATTG TAGCAAAAGG ACAGGATAGC GGACGGAAAT GGGGCACTTT AGGAACGGTC TTGCCAAGCG CTCCCAAACA AACCATCGAC GCGCATACTA TGGACGGAAC TTGCGTGAAG TACGATGCAT CCGGGACCAA TTTGGTGGCA ACATCAAGCA GCGATTCAAC CGTGAAAATA TGGGATACCA ACACGGGTAC TGTACGGGTA ACATTCCGGG GAACAGTCGG ACATTCCATG ATGTGTTGCG ACATTAACGG TAGCACTGTG GTGGGAGGAA GCAGTGACAA AACTTGTAGG GTTTGGAATT TGAGGACTCA ACGAATGGTG CGTACGGACG CGAACCTGGC ACATAGTTCG GCCTTGTCGC TAACGATGCC ACTTTTTTTT AGATTCATCA TCTCGTTGGC CACGCCCACA AAGTTACTTG TGTTCGATTA TTTGCCAATG AGCACGCCGT GGTTACTGGT TCTGCCGATC GTTCTCTCAA AGTGTGGGAT ATTTCACACA AAACCTACCG ACAAACGACC ACGTTACGCC ACAGCTCTAC ATCAAATTGT GTTGATGTCG GAACCGATTC CCAGACGGCT GTTTCGGGGC ATTTGGACGG AGGACTACGA TTCTGGGATT TGCGCTCCGG AGAACGCACG GAAGATATAT CCGGTGAGTG AGGGAGGGCA GTGAAGGATT GATTGACGGG GGCGCATAAG TACACAATTT CTCATTCTTG GCATTGTCTG TCGCTACAGG TCTCCACGAG GGCGCCATCA CGTCTTGCCA GTTCTCTCCC ACGGATTCCA GTTTGGTATT GACAAACGGA GCTGACTCGT GCTTGAAGAT TACCGATCTG CGTACAGGCC TACCCACACA TACGCTACGC GATGGTGCCT TCACTACGGC CCACAGCTGG TCTCACGCCG TATTTTCTCC GGACGGACGC TACGTGACGG CGGGCTCGAG CAGTCAAGGA GCAGTTCTGG TCTGGAGCGC AACCGACGGT AGTTTGGTCA AGACACTACG TTCACACGTT GCTGGCGTTT GCGCTATTGA TTGGTGTCGC GGTGGATCCA GTGGCCAGCA AGTGGCATCG TTGGATCGGC AAGGAAAACT AATTTTGTGG GCGTAACTTA CATTGTCAGA TACCTCCGGA TCTGGACGTA CTATTGTTGA TTTGTGTTGC CTCGTTTGTC TTCCAGGGGG TACGCGTCGG CGGCACCATG CTCCTGACTG CGATCGTTTC TACAGTGCAC AAAAGAAATG CCGCCCAATA TGAACGTCTT TGGGAGACGC CTAGTTCGCT ATTCGGACAA CACACAGAAC ATCAAGCTGT AAAACCAGTA TTTTTACCGT CCAAATTAGT ATTGCCAAGG CGAAGAGTAA AGGTTTTTAT ACTTAGGACA TATTGACGCC GATTGCGAGG CCCATGGCAC GATTGACATC CATTTCAATT TGTGCTTCAG CTTGTTCGAT AGTCCCGGCA GCCGCGGCGC CAAAGGCTGT TTTGGCGGCT TCAAACTGGC GGGAGACTGC CGCGGAATCA ATGTCGTCGA GCTTGACGGC TTCCGGGCAA ATGACGTCCT TGGCGAGGAT TATATTGGAT CAGAATGCGG CGAATATGGG AGAATAAATG TGAGTATCTC GTCATTCACG ACGTCACAAA TTGCGAAAGC GAGCGTTCCT ACGTTACGTA CCGTCGTCGA GTCGGCGTGG GTCAAGGCGT AACCACCCGC CACAAAGTAC TTTTCCGGTT CGGCCGTGTT TTCTTCGTGT AGAATCTGCA AGACACCAGG CTTGAGCTGT CCAACGTAAG GAACGTGATT AGCAGTAATA CCGTATTCTC CCTCCAAGCC GGGGCAGATG ACGGAATAGA CCGACGTGCC GTTGTAAATA GTCTCGTGGG GGAGGGCAAA GTTGAGTTTG ACCAGGGAAG ACGCCGCGGC GGATTCCGAG CTCATGCTGC GGACCGCAAC GACGCGGCGA AGGGCAGGAC GAGCTGTTCG ACTCAACATG ATAATACAGA ATTCGAACAA GTAAGAGTCG TTACTGGGAT GATCCGAAAG GGTCGGACTC GTCAGCTGAA GGACACACAG AGAGCGAGGA TCGCTACGGT TTGGCGGACT TGCGCGTTGG GATGGCAACT GTT
|
Protein sequence | MNSVYELVIE RNNRETAPFL SVHEANATLL IQVDALLTKY ELSEREVSSL RQQLQDAAIT SPGTKVTSAA AKAARKYEVQ LRDKLEKLQD EYNAKLQSEA NDQAAALKTA KDLSDMKDLN TAQEMTIANL REENKGAERA IEHLTNELSE AKSRTDLAEK QYDGLKKTIR SLQIENDHLK KESRIIEQRV VTEKGKTVDE VNILTEMVNS LKREVDMLRA YNKGSWFSKK VSKEKADNED KQIVAKGQDS GRKWGTLGTV LPSAPKQTID AHTMDGTCVK YDASGTNLVA TSSSDSTVKI WDTNTGTVRV TFRGTVGHSM MCCDINGSTV VGGSSDKTCR VWNLRTQRMI HHLVGHAHKV TCVRLFANEH AVVTGSADRS LKVWDISHKT YRQTTTLRHS STSNCVDVGT DSQTAVSGHL DGGLRFWDLR SGERTEDISG LHEGAITSCQ FSPTDSSLVL TNGADSCLKI TDLRTGLPTH TLRDGAFTTA HSWSHAVFSP DGRYVTAGSS SQGAVLVWSA TDGSLVKTLR SHVAGVCAID WCRGGSSGQQ VASLDRQGKL ILWA
|
| |