Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48492 |
Symbol | |
ID | 7203715 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011685 |
Strand | - |
Start bp | 665804 |
End bp | 667422 |
Gene Length | 1619 bp |
Protein Length | 488 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183007 |
Protein GI | 219125476 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.439098 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGATATTCTG GACTATCAAC AAAATTAGCG AGAGTATCTG AAAATTCGAG AAGGAAGAGC AACAAACAAC ATGAATCTTC CCCTCTTCTT GATTTTTCTT TTTATGCCTT TGGTTTCGGC CGCTTTGGGT TCGCATCGCA AGCTTAATCG TGCATTAACG GGCCGTTCAA TTCCAGGACA GTACATCATC GAGTTAGACC CCAGCATCCC GGATGCAAAG GGTTTTGCAA CCCGCATCCT CAAACGAGCC TTCCGGAACA ACCTTATCGA GACCTACGAC TACGCGTTGA AAGGATTTGC TGTTAAGGAT CTCCCCGACA TGCTAGTGAA CTTCATGCTG AATATGGACG ATGTACTCTC GGTGTCAGAG GACGCCATTG TCGAGGCTGA CGCAGTGCAG ATAAATCCGA CATGGGGCCT CGACATCACG GACGGAGAAG ATGACAGCCT GTACACTTAC GCGTATACAG GGCAAGGCGT GAATGCGTAC ATTCTTGACA CCGGAATTCA AGCGAATCAT CCTGAATTCC AAGGCCGTGT AGAGAGTTGC GTCTCCTATA CCGGAGAAGG TAAATATAAT TATCGCATGC TGATTGTTCT TCATGTTGTC CGCAATGCAA CAACCCACAC CATCATTTCT TCTTTGCTCA GTGTGTGGAT CTGATTTGAA TGGGCACGGT ACCCACGTGG CCGGAACTGT CGGGTCAAAA ACCTACGGTG TGGCCAAGAA GGTGTCGCTG CACGATGTCA AGGTACTTGA CCGGCGGGGG AGTGGATCCT TCAGCGGAGT TATTGCTGGC ATCGACTACG TGGCCCAGAT CAAGAAGACA GATCCCAGTC GCAAAACAGT CCTCAACATG AGTCTGGGGG GAGGCCGTAG TACGGCCTTG AACAACGCAC TCGATTCTGC AGCGGCTTCC GGCGTGGTAG TAGTTGTCGC TGCCGGAAAT AGCAATCACA ATGCCTGTAA TTACTCTCCT GCGTCTGCGT CGGGAGCGCT GGTAGTTGGT TCTATTGATA GCAACAACCG CCGTTCGAGT TGGTCCAACT GGGGTAGTTG TGTCGACATC TTTGCAGCTG GATCCGGGAT CCTGTCGCTG TCGCGAACCG GTGGCGTAAC CACAAAGTCG GGTACGTCCA TGGCTGCTCC ACACGTGGCC GGTGTTGCAG CATTGTACTT GCAAGCGGGT AGAAATCCCA ATACTATAAC TTCCGATGCT TTGAAGAACC GGGTGACCCG TACTCGAGGT TCCCACAACA AACTATTGAG TACATCTGCA CTACCTTCTG AACAAACGCC ATCTCGGGCG CCCACCCATG TCCCAACTGT TGAGTCCGTG CCTCTGGAAG ATTCAGATTC GTTGGCTCCG ACCAAGGCTC CCGTTTCTTC CCCCACCAAA GCTCCAGTGC CACAACCTAC TCTCATTCCA ACCAGGAAAC CTTCTCGCGC TCCAACCAAA AAACCTACTC GCGCTCCAAC CAAAGCTCCA GTCCCTCGGC CGCCACCTCA GTGTCGGTCT AGAGGTCAAG TTTGCAATCG ATTTCGGCGA TGTTGTAGAG GGTTGAGATG CTTTCAAACC TGGTCACCTC GGCAAGGACG TCGCTTGGCT TGTCGCTAA
|
Protein sequence | MNLPLFLIFL FMPLVSAALG SHRKLNRALT GRSIPGQYII ELDPSIPDAK GFATRILKRA FRNNLIETYD YALKGFAVKD LPDMLVNFML NMDDVLSVSE DAIVEADAVQ INPTWGLDIT DGEDDSLYTY AYTGQGVNAY ILDTGIQANH PEFQGRVESC VSYTGEVCGS DLNGHGTHVA GTVGSKTYGV AKKVSLHDVK VLDRRGSGSF SGVIAGIDYV AQIKKTDPSR KTVLNMSLGG GRSTALNNAL DSAAASGVVV VVAAGNSNHN ACNYSPASAS GALVVGSIDS NNRRSSWSNW GSCVDIFAAG SGILSLSRTG GVTTKSGTSM AAPHVAGVAA LYLQAGRNPN TITSDALKNR VTRTRGSHNK LLSTSALPSE QTPSRAPTHV PTVESVPLED SDSLAPTKAP VSSPTKAPVP QPTLIPTRKP SRAPTKKPTR APTKAPVPRP PPQCRSRGQV CNRFRRCCRG LRCFQTWSPR QGRRLACR
|
| |