Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50116 |
Symbol | |
ID | 7198918 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011695 |
Strand | - |
Start bp | 91815 |
End bp | 93596 |
Gene Length | 1782 bp |
Protein Length | 546 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185046 |
Protein GI | 219129754 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.433548 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TATTGCTACC CCCACGGCCT ACCTATCCAT CCCTATTCCC AGTCAATGCA TGTAATTGCC TTATTATTGT CAACCGTAAC TGTGAATTGT TGTGTCCAAG AGTGAGGACC GTTTCTGCCT CTCACGTAGC CTCATTAGTC GATGAATGTC GAAGACGTGG TTCCGAGCCA CGGGAACTCT CGCAAACGTT CGCGTTCCAG TACTCCCGTT CCGACCAATC CTGCGCAAAC CAAACGCCAA CTCCAGGCCG CCGTCGAACG CGATGCGCAG CTCATCCACA AGCTATCCAA CGAGGAAACT CGCAACGATA CACTGAACGA TATGCTCAAG CTTTCGCTCA GTCACGATCC AAGTTTCGCC ATGTCTTCGG ACTCTCTTTT GCAGACTTTG GCGCAGATTG TCAAGGAATG TCTGGAATGG AACGAACCGC ACATTATTCT ACCGGAAGAC AAAGACAGTA AAGACGTCCA GGACAACGCG CAAGCGAAAC AGGAGCTTTT GCTCAAATCC AAGCTCACCT GGATTCAAGC ACCAACCCCG CGATTAACGG CTTGGTTCCA CCACTGTCGT CAAATGCTGG GAACGCGCCG TGTACTGCTC GACCAGGCCA GTTTGCAGAC ACTCGATGTG ATCCTCGTTA TTCTTCGCAA TCTCAGCTAC GTAGGTGCCA ATTTGCGTCT CTTTATATAC GTTCCGGACA TTCTGGCCAT TCTGGGGGGC TGCCTGTACG AGCGTCCCCT CGAATACAAA GGTACCGACT CCTCCTTGGC GGGCACCGGT ACCCACCTGG CGTTGGCCGC TGCGCACGTC CTGCTGCACT TGGCTCCGTA TTGGGATGTT TCCGGTCAAC GACGTACGGT GGATCGACTC TTTTATCGAC CCCACACAGC CGATGGCGGT CCGGTGGTGC CGGATCCGGA ATCCTTTGGC TGGACAGCCA ACGGTGGTTG GGGATTTGGT GGAGCGTACC TGGCCAAGCA GCACGACTCC AAGGAAGACA CCATGGAAAA CATCTCCAAG GCATTCTTGC TCGCCGTTGC CAGTACGTAC TTGGAATCTG CCTGGAGCAT CTTTGGTCCG CTCGGACATG CTTTGACGGA TCCCTCGACC CCTCGCAACG TGCTGCTCAT GGTTCTAGAC GTCTTGCAAG AACTCATCAA CGTGGCCCGG ATCGGAGTCG TCGGTAATAT CCACGAAGAT GACGAAGAGA TTCCCACGTT GCGCGCAATC CTAGTACACA TGCCGGATAA TCTGTTGTCT CGCCTGGTCG ATTGCCTCTA CATTCCACGA CTGGGACCGG ATGCAATCGA CTATGTTGAT CCGGTGCACA ACGTTGTAAC GCGAGTCAAT CCACTCAAAC TACTCATGGG CTACGAAGCA ACAGTCGACA CCGAAGTGCG GGACCGCGTG CTGGACATCC TCGTGCCCTT GGTGGAATTG GACGCGCCAC GGATGGCGAT TCGGCTGGGA CACGACACGG GAGCAATCCG CGTGCGCTTG TTCGACGCCC TCGTTCCGGC CGTGACAACT ACTGTCGGTC GCAACGACGC TAGTTTGCTC GCGACGCAAC TGCTGCGAGA ATTGTCCAAG ACTGATGCGA ATCGAGTCGC CTTTCAATAC ATTCAATCCA GACTGGTAAC CCTGGCCAGC AAGGATACTC GTGTGGCCCA TTTGGTATGG AATCATTTGT ATAAACCTGC AGACCGTGCG TCGGCAGCAG GTTCGGAAGA AGGCGACGGT AGAGATGTTT CCTCCAACGG GAGTGGCGAC GAAGACGAAT GA
|
Protein sequence | MNVEDVVPSH GNSRKRSRSS TPVPTNPAQT KRQLQAAVER DAQLIHKLSN EETRNDTLND MLKLSLSHDP SFAMSSDSLL QTLAQIVKEC LEWNEPHIIL PEDKDSKDVQ DNAQAKQELL LKSKLTWIQA PTPRLTAWFH HCRQMLGTRR VLLDQASLQT LDVILVILRN LSYVGANLRL FIYVPDILAI LGGCLYERPL EYKGTDSSLA GTGTHLALAA AHVLLHLAPY WDVSGQRRTV DRLFYRPHTA DGGPVVPDPE SFGWTANGGW GFGGAYLAKQ HDSKEDTMEN ISKAFLLAVA STYLESAWSI FGPLGHALTD PSTPRNVLLM VLDVLQELIN VARIGVVGNI HEDDEEIPTL RAILVHMPDN LLSRLVDCLY IPRLGPDAID YVDPVHNVVT RVNPLKLLMG YEATVDTEVR DRVLDILVPL VELDAPRMAI RLGHDTGAIR VRLFDALVPA VTTTVGRNDA SLLATQLLRE LSKTDANRVA FQYIQSRLVT LASKDTRVAH LVWNHLYKPA DRASAAGSEE GDGRDVSSNG SGDEDE
|
| |