Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44481 |
Symbol | |
ID | 7197759 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 697894 |
End bp | 699957 |
Gene Length | 2064 bp |
Protein Length | 583 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178284 |
Protein GI | 219114977 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00684975 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CATTAGTATT GTTCAGCTTC CGTGAGATAG GACAAAACGC TTACTCTAAT CCACGGTTCT TTTGATCTTG TAGGGACCAA AACCAAAGAT GCTAACAAAC GCAGTGTATG GTTCCGAAAA CGACGACGAA GCGGGAGAAA GTGGATCCTC GGCTCGACAG GGCGAACAAA GCCAGAGAGA CTCAAGCGGA CCCCATGATG AGAGCCACAT CGAACACAAG GATGAGGGCA TGGCCTTCTT TCGACCTATC ATTCAGCGTC AACGATGGGG CGATGATCAA TTGCCGGCCC ACACTGACTG GGGGAATATT TTCTTTGACT TGTTTTATGT AGCAGCGTAA GTGAAGATCG AGCGGGTCAC AATATCTGGT GGCATCATGC ACATTTCCTC AGCCCAGCAA TCTACCTCTG GATTATCAGT GCTTACAACC TAGGAAGCTT ACTGCGAGAG GATCCATCAA GAAGAGGGCT CCTTTACTTG ACCGGCTGTT TTTTGCCGAT TTTGAATCTC TGGAATTACA AAATGGGCTT CTTGGCCCGC TTCCGGATCC CTCACGATGT CTACCACCGT GTATTCGAGG TGATGTATCT CATCCCATTA GCGACAGCCG TTACACACAT TCATCACGTA CCAATTCTTT CGGATCCGGA GCGCCATCTC GATATGTTCG TGTTTTGCTG TAGTATCACG CTCGCCGGTG CAATGTACTT GATCGTTTTG GTCGAAGTAA TGATTTCGCA ACGAATGGGA GCAAAGGAAC TCTACCCCGA ATCCTGGTAC ATGGCCCAGA AATTGATTTG GATTTCAGCC GTTCCCACAC TACTGGCTTT GTGTGCGACG GTGTATACTG GCAAGATGTA CTTTGATAGT TCAAAGAAGC ACGCAGTCTA TAACGCCTCG TCGGGGACCG GCGACTTCAA TGGTACCTCC AACGTAACTG GTGACGTTCC TCATCTGCGC GATTTGGCAT CGTCGGAAAG TCACGAATAC AATGGGGGAG CCGAAGACGA CGTCGCCATC TGGCTTCTCA TTGGGAGCAG CTTTGCAAAT ATTGCAACCA TGGGGTTTAT ATTCCTTGGC AATCGTTGGT GGAAGCCTAG CGTGCCAACT ACAGATTCTC AAAAGTACGA TACTTTAAAT AAAAGTGTAC AGGAATATAC AGTGATTTGG TGACCTCACG TCTTTCTTTT ACTTTTAGGT ACACTGTCCC TATGAATATC GAATTTTCCA TTCATCGGTA TGTTTTTTGT CCGACTTTCA CGCTGGCTCT TCTCTTTCTC ACTTTATACT TTCTTTAAAA TTAGATACGG CGAGTGGACA ATGTTGCTTC TAGGCGAAAG CATTCTGTCG CTATTGATAG TGGATCTTAG TGAAGGGGTT CAGTACTACG AGATATTCTT TTGCGGCGTG ATCACTGTTG TCTTTCTCGA GTATTTACAT TTTCGGTCGC AGCCATCAGA AGCAAATGAC CATGCGCTCA GGCGAAGCGC TAGTAGTGCA ATGGTCTTTT TCATGTTTAT GCAGGTATAT TCTATTGCAC TCGTAGTTCT TGGTACTTCG TATAAAATGT TCCTCTATGA ATCCGTTTAC GCCGACGAAA GTGCTGGGAC CAAAAGCCGG GCACTTTTGC CCATGTTCGA ACGTTTCCTT GCAGGAAAAT CGACGGCTTC CCGTTTCGAA ACTGGTGATC GGCAGCAGAG GATTGCTCAT TTCTTCAGTG GGAGTCTCGC GTTGGTTTGG ATATGTTTGG ATGCTATGAG TATTGCGCAC CGTGGCGTAA AATCAAATCT ACATCGACTT GAAGGCAAGA GGGGAAAGAA AATCGCACGT TTTTTGTTGT TGCTGCGTAC GGTATTGATT GTGTTCATCG GGACGCTAAG TCTTTATGTC ACGTCCCCTG TTTACCTCGC ATTTACTGGC CTTGGTGGCG TTGTCGCTCA GATTCTACTC CGCTTCTTGG GTTCGGCTAT TTTTTCTGTA GACGATGAAT TTCATGAGGA AGAGACCATC GAAAAAATTG CGACCTATGC CAACGCCAGG TTGACTGAAT CAGGCAGGGA ATAA
|
Protein sequence | MLTNAVYGSE NDDEAGESGS SARQGEQSQR DSSGPHDESH IEHKDEGMAF FRPIIQRQRW GDDQLPAHTD WGNIFFDLFY VAAAYNLGSL LREDPSRRGL LYLTGCFLPI LNLWNYKMGF LARFRIPHDV YHRVFEVMYL IPLATAVTHI HHVPILSDPE RHLDMFVFCC SITLAGAMYL IVLVEVMISQ RMGAKELYPE SWYMAQKLIW ISAVPTLLAL CATVYTGKMY FDSSKKHAVY NASSGTGDFN GTSNVTGDVP HLRDLASSES HEYNGGAEDD VAIWLLIGSS FANIATMGFI FLGNRWWKPS VPTTDSQKYT VPMNIEFSIH RYGEWTMLLL GESILSLLIV DLSEGVQYYE IFFCGVITVV FLEYLHFRSQ PSEANDHALR RSASSAMVFF MFMQVYSIAL VVLGTSYKMF LYESVYADES AGTKSRALLP MFERFLAGKS TASRFETGDR QQRIAHFFSG SLALVWICLD AMSIAHRGVK SNLHRLEGKR GKKIARFLLL LRTVLIVFIG TLSLYVTSPV YLAFTGLGGV VAQILLRFLG SAIFSVDDEF HEEETIEKIA TYANARLTES GRE
|
| |