Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42749 |
Symbol | |
ID | 7196376 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 958597 |
End bp | 961687 |
Gene Length | 3091 bp |
Protein Length | 1002 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176696 |
Protein GI | 219109886 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TCCATTGTAT TTAGAAATGG ATTCTAGTCG TTCATCAAGC AACCGAGCTT CGAACTCATC TGCTTCGCTA AGTGCCAATA GTTTGCCTTC CCAAAAGGCG GACCAGACGG CGGCAGCAGC CGCGTCACTG AACTTGTTGT TGCGTGAAGA TCGTCTCCTG GTGATCCATG GTTTGACCGA ATCGGGTACT CAGCCTGACA AGGTGGCCTT TCCGTACGCG GCGGCACTTT TAGCAGACGG AGATGATAGC AATACTGTCA ATACGAAACA ACGTGCTGAT GGAGCGTTGC AGGATGTCGA GCGGAAACTA GCTTTGGTCG AGAGCCTTGC CGTGAAACTC AGTCGCACCA GCCCTGAGGC CGTTGCAGGC CATCTACTCA GATTGCATGG ATATCATCTT CCTAAAGAGG GCCTTAAAGA AGATAAGCCA TCCAGCACGA CGCTATCAGC GGTTAGAGAC AAAGCTGATC GCCTGGAACG GCAATCCGAA GTTCTGGAGA ATGTAGCTCG ACGAGTGGAG GGATCATTGT CACGGGGCTT GAAACGTATG GAGACTGCGT GTACTCGTCT TGAACGGGTT CTGTCACTGA GCAATACGCT AAAAATGATA CTGAGACTTC AGTTCGAAAA TAGCAAGTTG CAAAATTACG ATCTGGAGGA CTTACGTGAC CTGACTCGTG CCGCCGCCAG CGTCTCGGTA GTCGAGGATT TGCTGAAGCG CTCTGAACTA CAAGCCTCGA TTGAAGCCAT ACAAAAGATA CGCCCCGAGG TCGAACGCAC TGCAACTGAT GTACGCCAAT CAGCCGCTAT GCTGTTGCAG GATCAGTATC ATCAAAAAAA TGCTATTCAT CAGCTGGGCG GTACTCTTCA AGTGTATTAC CATTTAGGAG AATTACCGGG GGCCGTTTGG AAAGTAGTGG AAAACGCGCA CGGTAAAGCA GAATCTACAT CTCGAGATTT ATGGAACGCA TTGACCCTAA TGAATTTGAC AGAACAGGCC AAAAAGACAG CCAAAGATAG CCGGTCCGTG GAAAAGAAGC TCAAGCAAAT GCGGGCAGAA GCTGCATCTC AATGGGCGAA TGGTATCTAC GACGTGTCAA CACAGGTGCG AAATTTACAG CGAGTGCTTA TGCGCAAAAG CGATCCAATA CAGCGCCAAT TTTTTGTGGA CGTCGTGGCC GCCGCATCAA TTCCAGCCGC CTTCAGAGAT TCGTCTCTCG GAAAAGACTT TTCTTTGTTT GGTCTATTTT GGGGGCGCTT TTGCAAATCC CTGGGAATTA TTTTGGAAGA TATTTTGCAA CAGGACAATG GAAAACATCG CTCGGACGTC GCAAGCCTGT ATCCGTCTGT GCGTAGCGTT TCGAACGACA TGTTGAGCAC TTTGCAGGAT AATTTGAATG CAGGCAATTC AGCATTGGAG GACCTTGGAA CTGCTGCAAC CCCAGGTATT CTCGGAGGAT CTGCTCTCTT AGATGACACA TTTCTGGACT GGACAACGGG CCAGTTCGAT GTAGAGGAGA ATCCGCAATC TGCCACCACA CCTGATTCCT GGACCCATAC TACTCAACGC AGCGCATCGG CGAAACACCC TTCGCAACGT TTTTCGGCCT CAGGTGGGAC AGGCTCTGCT ACGATGTCTC AAATATATCA ATCGATGGAG TGGAATACTT TACAGGGAGA TAAGAAGGGA CGCCATGGGC TTTATCCATT ACAACAAGCA TTTATTGAAG CCTGTACGGA CAGGCTATGT TCTCCACTTC AATTCATGTT CCCAGAAAAC GTTGCTCTTG ACGACGACGG TGTCGCCATT GCTTCCGGGC TCAGTATGTT GCCCAGCAAG TACGATATTC AACGCTTTGA CGAAAACATC CGTCAGGAAA TTTCGTTGGC TGACCCGAAA GAAGGCGGCG GTGATCTAAG CAGTGTTACT ATGATAGCCA ACTGTGTCGT GTCTATGATT TCAGAGCTCT GCCTCCGAGC AAAGAATGCG TTGAGTGGTA TTGGAGAATC AGGATATCTG AATAGTGATT GGTCAATGAC GGAATCGCTG AAGCATGATC GAAAGGTGAC AGTGATTCTT TTCACTGTGG CAAATTACTT GCGTATCGCG CCTGATACAG TGTTTTTGGC ACCATACCGT CCGTCCATTT CATTGCAACA AGAAGAAGCA GCGAGCGTCT GCCAAGTCGC ACTGCAACCG GCTCTCAAGG AGATTGAGAA AATGGTTAAA AATTCTGTGA CCTCACCTTT AGGACGAGCA ATCAATAAGC GAATTGGTGA CACCATGGCA AAAATGCATC AAGGTGTCTA TCTTGGTAGC AATGTGGGTA TCGACGAAGA CTCCCCTGCC TTTGTGCAGA AACACTTGAA CGGCATTTAC GAAATCATTT CGAAAGAAAT TCTTTCGAAG TTGCCTCCAG AATATGGGTC GGCTGTGGCG ACATCTGTGG CAATGTTTTC GATCTATAAT TTTGTGTCAA ATTTTACTCT GCTTCGACCT TTGGGTGAAT CGGCTCGTCT GCATATTACG CAGGACTTGG CCGACCTTGA GCTTGCACTG GAACAGCTCA TGTTGAAGAG CGGAAATTCT GTTTCTTTGC ATTTTATTGG AAACGGCAAG CCGTACTTGG AACTCCGTGC CGTTCGCCAA ATGCTGTTTT GGACGGGGTT GGACAGCGCT GATAAACAAG CCGTGGATGT CGCCAAAAGC TTGTTGCGCG AACCGTGGAT GAAGGATGTA CGTCCGTCAA CTATCTTTCA CTATTTGTAC TCGTACGCGC CTTCGTTTTT GTCATCCCCA TACCATACGA GACGTATGAA GCCAGAAGCT TACGTTCGGT TGTTGGTGAA GCCAGATGGC TCCGTAGAGG AGACAGAAGA CGATGCTTGG ATGACAGTTA TGGCGAGCTG CGATGCTTAC CAACAAAGAG CAAGCTCTGG AGGCTCAAAT ATGGATGGAG ACATTCGAGT GGCTGAAAAG CTTTTGACTA TGGGACCGGA TGTTATGCGT CGGCGAGGAC ATTAGATGTA AATGTCTCAT CCCTCCTGGT GCTTATTGGT TTATTTAAAC TAGTTCGTCT TGTGTTCATT G
|
Protein sequence | MDSSRSSSNR ASNSSASLSA NSLPSQKADQ TAAAAASLNL LLREDRLLVI HGLTESGTQP DKVAFPYAAA LLADGDDSNT VNTKQRADGA LQDVERKLAL VESLAVKLSR TSPEAVAGHL LRLHGYHLPK EGLKEDKPSS TTLSAVRDKA DRLERQSEVL ENVARRVEGS LSRGLKRMET ACTRLERVLS LSNTLKMILR LQFENSKLQN YDLEDLRDLT RAAASVSVVE DLLKRSELQA SIEAIQKIRP EVERTATDVR QSAAMLLQDQ YHQKNAIHQL GGTLQVYYHL GELPGAVWKV VENAHGKAES TSRDLWNALT LMNLTEQAKK TAKDSRSVEK KLKQMRAEAA SQWANGIYDV STQVRNLQRV LMRKSDPIQR QFFVDVVAAA SIPAAFRDSS LGKDFSLFGL FWGRFCKSLG IILEDILQQD NGKHRSDVAS LYPSVRSVSN DMLSTLQDNL NAGNSALEDL GTAATPGILG GSALLDDTFL DWTTGQFDVE ENPQSATTPD SWTHTTQRSA SAKHPSQRFS ASGGTGSATM SQIYQSMEWN TLQGDKKGRH GLYPLQQAFI EACTDRLCSP LQFMFPENVA LDDDGVAIAS GLSMLPSKYD IQRFDENIRQ EISLADPKEG GGDLSSVTMI ANCVVSMISE LCLRAKNALS GIGESGYLNS DWSMTESLKH DRKVTVILFT VANYLRIAPD TVFLAPYRPS ISLQQEEAAS VCQVALQPAL KEIEKMVKNS VTSPLGRAIN KRIGDTMAKM HQGVYLGSNV GIDEDSPAFV QKHLNGIYEI ISKEILSKLP PEYGSAVATS VAMFSIYNFV SNFTLLRPLG ESARLHITQD LADLELALEQ LMLKSGNSVS LHFIGNGKPY LELRAVRQML FWTGLDSADK QAVDVAKSLL REPWMKDVRP STIFHYLYSY APSFLSSPYH TRRMKPEAYV RLLVKPDGSV EETEDDAWMT VMASCDAYQQ RASSGGSNMD GDIRVAEKLL TMGPDVMRRR GH
|
| |