Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47575 |
Symbol | |
ID | 7202799 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | - |
Start bp | 155497 |
End bp | 157859 |
Gene Length | 2363 bp |
Protein Length | 744 aa |
Translation table | |
GC content | 58% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182012 |
Protein GI | 219123398 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.759441 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ACTGGAGTCG TCGCTCTTTT GACCGGCATC CCAGTCTCGT CTTGTCTTGT CTATACAGCA GAACTATGAA ATCAGTACAA TTGTTCCTGT GTAGTTTAGG ACTGGCGTTA CTCTTTCTCG GCACCACGAC GGAATGGCAA CGGCGCCTTC CAGACTTTTT GCGGCTCCCG GGGGAAGCGC AAGCACACCG TCGCGTCCAA ACGGGTAGTA CGGCTTCGAC TCCGCCCGGG CCCACCGCGC CGACCCCCGC AATCCGCAAA CCGTCCGTCG GGAGTCCCAC GTCTCCGACT CCGGTGGTGA CCACAGTCCC CGGCGGGACG ACCCAGACTC TTGCACAAAA CCAAACGACT CCCGAACAAG CGACTCCCAC GTCACTCCCG ACGGAATCAC CGGCGCCGTC GCCTTCACCG TCGGACGTAC CCACTGCCAA GCCTACCATC ACGCCCATGC CCACCGACAA GCCCACGGCG TCGCCGACCG AGTCTCCCGC CCCGTCCCCT TCGCCGTCGT CGATCCCCAC CGTGGCACCG ACCGAATCTC CGTCGAACGC TCCCTCGCAA GCGCCCACCA CGTCGCCGGC ACCGACCCAA GCGCCTTCGA CTGCTCCCAG TACGGCGCCC TCGTCTCCTC CGACCGACGT ACCCAGTGCG CATCCCACCG TTTCGGTACA ACCCAGTTTC CGTCCCAGTC GGTCTCCGTC GGATCAACCC AGTAATACTC CCACCGTGTC GCTGGCGCCG TCCGCATTGC CTAGTCTCGC ACCCAGTATC AGTCAACGAC CGTCGCAAGC CCCGTCTTAT ATTAGTGCAC AACAAGCCAT TGTCAACGTG ACGCTTGATC TCAACGCGCT CTTGACCAAC GCACAGATCG AAGCCTTGGA AGCGGCTACC ATTGTCTACA TGGAACAAGA GGCCGTTCCG GGAGGCTACC TCGAACAGGC GACTGTCAAC GTCATTGCTC AGCAAACGCG TCAACCAACT CCCTTTGGTC GAAGCCGTCG GGCTCTCCAG GAATATACCG TTCTGGAATT GGCACTGGAT GTGGCCGCAA CCTACACGGG ATCGGCAGTC GACTTTGATC TGGGTCTCTA TATGGCCTCC CGGTTGGATC CACCCAATCC CGTATGGATT CATTTGTTGG GCAACGAAGA CACTATTTTC TTGCCTTTGC GTCCCCCCAC GCCGGTAGGG ACCCGCAACG TTACTACCAG TCGAGAAGAA AGTGCCGAGA CACCCTCGGG CATGACCAAA GGCACTTACG CCGTGGTCAT TCTCACGGCT CTGGCGGCAC TAGCCCTCGG CGCAGTCTCC AGTGTCTATG CGGTCCGACA GCATCGACTC GAAACCTTGG GGACGGAACT CAAGAGTCCC CGGATGGTCG CCAACGCCAC GACGTGGAAT ACGGCCGATA GTAACGAATG GCACCAATCT TGTACTCTAG CGGAACAGGA AGAATCGGGG GAGAACTAGG AGGAAAAAGT AGAAGAAGAC CAGGTCAGTA CATCGGCATC CTGTCAAGTC TTGCCCCTGG GACTCGACTC GTTGTGTGCG ACCGATACCG TGGACCATGA AAGCTACTCC CGAGCCACCT CGACTGTCCC GTCTCCCATG GAAAAGGCAC AGGCGTCCGT GTTTCGACAT ACCGATCCGC CCGTCCGGAG GATGAGTCCG CGGGAATCGT CCGAAATTGA CTTTGGCCGG AACCGGGCCC TCTTGGATCA GGATGATTCC CTACTGTCCG GATCGTATGG TGACAGTCGC ATTTCGGCAC GTCCGCCTCC CCCACACCAA ACCACGGGTC GGTACCCCGG GGCCGCGGCA CATTTTCCCC ATTTCCAACC ACAACGAACA CACCATGCCG ATTTGGATCA GGAAATTCTT TTGACGAAAA GCGTGGGTGG TACTTTGGGT GCCAAGGCTG TGGACGATGA CAAGGCCAGT GCGTCGGATT TTTCGTCACA GGCCAAGTTC TATTTGAGTC GACTCTTGGG AACCAACACA GCGTCCGGTC CACTTTCACA AGCATCAAGT CGCGACCACG GGACGACGAT TCAAAAGACG GTGTCGTACG ATTCGGTATT GCGTCGACCG GGGTTGTACG ATGTCTTTGC CCCACCCGGT CCCATTGGTA TCGTCGTCGA CACGACCAAG GATGGTCCAG CCGTGCACGC TCTGAAGACG ACCTCACCCA TGTTGGGATT GATTCAACCA GGCGATTTGA TTGTCGGTTT GGACGATCAG GATACCCGCA GCATGACGGC CGCGACTCTG ACGCGCCTCA TGGCGGCGAA AGCCCAGGAG CACGAGCGTA AAATTACGTT GCTTACGAAC GAGCATGTTC AAACAGCCTA TCCTTTGTAC TAA
|
Protein sequence | MKSVQLFLCS LGLALLFLGT TTEWQRRLPD FLRLPGEAQA HRRVQTGSTA STPPGPTAPT PAIRKPSVGS PTSPTPVVTT VPGGTTQTLA QNQTTPEQAT PTSLPTESPA PSPSPSDVPT AKPTITPMPT DKPTASPTES PAPSPSPSSI PTVAPTESPS NAPSQAPTTS PAPTQAPSTA PSTAPSSPPT DVPSAHPTVS VQPSFRPSRS PSDQPSNTPT VSLAPSALPS LAPSISQRPS QAPSYISAQQ AIVNVTLDLN ALLTNAQIEA LEAATIVYME QEAVPGGYLE QATVNVIAQQ TRQPTPFGRS RRALQEYTVL ELALDVAATY TGSAVDFDLG LYMASRLDPP NPVWIHLLGN EDTIFLPLRP PTPVGTRNVT TSREESAETP SGMTKGTYAV VILTALAALA LGAVSSVYAV RQHRLETLGT ELKSPRMVAN ATTWNTADRR IGGELGGKSR RRPVLPLGLD SLCATDTVDH ESYSRATSTV PSPMEKAQAS VFRHTDPPVR RMSPRESSEI DFGRNRALLD QDDSLLSGSY GDSRISARPP PPHQTTGRYP GAAAHFPHFQ PQRTHHADLD QEILLTKSVG GTLGAKAVDD DKASASDFSS QAKFYLSRLL GTNTASGPLS QASSRDHGTT IQKTVSYDSV LRRPGLYDVF APPGPIGIVV DTTKDGPAVH ALKTTSPMLG LIQPGDLIVG LDDQDTRSMT AATLTRLMAA KAQEHERKIT LLTNEHVQTA YPLY
|
| |