Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_40962 |
Symbol | |
ID | 7198689 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011694 |
Strand | + |
Start bp | 429766 |
End bp | 431815 |
Gene Length | 2050 bp |
Protein Length | 648 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184875 |
Protein GI | 219129394 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAGAAG GGGATTCCCG CCGAAAGATT GGTGCTCAGG TGACAGCGAA GGCCTGTCAT GTTGTCCATT TGAGTGAGTG TGCTCGGCGA TACGGTGCTT TGAGGACCAC CAAGGTCGTT GTGGGGACTG TTGTGGAGGT CAACAATACC AGAAAGGCGC CAAACAACCG TGTATCAACC TTCATTACTG CTGACTTTGA TATTGGTGGA GGATCAGTCA AGCGGAGCAC TCTGAACATC CGTAGCGTCA AACTTTTCAA ACCGGACCAG TCGACAGTAC CAGCCAGTCC CGCAGCACCA ATACCGGCAG TAGACAACGC AGACACAGAT TTGGCCGTTC CAGAGCAAGA GGAAGGAGAA GCGGTCTTGC AGGAGACTTC TCCTGATGAA GAATTGGAAT TTCCAGCACA ACCGATGATG AAAATTGGAA TAGCTGCGGG GGAACAGGTA GCAGGACCTA CCGCACAAGT AGCCACGCAG GTTTGGGGTG TTGAAGACGC TTCCTTTGTA ATGGCTCATG AAACAAAGTG GTATGCTGAC GAGCAAGCTA CATTGATTGA TATAAATGGC AGTGTCCAAA GTAAGCAGTT TGGCATCAAT ACACCAATTG GCGACCTTCT TGGTCCAGAC TCTGACATTG ATGGAAAATA TTCGCGGCTG CAATTTTTTC TTCTCATGTT TCCACCCGAC CAACTGAGCG CCATGTGTCA GCTAACAAAT GTGCAGCTTG CCCAACAGAA CAAGCACTGC ATGTCAACAG GAGAGCTGCT TTGATTCTTT GGCATTCTAA TTCTTGCGAC AAAATTTGAA TTTAGCAGTC GATCGCAATT GTGGTCCACA ACCGCGCCGT CAAAATACAT TCCTGCCCCT GCATTCGGAA AAACAGGAAT GTCGCGGCAG CGCTTTGATG ATCTTTGGCG AAATATCCGA TGGAGCAACC AGTGTCCTGA ACGGCCGGAA GGTATGAGCT CCCATACGTT TCGGTGGCAA CTTGTCGATG ATTTTGTTGA AAGATACAAC AATCATCGAG CCAATACTTT CAAACCATCT CATCTTATTT GTGTGGATGA ATCAATGTCG CGATGGTATG GACAAGGGGG GGAATGGATA AATCATGGAC TCCCCAATTA TGTGGCTATT GACCGAAAGC CCGAAAACGG TTGTGAGATT CAAAATGCCG CATGCGGCTG TTCGGGTATC ATGCTTCGAT TGAAGGTTGT AAAGGGTAAG ACAGCAACAG AAAATGATGG GGACTACAAT GAACAGTTGC TGCATGGAAC AAAGATCCTC AAAGAGCTTG TCCTTCCTTG GTGGTGGACG GATCGGATTG TTTGCGCTGA CTCGTATTTT TCATCTGTCG GTACAGCTAT GGAGTTGCAG CGACATGGTT TGAGATTTAT TGGAGTTGTA AAAACAGCAA CAAAACAATA TCCGATGAGA TACCTTTCGA CTTTAGAGTT GAACCAGAGA GGCGAACGGA GAGGGCTTGT GATGCGAGAT GTTGATACAA ATTATAGCAC TCTGTTGGCT TTTGTGTGGA TGGACAGGGA CCGCCGATAT TTTGTGTCGA GTGCTTCCAG TCTGGATGCA GGCAAGCCCT ACGTACGCTA TCGTTGGAGA CAGATTGACC AATCTCCGGA TGCAGATCCA GAGAGGCTGG AAATTATCAT TCCACAGCCC AAAGCAGCGG AATTATACTA TTCTGCATGT GGGATGATTG ACAGGCACAA TCGAAGTCGT CAGGATACAC TGATGCTTGA ACGAAAGTTG GGTACAACAA ATTGGTCGAC AAGGGTTAAC CTCTCAATAT TTGGAATGAT TGTTGTTGAC ACTTGGTTGG CCTACAGTCT GTGTACAGGA ATAGGAAGAG CTAACGGGAG AGAAGAAAAG CAGAAAGACT TCTACACTGC CTTGGCTGAG GAGCTAGTGG ATAACCAATA CGACAATGTT GGAAGTCGCA GAGTTTTCGT GGAGACAAAT TTGGACAATG ACAGCCCAGC ACTTTCAAGG ACTACAGGAG AACCAAGAAG TGGCCTGTAC GCACATCTAA
|
Protein sequence | MSEGDSRRKI GAQVTAKACH VVHLSECARR YGALRTTKVV VGTVVEVNNT RKAPNNRVST FITADFDIGG GSVKRSTLNI RSVKLFKPDQ STVPASPAAP IPAVDNADTD LAVPEQEEGE AVLQETSPDE ELEFPAQPMM KIGIAAGEQV AGPTAQVATQ VWGVEDASFV MAHETKWYAD EQATLIDING SVQSKQFGIN TPIGDLLGPD SDIDGKYSRL QFFLLMFPPD QLSAMCQLTN VQLAQQNKHC ISRSQLWSTT APSKYIPAPA FGKTGMSRQR FDDLWRNIRW SNQCPERPEG MSSHTFRWQL VDDFVERYNN HRANTFKPSH LICVDESMSR WYGQGGEWIN HGLPNYVAID RKPENGCEIQ NAACGCSGIM LRLKVVKGKT ATENDGDYNE QLLHGTKILK ELVLPWWWTD RIVCADSYFS SVGTAMELQR HGLRFIGVVK TATKQYPMRY LSTLELNQRG ERRGLVMRDV DTNYSTLLAF VWMDRDRRYF VSSASSLDAG KPYVRYRWRQ IDQSPDADPE RLEIIIPQPK AAELYYSACG MIDRHNRSRQ DTLMLERKLG TTNWSTRVNL SIFGMIVVDT WLAYSLCTGI GRANGREEKQ KDFYTALAEE LVDNQYDNVG TQHFQGLQEN QEVACTHI
|
| |