Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_21789 |
Symbol | |
ID | 7202678 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | + |
Start bp | 352157 |
End bp | 354042 |
Gene Length | 1886 bp |
Protein Length | 559 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181891 |
Protein GI | 219123145 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.207672 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GGCAAGGAAC GACGGTAGAT GCATTCCTAG TCTTTTCATT GTCCTTACAA AACCATTCAT CGCACACCTA TCTCATAATC ATGATGCAAG GAGGAGGACC CATGGGACAG GTGATGGTCT TGAGTAAGTG TAAAGCACGG GGGGCGGTGA ACACGAATTA GTTTAGGTAC CCACCGCAAA CACACAGACA CAAACTGGCC TCACACGTTT CTGGCCTTTT TCATCCTTCG TCTTTTTCGA TTCCACCAGA CACCAAAACC CAACGCCAAA CGGGTCGCCA GGCCCAACTC GCCAACATCC AAGCCGCCAA AGCCGTCGCC AACATTATCC GCACGACCCT CGGTCCCCGC AGTATGCTCA AGATGCTACT CGATCCCATG GGTTCCATCG TATTGACCAA CGACGGGCAC TGCATTCTGC GCGAAGTCGA CGTCTCCCAT CCCACCGCCA AGTCCATGAT TGAACTCAGT CGCTCGCAGG ATGAAGAAGT CGGGGATGGC ACCACGTCCG TCATTGTCCT CGCCGCCGAA GTACTCGCAC AGGCCGAACC CTACCTGCGA CAAGACGCCA TGCACCCGAC CGTCCTCGTG TCGGCCTACA CCAAGGCACT CGCACAGGCC ATGATCATTC TGGAGGAACA AAGTGTCACC ATTGATGTCG AAAAAGACCA CGAACTCATG AAACTGCTCG TCCAAAGCTC CTTGGGCACC AAGTTCTCCT CCCGCTGGAA CGATCAAATG GTCGAAATGG CACTGCAGGC CGTCCTCACC GTTTCCCAAA AACGAGCCAC CGCAGCCGAT GGTGTACTCA AACAAGTCGA GATTGATATC AAACGCTACG CCAAGGTGGA AAAAATACCC GGTGGCGAAA TACAGGAGTG CGCCGTACTG GAGGGTGTCA TGTTCCAGAA AGACGTGGTG CACGCCAATA TGCGCCGCCG CATTGAAAAT CCCAAAATAT TGTTACTGGA TACGCCGCTC GAGTACAAAA AGGGCGAGTC ACAGACGAAC ATGGAAATCA CGGACGAAAA TGATTGGAAT ACGCTGCTCA AATTGGAAGA AGAGTACGTT GCCAACATGT GCGCGCAAAT TATTGCGGCG CAACCGGATA TTGTGGTGAC GGAAAAGGGC GTCAGCGACT TGGCCCAGCA CTATCTGCAC AAGGCCAACA TCGTCGCCTT TCGCCGGGTA CGGAAAACGG ATAACAATCG GATTGCACGA GCCGTGGGTG CCACCATTGT GTCCCGCACG GACGAAATCG ATGATTCCGA CATTGGAACC GGCTGTGGCT TGTTCGAAAT GCGACAAATT GGATCGGATT GGTTCTGCTA CCTCACCAAG TGCAAAGAAC CCAAGGCCTG TACTATTGTA CTCCGCGGTG GCTCGAAGGA TGTTTTGAAC GAATTGGAAC GCAATTTGCA GGATGCCATG CAGGTGGTGC GTAACGTTGT CTTTTCGCCC AAGCTTGTAC CAGGCGGCGG GGCCATCGAA ATGGCTCTCG CCGTGGGCCT CAAACGCACC GGCCAAAAGG TCCAAGGCAT TCAGCAAGGC CCCTACATGG CGGTCGGCGA AGCTTTGGAA GTCATTCCCC GCACACTGGC ACAAAATTGT GGCGTTTCCG TCATACGCGT GCTGACTGCC CTGCGGGCCA AGCACGCGGC CGCCTACGAC GAAGCCCAGG ATAAGACCGG AAGCGACGAC AGCAAGCAGG CCGCCTTTTG TTCGTGGGGA ATTAACGGTA CGACGGGTGA ATTGGTAGAT ATGAAGGAGT TGGGTATTTG GGAGCCGTTT GCGGTGAAAG CACAAACCAT CAAGACGGCC ATTGAAAGTG CCTGTATGAT TCTACGCATT GACGACATTG TATCGGGGTC CAAAAAGCGC GGGTGA
|
Protein sequence | MMQGGGPMGQ VMVLNTKTQR QTGRQAQLAN IQAAKAVANI IRTTLGPRSM LKMLLDPMGS IVLTNDGHCI LREVDVSHPT AKSMIELSRS QDEEVGDGTT SVIVLAAEVL AQAEPYLRQD AMHPTVLVSA YTKALAQAMI ILEEQSVTID VEKDHELMKL LVQSSLGTKF SSRWNDQMVE MALQAVLTVS QKRATAADGV LKQVEIDIKR YAKVEKIPGG EIQECAVLEG VMFQKDVVHA NMRRRIENPK ILLLDTPLEY KKGESQTNME ITDENDWNTL LKLEEEYVAN MCAQIIAAQP DIVVTEKGVS DLAQHYLHKA NIVAFRRVRK TDNNRIARAV GATIVSRTDE IDDSDIGTGC GLFEMRQIGS DWFCYLTKCK EPKACTIVLR GGSKDVLNEL ERNLQDAMQV VRNVVFSPKL VPGGGAIEMA LAVGLKRTGQ KVQGIQQGPY MAVGEALEVI PRTLAQNCGV SVIRVLTALR AKHAAAYDEA QDKTGSDDSK QAAFCSWGIN GTTGELVDMK ELGIWEPFAV KAQTIKTAIE SACMILRIDD IVSGSKKRG
|
| |