Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50588 |
Symbol | |
ID | 7199368 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011699 |
Strand | - |
Start bp | 240266 |
End bp | 242873 |
Gene Length | 2608 bp |
Protein Length | 698 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185545 |
Protein GI | 219130801 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.106084 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTCGTTCGTG CAGGATTCTG TCAACAATCT CGCGTTCGTC ACGGTAGCTG TCAGCATCTG TCATCGACAA TCGTTCACTT CCGAGTCCGT TTTCGGCTTT GAAACTTTCA GCATTTCACC AAAGATCCTC CATTTGTCCC GTACGAAACA CTCTTGGTGG TTACGTTTCT ATCCTTCCAA CAGAGAAGTG CTTGCTACGC AATCTGTAAT AGCTCCGTTT CCAGGCAGAA CACTTTCGAC TCTGAGTTGT CGGGATCTAT CGACGGACGG AAGGAACATC CATACACGTA TCGCATTTAC GTATCAAATA CACGTATCGC ATACACGCAC CCGTCGTCGA CACGAAAATG ATGAACGCGT TTACTACGGG ACCGGAGTCA ACTGCTGCCG ACCGCCACAA CAGACGCGCC ACGGGAATTT CTTCCGAGGC AGAAGAACGC CGGATGGAAA CCTTGGAAAT CATGAACACA CCAGTCGAAA ACGATGCGGA AGAGGAGAAT GATGATGACG ACGACGAAGA AGAAGAGGCG CTGGCGCCTG TTGCTGGATG GAGATTCTGG TGGATCTTTG CCAGGTTTCC GTTGGGTCTG CTCCTCGTTA ACACTATCCT TCTCGCTTTA ATCACCCTAT ACAATCCACA ATACTCGCCC TCATATATTG TGGGAAACTC GACGGACGTT GAAATGAACG ACGCCTTCGA TTCCGACGAG CGAAGGGATT TCGGGCTTGC GCAGTTCAGT GCGCTCGAAC TGCAATCATG GATTGTCAAA TGCGGGATGG TAGCCCTCTT AGCTTGCCTG GATGCTATTG TCTTTTACTG GTTCACAGTA CGTCTCAAGA AGGGCATGGA ATTGCTTGCG CAAAAGGCTC AGCAAGAAGA GCCTCAAGGA CGGCCAGTCG CAAATGAGAA TCTGAACAGG ACAAGAACGG CACCGTTGCA CAAGTTGGAC TTGAACTACA GCGAACTGGA CCTGGTACAC GCTCGGGTGA CCCAAAAGAT AGACTCGTAT CTATTTAGGG ACCCAGCGCG GCAACCGACC GTCCCAATCT TGGTCAGCCT GTTGTACCTA CTGACGGCAC TCACAGTGAC TGCTTGTCTG ACTACGCTAT CACTTTTGAT ATTTTTGTAC AGCTCCGAAG GATCTTCCAT GTGCTTGGAG AGCATCACAT CACAGTCAAG TGAATCCGTG GAATTTGATA TTGACTCGAT TGAAAATATC CCGCAAGAGT TACAGGAATG GGCAAGTCCG AGATATTCAT ATTCTGATGG ATCATCATAC ATTCATATGA GTGACGGAAC TACTTACTTT CGCGGTAGAA TGGCCGAAAA GGAACATCAT GGTTGGGATT CCTATATGGA TACGGAAACA CTAGTTGCGA CCAACGTAAA TGGAAATCTT ACTGTGTACA GTCAACTCCA CGAACCGCAT AGCTTCGTGA GTATTAACGA AGGTTCTGGT GAAACAATGG AAGGATTCTG TTTCCTCTAT ACGGAGTTTG CTGGACACGA CGACGAGGAA ATATATGAAT ACAAAAAGCA AGCAGTTGCA TGTGTGACAT CAGATGAAGA TAAAAGTCAG GGATTTCGAA ACGCAACTCT TCTAAACAGT GACAAATGGA ATCGGATTAG ATGGTCCGCT GGCAAAGCTC ACGATGGACG ATACTGGATC AGGCTGCAAG AGGATAGATT TGTTGATGAG TGGTCGTCGT ATCAAGAGGT TTTGCATATC ATACAGCTGG ACCCGCAGTC GATGATGTAT ACTATAGTTG CAAACTCAAC TTCCTTCCCC GATTTTCAAC AACCCTTGAG GAACGAAGGA AGTAGATGTT TCCGCTGGAC TAGCGGCATT GGGTTCGTTG CTGCCGCAAT ATCTCTGTTT CTTTCGGCGC TGGTGCTCCT ACTATTCATC AAGACAAAGT CGGGAGCAGG TTGCTTGGCG TTGTCTATCT TTGCAGTCCG ACTTTGGCTG GAAGAAACTT GGCTCGACGA AACTTGGCTA GACATGCTGA GTCCAGTATT GCTTGTTTTT ACATTCATAT GCTTGTGCAC GGCATCACTC AGCCTTGCCG TCCGAGAGAT GTTGCTATGG GGAATGTATA GTGTCATTGT GATGCAATTG GTTTTAGCTA TTCAATTTCA GGTAATGGGG ACTATTGGAC TAGGTATGGG CCTCGCGCTG GACCATCCTG TACTTCAGCT GGGTGGATGG ATTGGAGGAC CATTTGCTGT CTTTCTTCTT TTGTTCTACT CAATCACTGA TTCCATTGAT TGGTTGGAAC TGGTGGCGTT TATTCCATTG AGTGTCCTTA TTGCTTGCGG GATGGTGACG GCAGGGAATC AACTCACAAG ATACCGCCCA TTTCTGCTGT TCTACCTGAG GCGTTTGTGG CGATCGCTTT GCCTGAAAAT ACGGCCGCAA ATTCGACAAC AAAGCAGAAG CTGAGAAATG GACTTTTATA CACAGATTTA AAGGCATGTT GACTAGCTGC TGACTGTGAG ATTCTATTTT TACGTTGTAT CTTGTGGACG TCGAACAATC CATCACAGTC ACCACTAAAA CTCGGCAAAC CATAATCACC AAATCTTAAT GAGTATTGCT TTACTTGC
|
Protein sequence | MMNAFTTGPE STAADRHNRR ATGISSEAEE RRMETLEIMN TPVENDAEEE NDDDDDEEEE ALAPVAGWRF WWIFARFPLG LLLVNTILLA LITLYNPQYS PSYIVGNSTD VEMNDAFDSD ERRDFGLAQF SALELQSWIV KCGMVALLAC LDAIVFYWFT VRLKKGMELL AQKAQQEEPQ GRPVANENLN RTRTAPLHKL DLNYSELDLV HARVTQKIDS YLFRDPARQP TVPILVSLLY LLTALTVTAC LTTLSLLIFL YSSEGSSMCL ESITSQSSES VEFDIDSIEN IPQELQEWAS PRYSYSDGSS YIHMSDGTTY FRGRMAEKEH HGWDSYMDTE TLVATNVNGN LTVYSQLHEP HSFVSINEGS GETMEGFCFL YTEFAGHDDE EIYEYKKQAV ACVTSDEDKS QGFRNATLLN SDKWNRIRWS AGKAHDGRYW IRLQEDRFVD EWSSYQEVLH IIQLDPQSMM YTIVANSTSF PDFQQPLRNE GSRCFRWTSG IGFVAAAISL FLSALVLLLF IKTKSGAGCL ALSIFAVRLW LEETWLDETW LDMLSPVLLV FTFICLCTAS LSLAVREMLL WGMYSVIVMQ LVLAIQFQVM GTIGLGMGLA LDHPVLQLGG WIGGPFAVFL LLFYSITDSI DWLELVAFIP LSVLIACGMV TAGNQLTRYR PFLLFYLRRL WRSLCLKIRP QIRQQSRS
|
| |