Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44838 |
Symbol | |
ID | 7199555 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | + |
Start bp | 392159 |
End bp | 393547 |
Gene Length | 1389 bp |
Protein Length | 406 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178764 |
Protein GI | 219115938 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000100834 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAACAGATCG TCACACGTTG TTCACAGTCG CGCGAAACGG TTCTAAGATT GCCTTCATAC AGGGATTTCG AATATTCGCT CACACAAATG AAGACTTCGT CGCGTGTTCA CATCCGAAAT CGTGTGGTCG TCGAACCCGA ACTTCATCTC CATACCGTCG ACATCACCGA AGCCATGAGA TTACGTGGGG ATTCGGGGAC CTCACGTCGA GCAACAACGC CAATGACTTC GGACTTTGAT CTTGCCGTAA TGCCGCGCGA GAAAATTTTT TCTCGATCTC TTCATTCATG GTCTGCACAT CATTCTCTGA CGACGAACAA CTGGATTGCT ACCATAGCTC CCGCGTCGAG CCCTTCCGAA CCGGGCAGTA GCAACAAGGC CCGCTACGTA CAGATTCCTT TTCCTTCGGA GCGGGAAGCT CGTAGGTTTT GCAAGGCGTA CTCTCCCCCC AGACTCAGTA CCGCTGTTCT CTGTCAGCTT TGTCAGCTTA CGCCGCAAGC CGCTCGTCAC TGCCGAAATT GCGGGGTCAC CGTCTGTGAC AGCTGTTCCA CGCGTTGGGG CATTCGGATG GTTCCCAAGA CCTACAACCC ACAGTTGCTA ACTACTACTG TCCGGGTGTG CAAGTCCTGC GATTGGCTGT CCAACGCGTT CTGCATGGCG TTACTTCAAG GTCGTTACGA AGATGTCTTA ACAATCGTCG AAACGGGCAA CGTTAATCTC CGTACCTGCT TTGCGGACAT TCATCAGGAA GCCATGTTTC CTGTCCACTG TGCCATTCTT GGGGGGTCGC TCGCCACGCT GCAGTGGTTG GTGGAGATGC AAGGATGTCC CTTGTCAGTC AAGAAGGATC CCAAAACGAA CCGGGCCTTG TCGTTGCAAA CATCCGCTTC CCGCACCTTG CTCGATCTTG CGATGAAAGG ACGTCCCAAA ATTGACATCC TAGTGTACCT CATACAAAAC GGATTGAGCA TTAGTGATGT ACATGATCCA TCGCTGGCCC CCAAAACCCT CGAAGTCGTT CTAAAAGCGG GCTTTCCCAT TCACTCTATC GATACGCTCA TGCCCGATAT GCCTGTCATA ACCGACGAAT CGGACAAAAA ATTCGATGTC AGCCGCAACA AATCTTTGGT GTACGAGGAA TCCGTTGCGA CACTTGAAGA CGCTTGCGTC TTGTGCTGCG AACGCTCTAC GGATTGTGTG CTTATCCCTT GTGGACATCA GATTTGTTGT ACCGATTGCG GTCACCAACT CACTTCTTGT CCGGTGTGCA AGATAAATTG CAGCGTACTG CGAGTCTATC GTCAGTAGTG GACGTAGGAG CCATCAGCAA CGCAGCGCAC TGGAAACCGT TTCTGAGTCT AGATTTGCAG TCCGTACTCA TTTTTAAAT
|
Protein sequence | MKTSSRVHIR NRVVVEPELH LHTVDITEAM RLRGDSGTSR RATTPMTSDF DLAVMPREKI FSRSLHSWSA HHSLTTNNWI ATIAPASSPS EPGSSNKARY VQIPFPSERE ARRFCKAYSP PRLSTAVLCQ LCQLTPQAAR HCRNCGVTVC DSCSTRWGIR MVPKTYNPQL LTTTVRVCKS CDWLSNAFCM ALLQGRYEDV LTIVETGNVN LRTCFADIHQ EAMFPVHCAI LGGSLATLQW LVEMQGCPLS VKKDPKTNRA LSLQTSASRT LLDLAMKGRP KIDILVYLIQ NGLSISDVHD PSLAPKTLEV VLKAGFPIHS IDTLMPDMPV ITDESDKKFD VSRNKSLVYE ESVATLEDAC VLCCERSTDC VLIPCGHQIC CTDCGHQLTS CPVCKINCSV LRVYRQ
|
| |