Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46389 |
Symbol | |
ID | 7201646 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | + |
Start bp | 188447 |
End bp | 190542 |
Gene Length | 2096 bp |
Protein Length | 647 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180775 |
Protein GI | 219120056 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.105715 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAATTCCGTG CGTAACCAAA CGCAATTCAC CGGCTTATAG ACGAGAAGTT CGTAGTGTGT GGTAACCATG ACATACTCGC GAAGAATTGT ACTATGCGTT TGGTTGGCCT GGTTGTTGGT GGTCACGGAA GCCGCGACGA TCCCGTCTTC GCACGACTCG AGAACAAGCA AGCTCTCCAC GGCGAAGAAA GGCGACGATA GCTTTGTCTC GATTCCACTC ATTCCGCACC ATGTGCAAAG GCGTCGTCGT ATGTTGGAGA CTGGTGTCGA AGATGAGGCC CTCCCCCGTC CCACTCGCTC GCGTCGGGAC TTGGCATCTA CGAACACGGA CCGGGAAATT CAGCAAGTCG GCGCCTTGTA CCAGGGATAC GGAACGCACT ACACGGACTT GTGGTGCGGC ACACCGCCGC AACGACAGAC CGTTATTGTC GATACAGGCT CCGGTGTTAC GGCCTTTCCC TGTAGTGGAT GCGGTGATTG TGGTGTACCG AAATATCACG CGAATCCCTT GTTTGTGGAA GGGGATTCGA GTTCCTTTCA CGAGCTCAGT TGTACTGAGT GTTTGAAAGG AACCTGTCGC TCGGGGGCGA AGCAATGTCA CGTGGGAATG TCGTATCAGG AAGGAAGTAG CTGGAGCGCG TATGAAGCAC AGGATCGCTG TTATGTTGGT GGCTTTCACA ATACCGCGGC AGTGGATAGC GGGTCCAATA GTCCGTTGGA CCTCAACCGT GCCGAGGCCT TTGCCTTTGA TCTCAAATTT GGCTGTCAGA CCCGTCTGAC AGGCCTGTTT AAGACACAGC TTGCCGACGG AATCATGGGC ATGGATATTG CGAGTAAGTT TCTGCCCAGC ACGATTTTGT TTTTGGCTGA TTTTTATTCG TCAGGTGGGC TAATCATCTT TTCCTCCGGA ACTTTTGCAG AGGCTGCCTA CTGGCAGCAG ATGTACGATG CGGGAAAGAC TGCCAGCAAG AACTTTGCGC TATGCTATGG TCGTCAAGAC ATTGTTGAAC GGGAAGGAAC TGAAGCCGGG GCCATGACCT TGGGTGGTTT AGACACCCGT CTACACAAGT CCGACATGGT CTACGCCTCC ACCGGTGGCA CAAGCCAAAG TTCAGGGTTC TACAGCGTTC ACGTCCGCAA AATTCATCTT CGTGCCGGAA ACGGTGGAGA CTCGGCCGTC AGCAATTCTG AAGGACTCGA GGTTCGTGCT CTCGATCTCA GTGAAAGCGA CCTCAACAAT GGACGAGTCA TTGTAGATTC GGGCACCACG GATTCCTACT TTTCGCGTAG AGTGGCGTCC GAGTTCAACC GGGTCTACGA GGAGATAACG GGGCAATCTT TCACGCATGC GGCTCTTAGT CTCACGGAGG AACAAATCAA TGCCATGCCT ACAATCCTTT TCCAGTTGGA AGGAGACGAA GAAGCAAACA AGGCGCTCGT AGAAGAACAT CCCGACCGCC AAATTGTTGG TCTGGCAAAC ATTGTGGATC CTGAGCATCC TTTCGATATT TTGGTCGCCA TGCCACCTAT GCACTACATG GAATACGACT CTTCCAAAAA ATTGTGGCAA GCTCGCTTTT ACGTTGATGA CAGTAGTGGT GGTGTTTTAG GTGCCAACAC AATGATGGGA CATAATGTCT TCTTCGACAT CGATAATGGT CGCGTTGGCT GGGCCGAAGC ATCGTGTGAT TTTACAGCCC TTGAAGCCGA ATACGGAACT GACGACTTCG CCAGCGATTT TACCGATCAT ACTAGGCAGG AACGACCGGA ACCGGATGAT CACGTTTCAG CGCAAGAGGC TAAGTTCGAA CCCGACGACA CCCTCCCGAA CAGCAACTCG GGCTTTGATA TGGGTCTGCC GAATCAGTTT TGCTCTACTA TGCAGTGCCA GTTGGGTATC GTGATTGGGG TGCTGGCCGC CGTTGTCTTC GTTGTCGTGC GTGTGGTGCG ACGCAACACA GGAATTGCCT ATGAAGCGGC TGAATTAGAG CTTCAAATGA CTGCCTTACC AGAGGATGAC GACGACATGG CCGGCTACCG AGATCACGTG GCACCGGCAA ACACGGGATA CGAGAAAACC GATGAGGAAA TGCCCC
|
Protein sequence | MTYSRRIVLC VWLAWLLVVT EAATIPSSHD SRTSKLSTAK KGDDSFVSIP LIPHHVQRRR RMLETGVEDE ALPRPTRSRR DLASTNTDRE IQQVGALYQG YGTHYTDLWC GTPPQRQTVI VDTGSGVTAF PCSGCGDCGV PKYHANPLFV EGDSSSFHEL SCTECLKGTC RSGAKQCHVG MSYQEGSSWS AYEAQDRCYV GGFHNTAAVD SGSNSPLDLN RAEAFAFDLK FGCQTRLTGL FKTQLADGIM GMDIAKAAYW QQMYDAGKTA SKNFALCYGR QDIVEREGTE AGAMTLGGLD TRLHKSDMVY ASTGGTSQSS GFYSVHVRKI HLRAGNGGDS AVSNSEGLEV RALDLSESDL NNGRVIVDSG TTDSYFSRRV ASEFNRVYEE ITGQSFTHAA LSLTEEQINA MPTILFQLEG DEEANKALVE EHPDRQIVGL ANIVDPEHPF DILVAMPPMH YMEYDSSKKL WQARFYVDDS SGGVLGANTM MGHNVFFDID NGRVGWAEAS CDFTALEAEY GTDDFASDFT DHTRQERPEP DDHVSAQEAK FEPDDTLPNS NSGFDMGLPN QFCSTMQCQL GIVIGVLAAV VFVVVRVVRR NTGIAYEAAE LELQMTALPE DDDDMAGYRD HVAPANTGYE KTDEEMP
|
| |