Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42548 |
Symbol | |
ID | 7196254 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 385208 |
End bp | 386280 |
Gene Length | 1073 bp |
Protein Length | 346 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177081 |
Protein GI | 219110659 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0379002 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGACGG TGAGACTTGT GTCGCTTTTG TTGGTGTTGT CGACAGCCAA CAGCTTTCTA GCACGACTTT CCAGTACAAG GAATCGTAAC GAGGTTGCTG TGGCTTGTCG GAGCGAACCG GAAACTAGCA ATTACTGGGG AGATGACACT GATGACCAGG ATACCCTCCA GCCCAGCTTG ACGCCTCTGT CGTCGTTCGC GGCGTCTCCC GCCTTGTTTG AGCTAGATCC GGCTTCAGAC CAAGCTAGAG ATATCGTCAT GAACGATTTG AAACTCTCTG GTGCCCAACA CGAACAACTG GTGTCGTTGT GTCAAGCCGT TGTGGACTGG AACGACCGAA TAAATTTGGT TTCGCGGAAG GATTGCACGG TGGCAACCGT ATTTGGCCGG CACGTACTAC CTTCCATTGC CTGCTGTGCC TTTTCAGAAG ATCAAAATCC CTTAAACACT GCTAAAACAC TGGTCGACGT TGGCACGGGC GGTGGCTTTC CCGGATTGCC GTTGGCGATT GCCTATCCCG ACGTTCAGTT TGTCCTCCTT GACAGTGTAG GCAAGAAACT GACTGCGGCC CAAGACATGG CAAACGCTCT GGGACTCGAC CACGTTCGTA CACATCACGG GCGTGCCGAA GATTTACGGG ACGAGGTCTT CGATGTCGCC ACGGGTCGTA GTGTGTCGGC CATCCCACAA TTTTGTGCGT GGATGCAGCA TTTGGTCAAA CCCACGGGGC ATCTTCTCTA CTGGATTGGC GGCGACGTCG ACGCGAGTAT TCTGGAACAA ACTGTTTCGG ATACCCCCAT CGAGTCGCTA GTACCCGACA TGGAATCGGA TAAGAGAATA TTAATACTCC CCCAGCTTGC TGTGAAGAGG ATCGCCAAGG CTAGTGGAAT TTCTGTGCAA CCGTCACCAA CCAATCGATC ACAAAGAAAG CGCCCATCGT CCCAGAGAAA GACGACAGCC AAAGGATCTT GGAGCCGCCG AAACTCGGAA GAGCCCAAGC AGCGCGGCTA CGAAGGCTTC AAGCGGTATT CTAGCTCGTA ACCTACTTTT ACGATACCAG GACAACAAAA CCC
|
Protein sequence | MSTVRLVSLL LVLSTANSFL ARLSSTRNRN EVAVACRSEP ETSNYWGDDT DDQDTLQPSL TPLSSFAASP ALFELDPASD QARDIVMNDL KLSGAQHEQL VSLCQAVVDW NDRINLVSRK DCTVATVFGR HVLPSIACCA FSEDQNPLNT AKTLVDVGTG GGFPGLPLAI AYPDVQFVLL DSVGKKLTAA QDMANALGLD HVRTHHGRAE DLRDEVFDVA TGRSVSAIPQ FCAWMQHLVK PTGHLLYWIG GDVDASILEQ TVSDTPIESL VPDMESDKRI LILPQLAVKR IAKASGISVQ PSPTNRSQRK RPSSQRKTTA KGSWSRRNSE EPKQRGYEGF KRYSSS
|
| |