Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_19438 |
Symbol | |
ID | 7199823 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | + |
Start bp | 618180 |
End bp | 619853 |
Gene Length | 1674 bp |
Protein Length | 529 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178813 |
Protein GI | 219116036 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0420618 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAGGTATACG GATGCAAAAC AAACTGTCAA GCTCGTTTTC TTTCACGTAC ACCTTACAGC AACAGTCAAG TGCAAAGCAA GGTCACGATG CCGCGCAGCC AAGGAACGTC CCGCAAGGGT AAGCCCGGAA AGAAAGACCG CGACGCGGGA ATGGGGAAGT CACTCCAACG CGCTCAGGTC CAACGCTACC GGCCTCGAGC GGATGGTAAA TCGCGACGTG GCGACGGAGG CATGCACATG CAAGCTGGTG TAGATTCAAT CGGACTTGAG GAGGCGCAGC AAGATCAGCT CAAGACACGA TCCGTCTTGG AAATGCAGGA TTTGGATGAC TTCTTGTTGC AAGCCGACAT GGCGAATCGC GAGTTTGTTA GTGAAAAAGA AGGACTTGTC GTTGTAGACC AGACAGGTCA AGCATATCGG CCTCCAGCAG TGCAATGGGC GGACCAGAAG TCGTCGTTTT TATTTACAGA GCTCTCTGTG CCCCGTCGTC CTGCCTGGGA CGATACCACG ACTGCCGTGG AGCTGGACAT GAATGAACGA AAGTCTTTTC TAGAGTGGCG CCGAGCCATC GCCATCAAAG AAGAAGAACT CGCGCGGACG AGCTCGTTGG CTGCAGCAAC GCCATTTGAA AAGAATCTCG AAGTTTGGCG CCAGCTTTGG CGTGTATTGG AGCGGTCGGC GTGCCTCTTG CAACTCGTGG ACGCCCGAAA CCCGATGTTT TATCTTTCCG ATGATCTTCG AGATTATGCT TCTACTTTGG GCAAGCCAAT GATGGTTCTT GTCAACAAAA GCGACTATCT ATCACCCTCA CAGCGCGCTT CCTGGCGAGA ATATTTAATG GAGAAGGGCT GGGATCCAGT CTTCTTCTCT GCCGTCAAAG AACAGCAAAA GCTGGACGCT ATGGCCAATC GAAAGCGCAT CCAAGTACAA CTCGGGACAG GTCACGACAA CTCTTTAGAC CATTTAGGCG AAGAAACTGA TGTATCCATC GACGCTCACG ATCAAATTGA AGAACCAGAA GATGAACGAG GAGTGTCTGT TCCCCTTAGC CGTGAGCGAC TCATGGAAAC TATGCTAAGT TTTGCTCGTC AACACAATTG TCAGCCAGAC CCCAGGTACG ATAACCGAAT CCAGTTTGGA ATGGTGGGAT TCCCCAATGT CGGAAAGTCT TCGGTGATCA ATGTTCTTTT TGGAAGTAGC AAACATGAGC ATGGTGTGGT ACGCGTTGCT GTCGCCAGTC AACCGGGAAA GACAAAGCAT TTCCAAACAT TGATGTTGCC AGATGCCGAA GAGATGATGT TGTGCGATTG CCCAGGCCTG GTCTTTCCTT CGTTTGTGTC AAATACTGCT GACCTGATTG CGGCTGGCGT TTATCCAATT GCACAGATGC GAGATCACTG GCCAGTCACA AACCTTATTT GTCAGCGGAT TCCGCGTGAG GTTATCAATG CTCACTACGG TATCGTACTT CCGAAACCTA GCCAACTGGA AATGAATGAG CGCGGACTAA CAAAGCTGCC GCCTCCTAGT GGCGAGGAAT TTCTTGGAAC GTTCTGTATT GCCCGAGGCA TGTTGGCAGC AAGTAGCGGA GTCCCCGATT ACACACGGGC TGCACGAACG ATCATTAAAG ACTACGCGGA TGGCAAATTA TTGTATTGTC ACCCTCCTCC AAGT
|
Protein sequence | MPRSQGTSRK GKPGKKDRDA GMGKSLQRAQ VQRYRPRADG KSRRGDGGMH MQAGVDSIGL EEAQQDQLKT RSVLEMQDLD DFLLQADMAN REFVSEKEGL VVVDQTGQAY RPPAVQWADQ KSSFLFTELS VPRRPAWDDT TTAVELDMNE RKSFLEWRRA IAIKEEELAR TSSLAAATPF EKNLEVWRQL WRVLERSACL LQLVDARNPM FYLSDDLRDY ASTLGKPMMV LVNKSDYLSP SQRASWREYL MEKGWDPVFF SAVKEQQKLD AMANRKRIQV QLGTGHDNSL DHLGEETDVS IDAHDQIEEP EDERGVSVPL SRERLMETML SFARQHNCQP DPRYDNRIQF GMVGFPNVGK SSVINVLFGS SKHEHGVVRV AVASQPGKTK HFQTLMLPDA EEMMLCDCPG LVFPSFVSNT ADLIAAGVYP IAQMRDHWPV TNLICQRIPR EVINAHYGIV LPKPSQLEMN ERGLTKLPPP SGEEFLGTFC IARGMLAASS GVPDYTRAAR TIIKDYADGK LLYCHPPPS
|
| |