Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47395 |
Symbol | |
ID | 7202539 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | - |
Start bp | 521912 |
End bp | 523185 |
Gene Length | 1274 bp |
Protein Length | 331 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181744 |
Protein GI | 219122837 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTCTTCGATT CTTTCGTTGC CTTACTTCCA GCTTTCAAAG TATCATCATG AAGTTCACTT CCTTTGCCTT CCTCGCACTT GCGGCTTCCG CCAATGCCTT CACACAGCCC GCTTCCTTCG GGGCCGCTCG CAGCTTCTCC ACTTCTTTGG ATGTTTCCAA GAAGGATTTG GAGGGCGCTC AAACGATGAT TGACAAGATC ATTGACGATA CGAACGCCAA CCCAGTCTTT GTTCGTCTTG CTTGGCATGA CTCGGGTACT TTCGACGTCA ATGTTGAAAA AGAGTGGCCG GCATCGGGTG GGGCTATTGG CAGCATCCGC TTCGACCCCG AAATCAACCA TGGCGCCAAC GCTGGTTTGT CGGGAGCCGT CAAGCTTTTG GAACCCGTTA AGGAAAGCTT CCCCGATGTC AGTTTCGCTG ACATTTTCCA AATGGCCTCC GCCCGTTCGA TCGAACTTGC CGGAGGTCCC AAGATTGACA TGAAGTACGG TACGTTGTGA CTAACCTGCC TAGTGTGATG CCAATTTTCG AATCGTGTAT GGCTTTGGCT TCGAATGCTC ACCCTTCTGG TTATGTTGCC TTTAGGACGT GTTGATGCGT CCGGTCCTGA AAACTGTTCC GCTGAAGGAA ACCTTCCCGA CGCGGAACCG GGTCCGGACG GAAAGTATGG TGGTCCGGGA GGCAGCGCTT CAACCGAAGA CAAAACCCCC AACGGTCACT TGCGCAAGGT GTTCTATCGC ATGGGCTTGA ACGACGAGGA AATCGTGGCA CTATCGGGTG CACACAGTTT CGGTCGCGCG TACAAGGACC GCTCCGGACT TGGCGCTGAA AAGACCAAAT TCACTGACGG CAGTAAACAA ATTCGAGCGG ACGGGAAGGA AGCCAAGTAC AACCCCGGTG GTAGTGCATG GACCAAGAAC TGGTTGGTTT TCGATAATAG CTACTTCACA ACGATCCCCG ACGAGTCCGC TGATCCAGAA CTTCTCAAGC TTTCGACTGA CAAAACTCTC TTCGGCGATG AAGACTTCAA GCCCTTTGCT GAAAAGTTCC GTGATTCACA GGATGAGTTC TTCGCTTCGT ACGCAAAGGC GCATAAAAAG CTTTCCGAGC TCGGATCCAA GTTTGAAGCC GTCGAATAAA GATCCAGTAC AAACAAACTA CTACCCTTGG CCACCGAGGA TGTTTAAAAA GTACGCCAGA AGGCGTCATT ACGAAATTCA AGACAGATCA TCTTCAAAAT GAGAGATAAA AATAAAACTT GGCTAGTCGA TGGG
|
Protein sequence | MKFTSFAFLA LAASANAFTQ PASFGAARSF STSLDVSKKD LEGAQTMIDK IIDDTNANPV FVRLAWHDSG TFDVNVEKEW PASGGAIGSI RFDPEINHGA NAGLSGAVKL LEPVKESFPD VSFADIFQMA SARSIELAGG PKIDMKYGRV DASGPENCSA EGNLPDAEPG PDGKYGGPGG SASTEDKTPN GHLRKVFYRM GLNDEEIVAL SGAHSFGRAY KDRSGLGAEK TKFTDGSKQI RADGKEAKYN PGGSAWTKNW LVFDNSYFTT IPDESADPEL LKLSTDKTLF GDEDFKPFAE KFRDSQDEFF ASYAKAHKKL SELGSKFEAV E
|
| |