Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39949 |
Symbol | |
ID | 7195564 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011689 |
Strand | - |
Start bp | 432782 |
End bp | 433799 |
Gene Length | 1018 bp |
Protein Length | 300 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183995 |
Protein GI | 219127548 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGAAG AAACGGCAGT GGAATTTCTT CCACTCCAGC CGCGAGTCTC ATTCAACACG TCGAACACGG CCAAGAACCG TCTCGCGCGT TCTTCCTCGA CCGAGTCGTC GTCGCCATCG TGGGACATTG GAACTTCTAG TCGGCCTTTG GTTCAGCAAG CCTTGTGCTC CATCTTGTTC GGATTATTCG GCTGGAACTT TCCGCGATAC TTGATTAGTA TCGAAACAAC GATCCAACAC AACGTCCCGC CCTTTCAAAA AACCCAAGCG GGTGATGTCA TACTCGACTT TTTGCTGAAT CAAAACCTGA CGTATCCACC CACAGTTGGA TGTAAGTAGG AATCGACGGG ACCAAGGATG TTCCTTACGT ATAAATTTCT GACCTTGCAA CATACAAGCT TCGTTACTAA TTTGGGGGTC CATCTGGATA CCTCTGGTTT TGGTAGTGAT GGCGGCTTGG TCGTTTGCGC CGCATCGACC CGCTCACGTT CGCTGGCATG ACGTACACGC CTCGTTTTGT GGTCTAATTA CGGCCGTTGG ACTGTCGGAA GGTGCCACGG TCTTGCTCAA ACTCTACATC CAACGGCGAC GACCCAATTT CTACGCACTC TGCGGATTCG ATAAGCAACT TTTACAGTGC ACGGCAGATT TGGAAAAGAT TCGGGAGGCA AGCTTTAGCT TTCCTTCGGG ACACAGTAGC TTGGCCAGCT GTGGCATGAC TTTTTTGGTG TGGTTCTTTT TGGGCAAGAT TCTGTTGTCC CGGTGGAATC GCTCGACGAC CCGCATCTTG TGCGCTGGTG CGTGCGTTTT GCCGTGGGGA TGGGCAGCCT TCGTGGGGGC CAGCCGGCTT GTGGACCAAT GGCACCACCC TTCGGATGTA TTGGCTGGCC TCCTGCTGGG AGGCTTGTCG TCGACGATTG TATACCACAT ATGGTACCAC CCGACCTGGT CGGATGCTGC TGGACACCCG TGGTCACTCC AGCCTAACGA ACGGAAACTC GAATCATTTC TCGAGTAA
|
Protein sequence | MDEETAVEFL PLQPRVSFNT SNTAKNRLAR SSSTESSSPS WDIGTSSRPL VQQALCSILF GLFGWNFPRY LISIETTIQH NVPPFQKTQA GDVILDFLLN QNLTYPPTVG LMAAWSFAPH RPAHVRWHDV HASFCGLITA VGLSEGATVL LKLYIQRRRP NFYALCGFDK QLLQCTADLE KIREASFSFP SGHSSLASCG MTFLVWFFLG KILLSRWNRS TTRILCAGAC VLPWGWAAFV GASRLVDQWH HPSDVLAGLL LGGLSSTIVY HIWYHPTWSD AAGHPWSLQP NERKLESFLE
|
| |