Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50558 |
Symbol | |
ID | 7199387 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011699 |
Strand | + |
Start bp | 108755 |
End bp | 109904 |
Gene Length | 1150 bp |
Protein Length | 319 aa |
Translation table | |
GC content | 56% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185486 |
Protein GI | 219130678 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 40 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCTTCT ATGGAGTTTC CAATAAATGT AGGTATGTCT CTCTCTTCAC AGTCCAGTGA CCGCATACCC GTTTTTGTAT TTATGATCTT CTTGGATTGG TTCGAATATA TCCTCCTGAC GTCGTCGTCC TTCCGACTCG ACCCCAACCT CGACGAACAA AACACAGTCC AAGCCAAGTC TCGGGAACGG TTTGTTGAGA GCGGGGGCGC TAGCTAGCTA CCGCTGTGAA ACCCCCATAC CTATTCTCCA TCCACATCGA CACTGTGTTC ACACAGTCAT TGCTTTCCGC ATTTAGTGTT TTCCCACCAC CGCTCGATGA CGTTGCGTAC GACTCCCGTG ATAGAGACGC GTTCGTCGCT GCCCGACACG CCGGTGAGTG CGCTCACGAT CCCCACGTCG CACGCGTCGT CACTATCGTC GCCGTCAGCA GCGGCAACAG CGAACGATTC TTTCGCGTTG GAAATGAAGG AACGGGCTCG TACCCGTCGC GCCCGACAAG ATCAGGTCGT CTCGGACCTC AAAGTCCAAA TCCGTCGTGT CGAAGCCGCA CTCCAGGCGG AAACCAAACG ACGGGTACAC GGACTCCAAA ATCTCCAACA ACAGACGGAG GCCAGTATCC GCACACTGCA AGAAAATTTG GAAGAGTCCT GGCGACTCCA ACAAACAGCC ACGGACGATC GCTGGCAACA ACTGGCCGAC CGATTGACCG CTTTGGAAGA GTCCTGGCGT GTCCACGTGG CGCGCTGGGA AGACCGCACC GGTTCGGAAA GTGCACAGGC GCGGGAGATG CTACGGGAAC TGCAGGAGCA GGCGGAAGTG GCGCAGAGGG AGCGAGAGAT TCGTGAAGAG AGTTTGCGAC AGCGATTGGA AAACGTCGCG CAAGAAGCGG AACAAGCCTG GGACGAAGCC CGACGCGAAC GGCGCGCGGC GACCGACGAA CTGCAAACGC GGCTCGAACG ATACGACGAC AACGTGGAAG CGCACGTTCA GGGATTGCAA CAAACCCTAC GGCAAGAATT AGCCGCCCTG CAGGCCGATC TGCAAAGGGA ACAAACGGAA CGAGCGACGG CGGATCAAGA TATTGCCAAT GTCTTGAATC GCTACACGGA AACGATACAG AACAGTTTGG CCGTCGTTTC GGACGTATAG
|
Protein sequence | MAFYGVSNKC SDRIPVFVFM IFLDWFEYIL LTSSSFRLDP NLDEQNTVQA KSRERVFPPP LDDVAYDSRD RDAFVAARHA AATANDSFAL EMKERARTRR ARQDQVVSDL KVQIRRVEAA LQAETKRRVH GLQNLQQQTE ASIRTLQENL EESWRLQQTA TDDRWQQLAD RLTALEESWR VHVARWEDRT GSESAQAREM LRELQEQAEV AQREREIREE SLRQRLENVA QEAEQAWDEA RRERRAATDE LQTRLERYDD NVEAHVQGLQ QTLRQELAAL QADLQREQTE RATADQDIAN VLNRYTETIQ NSLAVVSDV
|
| |