Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37892 |
Symbol | |
ID | 7202674 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | - |
Start bp | 336494 |
End bp | 337795 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182050 |
Protein GI | 219123476 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGCTTA TGCCGGCCAA AGTTGAGAAT GATTTCCGCC TTAGTCCAAA AGTACACAAA CACTTTCGGC CTACGATGCC AATGTCGCCA CTGAGCAAGA CCTCCCCACC TCGTCTGGGT TACAAATTAG GCACTCGCTT GTACAGAGCT CCGCAGGCCG CGATTCGGAA CCATTCTGTG GTTAGTTCCA GGGATGACGT GATGAAGGGC GCACGTCGGA GACGGCCCAG TCGTGGCATT AGCCATGGCA GCCAACGTCA GCAGCAGCAA GAAGGCATGG GATTTGCGTT AGAGCGACAA TCGTCGGATA GGTCGTTGGG GTTGGAAGAA ATACTAGAAC CGCCCTTGTG GCATGAGCAA AAGGATCCAT ATCGAAGACT CAACGGAAGT GGAAGAGAGC TAGGTGGTAG TGTGCCAGAA ATAAATTGGA AACGGACACC ACCGGGTCGC AAGCTTAGAG CCTTGCACAT ACTTCAAGGC AATGAAGATT ATTCAGAGTA TTTGGAGGAT GAGTCTTTCC GAACTTACAA CTCCTTTGAA TGCCGGAGTT CAAAGGAAAA GGTTGCCAGT CCCGTAAAAT CACTATGGAC TATTACAAAT TTATCCCAGC TCCTCCTTAT TGTTATGCTG GGTGGCTTTA TTTTTGACTC GCGCCGCAAA GGAAAAACTC ACAAGGCTCA GCTGCAGCAG TATGACGAAG AACGAAGCCA TTTGCTTGAT CAGATGATGT GGATTGACAA GGCTGCCAAA AAAGTCCATC AACGGTACCC CGTTCAATCT CCAATAGATT TGGATCAAGA GACTAAAGAG CAGCTCAAGC AGGAAGTTAG GGATGCACAA GATTCTTTGC AAAAGCTTCA GCTACGGGTC CAGCTGAATG ATAGACAGTT TTTACACGAA AAATTTGGAG ACAAGCCGTT GCAAGTGGGC TTGAGCTTGG ATGCGACGGG GACGGAACGT ATCTCCATTG CGTTGTCTGA TGACACTCCC CACGCTGTCT CAATATTTGT ACAGCAGGCT GACAAGAATT TGTGGAGTGA CCTTCGTTTC GAACGACTTC TTTCAGGATC TATAGATGTA TTCTCTACCC AAGCAACCAC AACTCCATTG CTAGAATTTC TCGAGCGCTC GCGTGGTTGT CATGAACGCG GCGCTGTTGC CTTGAGACAA GAGGAAGACC GCGACATCAT GTTTTTGGTT CTTCGGATAA ATCTCGAAGA TCAGTCTCCG CTCTCGAACA CGGACGTGTG TATTGGACGC GTAGTTAAAG GCCTAGATTT GCTGACAAGT CGCGTTTCCT GA
|
Protein sequence | MQLMPAKVEN DFRLSPKVHK HFRPTMPMSP LSKTSPPRLG YKLGTRLYRA PQAAIRNHSV VSSRDDVMKG ARRRRPSRGI SHGSQRQQQQ EGMGFALERQ SSDRSLGLEE ILEPPLWHEQ KDPYRRLNGS GRELGGSVPE INWKRTPPGR KLRALHILQG NEDYSEYLED ESFRTYNSFE CRSSKEKVAS PVKSLWTITN LSQLLLIVML GGFIFDSRRK GKTHKAQLQQ YDEERSHLLD QMMWIDKAAK KVHQRYPVQS PIDLDQETKE QLKQEVRDAQ DSLQKLQLRV QLNDRQFLHE KFGDKPLQVG LSLDATGTER ISIALSDDTP HAVSIFVQQA DKNLWSDLRF ERLLSGSIDV FSTQATTTPL LEFLERSRGC HERGAVALRQ EEDRDIMFLV LRINLEDQSP LSNTDVCIGR VVKGLDLLTS RVS
|
| |