Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_25379 |
Symbol | |
ID | 7197473 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | + |
Start bp | 923829 |
End bp | 925069 |
Gene Length | 1241 bp |
Protein Length | 289 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177719 |
Protein GI | 219111935 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TATAAACCCT TTTGTCCCTC ACAATATCCA AGGCAAGGCA AACCCTTCCC CAAAACATGA GTTCCGTACA AGACAGCAAG GTCCCCACCC TTTCGTCCGT CGGCTTTATC GGCGCCGGGA AGATGGCGAC GGCAATCATG GTAAGAAATG CGTGGTCGAT ATTACATTGG TTGTATGGAT AAATCGGATA TTAGCCTCAC CCGCTGCTTG TTCCAATGGT ATTGCGTAGG ATGGACTAGT TGCCAAATCG GTGGTATCCA CGCCTGAATC TATTGCTTGC TCCGACGTCT TTGAAATGGC TGTCACGGAT GCCTCCAAAA AAGGATATCA TGCAACCAAA TCAAATCAAG AGGTCTGCCA ACGTTCAAAA GATGCCATTA TTCTAGCCGT CAAGCCCAAC ATTATTCCTG ACATTTGTGC GGATGTCATG GATGCTGGTG GTAGCGCACT GATCATTAGT GTCGCTGCTG GTGTAACGCT GGAAACTCTG GAGAAGAATC TCCCTGGTCG TCGTGTGGTG CGGGTCATGC CCAACACCGC TTGTTTGGTC GGCGAAGCGG CGTCCGGGTA CGCCATGGGA TCCTTGTGCA ATGCGGACGA TAACAAGATT GTACAGTTGA TCTTTGGTTC CTGTGGTCTG GCCCGCGAAT TTAAGGAAGT CTTGCTGAAC GCCGTAACTG GAGTTTCCGG CAGTGGGCCG GCGTACGTCT TTCAGTTCAT TGAGGCGTTG GCTGATGGCG GCGTGCGGGC TGGTTTGCCT CGCGAAGACG CTGTTCTTCT GGCTGCTCAG ACTTTGAAGG GAGCAGCAGA AATGGTTCTG GTGACCGGCA TGCATCCGGG CCAGCTCAAA GACATGGTCT GTTCGCCGGG AGGAACCACG ATTACCGGTG TCGACGAGTT GGAAAAAGGG TACGTGCGAC GACAGGGTGA ACGGGCTACT TGGCTTACGT TAACGCTGGA CTTACAATTT GATTTGGGTT TAACCATACA GAGGATTACG GACGACTGTC ATGCAAGCTG TCAAGGCTGC GACTCGACGC AGTATGCAGC TAGGCGGCAT TACCGAAGAG GAAATCACCA CGAAATACAA CCTTTAAAGA AGCACGATGT TATACGCCTA CAGAACTATT ACTCTCAATG CGAAAGAAAG TCCTACTTGC TCTTCATGAA TGATAGCAGA ACATTGAGGA CTGCTGTAAC TCTCTAGTTA ACATCAGATG TAAACACAAA AATGTGCGAT G
|
Protein sequence | MSSVQDSKVP TLSSVGFIGA GKMATAIMDG LVAKSVVSTP ESIACSDVFE MAVTDASKKG YHATKSNQEV CQRSKDAIIL AVKPNIIPDI CADVMDAGGS ALIISVAAGV TLETLEKNLP GRRVVRVMPN TACLVGEAAS GYAMGSLCNA DDNKIVQLIF GSCGLAREFK EVLLNAVTGV SGSGPAYVFQ FIEALADGGV RAGLPREDAV LLAAQTLKGA AEMVLVTGMH PGQLKDMVCS PGGTTITGVD ELEKGGLRTT VMQAVKAATR RSMQLGGITE EEITTKYNL
|
| |