Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_28219 |
Symbol | |
ID | 7204540 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011679 |
Strand | + |
Start bp | 452658 |
End bp | 454280 |
Gene Length | 1623 bp |
Protein Length | 447 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185707 |
Protein GI | 219120951 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.473905 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGAATGTAAC ATTTCATCGA TCTAACAGTA TAGCTAGCAT TCCAAGTAAT TGAATTCTTC CAAAAGAAGT GTGTTTCTTG GAAAATCTGC TGAACTGTGC AACGACTATG AGTTCCGGAG AATGTCAGAC GGGAAACAAG GGCGATTATG ATGAGGTAAG GAAGCCTTGA TTCATATTTT CCTGACGCAC CGTAGTTTTC TCCTTTTTGT TCGGCACTCT TCCTACGTGC ATGCATACTT GCAGCAACAC ATGAATTCTT TATTTACCTG ACGTCGTGAA AAACTCATCA TTTCGTGCTC TCTTCTATGT AGGAGGAAAA AGAGGACGCC CCGCCCCCGC TCGACGAAGG CGATATTGTC TTACTGAAAT CATACGGTTT AGGTCCTTAC AGCACTAAAA TTAAAGACGT CGAAAAAGAA ATCAAAAAAC ACCAGCAAAC GGTCAAGGAT TTGATTGGTA TCAAAGAATC GGACACTGGC CTTTCTCCTC CGAGTATGTG GGATTTGAAT GGCGATAAAC AAATGATGTC GGAAGAGGCA CCCTTACAAG TTGCACGGTG CACCAAAATT ATTACGGGAG ACTCCGACGC GAGCAACGAA GGCGCGTCCA GTGCTGTTTC TACCAGCACC AAGTATGTCA TTAACGTCAA GCAAATTGCC AAATTTGTGG TGGGACTGGG TGAGACAGTG GCACCTACGG ATATCGAAGA AGGTATGCGT GTAGGAGTTG ATCGTTCCAA GTACAGTATT CAAATTCCTT TGCCACCCAA GATCGATCCT ACCGTGAGTC TTATGACAGT CGAAGACAAG CCAGATGTCA CGTATGATGA TGTTGGAGGT GCTAAGGATG CCATGGAGAA GCTTCGCGAA GTTTTGGAGT TGCCTTTACT GCACCCGGAA CGATTTGTGA CGCTGGGAAT CGATCCTCCT AAGGGTGTAT TATTGTACGG TCCTCCGGGA ACCGGAAAAA CATTGAGTGC TCGTGCGGTT GCCAATCGTA CCGATGCTTG CTTTATACGT GTCATTGGTT CCGAGCTTGT ACAAAAGTAC GTCGGCGAAG GCGCCCGTAT GGTGCGCGAA CTCTTCACTA TGGCCCGGTC CAAGCGTGCG TGCATCATTT TTTTCGACGA AATCGATGCG ATCGGTGGCG CTCGTACTGG ATCAGACGAG AATGGAAGTG ATAATGAGGT ACAGCGAACC ATGTTGCAGA TTGTGACCGA GCTTGACGGA TTTGATGCCC GTGGAAATAT CAAGGTTTTG ATGGCGACGA ATCGGCCCGA CACACTAGAC CCGGCCTTGT TGCGCCCGGG ACGTTTGGAT CGGAAGGTTG AGTTTGGGTT GCCCGACTTG GAAGGGCGCG GACACATTCT ACGCATTCAC TCAAAGCGCA TGAACTGCGA CCGGGATATT CGGTTTGAAC TGATTGCGCG CTTGTGTCCG AATACTACCG GAGCAGAGTT GCATTCAGTT TGCACGGAAG CAGGTATGTT CGCCATTCGA GCACGACGTA AAAATGTAAG CGAAAAAGAC TTTTTGGAGT CTGTCAACAA AGTTGTCAAG GGATACAAAA AGTTCAGTTC TACACCTAAA TATATGGTAT ACAACTAAAA CGATAACTTT GTG
|
Protein sequence | MSSGECQTGN KGDYDEEEKE DAPPPLDEGD IVLLKSYGLG PYSTKIKDVE KEIKKHQQTV KDLIGIKESD TGLSPPSMWD LNGDKQMMSE EAPLQVARCT KIITGDSDAS NEGASSAVST STKYVINVKQ IAKFVVGLGE TVAPTDIEEG MRVGVDRSKY SIQIPLPPKI DPTVSLMTVE DKPDVTYDDV GGAKDAMEKL REVLELPLLH PERFVTLGID PPKGVLLYGP PGTGKTLSAR AVANRTDACF IRVIGSELVQ KYVGEGARMV RELFTMARSK RACIIFFDEI DAIGGARTGS DENGSDNEVQ RTMLQIVTEL DGFDARGNIK VLMATNRPDT LDPALLRPGR LDRKVEFGLP DLEGRGHILR IHSKRMNCDR DIRFELIARL CPNTTGAELH SVCTEAGMFA IRARRKNVSE KDFLESVNKV VKGYKKFSST PKYMVYN
|
| |