Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_21006 |
Symbol | |
ID | 7204601 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011679 |
Strand | - |
Start bp | 189507 |
End bp | 191284 |
Gene Length | 1778 bp |
Protein Length | 466 aa |
Translation table | |
GC content | 44% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185837 |
Protein GI | 219121218 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.540977 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGAAC TTCCTATTAC CGACCCTTCC ATCCATGCCA ACCTCCTACT CGAATCGAAG GAGGATGTCT TCAAAAAATA TTCCGTCGTC CAAGTGCTCG GTAATGGATC AATGGGAACC GTTTCCAAGG TCAAAATTAA GAAGCACAAG GTCGGGGGAA GCGCCTTTCA GCCGAAATCC AAGGGAATTT TTGGCTTTTT GAAGAAACAG AACAACAAAA GGAAGGAAGG TGAGACCAGA GAACACAATA GTCAGGACTA TATATACGCA CTCAAGTCCA TCATTCTGGA TCGGGTCTCT TCTGTCTTCC TGGACGAGCT CCGTAACGAA ATTCTTATCC TTAGATCATT GGATCATCCC AATATTGTCA AAGCGCACGA GGTTTACTAC ACGAGGAAGC AGATTTATCT CGGTGCGTGA TGAGGAATGT GAAACGTGAT TGGTCAGAAG TCTGAACTTT TTCTCTTTGG AAAGGAGACT AACGCATTTC CGTCATCGTT TTCTTATATC CGTGTGCTGT TTCTGGAAGT ATTGGAGTTG TGTGATGGCG GAGACCTTTA TACCAGGTCG CCTTACAGTG AAAGGGAATC GGCAAGGATT CTGCAACAAA TATTGTCGGC AGTGCGGTAC ATGCATGGTA CGCTATACCG ATACGCATAT AGAATTCCCG CAGCAAACTT TGGGAAGCTA ACAATTCTTG TTCTGCTTTG TCTAGATCAC GGAATTGTTC ATCGGGATCT CAAGTTCGAG AATATCATGT TTGAGAACAA TAGCCCCAGT GCTCGGTAGG TTACTGATCA ACTCTCGAGG CTACACGTAG ATGAAGCGCT CTTACAGTCA ATAATGTTCT ATTCATTTTG CACAGAGTCA AAATTATAGA TTTTGGATTG TCTAAAAAGT TCCTTGGCAA ACCGTCGTAC ATGACCGAAC GCGTTGGTAC CGTCTATACG ATGGCCCCGC AAGTCCTGCA AGGAGTCTAC TCATCGCAAG CTGATCTTTG GTCCGCTGGA GTGATAGCCT ACATGCTGTT ATCGGCTTCA AAGCCTTTTT ATCACAAACG ACGGCGCAAG ATGATTGACC AAATCATGAG GGCCGACTTC GGATATAATG CACCGGTCTG GAAGCAAATA TCAGAAAGTG CGCAAGATTT TGTAAGTCGA TTACTAGTGG TGGATCCAAA GAAAAGACTG AATGCAGAAA AAGCATTGGA CCATTCTTGG ATTGTGAATC GCGAACGCTT GCCAGATGAG ACACCATCCG AGGATTTGTT GGCCGCTGTC GATGATTGCC TCGTGAATTA TCGACAAACG TCGGAGCTGA AAAAGCTAGC TTTAAACATG ATCGCCCATC GTTCTACCGC GGAAGAGATC ATGCAACTTC GGAAAGTTTT TGACAGCTAC GACACCTCGA ATGATGGAAT TATTACATTT GATGAATTCA AAGCAGCTTT GCACAAAATG AAATATCCGG ATGAGATTGT ACAGGAAGTT TTTAGCAGTA TTGATGTCAA CCGAAATGGC CATATACAGT ACACGGAATT CATTGCATCG ACCGTCTTGG CACAGGGACA TATCGCAGAG GATCGGGTCG CAGTAGCTTT CGATCGCTTG GACTCTGATG ACACCGGCTT TATTTCCAAG AAGAACTTGC AAAACGCATT GGGCAAGGAA TACACTCCAG AACTCGTCGA AAATATAATG GAAGAAGTTG ACAAAGATAG GGATGGCAAA ATATCATATA CCGAGTTTCT GCAATACTTT CGGAAGGAAA CGAGCAAT
|
Protein sequence | MDELPITDPS IHANLLLESK EDVFKKYSVV QVLGNGSMGT VSKTREHNSQ DYIYALKSII LDRVSSVFLD ELRNEILILR SLDHPNIVKA HEVYYTRKQI YLVLELCDGG DLYTRSPYSE RESARILQQI LSAVRYMHDH GIVHRDLKFE NIMFENNSPS ARVKIIDFGL SKKFLGKPSY MTERVGTVYT MAPQVLQGVY SSQADLWSAG VIAYMLLSAS KPFYHKRRRK MIDQIMRADF GYNAPVWKQI SESAQDFVSR LLVVDPKKRL NAEKALDHSW IVNRERLPDE TPSEDLLAAV DDCLVNYRQT SELKKLALNM IAHRSTAEEI MQLRKVFDSY DTSNDGIITF DEFKAALHKM KYPDEIVQEV FSSIDVNRNG HIQYTEFIAS TVLAQGHIAE DRVAVAFDRL DSDDTGFISK KNLQNALGKE YTPELVENIM EEVDKDRDGK ISYTEFLQYF RKETSN
|
| |