Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_2032 |
Symbol | |
ID | 7198685 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011694 |
Strand | + |
Start bp | 379755 |
End bp | 381175 |
Gene Length | 1421 bp |
Protein Length | 442 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184871 |
Protein GI | 219129386 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.688536 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TACGAGCGGT ACGATGTCAC CGTAGACGAT GCGGAGGCGG ATCGTGCGAC GGAAATCAAG ATATTCTCTA TCCGACGCCC GCACATGCGA GCCTTTCACG TGGCCTGGTT TTCCTTCTTT TGGGCCTTTA CCATTTGGTT CGCTCCCGCG CCACTACTAA AAGAAATACA AAAGACACTC GGATTGACCA GAAAAGAGAT TTGGACGAGT TCCATTACCA ACGATATCAC CGCCATTTTC TTGAGAATTT TGATTGGCCC CTTGTGCGAC GTCTACGGCG CGCGCTTGCC CATGGCGGCC GTCCTGGTCC TCGCATCGAT TCCTACCGCC ATGGTAGGAC TCATTCAATC GGCGGCGGGG CTTTCCGTCA CACGCTTCTT TATCGGTATT GCCGGAAGTT CCTTCGTCAT GGCACAGTTT TGGCCTTCCC GTATGTTTAC CCGGGAATTG GCGGGCACCG CCAACGGGAT CGTTGGTGGT TGGGGGAACC TGGGGGGTGC CTTTACACAA CTCCTCATGG GCACAATTTT GTTTCCGGCT TTTCGGAATC TGTACGACGG GGACTCGGAA AAAGCATGGC GCGTTATTTG CGTCATTCCC GCTGCCGTCG CCTTTTTGTG GGGTATCGCC GTCCCGTGGA TTTCCGACGA TGCCCCGATG GGAAATTATG GAGAAATGAA AAAGCGTGGC GCCATGGATC GAATTCTGAT GACGACGGCC CTCCGACAGG GCGCGGTCGT CAATACGTGG ATACTGTACG TCCAGTACGC CTGTTCCTTT GGAGTCGAGC TCGTCATGAA TAATGCAACC GTGCTCTACT ACACGGATGA GTTTGGATTG AGTACGGAAG ACGCGGCGGC TCTCGGTTTT ATTTATGGTT CCATGAATTT GTTTGCTAGG GGCATGGGTG GATATCTCTC GGATCAGCTT AACCTCAAGT TTGGCCTACG GGGTCGCTTA TGGCTTCAAA CCTGTTTATT GGTAGTCGAA GGCATCGTCA TCATTATTTT TCCATTTGCT GATACACTCA GAGGAGCCAT CGTTACCATG TGCATTTTTT CTATTTTTAC GCAAGCCGCA GAAGGTGCCA TTTTTGGTAA GCCTTGACAT AAGACGCTAT ATCTCTTTTT GTGTTTGAGT CCTATTGCGA CTAATAGTTT GCTAATCTCT TGCTGCTTTT GACTACATTA GGGGTGGTCC CATACGTGAC CAAATTGTAT TCGGGCTCGG TTTCGGGTTT GGTCGGCGCT GGAGGCAATG CCGGCTCCGT CATTTTTGGT CTCGGATTCC GGTCGCTTTC GTACCGGCAA GCTTTCATCA TGATGGGGTG CATTGTGATC GCTAGCTCTG GTTTAAGTGC CTTCATCAAC ATTCCGTTGT ACGCGGGCTT ACTCTGGGGT AAGGACAATC ACTCCGTTAT T
|
Protein sequence | YERYDVTVDD AEADRATEIK IFSIRRPHMR AFHVAWFSFF WAFTIWFAPA PLLKEIQKTL GLTRKEIWTS SITNDITAIF LRILIGPLCD VYGARLPMAA VLVLASIPTA MVGLIQSAAG LSVTRFFIGI AGSSFVMAQF WPSRMFTREL AGTANGIVGG WGNLGGAFTQ LLMGTILFPA FRNLYDGDSE KAWRVICVIP AAVAFLWGIA VPWISDDAPM GNYGEMKKRG AMDRILMTTA LRQGAVVNTW ILYVQYACSF GVELVMNNAT VLYYTDEFGL STEDAAALGF IYGSMNLFAR GMGGYLSDQL NLKFGLRGRL WLQTCLLVVE GIVIIIFPFA DTLRGAIVTM CIFSIFTQAA EGAIFGVVPY VTKLYSGSVS GLVGAGGNAG SVIFGLGFRS LSYRQAFIMM GCIVIASSGL SAFINIPLYA GLLWGKDNHS VI
|
| |