Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_28689 |
Symbol | |
ID | 7202429 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | + |
Start bp | 473036 |
End bp | 474758 |
Gene Length | 1723 bp |
Protein Length | 453 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181564 |
Protein GI | 219122463 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGGTTCCAAA GTCACAAATG AAGCCTTTGA TGACTTGGAT CTTATCGACG AGCTAATTGG CGATGTCGAC AGTGCCAACA TACTGACCAG ACTAGACGAA AATGACAACA AGCGCGCTGC CGAAGACGAA GAATTTAATC AGAAAGCATT GCCCCAATCC GAAGTCGAGG TGGCATTGCG TATGGTTAAA CAACCAATAG ATCAAGCCGC CACCGCCTGG AAAATAGCCC CCTTCCTGAC AAACAACCTA AAGCGGGACG GATTTGCAAA CTTCTTTCCT ATTCAAGCAT TGGCGATTCC AGATGTAATT GCGAGTGAGC GTCATGCATA TATTCAAGCA CGAGACGTTT GCATCACGGC ACCGACTGGT TCAGGGAAGA CGTTGGCTTA TGTGCTACCC GTTCTCAATT CATTGGCTAA TCGCAAAATT CGCAGATTGC GCGCCCTAGT TGTGCTGCCG AGTCGCGACC TCGCCAATCA AGTCTTCAAA GTCTTTAAGT CGTTTATGGA AGGCTCCGAT CTCAAGGTCG GCCTCGCGAT TGGTCAGTCC GACTTTGTGG CAGAACAGAT GGCGATGTCC GTAGATCCCG ACATCAATTC ACAGGATTCG GATTCGGCTC GACGGCGGCT GGCCTTTGAT ACAGGGAACG TCAGTTTGGC TTTGCAGGCG TTTGAAGATG GCAACGACCA CGACTTACCG GAAAAAAGAG ATCCGCAAAG CACAATTGAT GTCCTGGTAT GCACACCAGG ACGCTTAGTG GATCACTTGG ACAATACACC CGGCTTCTCT TTGGAGCATT TGCGTTTTCT CATCGTAGAT GAGGCAGATC GTCTACTCAG TCAAACTTAT CATAATTGGA TTGGGCGGGT GATACAGAGC GCAAATTCGG GGTCGGTTGC CGCATGGAAA CGAATTCTTG CGAATGATAA CGAGCTTCCG ATGCCACAAG TATCGAAAGA CGGAGCGTCC TACTGTATCA CGCCTACAAC CTGGAGGCGC GGTGGAGTTG TCGGAGACGA TACCGACTTT AACACGAACG ATTCGTACCG TAGCGTCGCT TCTTCCGTTT GCCGACCGGT TCAGTTGCGC AAGTTTCTGG TTTCGGCTAC GCTGACTCGT GATCCGCAAA AATTGGCCTC CCTCAAGCTT GTGAATCCCA AGCACTTTGA CGTCCATCAG CTCAGAACCG GCCATCAGGG GTTCTTCAAC ACCAACACAA AGAAGTATTC AATGCCGGAG GGCCTTCACG AACACACAGT CGAGTGCACC GCGGAGCAAA AGCCTATCGT TCTTTTAGCG CTTGTGCTGG ATCAGCTTAC GCCCCAGCAA TCGCAATCCA GTAGCAAGCA GAGCGTGATA GTGTTTACGG CCAGTCTTGA TTCAACCCAT CGTTTAGCTA GGCTCTTACA GCTACTCTGG GTTTCGGCCG GCTACGGCGA GCCCGATTCA GTTGTGGAAT TCTCTAGCGC CCTCAACCAG CACGAACGAT CGGCGCTCAT GAAGCGCTGT AACGACCCCC AGGACAAAGT CTCCGTCGTA GTATGTTCGG ATGGCATGTC CCGCGGTATG GACATTGACG CGGTCCGAGC CGTGATCAAC TACGACGTAC CAGGTTTGGC CAAAACCTAT GTTCACCGCT GCGGACGGAC AGCTCGCGCC GGCAAGGAAG GACACGCAAT CAGCTTACTG AAGGGCGGAC AAACGCGACA GTTTGACAAA ATG
|
Protein sequence | MVKQPIDQAA TAWKIAPFLT NNLKRDGFAN FFPIQALAIP DVIASERHAY IQARDVCITA PTGSGKTLAY VLPVLNSLAN RKIRRLRALV VLPSRDLANQ VFKVFKSFME GSDLKVGLAI GQSDFVAEQM AILALQAFED GNDHDLPEKR DPQSTIDVLV CTPGRLVDHL DNTPGFSLEH LRFLIVDEAD RLLSQTYHNW IGRVIQSANS GRGGVVGDDT DFNTNDSYRS VASSVCRPVQ LRKFLVSATL TRDPQKLASL KLVNPKHFDV HQLRTGHQGF FNTNTKKYSM PEGLHEHTVE CTAEQKPIVL LALVLDQLTP QQSQSSSKQS VIVFTASLDS THRLARLLQL LWVSAGYGEP DSVVEFSSAL NQHERSALMK RCNDPQDKVS VVVCSDGMSR GMDIDAVRAV INYDVPGLAK TYVHRCGRTA RAGKEGHAIS LLKGGQTRQF DKM
|
| |