Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_38559 |
Symbol | |
ID | 7203306 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | + |
Start bp | 395130 |
End bp | 396356 |
Gene Length | 1227 bp |
Protein Length | 386 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182527 |
Protein GI | 219124473 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGATAA AGGTGGGTAT CTTGGGTCTT CCGAATGTTG GAAAGTCAAC CCTCTTTAAT GCACTTGTGC AGAAATCCAT TGCGCACGCC GCTAATTTGT AAGTCGTGGT ATGCATTGGA TTGAGTTGCG TTCGAGGACA GCGTCTCACT GCGGTTACCA ACAGCCCATT TTGTACCATC GATCCCAACG TGGCACCAAT CCCTGTCCCC GACCCATACT TGGAACGTCT CGGTCGTGTT GCGCAAAGCA AGGCAGTTAA ACCAGCAACG ATGGAATGGG TCGATGTGGC TGGTCTCGCC AAGGGTGCCC ATCGTGGAGA AGGTCTTGGC AACCGCTTTC TAGCATCATT GCGGGAATGT GATGCGATTT GTCATATAAT TCGTGCGTTT GAAGACCCAA ATGTTGTGCA TGTGGACGGA CAGGTCGACC CAGTTTCCGA TGCCAACGTG ATTGGACTGG AACTGATCCT GGCGGACCTG GCTCACGTCG AGCGTCGTCT AGAGAAAACT ACGTGCAGTG GTGTCGAGCG AGCGACGCTA GAGTCCATTG CCGAATCTCT CGAAAAGGGT ATTCCGGCAC GGGATCTACA ATTGTCGAAA GAAGACTTGC TTTCCATCAA ATCTATGGGA TTGCTGACGC TCAAACCATT CTTATATGTG TTCAATGTTG ACGAGGTTGA CTTTTGCTAC CGAGAACAAG CTCTCGAGAC GGCCAGAGAA AGACTGCAGT TAATTCCGTA TTGTGATCTT GAAACGAAGG ATAACTTTAC GATCGTGAGC GCAAAGATGG AAGCGAATTT AGGGGAAAAA CCCAGAGAGA CCCAATTGAG TTACCTCCAG GATATGGGAA TGGAGTTTGA GAAGGATGAC CAGCTTGAAG GCATACGGAG TTACAATGTA ATGCCCACCA TGGTTCAAAG ATTGCTAAAC CTGGGTTTAG TGTACACGGG ACCCGGCGTT GCGTCCAGTA GGTCGCAAAC CACCAAAGCG TACATGATCG ACAGCGGCGG TGGTGCCAGT ACCGGACGAG CAACCACCGC CCATGACTTT TCCGGTCGGT TGCATGGCGA CATTCGCAAG GGATTCACCC GAGCCGAAAT CACCAAGGCG GAAGCGCTGT TAAAATACGA CTCCTACGTA GCCGCCAAAG ATGCTGGCAT TGTCCGTACC GAAGGCCGTG ACTACATCCT CCAACCGGAC GAAGTCGTCT ATATCAAATG GAAATAG
|
Protein sequence | MKIKVGILGL PNVGKSTLFN ALVQKSIAHA ANFPFCTIDP NVAPIPVPDP YLERLGRVAQ SKAVKPATME WVDVAGLAKG AHRGEGLGNR FLASLRECDA ICHIIRAFED PNVVHVDGQV DPVSDANVIG LELILADLAH VERRLEKTTC SGVERATLES IAESLEKGIP ARDLQLSKED LLSIKSMGLL TLKPFLYVFN VDEVDFCYRE QALETARERL QLIPYCDLET KDNFTIVSAK MEANLGEKPR ETQLSYLQDM GMEFEKDDQL EGIRSYNVMP TMVQRLLNLG LVYTGPGVAS SRSQTTKAYM IDSGGGASTG RATTAHDFSG RLHGDIRKGF TRAEITKAEA LLKYDSYVAA KDAGIVRTEG RDYILQPDEV VYIKWK
|
| |