Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39786 |
Symbol | |
ID | 7195642 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011689 |
Strand | + |
Start bp | 34222 |
End bp | 35446 |
Gene Length | 1225 bp |
Protein Length | 352 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183797 |
Protein GI | 219127136 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGATG CGGACGGCGC CGAGCGAGGG TATCGCGACT ACGCTGGAGA GGAAGATGAT GCGACCGAGC TCGGCGTCAA GCAAACGCAA ACCCACGCTC CAAAGATGCT TCGTAGCGAA CATCATTTTC CAGTACGTCT ACATGCTCTT TTGAATGAGC TAGAAAACGA CAACTTGGAT CGAATCTTGC GTTGGCAGCC CCATGGACGA TGCTTTGTTG TATGCGATCA ACGGCTTTTC GCGACTCATG TGCTCCCCTA GTAAGTCGAC AGGGTGATTG GATGCGGAGA CGTCTGTATC CAATCTCATC AACTGATAAT ACTTCCTTCA GTTGGTTTCG CCAGACCAAA TTTTCATCGT TTCAGAGGCA GCTCAACCTA TACGGCTTCA AAAGAATAAC GGCGGGTAAG CGTAAAACGA TAACTTTTGG TTTGCATATT TTCAGTGAGC GATCCTTTCT GACATTCATC TGCTTCTTTT ACTGAAACCT TTTTGAACAG GTCGTGACAA AGGCGGCTAC TACCATGAGC TCTTCCTTCG AACAAAGCCT TTCTTAGCGC ACCATATCCT CCGCACGGAG AAAAAGGGAC AAGGGTACCG CAAGCCAAGT TCACCCAACA CGGAACCCAA CCTTTACCTA CAGACTTTTT TGCCCACTAC AAAAGGGAGT AACGTTGCCG AGAGAAAAAA TGCATCTCCC CTCCGGGAAA GCAATCCAGC ACTAGGCAAT AGAGAGAACC GTACCCAAGC TTTGTCCAGT CAGTCAAGAA TCCATCCTAG TGGATTGCTG AGGCCAGCTT TCCCAAACAC AACTTGGCGT CAAAACCCTA GATTCTTTCA AAATTACGAA TCATTCGAAG TTGTTGAAAA TACGCACCTT CTTGCGCCTC CTTCCGTTAG GAGTTTGTTG ACGAATACGA TGAGGTGTCC TGAAATCTCG ATGGCGCGAG TGCTTCCCAA TACAACTATC GAAGCGCTGG TGTTGCAGCA GCAGCAACGA CAGGCGGACC AAATTACACA GACCCGGCTC CTGATGCGAG AGCAGCAACT TGCTGTAACC CTTATTTTAG CTCACCAGCA AGATGTAGAA CGAGACACCT TGTTTGAAGG TGCTTATGCT CGAGGAGATC ATAACTTGCC ACATGTCTCC GAAATCGATT CTCTTCGACT GGCTTTACAC CTGCCCTTTC CAAGCTTATC ACTCCAGGGA CCATTTGGTC ACTGA
|
Protein sequence | MSDADGAERG YRDYAGEEDD ATELGVKQTQ THAPKMLRSE HHFPVRLHAL LNELENDNLD RILRWQPHGR CFVVCDQRLF ATHVLPYWFR QTKFSSFQRQ LNLYGFKRIT AGRDKGGYYH ELFLRTKPFL AHHILRTEKK GQGYRKPSSP NTEPNLYLQT FLPTTKGSNV AERKNASPLR ESNPALGNRE NRTQALSSQS RIHPSGLLRP AFPNTTWRQN PRFFQNYESF EVVENTHLLA PPSVRSLLTN TMRCPEISMA RVLPNTTIEA LVLQQQQRQA DQITQTRLLM REQQLAVTLI LAHQQDVERD TLFEGAYARG DHNLPHVSEI DSLRLALHLP FPSLSLQGPF GH
|
| |