Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_21557 |
Symbol | |
ID | 7202423 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | - |
Start bp | 448509 |
End bp | 449795 |
Gene Length | 1287 bp |
Protein Length | 348 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181728 |
Protein GI | 219122803 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTCTATCTGC AATCCAACAT GGCCAAAAAG AAAGATAGTT TCAAACGCCG TCGAATTAGC GGTGATCTTG TCGAAAAGGC TGGAAGCCTG GATGCAGGGC CTCAACAAAA AGTCGACGAA AAGGTTGTCA CGGAGGATAG TTCTTCAAGG AGAAAAGCTG AACTTCTCCA AGCCGAAAGT GAAGTACATA GCGACGGTGA GGATATCGAC GATACTACCG TGCGGAATGA TGGTCGGTAT CGAAACAAGC AGCGTTGTTT GACCCTTTGC TCTCGGGGTG TCACGGCACG CTACCGCCAT CTATTGGAAG ATCTGCGCAC GCTCATGCCG CACCACAAGA AGGAGTCCAA ACTTGATCCA GGTGAGGACG GGGTTGGCCA GGCTGTCAGC GATATTTGCG AAATGCGTTC CTGCAACACG ACTATGTTCT TGGAATGTCG CAAAAGACAG GATGCGTACA TGTGGCTGGG CCGCGTCGGT GGTCAGTCGC CCGGCCCGAG TGTGCGTTTT CATGTCACAA ACATTCATAC GATGGATGAG CTGCGTTTGA CGGGGAATTG TATGAAAGGA TCCCGGCCGA TTATGACGTT TGATGAGAGC TTTGGGCGTG TGGATCACTT GAAGCTATTA AAGGAACTCT TCATTGACAC ATTTGGTACC CCGCGGGGCC ATCCGAAGAG CAAGCCATTT GTTGATCGGG TGATGGCGTT TTGCTATGCG GACAACAGGG TAAGTCCTTC TATTACACTC TACTGTCTGC TACGTCCGTA AAACAGAGAT CATGCTAGTG ATTACAAAAA TGATGAGCGT ATGTCAGAGA TCACTGATCA ATGAGAATAC CATCATTATA GACTACGATG GAATGAAGCA ATGTCGGGCA ATCAATCCTT AGCCGCTTCT CTGTTGCAGC ATGGCACTCA CATCTGCATT CACTTTTGCT AAAACTTTTA GATCTGGGTG CGGAATTATC AAGTTATTGA AGAACAGCCG TCAAACGCCA AGGAAGCTCA TCAAATTAAG AAGAATTCAG GAAGAGAAGA GGCTACTTCC ATGGTGGAAA TTGGCCCGCG TTTTGTTTTG AACCCGATCC GCATTTTTCG AGGATCGTTC GGAGGTCAAA CCTTGTTCCA GAATCCTGAT TTTGTGTCAC CTAACGAGAT CCGTTCCTTG GAACGAAAGA GCAAAGGAAG TCAATATGAT CAGCGGAAAA ACTCGCAGAA GGAGCGACAC GAGCGGAAAT CACAACTGGT TTTGCCGGAA GACCCGTTGA AATCCGTTTT TCGGTGA
|
Protein sequence | MAKKKDSFKR RRISGDLVEK AGSLDAGPQQ KVDEKVVTED SSSRRKAELL QAESEVHSDG EDIDDTTVRN DGRYRNKQRC LTLCSRGVTA RYRHLLEDLR TLMPHHKKES KLDPGEDGVG QAVSDICEMR SCNTTMFLEC RKRQDAYMWL GRVGGQSPGP SVRFHVTNIH TMDELRLTGN CMKGSRPIMT FDESFGRVDH LKLLKELFID TFGTPRGHPK SKPFVDRVMA FCYADNRIWV RNYQVIEEQP SNAKEAHQIK KNSGREEATS MVEIGPRFVL NPIRIFRGSF GGQTLFQNPD FVSPNEIRSL ERKSKGSQYD QRKNSQKERH ERKSQLVLPE DPLKSVFR
|
| |