Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_11990 |
Symbol | |
ID | 7200454 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | - |
Start bp | 878350 |
End bp | 879369 |
Gene Length | 1020 bp |
Protein Length | 330 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179932 |
Protein GI | 219118309 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GACGGTAGCC AGAACGCCGA AGCCAGCGCC ACGCATGCGT TTGACCGTGG TTTCAAGCGA CTACAGCGTA ATAATGCAGC CCGGATGCAG CAATCTTGGC GAAATGCGTC GGATGATGCC GCAAACTACG ACTACGTTCG GGAAGAAATC GCCTCGCGCT TGATTGATCG CCTAGACGAC ATTAAGCGCG ACGAGGGATT TCCGCTAGCG CTCGACGTGG GATCCGGTCC TGGTTATGTT TACAAAGCAA TCTGTGCCGA TGAGGCCCTT CAAGGTGAAG GGGGCGTGGG TGGCGTCAGG AAATTAGTTC AACTGGATTC CGCTGGGGAA ATGTTGTACC GCGATGCCGA TCTCCTCGTG CCCGGCAGTG AACGCTGCGA TTCTTACCGT CTGGAATTGG ACGAAGAAGC CATTTTGCCG TTTCCGGATG GGACTTTCGA TCTAGTTATT AGCTCAACGT CGATGCATTG GGTCAATCAA TTACCGAAAC TTTTCAAAGA AATTCGGAGG GTGTTGAAAC CGGATGGCTG CTTTATGTTT GCCATGATTG GTGGTACTAC GTTACCCGAG CTGCGAGCTG CAATGGTGAT GGCAGAGATC GAGCGCGAAG GCGGAGTCAG TCCGCACGTC GGACCATTTG TAGAGCTTTC GGACGTGGGC GCTCTTTTAC AACGTGCAGG CTTTGCGTTG CCCACAATCG ACGTGGATTC CATGAAGATC GCGTTTCCCA ACGCAGCCGT CCTGATGGAG CACTTGCAAC GTATGGGGGA GAGCAATGCT TGTATTAAAC GAAGAGAACG TATTGGACTG GATACGTTTT TAGCAACGGC ATGTCTCTAC GACGAAATGT TTCCTTTAGA AGGACATGAC GACGGAGGGG AGCCAGCAGT GGAGGCATCG GTTCAAGTGA TCTACGCCAT TGGATGGACA CCTCACGTAT CTCAGCCGGC ACCGCTGGAA CGGGGTACTG CTACTCACAA AGTGGGCGAC ATTGTTGAAA AACACCAGAC AAATCCTTAA
|
Protein sequence | DGSQNAEASA THAFDRGFKR LQRNNAARMQ QSWRNASDDA ANYDYVREEI ASRLIDRLDD IKRDEGFPLA LDVGSGPGYV YKAIWGVGGV RKLVQLDSAG EMLYRDADLL VPGSERCDSY RLELDEEAIL PFPDGTFDLV ISSTSMHWVN QLPKLFKEIR RVLKPDGCFM FAMIGGTTLP ELRAAMVMAE IEREGGVSPH VGPFVELSDV GALLQRAGFA LPTIDVDSMK IAFPNAAVLM EHLQRMGESN ACIKRRERIG LDTFLATACL YDEMFPLEGH DDGGEPAVEA SVQVIYAIGW TPHVSQPAPL ERGTATHKVG DIVEKHQTNP
|
| |