Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_20034 |
Symbol | |
ID | 7200633 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | + |
Start bp | 591475 |
End bp | 592575 |
Gene Length | 1101 bp |
Protein Length | 306 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179678 |
Protein GI | 219117779 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCAGCTTCTA AGCAACCCCG AAACCAGTCT GAATTTGCTT TCATCAACCT TAACAAGAAG AGCTATGTAC GGACAGGTAG TTGTCGGACC TCCAGGAAGT GGCAAAACCA CTTTTTGCGA CGGGTACGTG AATCGGGGTC TGCGCACTTA CTGTTTTTCG TTTCCTCACA TCTCATATTT GAGCAAAGCA TGCAACAGTA CCTTCGGCTT CTGGGTCGGG ATGCGTGGGT TTTGAACTTA GACCCTGCGA ATGAAGGGGG ATCTGTAAAT GGCGGAACGG GAACGACCGA AGAAAATGTG GACGAAATCG AATCAAAATC ACAATTGCCG TACGAAACTA TTTTTGATGT CTGCGAAGAA GTTGTTAATC TTTCCTCCGT TATGAAAAAA ACAGGCCTGG GGCCCAATGG AGGACTAATA TATTGCATGG AATATATGGA GGCGCATGTC GGCGATATTA TTCTCAAAAT TAATGAGAAA CTGAAAGAAA AGACATACCT TCTCATTGAT CTCCCTGGAC AAGTGGAGCT GTACACACAT TCCACATGTG TACAGCAGCT ATTGAGTAAA ATGATTAAAG CTTGGGACCT GCGATTGTCA GCGGTGCAGC TTATTGATGC GCACTACTGC ACTGATGCAT CCAAGTTTCT TTCGGCGGCT ATGTTGGGAA CGACAACCAT GCTGCGGCTC GAGCTTCCAA CCGTGAACGT ACTTAGTAAG GTGGATTTGT TGTCCCGATA CGGTGATCTA CCGCTGCAGC TAGAATTCTT CACTGAGTGC CATGATCTTG AAAGACTGGT CCCTTTTCTC GAGCATCAAG CCATGAATCA TTCAAAACAC GACAACGAAT ATTCGAGTTC CGGAACGTCG GACTACGTTG AAGATCCCGA CTATCAAAGG GCTCGCACGA AGCGGCGGAG CTCTATATTT TTCCAGAAGT ACGCCAAGCT TCACAATGCT TTAGCCGAAG TTGTAGAAGA TTTCGGTCTA CTCTCGTTTC TCCCATTAAA TATAACAGAT GCTGGAAGCG TTGGCCGTGT ATTGGCGAAA ATCGACAAAT GCAATGGCTA TGTATTTATG GAAGGGTCCG TCCCGGGGGA T
|
Protein sequence | MYGQVVVGPP GSGKTTFCDG MQQYLRLLGR DAWVLNLDPA NEGGSVNGGT GTTEENVDEI ESKSQLPYET IFDVCEEVVN LSSVMKKTGL GPNGGLIYCM EYMEAHVGDI ILKINEKLKE KTYLLIDLPG QVELYTHSTC VQQLLSKMIK AWDLRLSAVQ LIDAHYCTDA SKFLSAAMLG TTTMLRLELP TVNVLSKVDL LSRYGDLPLQ LEFFTECHDL ERLVPFLEHQ AMNHSKHDNE YSSSGTSIFF QKYAKLHNAL AEVVEDFGLL SFLPLNITDA GSVGRVLAKI DKCNGYVFME GSVPGD
|
| |