Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46171 |
Symbol | |
ID | 7201378 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011677 |
Strand | + |
Start bp | 473067 |
End bp | 474476 |
Gene Length | 1410 bp |
Protein Length | 469 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180441 |
Protein GI | 219119358 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTTCCT CCGTGGACGA AGTTGCTTCA TATCGGGACG GCGATGTTGT GGGCGAAGGG CCCGACGCCT TTTTCGACAA CTCCGGCGAC AACACCATGA TTATGAAAAA CGATGCAGCA GTGAGCCACG TCTCTCGACG CAATCGCTGC TGCGGCGGAC GTCATCGTAC TCGGATGATC GTCACTGTAG GGCTGCTCTT GTTTGTGCTC ATTTTAGTCA GCGTGTACCG TAGCAAGAGC CAGACACCCC GGGAACCCTC TATGTATACA GCGTCGGGGC TGTTGTTGGT GCCTTCCAAC CAACAGCCTG GAAGAGAATC TGCAAAATCT TGGGATCAAC TCAGCTCCGA GATTGTAGGA CCACTGGTGA AGCGTTCTGG GGAAGGATAC GGACACGCCA TCGCCCTTTC GGAGTGGGAG TACGGTCCTC GTCTTGCGAT TGGACTGGGG GGCAACGCAC AACAACCCGG GTTTGTGCAG GTATTCCACC ACAACAAAAC GGCGGGATGG GTGCTCGAAG ATACAATTTC TATTCCTGGT AATGTCTACG CCCAACAAGA AGGAGAAGAG CGACAACATC TGGCCATGGC GGGTGACGCC CGTCGAGTAG TCTTTACCCA GGGCAACTAC GCTTTTTTCT ACTATTTCAA ATCTACCTTT ACGGAATTCA GGTGGAAACC TTTGAACGAT CCCATACTCA TTGATTCCGA GTTGACAGCA AACGGAGAAG CGGATCAATT TCTTGAAACT AAGCTGGCTT TGAGCAATGC AGGAAATGTG GTTGCGATCG CCTCCGAAAC GGTCAACAGA GCCCAACTCA AAGTCTACAA AGACGACACG TCATGGACAC AAAATACGTC TCCGGTTCAC AAATGGAAGG TGCACAGCAC AATTCCAATC GACCAGCTCA TAGGTGATAT TTCCATATCT GGTGACGCCA CTCGTCTCGC AATTGGGAAT GTGGGGTCCA CCGTCGACGA CAATGGCGAC GATTCCGGAA AGGTTCAGGT GTACGGATGG CAAGGTGGCG ACTGGTATGA GCTTGGACAA ATGCTGCGAG GCAATAGAAC GTTAGATCGA TTCGGTTCTT CGATCGCCCT TAATTTGAAT GGAGACGTGT TGGCCGTAGC GTCTAACGGC TCCCATCGTG TCCAGGTATA CCGGCTGGTT GGTGACGACT GGGAGCAACT CGGCTCAGAT TTGTATGCCA TGTCCATTTA CGAAAAATTT GGGATAGGAT TGAGCTCAGT AGGGACGGAT ACACGATGGC AAGTTCCTCA CCAGGGACCC CGGCGCTTAA GTCTTCGGAA CATAGTAACG ACCCGGAGTA CATGGAATAC GCGTACGGCC GCGTGTACAT TATGCAGTTC AACCTTGACG AGCTTCAATG GAAATCGGTG GGTTTTATAA
|
Protein sequence | MVSSVDEVAS YRDGDVVGEG PDAFFDNSGD NTMIMKNDAA VSHVSRRNRC CGGRHRTRMI VTVGLLLFVL ILVSVYRSKS QTPREPSMYT ASGLLLVPSN QQPGRESAKS WDQLSSEIVG PLVKRSGEGY GHAIALSEWE YGPRLAIGLG GNAQQPGFVQ VFHHNKTAGW VLEDTISIPG NVYAQQEGEE RQHLAMAGDA RRVVFTQGNY AFFYYFKSTF TEFRWKPLND PILIDSELTA NGEADQFLET KLALSNAGNV VAIASETVNR AQLKVYKDDT SWTQNTSPVH KWKVHSTIPI DQLIGDISIS GDATRLAIGN VGSTVDDNGD DSGKVQVYGW QGGDWYELGQ MLRGNRTLDR FGSSIALNLN GDVLAVASNG SHRVQVYRLV GDDWEQLGSD LYAMSIYEKF GIGLSSVGTD TRWQVPHQGP RRLSLRNIVT TRSTWNTRTA ACTLCSSTLT SFNGNRWVL
|
| |