Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47361 |
Symbol | |
ID | 7202513 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | - |
Start bp | 391967 |
End bp | 393556 |
Gene Length | 1590 bp |
Protein Length | 489 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181718 |
Protein GI | 219122782 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TGCTTTGAAA AATTTGACTC TCCTCTGTCC CATTCACTAC TGCATGAAAT CCTGCATTCA GTTGGGACAT TTCCGTCAGT CTTTTCGAGG GAAGAAATCC TGCGGTTGGG TCGGCCTATC GGCATTCTCC TTGAATACCT TTGCCCTTTT CTCGCGAGGC ACCATCGCAC TCTGTCCACC ACTTTCGGCT GCGCCCTATC GCATACGTCC GACATCGTAC CGTAGGTTAT ACGTAACTAC CGCTCTCCGT ATGGCTCCCC GCAGCTCAAA GAAGGAGCCG GATGCAGCTT CGGGAACTCC GAACAAACAC GATCGGGTTA CCCGCGGTAG CACCAAGACG CCGCCCTCCG CAGCGAGCAT TGACTCCCAG AAAACATCCG AAGCGCCGCT GAAAAAGTCA GCGAAGAGAA CCTTGCCCGA AGATACTACC AAAGAGAACA GTCCAAAAAA GAAGGCACCA ACGCATCAAG TCTTGACTGA ACGCGATGAC ATTCCTAGGC TTTGGAGTGA CGAGCAGGCA GCCAAAAACG GATCCTACAG TACGTGGCAT TAGGTTACGG AGAATCTGGA TGTATTTGAT CGTCGAGATG CTTACACTTC TGCTCACTTT TCGCAGCCAT GAGAATTGCT TCGTGGAACG TGGCCGGTCT GCGCGCTCTC ATGCGCAATT CCCCGCATGC CCTGTCTGAT TTCGTCCGAG AACACAATGT GGACGTTCTA TGCTTACAAG AGACAAAGCT ACAAGAGTCC CATCTGGACG ACCCGAAGCT CAAGATCCGG GGCCATTTGC TGGAGAAAGA AGGCTTCGAT TCATATTATT CCTGTTCAAC TGCTAGAAAA GGGTATTCTG GAACATCAGT CTTTGTCAAA AGGCGACAGC TCATTAAAGG AAGCAAGGTT GCAAAAAAAC AAAAGACTTT GGGGAGCTAC TTCGGCAAAA ACGATGAGCG AGAGACATCA TCAAATTCGC TTAAGGGAAC GGAAGAATTA TCGATCGATC CGCATCTTTT AGTGCCTGAA GGGGTTTCTT TCCAGATGAA TGTTGACAAG CACGACTCGG AAGGACGAAT AGTGGTCGTT GATTTCCCGT CGTTTACGAT GTGCAATGTC TACGTGCCAA ACTCTGGACA GAAGCTGGAA AGATTAAGTT ATCGCACCGA GGAATGGGAC AAAGATTTTT TGTCATTCAT TCAAAAAAAG CAGAAAGATC GAGGCGTTCC CGTCTTGTGG TTGGGAGATT TGAACGTTGC GCATACAAAT CTGGAGGTGT GGAATGATGG TGCCAAACAC TTGGCGAAAC AAGCTGGTGT TACAGCCGAG GAAAGGGCCT CATTTGAGGC ACAATTAAAC GCAGGGTTTA TCGACGCTTT TCGGCGTTTG CACCCAACGG CCAAGGGACA CTATTCTTAC TGGAGCCAGC GCGCAGGCAA CCGAGAACCG AACAAAGGCT TGAGACTAGA TTATTTCATT TGTGATCCAT CGTTATTCGA CGAAGAGTCC AAGACGATAG TACGCGATAG CTATGTACTC CCTCTGCAGC AAGGAAGCGA TCATTGTCCT GTTGTGTTGG AGTTGGAGAT TAAAGCCTAA
|
Protein sequence | MKSCIQLGHF RQSFRGKKSC GWVGLSAFSL NTFALFSRGT IALCPPLSAA PYRIRPTSYR RLYVTTALRM APRSSKKEPD AASGTPNKHD RVTRGSTKTP PSAASIDSQK TSEAPLKKSA KRTLPEDTTK ENSPKKKAPT HQVLTERDDI PRLWSDEQAA KNGSYTMRIA SWNVAGLRAL MRNSPHALSD FVREHNVDVL CLQETKLQES HLDDPKLKIR GHLLEKEGFD SYYSCSTARK GYSGTSVFVK RRQLIKGSKV AKKQKTLGSY FGKNDERETS SNSLKGTEEL SIDPHLLVPE GVSFQMNVDK HDSEGRIVVV DFPSFTMCNV YVPNSGQKLE RLSYRTEEWD KDFLSFIQKK QKDRGVPVLW LGDLNVAHTN LEVWNDGAKH LAKQAGVTAE ERASFEAQLN AGFIDAFRRL HPTAKGHYSY WSQRAGNREP NKGLRLDYFI CDPSLFDEES KTIVRDSYVL PLQQGSDHCP VVLELEIKA
|
| |