Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45636 |
Symbol | |
ID | 7200399 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | - |
Start bp | 783249 |
End bp | 784403 |
Gene Length | 1155 bp |
Protein Length | 154 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179912 |
Protein GI | 219118267 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAACG TGGTCGGCCT GTGCCTGATC CAGATTCTAG CTCTCTGTTG GGGAGATAGC TTGGCTTCAA CGGAGCAATC TAGGTACTGT GTAGGAGTTT CCGATTCTGA GTTCGTGTGC TCCGATGATC CGATCGGTAC CCTTTTGAGT GCACTGTCTG GAGAAGACCC GAACCTCCGA GCGCCTCAAT TTATCCCAGG CGTCCCCCAA CGAATCGACG GTACTGATGC CGAACAGGCA GCAATCAAGG ATGTGTTGGA TAGGATGGAC AAATACTTCT TCGATGAAAT TTTTTCAAAT CCTGAGTACA AAGAGGTGCG ATCTGAGTGG TAAGAATTCT GGTTGAGTTA TTCTCTTTCG CTGCGCTGTG TCACGAATTC TCTCTATCTG ACTTGTCTTT CTTTACATTA GTTGGAACAC CAACGAGCTT TGTGGCTTTT GGGCGGCAAT TGGTGAATGT GAATCTAATC GAGTCTTCAT GCTACCAAAT TGTGCGGCTG CATGTCGCTT TTGTTTGCTG CTACATACGA ATATTGGGGA AGAATAGGGG CATTGGTACC GAGCAGGTCG ACTACTATTG AATAGCTCCC TTTCTTAGAT AAATGGACCT CCGACGTCGA ACAAGTACGG CACCCTTTTT TGGTAAAAAT CTTGTCATGT TCACAGTCAG TCTGGTTGAA TTCCAGGATT ACGACTACAA GACTCAATCC CTGACCTGAA GAGGCCTCGT CCTTGATAAT TCGCCCCTTT ATTTCTGATC TCAGCTTCGT GAAGATGAGC ACAGTGATCT TTTCTTTGTC GAAAGCGCCT TCCTTATCCT GGGCCTCTCT GCTCTAGAGA AGGAAATACG TTTTTTTTTA CCATGGACTC GCAATTGATC GATTCTCCTC GAGGTGAAAC TCGTTCGGCT TTCACATGGC TCATTATAGA GATGAGAGGT GCGACTAGCT AGCATTCCCC TTTGTCCGGA TATCCTTTGC TTACATAGCC TGTAACCAAC AGGATATACC CCGCGTTCGT TGACAGAATA GCTGTCTTCT TTGTTGCACT AGAATACAGC TGTGACACGA GAGTTGGACT TTTTCCCGAC AGTTTCAACA GGATAAACTT CGCACTCATC GGTAGAATAG GTCGATGTCT TTTATGTTGC ATTTT
|
Protein sequence | MKNVVGLCLI QILALCWGDS LASTEQSRYC VGVSDSEFVC SDDPIGTLLS ALSGEDPNLR APQFIPGVPQ RIDGTDAEQA AIKDVLDRMD KYFFDEIFSN PEYKEVRSEC WNTNELCGFW AAIGECESNR VFMLPNCAAA CRFCLLLHTN IGEE
|
| |