Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_44027 |
Symbol | |
ID | 7204219 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 735023 |
End bp | 736529 |
Gene Length | 1507 bp |
Protein Length | 472 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186119 |
Protein GI | 219113071 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.361974 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GCGCTGTCCG AATGCAACGA CTTCTTCAAT TCGAGCTCGT CCATTTCTGG GGCTACTCTC ACGATGTCAG AGCAGCATGC TCATGAAAGT TATCCGCTCC TGCCACGGCA AGGGAGCCTA GCGTCGGTTC CCCTTTTTCA AGAACGACCC GTACCCGACG CTCAGCCTGA TGCCGCAATC CCTTTGAACG AACGTCATTC TGTTCTAGAA GTCATTGCCG AAAATGTGGA AGGCTTTGTA GAGGGAGCTC ACGATGCTGC AGTGGATATC CTTGAATCAG CTCAGGAATA CGCTGGAGAC ATCAAAGATG CTTTTGTAGA GGCTGCTGTC GACACCAAAG AAACTCTGGT TAGTATAGTA GAAGATGCGA GCGAGGCCGT TGCGGAGGAG TTCCAAGAAG TTGCTGACGC TTTCATTGAA GAGCTCGAGG ACGCCGACGA AGAGATGGAC AAAACTTTCT TGCTTGAAAT GACTCTGACC AGAAATCTTT CCATTCTACC GGCTGACATG GTAGATTCAG CCGCTATGGT TCCCTCAATG ATTCCGTTTC CAAATCCGGA CTGTCAGGCT GAGGACGAGG AAACGGGGGA GGGAGACGAA GAAACTTTAA AAGACGAAGA CAATGAAATC GAAAAGGCTC CAATGAGTGC ATATTTTCTG TTGGCGTCGG CCGTAATATC ACTGTCATCG ATCGGCCCCT TACTGGACCT ACAAAATGAT GTTAGCGGAA CGATGAAGAT TTACTGGCGA ACAACGGCAA CAGCATTGCT TCTTCTACCG TTCGCCGTGA GTTCCATTTG TCGCGAAGGC TTCCCGCGAC TGAGTTGGCC ACAATGGGTT GTGTTGTTGA TGACTTCCGG AAGCTACGCC GCTATGTGTG TCTTCTTTGT TTGGGCATTA GATTATACTG CCGTAGGAAA TGCGGTGATT TTCAGCAATT CCCAGGCTCT AATCTTACTG GTTGGGAAGG CATTTATTGG AGAAGCCGTT TCGTTGCTTG AAGGCTCAGG AGCCCTAGTT GCCTTTTCGG GAGCAATAAT GTGCTCTAAG GATTCCTCCG ACACAACTCC CGAAGACCCG GGAGGTTTTA CGACAGTGCT AGGCGATTGT TTCGCCATTT CGTCGGCATT CTCCGGTGTA GTATACCTCG TCCTTGCCAA AACCGTTAGG ACAAGCATGG ATCTTTACGT TTTCATGTTT TTCATTATGT TTATCGGGTC GCTACAGACT TTACTGTTCC TTTTCATCGC TCGAGAGCCG TATAGCATTG ATCGCGACCC AAATACCGGA GTCTTTGGTT GGACAGCGTT CGAACAAGAC CGCCTTCCTC TGGAATTGTT CATGGTGGTA ATTTGCAATC TGTTCGGAGC GATGGGATAC GTCCGTGCTA TGCACTATTT CGACAACCTT GTGATATCGG TCGCCGCTCT GATGGAGCCC GTCGTTGCAG AGTTCCTGGC TTTTACGTTT GGAGTGGGTT TTCTACCCGG CTGGTTAGGG TGGTTAG
|
Protein sequence | MSEQHAHESY PLLPRQGSLA SVPLFQERPV PDAQPDAAIP LNERHSVLEV IAENVEGFVE GAHDAAVDIL ESAQEYAGDI KDAFVEAAVD TKETLVSIVE DASEAVAEEF QEVADAFIEE LEDADEEMDK TFLLEMTLTR NLSILPADMV DSAAMVPSMI PFPNPDCQAE DEETGEGDEE TLKDEDNEIE KAPMSAYFLL ASAVISLSSI GPLLDLQNDV SGTMKIYWRT TATALLLLPF AVSSICREGF PRLSWPQWVV LLMTSGSYAA MCVFFVWALD YTAVGNAVIF SNSQALILLV GKAFIGEAVS LLEGSGALVA FSGAIMCSKD SSDTTPEDPG GFTTVLGDCF AISSAFSGVV YLVLAKTVRT SMDLYVFMFF IMFIGSLQTL LFLFIAREPY SIDRDPNTGV FGWTAFEQDR LPLELFMVVI CNLFGAMGYV RAMHYFDNLV ISVAALMEPV VAEFLAFTFG GG
|
| |