Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48855 |
Symbol | |
ID | 7194941 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | - |
Start bp | 486371 |
End bp | 489395 |
Gene Length | 3025 bp |
Protein Length | 903 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183493 |
Protein GI | 219126498 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.105124 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCCGAC TTGTTTGGTC CATAGAAGTC TTGACGACGA CTTTTCTCGC AGGCAACTGC GTTTGTTTCC TGTTATCAAT AATACCTTCC GTGGCCGCTT TTGCCGAAAC ACTCCGCGTA CGACGCGTTC TACCATCGGT CTCGCCACAA GACGCGTATG CGCGCTGCTT GCGGGGATGG AAGCGGGATA ACCTTGGTCT CGTGGGGGTA CCTCCGCCCA TTCTACTAAG AGACGGCCAT CCTGATACCG GCGTCGGTAT GCTTCTTTTG CGAATACCGC CCTTCGGCTT GAAAGAGGGC ATCGTCGGCT GCCAGCGGGA CGAGTGTTCT ACCACCATGA ATTACCAAGT TTTGAATCCG GGATGGTTTA CTTGGCCGGT TGCTGAGCAT GAAGGATGGA TTCGGTTCGC CTCTGATGGG GAACAGGGAT GTATTTTGGA ATGGACGGTC CAGTGGACAC CGCTACCATT GCCGCTATCC TTTTGGAACG GTTTTCTCAA AGCACTAACA ACAGTGATAG TGGAAGCCGC GGCAAGCTAT GTCGCCAGAG GGGGCTCAGG CGCCTACCAA TCAGTTGCTA TCCGCAAAAA AGAGTAGCCA AAAGATACTG CCCCTATCTG GTACACGTGT AGATGAATGT AATTAATGCA GCAACGCAGT ATAGATTAAG AAAGATCGTA AACATCGAAG TCATCCGGAA GAGCTGCGAA TTTAACAAGA ATATCGTGAT GAGATACTCC TGTAGCTCTA GTGCATGGGA CAACAAAAAA TCCTTCGGTT TGAACAGATC AGTCCGCGTA CCTACGATGA TGTTCTCGTG TGCTACGGGA GGAGCTTTCG GCGCGGCATT CCCCTTGGCG GAGGTTGATT CTTCTTGCAC ACGCCGAGCA GGTCGAGCCT GCACAGGGAC GCTTCCGGGA TTCTGTCTCG TTTGGGCGGA TCCTTCGCCA CCCGGCGTCC GGAGACTGAT GGGTCGCACT GGCGGCCTGG AAAACTCAGG CTCCATCACT TTGGGGGCGA TCGAGAAAAT TGATCCTATT GTCTGCAAAG GATGCAAATA ATGTTACATG GTCCTCGAAG AGCTCACAGT CAGGCATTCG CCAACTAACG TTGGATCTGT TCTGCGTTTT CCAGTGCGAT AGTATCTTCG GCTCTCGATT ACAGAACTCC GGGTTGTATT GTCACATCAT ACTCACAGTC GATGGGTAAA GGGGTCTGTC CGGCTGCCCC CGATCACGGA TGTACGGGAC ATCTTGTCCA GGGACCGTCA AACGAATTTG GGGCACGCTT CCGACATGCA TCACACGACT TCGCGATTGC AATGATCTGT GCACCCTCCG TTGGTTCATT TCACAGCTCG AGCTCGGAAA TACTGAGGTC ACTCGAACAA GACAAGATGA GGGAATCTAG TCGAAAACGA CCACTTAGAG CAGAAGATGG CCTACAATCA TCGTCGTCGG CGTCTGACTT GGACCAACAG CAACAACATC ACCAAGACGA CGTGGATACC CTGGGCGACG CGACGATTGC TTGGTACGGT TCAAACGAGG CGACGGTAGT ACTAGGGGCA ACACCTCTGT GCCTTTTAGG ACGTGGAAGA ATACGTCTTA TCTCGGGAAC TGTCAGCTTG CACGGCTTTT GCCTTACCGA TGAATTTCGA GATTTCGAAA GTCCTATTTA TGCGAGCTGG CTTACTATTG TCGCCGAAGA GCACGGTAAG AAGGGAGGAG AGGTAGCCAA GGTAGCAGTG GAATCAACAC GGCGTGTTGC TGGTGTTCTC GTACCAACTT TTGAAATTAA GCTGGCGACC GAGCCCCATT CACGACCGAC CGTTATTCCC CGTCGATGGA CAGAGTCACT AGACTCAATT CTACAGGAAC AATTGTGTCG ACAACTGGAA AACGAGAAGG CAAGCCAGCG GAAAAGACCC ACCGCCTTCA CACGTTTACT CGAAAAGGAG AGCAGTAGTG AGAAACTTGC TGCCAAGGAA ATGGAACAGT CAGCTCCTGG GTTCAAAGTC GTCGTGGTGG GTGCCAAGAA CGTCGGCAAG TCCACCTGTT TACGCTACGC CATCAATCGA CATCTGTCGA TGTGCAGCGA AGTAGCGGTA TTGGACGGAG ACTTGGGACA GCCGGAACTA TCGCCTCCAG GGATGGTGAC TTTAACACGT TTGCGACAGC CAATCTTTAG CCAGCCGCAT TTACACCTAG TAACGAACGA AGATAATGCG TCAGCTGCAG CGCCTCGACA CGAAATGGCG TATTGGTTTG GGGCATCTAC TTCACAAGGG GATCCCGAAA AATACGTGAG CAGTCTCACA AAGCTAGTGC GCTACTATCA CGAAAAGCTT CTTCCTCAAA AGCCTACGTT GCCTTTGCTG ATCAATCTCG ATGGGTGGGT CAAAGGACTT GGGATGCAGA TCTTGGAAGC TATCCTGCTA CAAATTCAAC CAACGCACGT GATTCAGATA CTAGGAGATC TCAATTCCAA AGTCTTCGAA TTGTCGTTAC CTGACGAAAT TCATCTCCAT ATATGTCACG CCTACCACGT GATACCACCA GAGGAACAAC AAAGAAAAAC AAAAATGTCT ACGCCCACTT CCGATTCCGA GTCGAAGGAA ACATGTGCGC TGGTCGACAG GCAACACGAC ACTGGTCGCG TTACTTCTCT GTCCGACATT GCGAGTCTCC CAACAATCCC CTCGTCCATA CCGGCATCAG CGCTCCGGAC ACTCCGCTTT GTCTCCTACT TTTTGGACGA TGTTTCCATT TGGGATCGCA TACGCTTCGG TCAAAAGGAA CTGATCGTGG ATATTAATTG TGTGATTGCC AAACGTTTTG CCTCGCAGAA GCCGTACATA GTACCGTTTG AAGCAATTGC GGTGGATTTT AGTTCCGACG AGTTTCGACG TGACATATGG ACACCAGAAC GGATTCTGGA CAGTTTGAAT GGATCCATAG TAGGCTTATG CTGTCGAACG GGCAAATCAG ACGACGAGTT TGACTGTTGT GTTGGTCTCG GTATT
|
Protein sequence | MVRLVWSIEV LTTTFLAGNC VCFLLSIIPS VAAFAETLRV RRVLPSVSPQ DAYARCLRGW KRDNLGLVGV PPPILLRDGH PDTGVGMLLL RIPPFGLKEG IVGCQRDECS TTMNYQVLNP GWFTWPVAEH EGWIRFASDG EQGCILEWTV QWTPLPLPLS FWNGFLKALT TVIVEAAASY VARGGSGAYQ SVAIRKKDAW DNKKSFGLNR SVRVPTMMFS CATGGAFGAA FPLAEVDSSC TRRAGRACTG TLPGFCLVWA DPSPPGVRRL MGRTGGLENS GSITLGAIEK IDPISMGKGV CPAAPDHGCT GHLVQGPSNE FGARFRHASH DFAIAMICAP SVGSFHSSSS EILRSLEQDK MRESSRKRPL RAEDGLQSSS SASDLDQQQQ HHQDDVDTLG DATIAWYGSN EATVVLGATP LCLLGRGRIR LISGTVSLHG FCLTDEFRDF ESPIYASWLT IVAEEHGKKG GEVAKVAVES TRRVAGVLVP TFEIKLATEP HSRPTVIPRR WTESLDSILQ EQLCRQLENE KASQRKRPTA FTRLLEKESS SEKLAAKEME QSAPGFKVVV VGAKNVGKST CLRYAINRHL SMCSEVAVLD GDLGQPELSP PGMVTLTRLR QPIFSQPHLH LVTNEDNASA AAPRHEMAYW FGASTSQGDP EKYVSSLTKL VRYYHEKLLP QKPTLPLLIN LDGWVKGLGM QILEAILLQI QPTHVIQILG DLNSKVFELS LPDEIHLHIC HAYHVIPPEE QQRKTKMSTP TSDSESKETC ALVDRQHDTG RVTSLSDIAS LPTIPSSIPA SALRTLRFVS YFLDDVSIWD RIRFGQKELI VDINCVIAKR FASQKPYIVP FEAIAVDFSS DEFRRDIWTP ERILDSLNGS IVGLCCRTGK SDDEFDCCVG LGI
|
| |