Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_16704 |
Symbol | |
ID | 7199008 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011696 |
Strand | + |
Start bp | 46189 |
End bp | 49368 |
Gene Length | 3180 bp |
Protein Length | 1045 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185112 |
Protein GI | 219129893 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTTCCG ATACCGAATC GGTTGCCAGC GCAAAAAATA AGTGGGACGA TGTCATATCC ACGGGAGCTT CGTCACGTCG AAGGAAAAGA TGGGACGAAA CTCCTGTGAT GGCGGCGTCA TCATCAGTAG CAGCCACGCC TCTTGTCACG ACCGGGTGTC GATCAAAATG GGACGAGACT CCTGTATTGG CTTCAGGTGG CGTAGGAGTG ATAAAGACCC CTACGCTTGC TGGCGCTCGG AACCGTTGGG ATGCAACTCC GTTGTCTACA CAGCCTTTGG CAGGTGCGTC GCAAACTCCC ATGGGAACGC CGCTAGATAA AGCAATGTTG TTGGAGCGAG AAATGGAATC AAGAAACAGA CCATGGACTG AAGGTGCTTT GGACGCAATT CTTCCTTCTG AAGCATACAA CATTGTTCGT CCACCGTCAA CCTACATTCC TCTCCGGACA CCCGGTCGAA AGCTCCTGGC TACACCCACG CCGATGTCGA TGACCCCTGC GGGCTTCCAG ATGGAAGTGC CCGCTGAGCA GCGCATAGAC GCGTCAGTTC AAGATATTCG AGAGGCGTAC GGTATCCCTT TGGCCCCGAC TGCCGACGAG ACGGGTGCTG TAGGATCATT ACCATATATT AAACCCGAGG ACATGCAGTA CTTTGGGCGG CTTGCCGAGG AAGTCAACGA AGACGATATA TCAAAGGATG AGTTGAAGGA ACGTCAAATT ATGACAATGC TGTTGAAAAT CAAGAGTGGC ACGCCCCCTC AACGAAAGAC AGCTATGCGA CAGATTACGG ACAAAGCGCG CTCCTTCGGC GCAGGTCCTC TGTTTAATCA GATACTTCCA CTTTTGATGA GTCCAGCTTT AGAAGATCAG GAGAGGCATT TGCTTGTCAA AGTTATCGAC CGGGTCTTGT ATAAGCTTGA CGACTTGGTA CGTCCTTATG TTCACCGGAT TCTCGCTGTG ATTGAACCGT TGCTAATCGA CGAAGACTAC TATGCCCGTG TCGAAGGTCG GGAGATTATC AGCAATTTAG CGAAGGCTGC AGGACTTGCT ACGATGATTG CTACGATGAG ACCGGATATC GACAGTCCCG ATGAGTACGT CCGAAACACG ACTTCACGTG CATTCGCGGT AGTGGCAAGT GCCCTTGGTG TTCCGGCCCT GCTGCCTTTC CTAAAGGCTG TCTGTCAATC ACGAAAATCC TGGCAGGCGC GCCATACGGG CATAAAAATT GTGCAGCAAA TAGCTTTGCT CATGGGCGTT GCTGTGCTCC CCTACCTGAG AGAGTTGGTC GAGATAGTCA GCCATGGTCT TGTTGATGAT ATGCAGAAGG TTCGGATTGT CACAGCATTA ACCTGCGCAG CTTTAGCGGA AGCCGCTCAT CCGTACGGAA TCGAGAGTTT TGATCCTGTA ATTCGCCCGC TCTGGAAAGG TACTATGGAG CAACACGGAA AGGCCCTCGC TGCTTTTCTC AAAGCTGTTG GCTTCGTGAT TCCTCTCATG GAAGAGAACT ACGCTAGTCA CTACACCCGG CTCGTCATGC CCATTCTTAT TCGTGAATTT CACTCCCCCG ATGAAGAAAT GAAAAGGATT GTGCTCAAGG TTATCGAACA GTGCGTTGCC ACGGCTGGGG TCGAACCTGA CTATATTCGC ACAGAGATCC TGCCCGAGTT TTTCCGAAAC TTTTGGATCC GCCGTATGGC ACTAGATCGT CGCAACTACA ACCAAGTTAT TGAAACCACC GAAGAATTGG CAAACAAGGT TGGTTGCTCT GATATAATCA TTCGTATCGT GGATGATCTC AAGGATGACT CAGAACCTTA CCGACGGATG GTGATGGAAA CTTTAAAAAG AGTCTTGAAC AATTTAGGCG CTAGTGATAT TGACGAACGC CTGGAGGAAC GGCTCATCGA CGGTATTCTA TATGCCTTCC AGGAGCAAGC TGTGGACGCC AGTAGTACGG GTTCTAATTC TTTTGGCAGG GAAAGCCAAG TGATGCTGGA GGGCTTCGGA ACTGTTGTAA ACGCTCTCGG CGAGCGATGC AAGCCATATC TTAAGCAAAT TGCCGGTACC ATCAAATGGC GACTCAACAA TAAGGCGGCA TCTGTTCGTA TGCAAGCTGC TGATCTTATC GGACGAATCG CAGTTGTGAT GAAGGCCTGT GGCGAAGACC AATTGATGGG GCATCTGGGC GTTGTCCTTT ACGAATATCT TGGTGAAGAA TACCCGGAAG TGCTGGGATC TATCCTGGGT GCTCTTCGCG CCATTGTGAA TGTCATTGGG ATGACAAAGA TGACCCCACC TATCCGCGAC TTGCTTCCGC GTTTGACACC GATTCTGCGG AATCGCCATG AAAAAGTGCA AGAGAATGTC ATTGATCTGG TCGGCCGTAT TGGGGACCGT GGTGCAGAGT TTGTGTCGGC TAAGGAGTGG ATGCGTATCT GTTTTGAGCT ACTTGAAATG TTGAAGGCCC ATAAGAAGGC CATTCGACGT GCTGCAGTTA GTACGTTTGG TTTCATTGCG AAGGCAATCG GTCCGCAGGA TGTGCTGCAT ACTCTCTTGA ACAACTTGAA AGTACAGGAT CGGCAGATGC GCGTCTGCAC GACTGTCGCC ATTGCTATCG TTGCAGAAAC GTGTGGGCCA TTTACAGTCT TACCCGCTTT GATGAACGAA TACCGTGTTC CGGAACTCAA TATACAAAAT GGTGTATTGA AATCTTTGAG CTTCGTCTTT GAATACATCG GGGATATGGG TAAGGACTAC GTCTACGCTG TTACCCCGCT CTTAGAAGAT GCCTTGATGG AGCGCGACCC TGTGCATCGT CAAACGGCAT GCTCCATCGT GAAACATCTT TCACTCGGCG TTGTTGGCTT GGGATGCGAA GATGCACTAC TTCATTTGTT CAATTACGTC TGGCCAAACA TTTTCGAAGA AAGCCCTCAC GTTATCCAAG CAGTGTTTGA TGCAGTACAA GCACTCATGG TGGCATTGGG ACCCAACGTA ATCTTGGCCT ACACAATCCA GGGGCTCTAT CACCCCGCAC GGCGCGTACG GGATACATAT TGGCGTGTAT TTAACATGCT CTACATTTAC AATGCCGATG CTCTCGTAGC GGGGTACCCT TCCATGAGGG ACGAAGGTGG AAACACTTAC AAGCGCACTT CTCTTGAACT CTTTATTTAA
|
Protein sequence | MVSDTESVAS AKNKWDDVIS TGASSRRRKR WDETPVMAAS SSVAATPLVT TGCRSKWDET PVLASGGVGV IKTPTLAGAR NRWDATPLST QPLAGASQTP MGTPLDKAML LEREMESRNR PWTEGALDAI LPSEAYNIVR PPSTYIPLRT PGRKLLATPT PMSMTPAGFQ MEVPAEQRID ASVQDIREAY GSLPYIKPED MQYFGRLAEE VNEDDISKDE LKERQIMTML LKIKSGTPPQ RKTAMRQITD KARSFGAGPL FNQILPLLMS PALEDQERHL LVKVIDRVLY KLDDLVRPYV HRILAVIEPL LIDEDYYARV EGREIISNLA KAAGLATMIA TMRPDIDSPD EYVRNTTSRA FAVVASALGV PALLPFLKAV CQSRKSWQAR HTGIKIVQQI ALLMGVAVLP YLRELVEIVS HGLVDDMQKV RIVTALTCAA LAEAAHPYGI ESFDPVIRPL WKGTMEQHGK ALAAFLKAVG FVIPLMEENY ASHYTRLVMP ILIREFHSPD EEMKRIVLKV IEQCVATAGV EPDYIRTEIL PEFFRNFWIR RMALDRRNYN QVIETTEELA NKVGCSDIII RIVDDLKDDS EPYRRMVMET LKRVLNNLGA SDIDERLEER LIDGILYAFQ EQAVDASSTG SNSFGRESQV MLEGFGTVVN ALGERCKPYL KQIAGTIKWR LNNKAASVRM QAADLIGRIA VVMKACGEDQ LMGHLGVVLY EYLGEEYPEV LGSILGALRA IVNVIGMTKM TPPIRDLLPR LTPILRNRHE KVQENVIDLV GRIGDRGAEF VSAKEWMRIC FELLEMLKAH KKAIRRAAVS TFGFIAKAIG PQDVLHTLLN NLKVQDRQMR VCTTVAIAIV AETCGPFTVL PALMNEYRVP ELNIQNGVLK SLSFVFEYIG DMGKDYVYAV TPLLEDALME RDPVHRQTAC SIVKHLSLGV VGLGCEDALL HLFNYVWPNI FEESPHVIQA VFDAVQALMV ALGPNVILAY TIQGLYHPAR RVRDTYWRVF NMLYIYNADA LVAGYPSMRD EGGNTYKRTS LELFI
|
| |