Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_11088 |
Symbol | |
ID | 7197725 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | - |
Start bp | 756420 |
End bp | 759840 |
Gene Length | 3421 bp |
Protein Length | 1086 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178570 |
Protein GI | 219115549 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAATCG AAACAGAGTC TGGTGCCGTA TCGGGAACTC GAGGAACTGC CAAACGTGAT ACGTTGCGAG CGAACGAAGT CGCTGTGCAA AAGATTTGGG ACGAAGAGAA GGTGTTTGAA ACGGAACCCG ACGATCGTGA AAGTTTCATG GTGACTTTTC CATACCCTTA TTCGAATGGT CATTTGCACA TTGGACATGC ATTTTCCTTG ACGAAGGCAA TTTTCCGAGC TCAATTCGAG CGCCACCGAG GCAAGAATGT GCTGTTTCCA TTTGCCTTTC ATTGCACAGG CATGCCAATT CAAGCGGTAC GTTGATGAAA TCCAATCCTT TTCCTTTGCA GATTCAAAAT TCTGACAAGT TTCCGCGGTT TACCAAAACT AGGCAGCGAA TAAGCTAACA TCGGAGATTG AATTGTATGG ATGCCCTCCT CAGTTTCCGG AAGCCGATCC TGAAGTCCGT GCCAAGATGG AAGCAGAAAT TGCCGCCTCG AAAGAAGCCA AGGCGGCCCA ACCTGAAAAT AAGTCAAAAG GGAGCAAGAC AAAGCTTGTG CAAAAGACCG GAACGGGAAT TGTGCGACAG TGGAACATTC TCCTCAAAAT GGTTCCGGAA GATGAGATTC CGGCATTCCA GGATCCTTTG CATTGGCTGA GCTACTTTCC TCCAATTGGT GTAGACCACT TACACAACTT TGGAGCAGGC GTGGATTGGC GGCGGGCGTT TATAACTACG TATGTCAATC CATACTATGA TGCATTCATT CGTTGGCAGT TCGAGGTATT GAAAGAGAAA GGCAAGATCC TTTTTGGGAA ACGTAACAAT GTGTTCTCGC TGGTTGACGG ACAAGTTTGT GCAGATCACG ATCGTTCGGA AGGTGAAGGT GTAGGACCTC AGGAATATGT CCTGATTAAG CTTAAAGTGC TGCAGCCAGG CCACGGACAG TCCCGCCATG CGAAGATGGA AGCAATTTTG GCGAAATATG ACCAGCCTGT ATATTTTGTC CCTGCTACCC TGCGCCCTGA AACTATGTAC GGACAAACCA ACTGCTTTGT CCTCCCTGAC GGTGAATACG GTGCCTATAT GATTGATGCA ACGAACGAAA TTTTTATCAT GAGTGCGAGA TCGGCTCGGG GACTCTCATG CCAATCATAC CAGGGAAATG AATACTTTAC CAAAGAGTTT GGTAAAATCC TGTGTCTCGA AACGTTTACA GGTAGCGAAT TGCTGGGCTT GCCGTTAAAA GCTCCGATGG CGAAGTACGA TAAGATTTAT ACCCTTCCAT TGTTGACGAT AAGCATGGGC AAAGGAACTG GAGTGGTGAC ATCCGTCCCT AGTGACGCTC CCGATGACTT TGTTTCACTT AAAGCGCTAC AGGACAAACC CGACTTTCGC GCGAAATACG GCATTACGGA CGACATGGTC ATGCCGTACG AAGTTGTTCC AATCATCACA ATAGAAGGAT ACGGCGATGC GTCTGCTGTC TTTATGTGTG AAAAGCTAAA GATTACAAGC TTCAACGATA AGGCTAAGCT TCAGCAAGCC AAGGATGAGA CATACCTTAA GGGTTTTAAT ATGGGAATTA TGAAAGTCGG AAGCCATTCC GGAAAGAAGG TCAGCGATGC CAAACCAATT ATCAAGCAGG AATTGATTCT TGCTGGCCAG GCGTGTCTGT ACTTTGAACC AGAATCTAGG GTGGTTTCCC GAACGAGTGA CGAGTGTGTT GTCGCTTCTA CAGATCAGTG GTATCTGGCA TATGGAGAGG AGTCGTGGAC TAAAGCTGTT AAAAAGCACG TCTTGAACTC GGATAATTTC AACGCATATG ACCCGGCTGC TTTGCACAAG TATGACTACA CCATTGGCTG GTTGCAAGAA TGGGCGTGTA CTCGGCAGTT CGGTCTGGGA ACATTCTTGC CATGGGATAG AGCCTGGGTT ATTGAGAGTT TGTCGGACAG CACTATCTAC ATGTCCTTTT ATACTATCGC ACATTTTCTC CAGGGAGAAG GCAATTTGAC AGGTGACAAA TCAAAGTCTC CTTGCAGCAT TGATCCTGCA GATTTGTCAA ATGACGTATT TGATTTCATT TTTCGCAAAG GACCATTGCC AAGTGACTCA AATATCCCGG CCAAGACATT AGAGAAGATG CGTACCGAGT TCCGTTATTG GTACCCGATG AATTTACGTG TGAGCGCGAA AGACTTGATT CAGAACCATT TGACTATGGC GCTTTTCAAT CATGCCGCTG TTTGGGAGGA GGAACCAGAG CTATGGCCCA AAGGATACTA CTGCAATGGG CATGTCCTTG TGGATGCCGA GAAGATGTCA AAGTCGAAGG GGAACTTCTT GATGATGAAT GATACAATAC AAACGTATGG TGCGGATGCC ACCCGGTTTG CCTGTGCCGA CGCTGGCGAC TCGTTAGATG ATGCGAACTT CAGCCGAGAG ACAGCCGACG CCGCCATTCT ATCTCTTATC ACGGAAGATG CATGGATAAG TGAAACGCTC ACATCTGTGG ACCTTCGATC CGGCGAAGAA AATCTGATCG ATAAAATTCT GTTGAACGAA ACAAACCGCC TAATTGCTAG TGCTGGTAGT AATTTTGCTC GAATGCAATT CAAGGAAGGA CTTAAGGAAG GATGGTTTGA GATGCTCAAC GCCCGCAATG ACTATCGAGC CTGGTGCAAG GATAGCGGTG TTCCAATGCA CAAAGGTGTA GTTCTACGGT GGGCAGAGAC CATTGTAATT TTGATTTGCC CGATTTGTCC GCATTGGTAT GTATCTCGTT GAAAACGAGA ATTACGATCG CTTCATTGAA TTTGCATCGT CTTACTCACC TTCTGTTGGT GTATCTACAG GTCCGAGAGG ATATGGAAAC AGATCGGAAA CATCGGGTTG GCCATTCGGG CACCATGGCC GGTGGCGGAG GAAGAAGACA AAATTTTGAC TCGACAAGCC AAGTTTTTGA GAGATTCTAT AAAGCACTTC CGTTCCCAGG CTGGGAGAGC AAAGAAGGGG TGGATGAGGG CTTCTATTCT TGTAAACGAT AGCTATCCTC AGTGGAAGAT TGATACGCTT GTATGGATGC AAGGCCAGTA CGATGTATCG TCTGGATTTT CTCCAGGATT CATGAAAGAC CTAAAGGATT ATACTGCAAA GTTTGTGAAA GACAAGAAAT TGATCAAGTT TACAATGCAG TTCGCCTCAT TCATGAAGAA AGAAACAGAG GATGTAGGTG ACGCCGCGCT CGACGTTCTG CTTCCCTTTG ACCAGAAAGA GATACTTCAG GTAAGTATTG AATACATTAA GGCGCAGCTC AACATCGAAG AACTTGATAT TATCCAGCTG GGTGTGGAGG AAGCACCTGA AGTCCCAGAA AGAGTCCGGG AGAACGTAAC GCCAGGAAAG CCGTCTCTTT ATATCCGCTA A
|
Protein sequence | MTIETESGAV SGTRGTAKRD TLRANEVAVQ KIWDEEKVFE TEPDDRESFM VTFPYPYSNG HLHIGHAFSL TKAIFRAQFE RHRGKNVLFP FAFHCTGMPI QAAANKLTSE IELYGCPPQF PEADPEVRAK MEAEIAASKE AKAAQPENKS KGSKTKLVQK TGTGIVRQWN ILLKMVPEDE IPAFQDPLHW LSYFPPIGVD HLHNFGAGVD WRRAFITTYV NPYYDAFIRW QFEVLKEKGK ILFGKRNNVF SLVDGQVCAD HDRSEGEGVG PQEYVLIKLK VLQPGHGQSR HAKMEAILAK YDQPVYFVPA TLRPETMYGQ TNCFVLPDGE YGAYMIDATN EIFIMSARSA RGLSCQSYQG NEYFTKEFGK ILCLETFTGS ELLGLPLKAP MAKYDKIYTL PLLTISMGKG TGVVTSVPSD APDDFVSLKA LQDKPDFRAK YGITDDMVMP YEVVPIITIE GYGDASAVFM CEKLKITSFN DKAKLQQAKD ETYLKGFNMG IMKVGSHSGK KVSDAKPIIK QELILAGQAC LYFEPESRVV SRTSDECVVA STDQWYLAYG EESWTKAVKK HVLNSDNFNA YDPAALHKYD YTIGWLQEWA CTRQFGLGTF LPWDRAWVIE SLSDSTIYMS FYTIAHFLQG EGNLTGDKSK SPCSIDPADL SNDVFDFIFR KGPLPSDSNI PAKTLEKMRT EFRYWYPMNL RVSAKDLIQN HLTMALFNHA AVWEEEPELW PKGYYCNGHV LVDAEKMSKS KGNFLMMNDT IQTYGADATR FACADAGDSL DDANFSRETA DAAILSLITE DAWISETLTS VDLRSGEENL IDKILLNETN RLIASAGSNF ARMQFKEGLK EGWFEMLNAR NDYRAWCKDS GVPMHKGVVL RWAETIVILI CPICPHWSER IWKQIGNIGL AIRAPWPVAE EEDKILTRQA KFLRDSIKHF RSQAGRAKKG WMRASILVND SYPQWKIDTL VWMQGQYDVS SGFSPGFMKD LKDYTAKFVK DKKLIKFTMQ FASFMKKETE DVGDAALDVL LPFDQKEILQ VSIEYIKAQL NIEELDIIQL GVEEAPEVPE RVRENVTPGK PSLYIR
|
| |