Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47270 |
Symbol | |
ID | 7202357 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | - |
Start bp | 140437 |
End bp | 143782 |
Gene Length | 3346 bp |
Protein Length | 897 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181665 |
Protein GI | 219122672 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CACATAGATG CAAGGCAGAA AAGTACCGTG ACGACCATGG CAACTCCGGA CCAAATATTG GCAGCTTCTC GCTACCGAGC GGGAGGTGGT GCTCTTTTTA CGAGTTCCAA GTGCCCGCCG GGTCCAGGCC TTGAAAATAG TTGCCAACTT CCTTTCGGGT TCATCTACAC TCCACTTTCT CCTCCCGATA ATATTCAAGT CGTGCCTATC CACGACGAAA ATCTGCCGCC TGTAATCTGC TTAACGTGCC TTTCGTACTT AAACTTGTAC TGTGATGTGG ATGAAACAAC AGGCGTTTGG ACGTGTGCTC TTTGTGGATG CAAGAATGCT GCGCCGCCGG AAAGCTTTCA CAACGGAACG CTTTCCCCAA TATTGATATC TCCCATTGTA GAGTTTCGGC AGCCCATTGC GGAGGCGCAT GATCGTGTGA ATACCATTTC GGTGGTGGTT GTGATGGATG CCAATCTTCC ACGAGCTGAG GCGCAGGCGG TAGGATCGGC GTTGCAAGCT ATCCTTCCCG AAATGGCCGA CGCGAAAACA CGAATCAACC TTGGATTTAT TGTTTTTTCT AAACATGTGT CAATTTACCA ACTTAACTCA ACCGGTGTTG CTTCCGCTGA CATATTTTCA ACCCATGAAG GACTCACCGA GAAGCACTTG GAATCAAGAC AATACCTCAC CGAAATCGGA CAGGATGGCA GCCTAGAATG CATGTGGCGA TGCCTGTCCG CTGTGTACGG AGTTGTGTTG GATAATGAGG AGGGAAGCGA AATAAACGTT TCCAAGGGAA AGCAACTATC TCGATTGGAA CAACTAAAAC AACGCAAGGA GACACGAATG CGTAAAGAGC TTGAGAGAGA CGACGACGAT CCAGATGTTG TAGTCAAGTC GCCTTGGGTG TTGGCGAAGG AAAACAGTGC ATCCAGGCAC CCTTTGCGGT GTACAGGAGA AGCTATACAG TGTGCCATTG ATCTCGTCGC CTCGCCTTCC AATGTCGATT TGATCGAGTC GAGAATTTTG GTCTTTACTA ACGGATGTCC CAACTATGGG GATGGCAGTG TCGTTTACGA CGATAGAGAC ATGACAACGA CAGCACGAGC CCGTCCAACA GCAGATGTGG TTGATCCTCT CAAGCTGTCC GGGGCGGTGG AGTATTTTAG CATTATCGCA AAGGCAGCCG TGGAAGGGGG TATTGCCATT GATGTATGTT GCTCTGGTGC GTCTGAGCTT TGCCTTCCAG TGTTCCAGGC GTTAGTCGAA CCAAGCTCGG GCTATGTTTT GCCTCACGAA ACCTTTGCGG GACCGCATCT GAAACACAAT ATGAACCACT TTCTGAAAGA AACAAATATG ACAATGGCAG CGTGTAGCGA AAGTCAGGCG GAAAAGTCGA AAAGCCTAGC GCCTTCTGGC TGTACAATCG ATATCCGTAT GCCAAGGTAA GTTGCCGGCC AGTGAGTGTT CGATCTCCTT TTGAGCGGTA GTATCTAAAT GAAATTGAAC AGCTTTGTGA ATCCCACACA CCTGGTTGGT CCGGGTGAGA TTCTTGATGA CTTCAAAGGT TTGTTGCTGA ATGAGCGTTC TGCTTTCGCC GCCGGATGTA AGCTAGCCGC TCGTATCGGA ATGAGAACAA ATCATTTACC CCAGAAAGAT TTTGTTGACG ACGCAGTGAC CCGACTATCA ATGGGAAGAA AAGATCCGTT GTCAACTTTT TCCGTTATGC TTGAGATCAA CAATTTCTTC CAGAAAGACG CCTTTGCTTT CGTTCAGTGC ATTGCTCGTT TCGTCGACCG CAGAGGACAA ATTCTGATAA CGAGAGTGTT TTCACATCGA ATTTCTATTG CAAACGACGT CGGTGAGTTC TTGGATTCTA TTGATGAGGA GGTGGTGCCC GTGGTCCTTG GAAAAGAAGC TGTCTATAGG TCAATGTATG GAAGAGAAAT TGACGCCAGA AACGAAGACG AAACAGAAGT TGCAACCTCT GATGAACTCG ACGATCTTGC GTATGATGCA CAAAAAGATC TTGACGCGAC AATTCATCGC ATATCCGTTG CTTTTCGCTT GCTTGGACTG GAACAAGGAA ATCGTGGGTA AGTGGACATT TGTGTAAATT TGCATTGAGT CGGCAACTAC TCTTACATGT TGCTACTTGC GACGATCCTC CAGATTGGAT CTTACCGAGG AAGGGGGAAT TCGCACAGTA GGGTCTTCGA TTGATTTCGC GTTTCCCCCC GAGCTTTCGG ACGCACTGCG TCGTCTATAC CACCTGCGAC GTGGTCCTCT GCTGAGCCCC GGCCCGATGC GATCGGATGA CGATCGGGCG CAGATCCGAT CTCTTTTCCT TCGCTTACCC TTGGAGGATT GTCTGTGCAT GTGTGCGCCC TCTTTGTGGC GCACGGAGGT TACTCCAGAG TGCAAGTCTG ACTCTGTAGA GTGGATCGCT GTTCCACCCG AATCTCTAGC TTTATGGGAC AAAGTGGGTA ACGTTGAGAT GGAGTGCGTA TTTACAATGA TGTGGCATGA GAACTAACAC TCATGCTTGT CGGTCTCTTC GTATAGACGG CTATTGTGGC AGACTGCTAT CATAGTCTTT TCATTTGGTT TGGGAGGGGG GTACCAGAGT CTTTCTTTGA TTCGATTCGG CAGCAGGCGA GGACATATTT ATTGGATCGT TCTGTTATAC GCTTTCCGAT GGCAGAGATA TACACTGTTT CCGAGGGCGA ATCCATGGAT CGTAGATTCA CTGCTCTTCT AGCTCCCTCG TACGGGGATC CTGTCGACCA TCAAGTAGCA AATTTTCCGG CGCTTGGTCA GTTGTCACCA CAAGAGCTCG AAAGTCTTCG TTGTAAATTC AGATTTTACG ACCCTACTTC AGACCCAAGT TTCCGAACGT GGTTCTGGGA CGTTGCGAGC GCAACTAGCT CAAGTAAAGA GTTCGGTCTG TCGCTCTGCG AGTAAGGAAC TTACTTAAGC CTATTCACTT CCACACGTGA GGGTCCGATA AGAAGATCTC TTTTTGGTCT AGCAACATTT TCCTCGCAAG TCGGCTCCCG TAGCTAATGA TGGCTGTAAG TGGTACGGAG CTTGAAGTTG CCGTGTACCT GAAAAGCATT TTACCTACTG TGTGAGATGG AAGTCTCGCA CACGTAAATG GGTAGAACTA AAGCTTCATA CGCACAACTC AAATCCTTGT CCATTCATTC CAAGCATAGT TATCTTTCCG CTCGTAATGT ATGAGAGACC TTGCAGATTT GGAGGTGCTT CGGCAAAAGC TATTGCTGGA CAGAAGCAAC CGTTTTCTGA GGAGGTCAAA ATATCGGATG GCGCGGCATC GTCCGAGATC ATATCC
|
Protein sequence | MATPDQILAA SRYRAGGGAL FTSSKCPPGP GLENSCQLPF GFIYTPLSPP DNIQVVPIHD ENLPPVICLT CLSYLNLYCD VDETTGVWTC ALCGCKNAAP PESFHNGTLS PILISPIVEF RQPIAEAHDR VNTISVVVVM DANLPRAEAQ AVGSALQAIL PEMADAKTRI NLGFIVFSKH VSIYQLNSTG VASADIFSTH EGLTEKHLES RQYLTEIGQD GSLECMWRCL SAVYGVVLDN EEGSEINVSK GKQLSRLEQL KQRKETRMRK ELERDDDDPD VVVKSPWVLA KENSASRHPL RCTGEAIQCA IDLVASPSNV DLIESRILVF TNGCPNYGDG SVVYDDRDMT TTARARPTAD VVDPLKLSGA VEYFSIIAKA AVEGGIAIDV CCSGASELCL PVFQALVEPS SGYVLPHETF AGPHLKHNMN HFLKETNMTM AACSESQAEK SKSLAPSGCT IDIRMPSFVN PTHLVGPGEI LDDFKGLLLN ERSAFAAGCK LAARIGMRTN HLPQKDFVDD AVTRLSMGRK DPLSTFSVML EINNFFQKDA FAFVQCIARF VDRRGQILIT RVFSHRISIA NDVGEFLDSI DEEVVPVVLG KEAVYRSMYG REIDARNEDE TEVATSDELD DLAYDAQKDL DATIHRISVA FRLLGLEQGN RGLDLTEEGG IRTVGSSIDF AFPPELSDAL RRLYHLRRGP LLSPGPMRSD DDRAQIRSLF LRLPLEDCLC MCAPSLWRTE VTPECKSDSV EWIAVPPESL ALWDKTAIVA DCYHSLFIWF GRGVPESFFD SIRQQARTYL LDRSVIRFPM AEIYTVSEGE SMDRRFTALL APSYGDPVDH QVANFPALGQ LSPQELESLR CKFRFYDPTS DPSFRTWFWD VASATSSSKE FGLSLCE
|
| |