Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45854 |
Symbol | |
ID | 7200960 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | - |
Start bp | 472830 |
End bp | 478008 |
Gene Length | 5179 bp |
Protein Length | 1689 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180245 |
Protein GI | 219118957 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0900508 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTGGAC AACAGCCCAA CACTCCGTTC GGCGGAGGAG GTTTTGGACA ACAGAACCCA ACATCCGGTT TTGGAGCGCC AGCTCCAGCC ACCGGGGGTC TGTTCGGTAG TCCAGCGGGG GCTCCAGCGT TTGGACAAGC TCCTGCTCCA GCGTTCGGAC AGGCTCCTGC CCCAGCATTT GGTGCCCCTT CGGCTCCAGC GTTTGGACAA GCTCCTGCTC CCGCCTTTGG TTCCCCTGCA CCCGCATTTG GAGGTGGAGG GTTTGGACAA ACTCCTGCCG CCCCCACTGG AGGACTCTTT GGACAACCGT CTCCCGCACC AGCCTTTGGG GGCGGTTTTG GACAACCCGC TCCCGCACCG GCACCCACCT ATGGTGGTAT GTTTGGACAA CCCGCTCCCA CTCCCGCGCC TGGACTCTTC GGAGCTCCAG CTCCACAAAC CAATACTTCA CCTTTTGGCG GAAATCCGGG TGGTTCCGCT TTCGGGGCTC CGGCGCCTTC GGGCTTTGGT GCACCCACAT CCTCGCCCTT TGGTTCCACG GGGACTACTG GTGCCTTTGG AGCAAACACT ACCAGTTCGT TTGGCGCACC CGCACCTGCC GCTGGTGGCT TATTTGGACA ACCCGCACCG GCTCCAGCCT TTGGCGGTGG TACGTTTGGA TCGCCGGCTC CCGCTCCCGG CGCCTTTGGC GCACCACCAG CCTCGGGTGG GCTGTTTGGA CAACCCGCAC CGGCACCCGC AGCCGGCGGC TTGTTCGGAT CCCCCAGTCC CAGCGCTCCC GGAGCAGAGG GAACCCGGGC GGTCCCTTAT CAAGCAACGA ATCGACAAGA TGGCACCGCC ACCATTACTC TACGTTCTAT TACGGCCATG CCACAGTACG AGAACAAGTC GTTTGATGAA TTACGCATGG AAGACTTTTC GCAAGGCAAT CGCGGATCTA CTGTGACTCC ATCCACGAGT AACGCTTTTA GTGGAGGGTT CGGAGCACCG GCTCCCGCAC CTAGTGGCGG TTTGTTTGGC GCCCCCGCAC CAGCTCCCTT TGGCGCTCCC GCACCAGCTC CCTTTGGCGC TCCCGCACCA GCTCCCTTTG GCGCTCCCGC GCCGGCGGGA GGATTGTTTG GCAGTCCTTC TCCAGCACCG GCTCCGTTCG GGGCGCAGTC GAGTAGCTTA TTTGGCAGCA ACCCTGCACC AGCGCCCTTT GGTGCACCGG CTCCGTCGAG TGGTCTATTC GGCTCCAATC CAGCACCTGC CCCGTTTGGA GCCCCCGCTC CTTCCGGGGG TCTCTTCGGG TCTTCCCCCG CACCGGCACC TTTTGGTGCT CCAGCCGGCG GCGGGCTCTT TGGTAGCAAT CCAACGCCCG CGCCCTTTGG AGCTCCGGCA CCCAGTGGCG GTCTGTTTGG TGCCACCCCG GCGCCCTTCG GAGCTCCGGC TGGCGGTAGT CTGTTTGGAT CCAATACAGC TCCGGCACCG GGAGGCTTTG GCTTTGGATC ACCAGCCCCA GCGCCCGGTG GTAGTCTCTT TGGAGCACCA GCTCCCGCAC CGGGAGGCTT CGGGTATGGA GCGCCGGCTC CGTTTGGTGC TCCCACACCG GGTTTGTTTG GAGCCCCCGC GCAAGCTCCG CCGCCTGCCG CTTTGCCGCA AAACGCTGCT ATTATACCAC CTGTTGTCAA TGAAGTCATG GAACAGCAAT TGCGGGCAAT TGAAAATAAG CAGGCCGAAC TTCAGAAGAG TGAAGCCTGG AAGGGTAGCG CAACGAAAGA TCCAGTCACG ACACCTACGA GTTTGTCTGA AGCGGACGGA CTCTTTGCGT CGCGCTACTC GGCTTCGCCT TATGTTACGA CGACCCCTCG ATCAGCCGTC AAGATTCGAC CCCGTGGATT TCCCAGAAGT GAACCGAGCA AGTCGACAGC CCTTTCGCTC AGCGCTGTTG GGCGCGACAA TAGCGGTCTC TTGTCACCCG AGTCTCACTT GCGTTCATCC GTTATGAGTT TGCATATCAA GCCAGAGAGT ATGAATCGCA AATCCAGTTT CCGTTTACAG ATCAACAAGC CCTCAGCCTC TTCACCGGTG CCGACGCCAT CTGATCCAAA ACCGCAACAA CCTTCGTTCC TTTCTCCGAC CTTTGCTGAG ACTCTCACGT CGCCTCCACC TGATGTCTCT CCAACTACCG GGGCCTCTCT GGTGCATGAG TCGCCCCAGC CTTTTACAAC TCCAAATGCA TCCGCAACCC CCAAGAGCCC TGCCTATGAG TTGTACCAAC AAGTTATCGG CAGTGGAGAG GCGTCCAGCA AGCCTCAAAG TCAGGTGAAG AAGCCTACCC GTACCTCGGT ACCCACACTA ACGCGAAAGG GATATATCAT ATCGCCGACT TTGGAGGAAT TGGAAAAGAC TGAGGACGCC GATCTGGCTG CCGTTAGTGA CTTTAGTGTC AAGCGCCCGG GATTTGGAAT GGTGGAATGG GAAGGCGACG TGGATGTACG GGGGGCAGAC CTGGATCGGA TAATCACCAT TGATCAAGCA GATGTATCGG TATATCATGC CGATGAAGCC GAAGGTAGCA AGCCCAAGGT GGGCTCCAAA TTGAACCGTC CCGCTATTAT TACTTTCTAC AATATTTTTC CGAAAAACGG TGGGGCCAAT GCATCCAAAG AAGAAAAAGA AAAGCACGCG AAGAAAGTAC AGCGCAGTAC TGCCAAGATA GGCGCCGAGT TTATGTCCTA TGATCGCAAC AATGGTGTCT GGAAAATCCG AGTTCTCCAT TTCAGTCGCT ACGGTTTGGA TGACGACTCA GACACCGAAA ACGAAGTGCC CCTGCCGGAG CAAAACAGCG TGCAGTTTCA ACAACAGACA CCTCCAAACG CTCAATCTCT CCTGCGTCGA AATCCCACAC CATACAAACC GAGTCGAATT CAATTCGACG AAATGGAAGT TTCCGAAAGC GCTGATGACA GTGATGTAGT TTGCGTCCAG GACATTCAAA TGACGGACTC GGAAAAAATC GCCTTGGTTC AAAAACGCGC TGATGAAGCG GCTAAGGAAG TCTTTCACAT TGTACCACAA CAAGATCCTG ACTATGAGGT GCAACATCCA CTACGGCCCG CGAAGATCAC AACGTTTGAG AATGTCGGCG ATTGCGATTC CGAGGAGGAA TCTGACTATG TGGTACCACC CGATGGCGAA GACTGGCATG CAGCTCGATT GGCATCAAGC TTCTGCAGAG GTATTGCAAT TGAATCAGGC ATGCATTCTT CCTCTACTGA TATGGGCCTG CGTATGGGGC GAGTCTTTCG ACCTTGTTGG CTTCCTAACG GTTCGCTTCT GAAGTTAAAG CCAAGCAGTT TCAATCGGTC GCCTACACTT ACCTCTTTGC GTCCTGTCCT TTCTGACTCA TATATTTCTC AACTCACTAG TGAACACCTT CTGGAAATTC ACCGTTCAGA GTCAGTAGCA CTCGAATCGC AAGACGGCTG TCCCTTGTTC AGCCTCCCAC GAGCTCTTCA GAACAAAGGC TCTTTGATGT CTCATAAAGC GTTGTATGAA ACCGTCACCA AATTCCGCTC TGTCCGCAAC GAAAATAATG AAGTCCAGTC AGCTTTTGAC CTGATAGCGC GTTTGATGGA CAGTGAATCT TTTCCTCCCA CAGAATCAGT AGATGGTGTT CGCTATATTG CAAACTCAAT ATCCTTCGAC TCTCGAAAGA ACACTGCCGT GCTCGCTTGG CTTGTTGACG TTTGCGCGCC ATCTGTTGAC TCTGAGATTG CTGAAGCAAA GCTTCGAAAT TTCAATATTC TTGCTATATT TGCGGCTTTG GCTGGTGGAG ATGTCGACAA AGCCTGCATT GTAGCAATCA GCTCTGGTCT CAACAATCTC GCTGCGATTC TCGCAAGTGG TTCGGAGGGC AGGAAGGATG TACTTTTGAG TGTGAGTAAG CTTGCCGAGA GCAATCATGC GTCATCAGTG CCAGCGGAAT TGATTCGCTT AATGAAAGAA TCTGGGGGAG ATGTTCACTC AGAGTGCACT TTGTACAAAC AAGGCTCTAG TTCTCTCGAC TGGAAACGCC GTCTTGCTCT TCGCCTCTTG CAAGATAGCG ATAAAAGCCT CGTGCAACTG TTGTGTCAAT ACGAAAACGA TATTTCGTCT AACCACGCAC CACCACCAAA TCTTCCCCAT TCACAACAAA CAGACATGAG AAGCCTCACT TTCCAGCTTC TTAAGTCATA TTCGGTTCCA CAGTCCATGG AAGTTTCTGA TGTAGTACAT CCTCTTGGGT TCTCTTCAAT GAGCCACGAT TTTTCACTGG TCTTTCATCT GACAGCCTTG ATTTGTGCGA CGGGCGCTAC GAAAAACGAT TCATTCAAAA CTGAATATAT CCTAAACAGT TTTGAAGCCC AGCTGATACA AGCTGGGCGC TGGGATCTGG CTGTGATCGT TTGCTTGTCG GCGATAGGTG AAATGTCTGA AACCTTGCAT CATTGGAAAG CTCACCGCGC AAAAAGTCTG ATTTGTCGAT TTTCATCGGA CAATGATGGT AAGCGTCTTT TTCTCGAGGA GACTGGAATC CCTCGCCGGT GGTTTGAAGA AGCCCTCGCC TACCGAGCTC TATACCGAGA GGACGCCTTC GGCTTTGTTG TGCACGGATT GGAATGCGAT CTCAAATCTG CGCAAGATGT GCTTGAAGGT TGTTGGTTGC CCAACCTATT TTTCTTGAGC TTGAAGGACA TCCGTACGTT AATGGAGCGA ATCGGTATGG CTTTTTCTCC CAATTCCCTT TCGGCTGCAA TGCACCGCTT CTTTGACTTG AATGACGGGG TAAATCTACT CATTGGGAAA AGTCAGGACG AAATAGAGAG CATTGTGCCG TCTTTGATCG AGTCGTGCCA GGGTATTGAA CGTACGCTTG TCTCTACAAA ACAGCATGGT CTAGAGTCTA ATCGTACCAC AAGCTTGCTA ATGCGAGAAA ATTCGATACC GCTTCAGTCG ATGATTTCTG AAGCTTTGGA GCATCTCAGT TTTCTACGTC TCCAGTTGAG AGCTATCGGG GAAGCCAGGC TGACCTCGAA GTCGACGTAA ATCCGAGGCT GGAGCCAAAG AAAGATGCGC AATGCGCAAG GAATAGCACA TAGTTACAAT CAGTCAGACA TCGCATTATG CGGAAATTCT CTTAAAGACC TATCTCATC
|
Protein sequence | MFGQQPNTPF GGGGFGQQNP TSGFGAPAPA TGGLFGSPAG APAFGQAPAP AFGQAPAPAF GAPSAPAFGQ APAPAFGSPA PAFGGGGFGQ TPAAPTGGLF GQPSPAPAFG GGFGQPAPAP APTYGGMFGQ PAPTPAPGLF GAPAPQTNTS PFGGNPGGSA FGAPAPSGFG APTSSPFGST GTTGAFGANT TSSFGAPAPA AGGLFGQPAP APAFGGGTFG SPAPAPGAFG APPASGGLFG QPAPAPAAGG LFGSPSPSAP GAEGTRAVPY QATNRQDGTA TITLRSITAM PQYENKSFDE LRMEDFSQGN RGSTVTPSTS NAFSGGFGAP APAPSGGLFG APAPAPFGAP APAPFGAPAP APFGAPAPAG GLFGSPSPAP APFGAQSSSL FGSNPAPAPF GAPAPSSGLF GSNPAPAPFG APAPSGGLFG SSPAPAPFGA PAGGGLFGSN PTPAPFGAPA PSGGLFGATP APFGAPAGGS LFGSNTAPAP GGFGFGSPAP APGGSLFGAP APAPGGFGYG APAPFGAPTP GLFGAPAQAP PPAALPQNAA IIPPVVNEVM EQQLRAIENK QAELQKSEAW KGSATKDPVT TPTSLSEADG LFASRYSASP YVTTTPRSAV KIRPRGFPRS EPSKSTALSL SAVGRDNSGL LSPESHLRSS VMSLHIKPES MNRKSSFRLQ INKPSASSPV PTPSDPKPQQ PSFLSPTFAE TLTSPPPDVS PTTGASLVHE SPQPFTTPNA SATPKSPAYE LYQQVIGSGE ASSKPQSQVK KPTRTSVPTL TRKGYIISPT LEELEKTEDA DLAAVSDFSV KRPGFGMVEW EGDVDVRGAD LDRIITIDQA DVSVYHADEA EGSKPKVGSK LNRPAIITFY NIFPKNGGAN ASKEEKEKHA KKVQRSTAKI GAEFMSYDRN NGVWKIRVLH FSRYGLDDDS DTENEVPLPE QNSVQFQQQT PPNAQSLLRR NPTPYKPSRI QFDEMEVSES ADDSDVVCVQ DIQMTDSEKI ALVQKRADEA AKEVFHIVPQ QDPDYEVQHP LRPAKITTFE NVGDCDSEEE SDYVVPPDGE DWHAARLASS FCRGIAIESG MHSSSTDMGL RMGRVFRPCW LPNGSLLKLK PSSFNRSPTL TSLRPVLSDS YISQLTSEHL LEIHRSESVA LESQDGCPLF SLPRALQNKG SLMSHKALYE TVTKFRSVRN ENNEVQSAFD LIARLMDSES FPPTESVDGV RYIANSISFD SRKNTAVLAW LVDVCAPSVD SEIAEAKLRN FNILAIFAAL AGGDVDKACI VAISSGLNNL AAILASGSEG RKDVLLSVSK LAESNHASSV PAELIRLMKE SGGDVHSECT LYKQGSSSLD WKRRLALRLL QDSDKSLVQL LCQYENDISS NHAPPPNLPH SQQTDMRSLT FQLLKSYSVP QSMEVSDVVH PLGFSSMSHD FSLVFHLTAL ICATGATKND SFKTEYILNS FEAQLIQAGR WDLAVIVCLS AIGEMSETLH HWKAHRAKSL ICRFSSDNDG KRLFLEETGI PRRWFEEALA YRALYREDAF GFVVHGLECD LKSAQDVLEG CWLPNLFFLS LKDIRTLMER IGMAFSPNSL SAAMHRFFDL NDGVNLLIGK SQDEIESIVP SLIESCQGIE RTLVSTKQHG LESNRTTSLL MRENSIPLQS MISEALEHLS FLRLQLRAIG EARLTSKST
|
| |