Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48134 |
Symbol | |
ID | 7203482 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | + |
Start bp | 293094 |
End bp | 295934 |
Gene Length | 2841 bp |
Protein Length | 946 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182508 |
Protein GI | 219124433 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.252126 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTGTTT CATCGAGCTG TCCTTACCCA TCTTCGGCTC ACGGAACGGC GAGTCGACGC GGCCACAAAA AGATGCGCCA CATCAAGAAA GAGCAAGCAC CAGCTGTACC GACCATAGAA ATTCAGACAG TCCTGCGTCT CCGCCCGCTC CTGAAAAAAG AACGGGACGA CCACATCGTC ATTGAGCCGC TACTTCAAAA CAACGTTCCA TCACGGTCCG TTGCCTTGCA TCCAATTCCT CGTCCAGAAG TAAGTGCAAG CAGTCAGCTT GTGCGCCAGA ACTTTTCGCC AGAAACAGTC GTCACGTCGC ACGATTTGGA GTTTACGACA GATCATGTGT TTTCCTCTGA CACCTCTCAA GAAAAACTGT ACTTTTCAAT CGGCTCGCCC ATAGCATTGT CATCTATGGA TTCGCTGCTG AAAGGCTCTA AGAAGACACA CATGATAGTA ATTACTGGAT CTAAGAATAG TGGTAGATCT TTCTCGACCT TTGGTCCAGT TTCCAAAAGG AAACATGCCT CAGACGGTTT GGTACCTCGG ATACTCGACA GTCTTTTTAG TCAGGCGCGT CATCAAATGA AACACGCGGC AGGGAACTTC GCAGTCAATA TTTCCATCAT GCAAGTTCAT GATGAGAAGA GCGACGAATG TACGTTGCAC GATCTCCTAC AGCCCATCAA AAGCGGAGGA ACTCCAAGAA AGCGGGCGGC GACTGGCTTT GCACATTGGA GTGGTACCGG AAACGCAAAA GCCAACGCCG TGAAATCTCT TGTGGCGAAC TTTGAGCGGC AACAATGCGC TACGCGTGCT ACCAAGACCA TTCCCATGAG GCGTTCGCCG GCTCGTGGAT ACAATTCAAT GAAAGAACCG GTCTTTGTTG AACAGGATGC CAACACTTCC GATTTTCAGA TAATCAATGG TCAGATCAAA GCATGCTACA ATACCGAAGA AGCTCGCGAA ATACTGATTG CTGCAACAAA GCGGTCCGAA AAGATGTCAA AAGGCCATTT GCTCTCATCG CACGTACTCG TCACTATGGA AACATCTTGG ATGGGCAAAA ACGAGACAAA ATTTGGAGGA ACGATTGTCG TTCTCGATCT CGCAGCACCC GCTTTTCAAG ACAATACCAC ATCGATGCCC AATCATAGAC GCGTCAAGGA CAGTATCCAG ACTAGACACG ATGCGTATTG GGCTGTTCGT AAATGCTTTG AAACTCTGCA GTATAATCAA ACTGTAGCCT CGCTTATGTG CGGTGATGGC CGGCAAGAGA GCTTGAAAAA GGTCCCGTAC CGTCAGCACA TGTTGACCAT GCTTCTACAG CGCTTTTTTA AAGGAACCAC GTCCATCACG TACATGCTCA CGGTGTATCC AGGTCACACC GACTATGCTG AAAAGAAAAA TCTTTTGCAA GACCTACAAA TTTTGTCTAC ACGCGAGCCG GGGCAATCGG CTATGACCGG ATTGCGTAGT CGGCGTCCCA GTGTGGAATC CGGAGGAGAT CAAAAGCCAA ACAACCAGCT AGCACCGCCC TTGTCCATGG CTTCACATGC TACCAACCGT GGATCGCAGC CTGTAGATCG GCAGATATTA AAGAAAGCAA TCTTGTTGAC ACCGATGAAG GAAGACATGA TTGAGATCCC AAACATTGCC AAGAATCCTT CCCTGACGTA CAGTGACTCA ACACAAGAGA AAATTATGGA CGACGGGGAT GAGCGGAGAT TTATTGACGA TGAATATCTG GTGCCTTTGC CTCCCCCACC TATTGCTCCG AGTTTTTCCG CTCGGCAATT GCCTCAGTAC ATACTGTCGC CCGACGCCAG CGCGCCACCG GAAGACGGAA TCGAAGCAGG CCTAGCCCCG AGTTTGGTGA CAACCCCAAG ATTGGCCACG GTACCATCCA CTTTTGACTT TGCCCGCGAG CCCAAATATG ATCTCTGTAC TGCTGAACAT TCTCCTCGTC CCGCAGAGCC AGACTCGCAT CAGACAGACT TCGTCTGTGC AGTAGAGGAG CGCCAGTCCC CTACGGCTCA TCCAGATCCA TCCTCACTGA TCGCAAACCA GCCGCGCTCT CCAATACAAG AATTGGCACA GTCGGTTGAC AGAAAGCACC GCAACGTATT CAATTCCAAG AGCTTTCGCC CAGTAAAGAC ATTCAACAAA GTTGTACTTG CATCAAAAAA GAAGGCCAGC CAAGTAATTG CATCGCTGGA GCAACGTCCA TTTGTGGATC CGGAAGTAAA GGTTGGCCCT TCGAATTTGT CTCTTTCTGA CACACATGCA GCGCCAGTTG ATGACAGAGA ATACATACTT GAGCTCGAAC GAAAAAACTC CATGTTGGTG GAAGAAAACG AGAGACTTCG AGCACGCAAC GAACAACTTG AGATCGAAAA TAAGTCAATT GTGTCACGCG GCTGGGACTC AGTACTTTCA AATGACGAAA ACATTAAGCC AATGGATGAA AGGCGGCCAC CGAGCATAAC GAAGCCCTTG GCCCAAAAAC TGTCACAACT GGTGCAGACC AAAAGTTCCA ACACTGGTTC CAGAAGTACA GACAACCCAT GGAATCGACA TGCCAAAACT GCAATCCCGC CTGTTCCTTA TTCTACTCAT AACGATGCCC ATGGACAGCA CTCCCCGAAC TTTCGGTCCG CAGATTTACA ACGCACTGAG CAAAACCCTC GTTTTCGTCA CATTTCCGAA ATGTCTAGTG GTAAAGGTAG TAAACCCCCA CCTAGCAAGT GGATAGATGG CAACAAAGGT AGCAAGTCTC CCAGTTTTGT GTTGAATCTG CAGGGGGGCT TTCACGGTGG CCACCCTGGG GCTGTTGTGC CCGGAGGCTA G
|
Protein sequence | MVVSSSCPYP SSAHGTASRR GHKKMRHIKK EQAPAVPTIE IQTVLRLRPL LKKERDDHIV IEPLLQNNVP SRSVALHPIP RPEVSASSQL VRQNFSPETV VTSHDLEFTT DHVFSSDTSQ EKLYFSIGSP IALSSMDSLL KGSKKTHMIV ITGSKNSGRS FSTFGPVSKR KHASDGLVPR ILDSLFSQAR HQMKHAAGNF AVNISIMQVH DEKSDECTLH DLLQPIKSGG TPRKRAATGF AHWSGTGNAK ANAVKSLVAN FERQQCATRA TKTIPMRRSP ARGYNSMKEP VFVEQDANTS DFQIINGQIK ACYNTEEARE ILIAATKRSE KMSKGHLLSS HVLVTMETSW MGKNETKFGG TIVVLDLAAP AFQDNTTSMP NHRRVKDSIQ TRHDAYWAVR KCFETLQYNQ TVASLMCGDG RQESLKKVPY RQHMLTMLLQ RFFKGTTSIT YMLTVYPGHT DYAEKKNLLQ DLQILSTREP GQSAMTGLRS RRPSVESGGD QKPNNQLAPP LSMASHATNR GSQPVDRQIL KKAILLTPMK EDMIEIPNIA KNPSLTYSDS TQEKIMDDGD ERRFIDDEYL VPLPPPPIAP SFSARQLPQY ILSPDASAPP EDGIEAGLAP SLVTTPRLAT VPSTFDFARE PKYDLCTAEH SPRPAEPDSH QTDFVCAVEE RQSPTAHPDP SSLIANQPRS PIQELAQSVD RKHRNVFNSK SFRPVKTFNK VVLASKKKAS QVIASLEQRP FVDPEVKVGP SNLSLSDTHA APVDDREYIL ELERKNSMLV EENERLRARN EQLEIENKSI VSRGWDSVLS NDENIKPMDE RRPPSITKPL AQKLSQLVQT KSSNTGSRST DNPWNRHAKT AIPPVPYSTH NDAHGQHSPN FRSADLQRTE QNPRFRHISE MSSGKGSKPP PSKWIDGNKG SKSPSFVLNL QGGFHGGHPG AVVPGG
|
| |