Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48186 |
Symbol | |
ID | 7203319 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | + |
Start bp | 445998 |
End bp | 448976 |
Gene Length | 2979 bp |
Protein Length | 992 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182536 |
Protein GI | 219124492 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGTCCA TTGTGTTGCT ATCTGCCACT GCCCTGATGG GCTGTGACGC CTTCCAGGCT TGGAGCATTA CCGCTGCACA GAATTATCTT GCTCGCTCTT TCTTCGATGC GCATCGGATC GATACTGAGT TTCGAACAAG ATATCCAAGA CACAACGTTT TGTCCCGCTT ATCCGGGTCC ACTTCTGGCA TTGACCCTGC TCCCCAGGTC TTTGCTTCTG GCTTTTCCAC GAAAACCGAT CTCGTCGAAG CTTTACAGGA GGCAGTGGAA ATGGCTGTTA GAGCTTTGCC TCCGGCAGCA GCAGAAAACC CGATCATCGA TTTGTGTACA GTATCGGTGT CGTCTTTATA TGACGGAGGT TCAAGCCCAC CGACAACGGT GGTAATCCCG ACAATTGTAG AAACTGCCCG GTCAAAGTAC GGGATCATTC AACACTTGAT TGGTAGTTCA GTGGCGGGGT GCATTGCGAG CGTCGCAACG ACTGAAGTTG ATAATGCTTT GACTGCTTGT CAACCTGTGG AGCTCGATGG AACTCCAGCT GTTTCAATTT CTCTGGCGAT CCTTCCAGAT GTACAGCTAC GCACCTTTTT TTGCCAATCG GCTTATGTTC CTGACGATAT TGGACGCATC AGTCCTGCAG AATGGAAGCG GGCCGTTGGA CTCAGTGGCT TTTTGGAATC AAACAAGAAA GATGGTTCTG ATACAGAACT TAGTCAGCAG GACTCGGTTG TCATGCTTTT ACCGAGTCCG GCCTTCTCGA CAGAGCTCGA TGACTTCTTG CTCGGTCTCT CCCTCTACTT ACCCCGAGCC CAAACTTTCG GCGGCATAGC AAGTACGGTC TCATCTCTCT CGCGCGCCAA GCTATACCGA TATTCAGCTG CCAGTCACTT ATCCGAGTGT TTGTCCGATG GTTGCGTTGG TGTTGCAATG ACAGGGGATA TCCAGATTCA AAGCATGGCT GCGCACGGCG CCAAACCTGT CGGTGGTATT TATCAAATAC TCAAAGGCCA AGATTCAACA ATCGGCGTTA TTGTCCTGGA CGAGACTGCA ACTCAGGCCC TGAAGGACGA AGAAGACAAC GTCGATAACG ATAGCGACAA TGATTCAGAA GAGTCAGAGC CATTGGACAA GAAAGCTGCT TTAGCACAAG CGTACGCCAA GGCCCAGATT CCGAAACCTG TTTTAGCCGA AGCCAATTTT TTGATGAGAA CACTGTCAGA TGAAGATCAA GCTTTTATGC GTCGACAGCT TTTAATTGGG ATAGATAAGG GTGGTAGTAT CGGTAGGTCT GCGAGCGAAC TTGCGAGACT ATCGGAAGGC GAAGGACACA GGTTCACCGT GCATAAAGTA GCCACTGCGG GCATGAAGGA TGGAAGTGTC ACGTTCTCGT TGGGAAGTAT TGATGTTAAG ACTGGTACGC GTATGAGATT CTTTGTGCGC GATTCGGAAT TTGCCAAGAA GGAAGTCGAA GCTTTATGGT TCGGATACAA GAAGCGATTG TTAAACCAGC AGTTTGGGAA AGGCGAGCAT ACGACAGATT CCACTTTCAC ACGGTCGGGA TGCTTTGTAA TTCCAACTCT TGATCGAGGG AACAAGTTCT TTCAAGGCAA ACCTGGGTAT GAAAGTGGAA CCGTTGCTCG CATTCTACCT ACCCTTCCTA CAATAAGTGG ATTTTTTTCG AACGGCATCA TTGGAATTAC GGAGGGCGAC GGTGATACTA GCACGGGTGT GCAGGGCAGC GCGACAGGTT ACACATTGAT TGGCAGTAAA ACGGATCGAC CAATATTTTC ACCAGCAGCT GCAGCCGCTG CTCATACTGC AGCACAAGAA GAGAAGGAGG CCCAGGAAGC GGAAGCTGAA GCTCAGGCTC TTGTTGCGGA AGCTAACAGT AAGGTCGGGG AAAGTTATAC CCAAGGAAGC AATGGCGTCG TGAAGACAGC ACCTCGTTCC GAAGATGGAG AACTCATAAT CAAACGCCGA GAGGTTCACT CCGGGCGAGC CATGACAGTT TCAGCCGTCG AATGGAGTGT AGCGGAAAAG GCGGCAATTC CAACGAGCAC ACTGGAAGGA TTTATGTGGG ACAAAGAAAC AGAAGTTGAC CGCTTCCGAG AGCGGGTACC TCTGGTCAAT CTGGTATCTC AGTGCAGATT ATCACAAATG GATCCAAAAG CTCCTAAACC TCGAGGCTTC GTTGTACCAA TCCAGCAAAT GGTTTCGGAA GGAAAATTTG TTGTTATTCC AGAATGCAAG CGAATGGAAC CCACGATCGG GAGTTTGCGA CGTCGCTATG ATTTGAGTAA GCTTGCTCGC GATTTCACTT TTGACGGCGC TGTAGCCATT AGTGTGAATT GCGATGCAGT CCTCTTTGGC GGGTCTCTGG GCGACGTCAC TGCAGCACGT GAAGCTGCTG GTAGCGCCGT GATTGATAGC ATATCAGAGG AGGGAGTCGT CGTCCCCCCA ATTCTCGCAT CCGACTTGAT CCTTTATCCG TATCAGCTTT ATAAGCTGCG TTTGGCTGGC GCCGATGCAA TTAACCTCTT GGTTGGAGCT CTAGAGAAGA AAGATCTGTC ATACCTTACC AAGATAGCGT CTAGTCTTCA GCTTCAGTCA TTTGCCACCG TAACTTCCGA AGTGCAATTA CTGGAAGTGG CAAGTCTGCA GGAAGGGACC ATTGACGGAA TAATCGTATC CAATCGTGAG CTTGAAGACT TTTCCTTCGA TATGACTGGG GAGCAAGCAT TGTACCTGTT GAAAAGCAAT GCTCTGGCGA AAGTCCGCGC AAAACATGGT GAAGACCTTC TTATCTTGGC TGAAGGAAGA GTCGGTATAA TCGATCGTCC TCAGGCAGAC AGCACAAGAA GTGCTAAGCT TTATATTACC GAATTAAGGG AAGCTGGCGC AGTGGGTGCG ATAATGGGTG GTGCATTGGC AGTGGACGGA GGGGGGTATC AGCAGGTAGC GAAAATGGCG CAACTGTAG
|
Protein sequence | MKSIVLLSAT ALMGCDAFQA WSITAAQNYL ARSFFDAHRI DTEFRTRYPR HNVLSRLSGS TSGIDPAPQV FASGFSTKTD LVEALQEAVE MAVRALPPAA AENPIIDLCT VSVSSLYDGG SSPPTTVVIP TIVETARSKY GIIQHLIGSS VAGCIASVAT TEVDNALTAC QPVELDGTPA VSISLAILPD VQLRTFFCQS AYVPDDIGRI SPAEWKRAVG LSGFLESNKK DGSDTELSQQ DSVVMLLPSP AFSTELDDFL LGLSLYLPRA QTFGGIASTV SSLSRAKLYR YSAASHLSEC LSDGCVGVAM TGDIQIQSMA AHGAKPVGGI YQILKGQDST IGVIVLDETA TQALKDEEDN VDNDSDNDSE ESEPLDKKAA LAQAYAKAQI PKPVLAEANF LMRTLSDEDQ AFMRRQLLIG IDKGGSIGRS ASELARLSEG EGHRFTVHKV ATAGMKDGSV TFSLGSIDVK TGTRMRFFVR DSEFAKKEVE ALWFGYKKRL LNQQFGKGEH TTDSTFTRSG CFVIPTLDRG NKFFQGKPGY ESGTVARILP TLPTISGFFS NGIIGITEGD GDTSTGVQGS ATGYTLIGSK TDRPIFSPAA AAAAHTAAQE EKEAQEAEAE AQALVAEANS KVGESYTQGS NGVVKTAPRS EDGELIIKRR EVHSGRAMTV SAVEWSVAEK AAIPTSTLEG FMWDKETEVD RFRERVPLVN LVSQCRLSQM DPKAPKPRGF VVPIQQMVSE GKFVVIPECK RMEPTIGSLR RRYDLSKLAR DFTFDGAVAI SVNCDAVLFG GSLGDVTAAR EAAGSAVIDS ISEEGVVVPP ILASDLILYP YQLYKLRLAG ADAINLLVGA LEKKDLSYLT KIASSLQLQS FATVTSEVQL LEVASLQEGT IDGIIVSNRE LEDFSFDMTG EQALYLLKSN ALAKVRAKHG EDLLILAEGR VGIIDRPQAD STRSAKLYIT ELREAGAVGA IMGGALAVDG GGYQQVAKMA QL
|
| |