Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43654 |
Symbol | |
ID | 7197364 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | + |
Start bp | 1111876 |
End bp | 1113991 |
Gene Length | 2116 bp |
Protein Length | 629 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177754 |
Protein GI | 219112005 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.769463 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TGCTACCTTA CCCCGTCACT TGTTCCAAAG ACGCCGAGCA TTCTACGATT GAATCACTAA AATGGAGAGA TCCGGTAGAG ATGAAAATGC TGAACCGGAA GTCCGCAAAC GAAATCGTCG AAGGTTGCTT TGGTGTTCTT CTTCGGCGCT GTGTGTTGTT GCCGCAGCTC TTAGCAGCAA CGGTTTGTCG GCCTCGCTTA CAGGAAAGTC ACTACCTTTT CCAGCGTTAG GCCATCTGCT TCAGTCTTCG ATGCCGATCT TGGAAGACGA TGATGAACGG TTGGGTATGA AATCGTCCAA GCGAATGCTA GAAGATGCTG CTAATGGGAA TACTAGTGGA CAGGATGATG CACAACAAGA CGAAAGTCAG CAAGTCACCG ATGACCAAGC AGGTATCAGC GCCCAAGATC AGGATGACGA CAGCCAAACG GCTGGAGACG ACGACCAGAA GCAGACGGAA GAAGAGGAGC ATCGAGGTCC AGACCAAGAC GATTGGTTTC GAGATGGCTT TGCCGACGAC ACTTTTACCG ACGATGATGC GAAAAATAGG TACAGTGAAG TAGACGATTT TTATGCCTAT GTTGGCGACC CGCGTCCTCC GAAGCTAATG CCACTTTCGA GTCGAGAAGT AATTGGATAC TCTGTAGTGG CAATGGCGTT GACTCTTGGA GCTAGTGGAG GGATCGGTGG TGGTGGAGTC GTCGTACCGG TTTATCTCCT CGTCATGGGA TTGCATCGTA AGTTTACGTG TGGGTTCAAT AGCCATTTGG ATACTCAAAC CTCACCTATT TATTTTGTTG TCTGAACTTA ATCACCAGCT CATTACGCCA TACCTATCGC CAGTGTCACC GTTTTCGGAG GGGCACTCGC TAGTACTATC GTGAACATGC AACGTCGACA TCCACTAGCG GATCGTCCCA TTATTGACTG GGACCTTGTC TTAATGATGG AACCATTGAC ATTGATTGGG ACACTACTGG GTACCCTGTT TCATCGGATC TTGAGCGAAA AGATTTTGAT TGTTTTACTA GTCTTGCTTT TGAGCATAAC AGCTCACTCA ACGTTGAGCA AAGCCATGCG CATGTATGAA GCTGAAAAAC GCTATATTCG GCATCTTATA GCCGCCCAGG CCGATTCTCC AAGAGGAAAC CCATCACTCG GAGGCTACGT GCTCCCGTTC GGTGACGAAG ATGACTCTCG GGCTGATACT GGTTGTAAGG AAGAAGCCAG AATGGCGGCA GAAGAGCGTC AACGTATTTT GATTCTTAAT CCAGACTTTC GAACGATGAA AACAGATTTG CTAGAGCAAG AGAAAGTGAC CCCTCGAAGC AAGATCATAG CGCTTTGCTG CATGTTTTCC GTACTTATCT TTTTGAATCT CATGGTTGGT GGAGGTTCTT TCGATAGTCC ATGGGACATC AAGTGCGGCT CGACCGCATT TTGGGTGGTG CATGTTGTAA TGATTGCATT TTTGATGTCA TCAGCGTGGA TGGCACAAAC ATATCTCATT GCTCGACACG AGATCAAGGA TATGGTTCGA TTTGATTATG TCCACGGAGA TATCAAGTGG GATACTCGCA CATCCATTAT CTATCCAGCT GTATTCACCA TCGCTGGGGT TTTCGCTGGA ATGTTTGGCA TTGGTGGAGG TGTCGTCATT GTGCCGCTCT TACTGCACTC CGGAGTGCAT CCTGGCGTTG CGTAAGTTCT ACATATCCTT TTACTCTTTT GTCGGTGATG GCTACCTTGA CTTAAGTTCT AATTTGTGCT TTCACCATTG AGCAGATCCG CAACATCTAG CGCCATGATT CTGTTTACAA GTCTCGCATC TGTCTCCACC TACTTCGTTT TTGGTTTAAT CGTTGCCGAC TTTGCCATGG CCGGCTTTGT CATCGGTTTC ATATCTTCTA CTCTAGGACA AATTCTCATG CGTCGAGTCC GCCAAGCCAA AAGTGCCAGC GGACGCAAGT TTGAGCGCAA CTCTTACCTC GCTTTTGTAA TTGGTGGCGT CGTCTTAGTG TCTGCCTTGC TGATGACAAT TCAATACGTC TTCATGATTG TCGATCAGCC CGACGAAGAT ACGTTTGGTG GTTTGTGCGA TGGACTGCGA TTCTAA
|
Protein sequence | MERSGRDENA EPEVRKRNRR RLLWCSSSAL CVVAAALSSN GLSASLTGKS LPFPALGHLL QSSMPILEDD DERLGMKSSK RMLEDAANGN TSGQDDAQQD ESQQVTDDQA GISAQDQDDD SQTAGDDDQK QTEEEEHRGP DQDDWFRDGF ADDTFTDDDA KNRYSEVDDF YAYVGDPRPP KLMPLSSREV IGYSVVAMAL TLGASGGIGG GGVVVPVYLL VMGLHPHYAI PIASVTVFGG ALASTIVNMQ RRHPLADRPI IDWDLVLMME PLTLIGTLLG TLFHRILSEK ILIVLLVLLL SITAHSTLSK AMRMYEAEKR YIRHLIAAQA DSPRGNPSLG GYVLPFGDED DSRADTGCKE EARMAAEERQ RILILNPDFR TMKTDLLEQE KVTPRSKIIA LCCMFSVLIF LNLMVGGGSF DSPWDIKCGS TAFWVVHVVM IAFLMSSAWM AQTYLIARHE IKDMVRFDYV HGDIKWDTRT SIIYPAVFTI AGVFAGMFGI GGGVVIVPLL LHSGVHPGVA SATSSAMILF TSLASVSTYF VFGLIVADFA MAGFVIGFIS STLGQILMRR VRQAKSASGR KFERNSYLAF VIGGVVLVSA LLMTIQYVFM IVDQPDEDTF GGLCDGLRF
|
| |