Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42735 |
Symbol | |
ID | 7196125 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 931544 |
End bp | 934006 |
Gene Length | 2463 bp |
Protein Length | 796 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177192 |
Protein GI | 219110881 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAATTCG TTCTAGAAAC ACTGCTAAAA GACGCTTCCA GAAACGAAGT ACCGCCGGCA AAACGGCGAG CAGCACTGAC GAAGGCGTAT GACTTACTGA GTCGGATAGA TAAAGAACTC ATTGCTCTTA ATGCGCGAAA CACTCCATCT GAAAACGATA GTGAGCAATC GGATACAGAT GTCGAAACGA AATCAACAAA CGGAATCGCC GTGGCGGTTG CGGGCGCGTC GTCCAAGGCG TCCATCGATA CCGGACTACA CTCACCTTCC GGGGGGCGAT CCAAGAGTAC TTTGAACGGT GGGATCACAA CTTCGAGAGC TGCGAGCCCT TCTGATTTGA GAAGTAGTGG GGAAAAATCA GCGCCGGCGA TACTTGATGG CCCCGTTGCT GCTACAAATG TTCCATCTAC GACGAAAGCC ACCACCTCCA GTTCTGGGTC GGGACTGTTG GCTACAACGT CGTCGGAAAA GGCACCTTCT GACGTAAACA TCGCCGTTGT AGCCAAGAAA TCATCAACGA GATGGGGCGA GCCCCTTTCA AAGAAGATAT CTGCAGGCCG GAATGGTGCA CATCATGAAG TGGTCGAGGC AAATTATCCC TCTGCTGTAC CAAATAATGA CGCACCGAAA CCTCTACATG AATCTTCTGC AAGAATACAA ACTGATGGTA AACCGAGATC TGAACATCTG GAAAATGAAT CTCTTCATCT GCCGGGGTCA GCTTTAAGGG TAGATCCGAT AGCAACAGGG GACGAAGCGA CGAGACCAGC AAGCCCCGTT CAGACTTCCA AACATTCATG TCGTACCAGC CTACCTCCGT TCATTCCAGA AACAACAAGT AAGCGCGCTA CCAGCGGTGG TCCTTCTTCA GTGCTCTTGG ACAATACATG TAGGTTGGCA TCACGTGATT CGGACAAAGA ACAACGTCCT ACGTTGGCAG AAAAGGGAGG CCAAAACATT GGCGTGCTTG TCTCTAGCAG TACTGTCACA GCGAAGAACG TACAGAACCC CAAACACAAG GTGGATTTGG ATGAATCCGG CTCATCTAGT AAACTTTCTA GTAAAGGAAT TCAACGACCT CAAAGAGATC TGTCTCTACG ATGGGAACCG CCGAAACGGA GATCTTTTGA AGTCGAAAGG AGCGCATCGA ACGATACATT TGATCGACAA ACCAGGAATC GCGAATCGCA TTTGAATGAA AGAATGGATG ACGCCATATC CCAGAAGGAT GTGACAGCTA ATTTTTACAA TGAACAACGC TCTAAGCAAG AAAGAGTGAG ACAAGACGCG AACTCGATTT TTCAATCCAA TAGAAAGCCG CTGGTGAACG ATGTTTGGAT CAATTCCGGC GTTCCAAGTC ATGCAGTAAC AGTCAAAATG GCCGCTGACA GAAGTCCAAC GTTTTCAGGG TCTTTGTCTG AAAGAGGGGA AATAAACTTG AAACTCAAAC TGGATCCCTT ACCAACAAGC ATATCCAAAA GCCTACCGGT CCGACTGAAA AAATGGGATC CTTTTTTTAC TTTCGCGGGG CCTTGCAGAT GCACGTTGAC GGTGCCTGCA GGCAGCACAG ATGCAACTAA AACAGGATCA TCCTTGAGAA TCAATCTCAA CACACCTATG TACCAAGAAT TCATCAGAAA AATTCAGCCC GAAATGTGGG GAAAACCTCG CCAGGGTGCT TTGAAATGGA AGAAGGGAGA TTGGGGATTA ATGTTACGCG CTCTTCCCTT ATCAAGGACC TCAAAAAATC GAGCTGATTG CCACTTATGG CCTAAGGGAA CTTTTCTGCA ACTTAACGGA AAGCCACTGC GATTGGCTCA AAGGCAGCAG CAGAGTCATG ATAAGTCTCT ATGGAAGAAT CAATGCACGC AGTTAGACTT GACTGAACAT GTATCTATGT CTGACCCGAA TGTTTCGATT GAGATATGTT GCTATGATGA AGAACCGTTT ATCTTAATGG TGGGGTTTTG CAGATATGAA TCTGCCGATT CGATATTCTC CACGATACGA AACCCAAATA ACGGGCTACT GAACCGAGTC ACTGTAAAAG AAGGGATGCA GCGAGCCATT CAAAGAGCAT CTGGACAGAT GCACATTATT GATGGGAGCG ATGGTGAGAA GGTAGAAGAA GTAGGCAAGT TTGTTTTCAG CTTGACTTGC CCTATTTCCA AGGCTTTAAT GAACTCTCCT GTTAGAGGAA GGTCATGCAA ACACTGGCAG GTATGTGACT AGATGGTCTG CCTTCTTCGT AGTATTCTAC AATTCTTATA ACTCGATTTG TATTCACCAC AGTGTTTTGA TTTGAAGACA TACTTGGATG CAAATCAAAG AGTCACGGGC AGCCGTTGGC GTTGTGCTTC ATGCGAGTTG TTTGTGCCAT ATGATGAGCT CGAAGTGTGT GAGTTTACGC TAGCCGCGTT GCAGCGATAC AGAAATGAAG CATCCACGGC CGAGATCGCA TAG
|
Protein sequence | MEFVLETLLK DASRNEVPPA KRRAALTKAY DLLSRIDKEL IALNARNTPS ENDSEQSDTD VETKSTNGIA VAVAGASSKA SIDTGLHSPS GGRSKSTLNG GITTSRAASP SDLRSSGEKS APAILDGPVA ATNVPSTTKA TTSSSGSGLL ATTSSEKAPS DVNIAVVAKK SSTRWGEPLS KKISAGRNGA HHEVVEANYP SAVPNNDAPK PLHESSARIQ TDGKPRSEHL ENESLHLPGS ALRVDPIATG DEATRPASPV QTSKHSCRTS LPPFIPETTS KRATSGGPSS VLLDNTCRLA SRDSDKEQRP TLAEKGGQNI GVLVSSSTVT AKNVQNPKHK VDLDESGSSS KLSSKGIQRP QRDLSLRWEP PKRRSFEVER SASNDTFDRQ TRNRESHLNE RMDDAISQKD VTANFYNEQR SKQERVRQDA NSIFQSNRKP LVNDVWINSG VPSHAVTVKM AADRSPTFSG SLSERGEINL KLKLDPLPTS ISKSLPVRLK KWDPFFTFAG PCRCTLTVPA GSTDATKTGS SLRINLNTPM YQEFIRKIQP EMWGKPRQGA LKWKKGDWGL MLRALPLSRT SKNRADCHLW PKGTFLQLNG KPLRLAQRQQ QSHDKSLWKN QCTQLDLTEH VSMSDPNVSI EICCYDEEPF ILMVGFCRYE SADSIFSTIR NPNNGLLNRV TVKEGMQRAI QRASGQMHII DGSDGEKVEE VGKFVFSLTC PISKALMNSP VRGRSCKHWQ CFDLKTYLDA NQRVTGSRWR CASCELFVPY DELEVCEFTL AALQRYRNEA STAEIA
|
| |