Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46518 |
Symbol | |
ID | 7201673 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | + |
Start bp | 583907 |
End bp | 587152 |
Gene Length | 3246 bp |
Protein Length | 1081 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180861 |
Protein GI | 219120236 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.42154 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAATTGG GTGAAAGCCT TAAACTTTGT GTTGCTGATA CTATGAAAAC TGACTATCGA CAAACTCTCG ATCTCGAGGC GAGCGGAAAT CAGGTTGCCT TAGCCGGGAA GTCTATCGGT GCCAGTTTCG AGACGTTCAT CATGGATGAT ATAAATGGAA ATTTACCCAG TTATGATTGC CGGCTCCTTA ACACTAATCG TGTCTACGAC AACGACGGTG AGGATCTACT GGGTCAAGGT TTTTTCATCC TCGAGGTCAT TGTGGAACAA AGGGTATATG TAACGGCTAC CGACACCTTG ACAATAGAGC AAGTAACAAG TCTTGTCGGA TTGGGTTTCA ATCGAGGTGA TGGCGGTCGA GTCAAGTTTC GGTCTTTGCT ACAACAGTAC GAAGCGTTCG AAAGCATCTT GAATGTTGAA GTATTAGAAA GTGTATCGTC ACCACAGGCC GCCCCTTCGT CCATACCGAG TACTTTTCCA TCCGGCAAGC TCTCCTTTGC TCCATCTCCA GCACCAACTA TAACGCCATC AGCTGTACCT ACGGATCAAC CGTCGCATCG TCCCTCGATA TTCTCGACTG CTATGACTAC GAACTCACCA TCTTTCCCGC CATTCAATGT ACCAATGTCA TCAACACCTT TGAAAGCTCC AACACTACTG CCTGCGGCTT TGCCCACAAC TTTGGCCCCA AGTTCTCCGA GCTCGTCCAA GTCCAACAAT GATGGTTTAC CTTTAATTAT CGGCACAACT ATTGGGGGGG TGTTGACTAT GCTGGTGTGC TGTTTTTTCG TCTTTTGTGT ATGGTTTCCC TATTGGCGTG AGGGGAACAC CGGCGACGGC AATGGGCCGA ATCGGCAATT TGAAACGAGC AGTCAAATGT CATCGGGACG TGACACTATT GTCCCGGGTG TGGTTCAGTT GGATGATGCG TCCTTGGCTA ATACTACTTT GGGAGATGAG ACAACAGATG GTGGCTTTCG CAACAATTCT GCCGAGAGCA AGAGACCGCA ATATTCGTTT CTCAAACCAG CCCCAATCGC ATCAATGGAC AGTTTTGACG AGAGCTCTCT GTACACTTCA CCCGGACCAC CCGCTAGCAA CGCAAACCAA AGCAGTTTAC GCATAAGTTC CATAGCCATG GCTGCGGTAA AGCCCCCGGT TATGTCCAAG ATTGACACAT CCATGGAACT GAACCCCAGT CTGAGCCTTG CCCTTCACTT TGAAGACGAC ATCGTCTTTC CCTTGTCTGA GAGTAAAACG GATTGGAGCA CCGAAAAGCG AGCTACAGTG CACAAGGATT CCATGCATTC GGTGCTCACG GGCGGCGGAG TCGTGGATCT AGATGAGGTT TATTTCTTTG ATGACGACGA GTCGAAGGAG GAAGGCTTTG AGCGAGTGCC CGGTATACCT GGCAAACTGG CAATTACGTC TTCTAAGAGT GACGAAGAAA AAGCTCGCAA AGTGTCTCAA TCAATGGAAG AAGGTACTAG AGGTTTCGAT CCATTTGACG AAGAGAAGTC TGGGTCGTCG TCGTCGTTCA CTTTTGACAA CGGAGTGGAG ACGGCTGAGA TCTTCAAAGA AGACTCCTCG TCCGATACGG AAGAAGACGA GAGTTTGCCC GTGACTAACC CACACCAGTC CAAACGAGTG ATGCCCCTGC TGTCAGGGAG TACTCCCGAG GCCAACGATA CCCAAAGAGG CATTCAGCCT TGTGATGTGC ATAGACGAAA GTTGATCAAC AACATTGGAG AAGGCGCTCT GGATACAATA GAACTAAACA GGAATGGGCA CGATAGCAAA GGCGACGATA CCCAAAGAGG CATTCAGCCT TGTGATGTGC ATAGACGAAA GTTGATCAAC AACATTGGAG AAGGCGCTCT GGATACAATA GAACTAAACA GGAATGGGCA CGATAGCAAA GGCGACTTAA AAGAGAAGCA GAGAGAAGCA CGCCTCTCAA GGTCAGCAAG AAAGAACAAT TCGTTGCTCC GGAATGTTCT AGAGGATGCC CGTCGTTTAG CTGAGGCTGC AACTTCTAAC AATCGTTCCA GAGCTTCTCG AAAGACGGCA CCACCGCGAA TTGTCGACAA AATCAAACGC AACTCGCACG ATAGCCAGCC TTTTGATTTA CTAGCTGATA CATTAGACGT GAAGGAGTCT CTTTCTCTTG CTTCGGCAAA ACGCCCCGCG CATTCCACGA AAGGGAAGGT TGTCTCAAAG ATGACTGCTG CGCGGTCGAG AGGAAGCGGA GATTTGACAA GCGACAATGA TCATGCTAGA ATTAAGGCCT TTGCAAGTAC ACATCTCTTC CGCAGTCGTC TTCTGGGCAA AAGAGAGGCA GGGACTGGTC AGCATAGTTT GCCTTCATCA ACCACTTCGA ATTCTTCATT GCGGCTAACT GTAAATGTTG AAGCTGAAAT CACTGATGCG ACGTCTGCTT TCGATGCCAG CTCGGTCCTA ACCTCTGCAA ATCCAAGAGA TTCTCCACAA GGCAAGTATG CTGGTCAATG CCCGGAAAAC CAGAGTGTAT TGTCACCCCC CAACAACATA GAAATTTTAT ACCCAAACGG AAAAATAACC GTACAGGACG ATTTGTCATG CACACGGGAA GGCGCTCCAA AACCTTTGTT AACTGATGCG GTGGATAAAG CTCCTGAAGC TTGTTCTACA ACTGATATTA AGTCGACAAG TGTATGGTCG ATCGCGCAAT CATCTCAACC ACAACGGTCT CGAAATTCAC GGTGTGGGTC AATATCAAGT ACGGGACGAC AGCTAGAAAG ATCGCCGGTA TCTACCCAGT CACGGCGGCA GAATCGGCGC CTTTTGCGTG ATGAGATTGG GAGCTCCAGT CGCACCTTTT CTTCCGCCCC AGAGAGGTCC CAAGGAGAAG AAAAGAGTTT GGGTGACGAC ACCCTGCCTC TGGCATTTGA ACAGGATTTG GAAAGGCTTA AGCTGCAGCT CGTAGATATC GTGCGCACCG ATGCATTCAA GGTTGGTCCC TCTTCAATAA CAGCGTCTAA GACAAATCGA TCCATTGCCT TCGTCAGGAA GAATAAGAAA GACCAAATTG TTGTCATCGT CCCCCCAGGG AAGGTTGGAG TGGTTCTCGC AAATCGGTAC GATGGAAAGG GAACGATGGT GTCGGAAGTT CGGCCTTCTT CAGCTGTTCA TGGGGCCATT TTCCCCGGAG ATGAAATTGG TACGTTAGTG ACTGTGCTTA CGATTGTTGT CGCCAACATT GCTTGA
|
Protein sequence | MQLGESLKLC VADTMKTDYR QTLDLEASGN QVALAGKSIG ASFETFIMDD INGNLPSYDC RLLNTNRVYD NDGEDLLGQG FFILEVIVEQ RVYVTATDTL TIEQVTSLVG LGFNRGDGGR VKFRSLLQQY EAFESILNVE VLESVSSPQA APSSIPSTFP SGKLSFAPSP APTITPSAVP TDQPSHRPSI FSTAMTTNSP SFPPFNVPMS STPLKAPTLL PAALPTTLAP SSPSSSKSNN DGLPLIIGTT IGGVLTMLVC CFFVFCVWFP YWREGNTGDG NGPNRQFETS SQMSSGRDTI VPGVVQLDDA SLANTTLGDE TTDGGFRNNS AESKRPQYSF LKPAPIASMD SFDESSLYTS PGPPASNANQ SSLRISSIAM AAVKPPVMSK IDTSMELNPS LSLALHFEDD IVFPLSESKT DWSTEKRATV HKDSMHSVLT GGGVVDLDEV YFFDDDESKE EGFERVPGIP GKLAITSSKS DEEKARKVSQ SMEEGTRGFD PFDEEKSGSS SSFTFDNGVE TAEIFKEDSS SDTEEDESLP VTNPHQSKRV MPLLSGSTPE ANDTQRGIQP CDVHRRKLIN NIGEGALDTI ELNRNGHDSK GDDTQRGIQP CDVHRRKLIN NIGEGALDTI ELNRNGHDSK GDLKEKQREA RLSRSARKNN SLLRNVLEDA RRLAEAATSN NRSRASRKTA PPRIVDKIKR NSHDSQPFDL LADTLDVKES LSLASAKRPA HSTKGKVVSK MTAARSRGSG DLTSDNDHAR IKAFASTHLF RSRLLGKREA GTGQHSLPSS TTSNSSLRLT VNVEAEITDA TSAFDASSVL TSANPRDSPQ GKYAGQCPEN QSVLSPPNNI EILYPNGKIT VQDDLSCTRE GAPKPLLTDA VDKAPEACST TDIKSTSVWS IAQSSQPQRS RNSRCGSISS TGRQLERSPV STQSRRQNRR LLRDEIGSSS RTFSSAPERS QGEEKSLGDD TLPLAFEQDL ERLKLQLVDI VRTDAFKVGP SSITASKTNR SIAFVRKNKK DQIVVIVPPG KVGVVLANRY DGKGTMVSEV RPSSAVHGAI FPGDEIGTLV TVLTIVVANI A
|
| |