Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44362 |
Symbol | |
ID | 7197838 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 334874 |
End bp | 338357 |
Gene Length | 3484 bp |
Protein Length | 1083 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178211 |
Protein GI | 219114831 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTTAATGTAG CCTCACCGAC TGGTAAATAT ACACCAGCAA GTGTCTGGTC TCCATGATGC AAGAGAACGT TGGCGGTTCG GATGGCAACA AGCGCCGCCG GCGGGTTTGG ATCGCCCAGG AAGAGCTCCA TTCGCTCGCT GCCTATACGG ACGCAACGGA TCGACGTCTG GAGCAGACAG AGATGAGTTT ACGAAAGGAA GTTACGCAAA GATTGGTGCT GCAGAAACGC CTCCGTGATG CCGAATTGAA TCGGCAACAC TTGCTGGCGG TCCAAACCCA ACAAGAAATA CTGCGCGACG CATTGGCCGA CTCGGTAGCG CAATTGGCGA AAAAAGTACT GGTGGAGGAG TCGCCCTCCA ACGTGACAGA GTCCGACGGA GGGGCGGACG ATCTCGTGCG AATAATTCAT ACGGCAGGAG ATAGGACTCC GCCTTCTGAC GTACATGGAA CAGGCAGTGC GGACGAGATT GCTCCACACT CGCCACGGGA TGCAGCAAGG ACGGCACGCG ATGAACCAGA GATCTGGCCG GACAACAACG ATACCAACAA CAACAACGGC GTGACGGTGT ACACTATTAC GGAGGCACGG CCACGCGGTA CGGTACATAT TGACGCGTAT CCTCGCCATC ACGAAACGAC CACATCGCAC GGGCACCGTA AATGCGGAAC CCCCCGAGTC TCCTGCTTTA TTCCGTGCCT GGCATCGCCA AACGCGGTGG CGCGAATACT CGCCGTTTCA ACGAGAGAGG TCTTTGCTTT AGATGCTGTC ATGCGCGACG CCGGTCATCG TCGTCAGCTA CTACGGTGGA CGGCACTGGA CTTTGGACTG TGGAGCTCGA ATACCGAATC TTCCAAGGAT GGGTCCGGGG CGAAAAATGG TAACGGCGAT CTCCGTAGAG ATGCTTCGCC GAAACGACTT GACCCACATC TTCCGCTTTG CCCCTACGAA TTGGCAGGAG AGTGTGCCGA TCCTTTTTGC TCCTATCAAC ACATCACCCC GCGGTCGAGT ATAGCAAACG TGATGCCGCG CGAGTTTTTG CCTCTACCAA CCATTACAAT CTCCAAAAAA GTGGGCATTT TGTCCGATCG GAGGCCTCAT GATCAAAGCG AGGCAACTAG GAAATACGCG GCAAAAGTTC GAACGAATGC GGTCGACTCG GAAGATGGAT ATATTCCGTT ACCACAACCA GCCAAGCCCC CAATTGCTGC GGCTCCGCAA GATAGTAGTT GCAAAAACGA ACATTTAGGC TGCCTTGCAT GGTGGGATAC CTCTGGTATC ATTTCCAACT TTATGCAATG CCCTCATCCT TTCTCTCAAA TACGCGAGAT CTTTTCGTTT GATACTGATG AAGTGAGATT TCTTATTGAA GACTCGGATC GCATTCCTTC CGCTCTTTAC ATATGGCTGG GTAAAGTTTC GGGAATATGC GCTCTTTCAA CCCATGCCGG GCGCTTTGAT GTCGCATGTT CGCTACTGCA GGACATCAAT CGGAGGTTGC GAGCAAAACA GCAGGAAACA CAGGAAACTG CCCCTAACAA ATCTCCGATG GTACTGACGA GGTCGAGGAT TGCGGTGGGT ATAACGACTT GTTTTAGTCG CCTTTTGGAA AAGTCTTTTC TATACGATTC CAATCCCGGG GACGACATTT TTTTGTTTGC GTTTCAAACG CAATTGAAGA TTTCTCTTGT TTCTGCCTAT CTCCATAGTC TCTACACACA GGAGATCGAT CCGACATTTT CGTCAACACA AACACTCCAT CACTTCGAAG AGATATGGGT TGGTTTGGAA GCTGCACTTA CATCCGTACC GCCCGCTTAT TGTGAGGTGA TGGAATGGGA AAAGCTGCAG CAAGTACTCT TTTCCGCTTC AAGGGATGAG GACAGGGGGG AGAAGCCTCG GAAACCTAGG CAAAACGATA TCAACCACGG CACCGCTAGC CTATTCTTTT TGTCGTCAGG CGGAACTTTG GAGTGTTTGA GCCTGATTAA TCGGGTTCAA TTTTCATTGA ACGCAATCGC TTCCCTTGAC TCTCTGATGA ACACTGCTCT TCGTTCTTCT TGGAGCGCGT TCAACCACTT GTTCGAAGAC AAGAAATCAG GATCAAGCTC CTCTCGACAC GATCTCCTAT TTTTTTCGCA GATTGGATCC ATTGTTCTCG CTTGTTTGCG TCGAGCTTCG ACCGCTATCG AGTTGAGTGA TACAGGATTT GCAAAGTCGC TGATGGACTA CATCGAACTG TACAATTTGA CCGAGACAAT TCTTTGCTGG CTTGAATCGA TTCCGTCCAC TGAGTCTTGG ATCGATCTTC TCCTTGCACC GCTATTTGCG GCCAACATCG CTCTCGGCTG TAGGCTGCAG CAATACGATA AGATTTGTCG CCGGCTTCAA GATTTTTTGA TGCATAGGCC AAGAAATGAA AGCTGTGCAG GGCTTTGTAC ATTTTCAGAG CTTTTGTGGT CCCAATATAT TCAGCTTCAC TTCACTTTAC CCTATAACAT AGCAATGGGT AACATGGACG GTCAAAATGG ACATTCATCA CTGACGTGGG AGATTCCAAT GGAGGTCAAC AAATCTCATC AGGCCATGTG TCAGGTGTTA TCATCTCGTG AAGTATCTCC CCATCATGTG GTTGCACCTC ACGATCAAAC AATGGTTTAC GAGGTGATAC CTGTACTATC AAGTGAGTCA AAAATCTCCT CCGAACGGGA AGACAAATTT TCCGAGAACG AGAAAGCGGA AGGCGGTCGG GTATTCACGT CGTCATGTAG TTCCTCAATC CGTGACTTCT ACAACCTCGA ACACGAGCCA ATGTCCTTTG TGTTCCACCG AGACAGTGCA TGTATGGAAA ATGGCGGTAG TCGGTTGTAC CCAGGTATTC CACTTTCGAT ACCAACGGGT TTGCTTTTTG CCGGCTCATC ACTGAAGAGT CTTTCTCTGA ATGGTTTTGG GCTGGAGCGG CTACCTGATC ATATGGATAT GTATTTTCCG AAACTTACAG TACGTTCGCT GGAGTCGTCA TTCGTTTTTA GCTATCCGAT CTTTCTCACA ATATTTGTTT GTCCAAGAGT CTTGAAATCA AGGAGAATTC GTTGGTCATT TTGCCCGAAT CAATTTGCAA CCTAACGCAA CTAAGAGTTT TAAGGATCGA CCGGAATATG CTAGGCTCTC TACCGTCAAT ATTTCAAATG GTCAGCCTCG TGGAGTTGAC TGCTGCCAGG AACAAGCTGA CGACCCTTCC GGCTTCGCTG GCTGAATGTC TGAACTTGCA AGTGTTGGAT GTGAGTGGTA ATCCTATCAA TTGCGATTTG CTCTGGTTTG CAGCCAGACT GAAGCATCTT CGAGTCCTCC ATCCTATGTC GGAATTAGTA TAATATAGGC ATGTATAGCA GTATGTGAGT GTTGCGTTGT ACTTCGCTCA ATATTAGATT CTAAATCAAA TTGAGACGGG CGAAGACCAA GAAAGTCGCT CGTGTTTGTG GTAA
|
Protein sequence | MMQENVGGSD GNKRRRRVWI AQEELHSLAA YTDATDRRLE QTEMSLRKEV TQRLVLQKRL RDAELNRQHL LAVQTQQEIL RDALADSVAQ LAKKVLVEES PSNVTESDGG ADDLVRIIHT AGDRTPPSDV HGTGSADEIA PHSPRDAART ARDEPEIWPD NNDTNNNNGV TVYTITEARP RGTVHIDAYP RHHETTTSHG HRKCGTPRVS CFIPCLASPN AVARILAVST REVFALDAVM RDAGHRRQLL RWTALDFGLW SSNTESSKDG SGAKNGNGDL RRDASPKRLD PHLPLCPYEL AGECADPFCS YQHITPRSSI ANVMPREFLP LPTITISKKV GILSDRRPHD QSEATRKYAA KVRTNAVDSE DGYIPLPQPA KPPIAAAPQD SSCKNEHLGC LAWWDTSGII SNFMQCPHPF SQIREIFSFD TDEVRFLIED SDRIPSALYI WLGKVSGICA LSTHAGRFDV ACSLLQDINR RLRAKQQETQ ETAPNKSPMV LTRSRIAVGI TTCFSRLLEK SFLYDSNPGD DIFLFAFQTQ LKISLVSAYL HSLYTQEIDP TFSSTQTLHH FEEIWVGLEA ALTSVPPAYC EVMEWEKLQQ VLFSASRDED RGEKPRKPRQ NDINHGTASL FFLSSGGTLE CLSLINRVQF SLNAIASLDS LMNTALRSSW SAFNHLFEDK KSGSSSSRHD LLFFSQIGSI VLACLRRAST AIELSDTGFA KSLMDYIELY NLTETILCWL ESIPSTESWI DLLLAPLFAA NIALGCRLQQ YDKICRRLQD FLMHRPRNES CAGLCTFSEL LWSQYIQLHF TLPYNIAMGN MDGQNGHSSL TWEIPMEVNK SHQAMCQVLS SREVSPHHVV APHDQTMVYE VIPVLSSESK ISSEREDKFS ENEKAEGGRV FTSSCSSSIR DFYNLEHEPM SFVFHRDSAC MENGGSRLYP GIPLSIPTGL LFAGSSLKSL SLNGFGLERL PDHMDMYFPK LTLSDLSHNI CLSKSLEIKE NSLVILPESI CNLTQLRVLR IDRNMLGSLP SIFQMVSLVE LTAARNKLTT LPASLAECLN LQVLDILNQI ETGEDQESRS CLW
|
| |