Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43038 |
Symbol | |
ID | 7196237 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 1871969 |
End bp | 1875641 |
Gene Length | 3673 bp |
Protein Length | 1112 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177391 |
Protein GI | 219111279 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00196864 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCTCGT CTCGTTGGAA GCGCTTTGCC TTTTTCGAGC GAAGCACGCT CGACGTCCCA CCGGAAGTCA TCGACGACCT TATTCCCGTT GATGGAGTCA GCCGAGACAA CCGTCGCTCG GTCAAATCCT TAAAATTGGC TGGTAAAGAA GTGAGCAACG ATTCCGTTAT CTTGATAGTA ACAACTGCCG CGTTGCCCTT GTATTCCAAG CCAACGGAAG TCCGTACAAC TCATATTTCC GCTGGTCAGA GAAAAAAACA AGAAACGAAA GATGACTCTA TAAGCGCCAT GTGGTCGAGT CTGACAGCTT GCACAGTACC TGCTTTTGCT GAAGAGGGCA ACAATTCATC TGGGGCTGCG GGGGAGGCGC TCGCTCATGG TATCTCCATG CCGAGCCAAG CGCAGCTTCC TAGACAGGGC ATGAATGGGA GAATTTCGAA AGCATTAAAG GGCGCTTCCA TGGATGGACT CGTACTCTCG TTTGTGACTT CTCGAAGTAC AGAGTTTGTT CACTGTGTCG ATGTTACAGT CCGTTGCAAT CCCCCTCGGG GCAAAGAAAG TCTCGAAGAT CTTGACGGTT GGCGTGGTTA TTTTAGCCCA TTTTTAAGAA CCCCGGGCGA TCGATCTGGG ACGACGTCTG AAACTCCTAT GGTAGAAAAT ACGAACCACT CTGACGTTTC GGTCTTAGGC ATAGCGGCCT GTCGGATGTC AACAGGCGGT CAAATCGTCG ACGAGCACGA CGTTGTTCAC TTGGTTTGTA TTTCTGCTCA GCAGATTTGC GTGTGGGAAG ATCCCCACAT TCATCTTTCT TGCCGGCGTC CATTGACGCC ACCTCCTGTC CCCAGTGAAG CGAGAACAGT TTATCTTCAG TCATCTTGGC GACCTACTGA TGGGAATTGC CGTGTGGTAG ACATTATTCC AGGCATCGTC GCCGTGGGTA CCGATACAGG AGCCGTTCTT ATATTAACGT ATTCGCCGGA CCTCTCAGTG AGCACTTCGC GACCTCAGGG CCTTCGGACG TATTTGCGTA TTCCCCCACC ACCGACTCAA AATTTGGAAG TAGTGAGTGT CAAAATATCA TTGGTAAACG ATAAGGCTAG TGTTTTTGTG GCCTACAACT TCACGGGATC AGTTTCGGTA TCTCAGATGT CTACAGCAGG AGTTTGTTGC TATGATTTTC CGGTACCAAC ACCCAGCTCA CCCTCTTTGT CGGCCCCCTC CGCCCGCCAC GATTTGGACG GACGATATGT GGGTTCCTCG ACATTGGTGG ATGCTCTTAC GACTCAGCGG GGACTAGAAC TGAGTGTGGT ACGTACGTGC AACTGAAATA TTAGAATCTG TTACGTCGTG TTGCGTTCTT CTCACACAAT TGTCGTTGGG TCTATTGTTC CAGGCGCGTC CTGACGGACT GTACTCCTAT TCCCAGACGG AACGTATTGG AGTCGCTCCA GTTGACGGTA CAAAGCTTGC TCTCTGCTTG ATTCCACCGA CGATCCCAAT GGAAAGGACC CGAGAAGTAG ATCTTGAAGG TATACCTCAT GGTTATGCTC TAGTAGCGTC AACAGACGCA AAATCTGGTC GTGATGCTGT GGACATATAC GATTCCACCA ACAAGCTTGT TGCGTTCCAT TTACTCCTTT CACCGGGACA CCGCGCCGTG CGTACGGTAG GAATTACCAC TCCGCCAGTC CAATGCACTG ACGGAGGCAT CAAAGGTGGT CGATCGTCCG CGTTGGTCCT TACTTCGGGA GGCTCATTTG TGACTTTAAC CGAGAAACTG ACAGATGAAA AGATCTCATT GTTAGTGCAA AAAAATCTTT TTTCGGCAGC CATTGTGGTT GCGTACGCTG ATCCATCTTA TCAACCAGAA GAAATTTCTC AGCTTTATCG ACAGTATGCG GAACATTTGT ATCACAAAGG CGACTTTAGT GCTGCAATTG ATCAGTACAT ACATACAATC GGATCGCTAG AGTCGTCGCA TGTCATTTTT CGGTACCTCG ATGCACCAAA GATTCCTCTT CTTGTCAAGT ATTTGGAGAG TTTGCGATCA CGAGACCTTG CTACAGCGGT GCATAATGAG CTTTTACGAA CGTGCCATCT CAAGATGAAC GACCGCGAGG CAGCAGAAGC AATTATCACC ACATCGAGTT CGATCGACAA GGCGTCATTG TCTTCTATAC TAACAAATAT TTCCAGCAGT CCTAAGGAGG CTCTGGCAAC CATATGTTCT CTGGATGCAA GGCAGACAGC CGAAATCGTT ATTCAACACG GAGCATCATT GACCCGAGTA CTTCCACGAG AGATGGCTGG AATTGTTATT TCACTATGCG TTGGAACCTA TTCACCGAAG GGTCTTGCAG ATGCGGCATC CGCTGCATCG ACTGACTTAA ATCGAATGAT TGCGTATGCC ACGGACGACA AAAAAAAAGC CTGTGAACCC TTCCCTGCGA ACTTATTTGC GTCAGCCTTC GTTGAGCACC CAAAAATGTT GCGACTCGTC CTTGCTCATT GCAATCGCAA CAAATGCGTT CTGACACCAT CTTTACGTCG TACCCTGCTT GAATTGACGC TGGCGGAATG GAACCAAGCG AAGCGCTCGG GTGATACCGA GGCTGAAAAG CTGCGCCACA AGGAAGCAAT AGCGGTACGT TGTCTATCTC TCCCTTCGTT TTAGGTGCAC ATGGTCTCAC ACTGATCGTA TCGATTGCAC AGGCACTGAC AGACTCTCAT TCTCGTGAGA TCGGAGACTA TGATGCACTC GTTATTGTCC AATCAGCTAG CTTTGACGAA GGTGAACTGC TCTTATATGA ACGATTACAG ATGGCACCAT TGCTTCTAGA TCGATACGCT AAAGATGGAG GTGAAAAGTC GCGGCGCCAA ATGCTCGCCA TGTGCCAAAG CGATCCAGAG ATTTTGGCAG ACGTGTTGGG TCGCTTCGTC GATATGGCAG GGAGAAGGCT GTCGCAGTCC AGCGTAAAGA TTAGCAGTGA CTACGATTCG GATTTTGATG AATCGGAAGA AATCTTGAAT GACATTCAAG AGGCCCTTGC ATTAGCTCGT CGGCAACGAG TGATTCCGCC AGTTCGAATT ACCCGAATCC TAGCTGGTGA GGGAATAGGA CAATTTACAG ATAGGAATGA TAGCAGCTCG GAGAGCAATC TGAATAAGAG AACAGTTCCC CTCTCGGTCG CTTTAGAATA CGTTGGGACG ATCCTTGAAG AATCAAGAAG AGAAATCTCT CGCTTGAAGG CCGAGGTCGA GGAGTACAAC CAGATGTGCA ATTCCATGGC AACTGAAATC GAATCTTTGC TGCGAGCGTC TCATTCTCTT CCTTCGTTGG CTCCCACCAG CAGTGCGGCT CCAAGGCGTC TGAATATCGA CGGCCTGTAC GCGAAGATTC GGTCTGGGGA GAATGAGAAC TATTCTCCAG GTCAACCCCG CGAAGCCTTT TGGAGGGATA TGGAGCAGAG TGAAGACAAA TTTGATACTA TTGCGCGTTT TTTTGCCAAA GGAGTCATCA GCTAGTGGGT TAAGGTTCCA GCATGTCGTT TCCTTACAAA CATACGCTAT ATTAAGAACG GGTGGTGCAA CGCAGTAATC TTCTGGGTCG TGTTTTAAGG TAGCGATTTT AATTGCATCG ATCCAGTATA TAGCCAAAAA AAACGAAGCC ATT
|
Protein sequence | MASSRWKRFA FFERSTLDVP PEVIDDLIPV DGVSRDNRRS VKSLKLAGKE VSNDSVILIV TTAALPLYSK PTEVRTTHIS AGQRKKQETK DDSISAMWSS LTACTVPAFA EEGNNSSGAA GEALAHGISM PSQAQLPRQG MNGRISKALK GASMDGLVLS FVTSRSTEFV HCVDVTVRCN PPRGKESLED LDGWRGYFSP FLRTPGDRSG TTSETPMVEN TNHSDVSVLG IAACRMSTGG QIVDEHDVVH LVCISAQQIC VWEDPHIHLS CRRPLTPPPV PSEARTVYLQ SSWRPTDGNC RVVDIIPGIV AVGTDTGAVL ILTYSPDLSV STSRPQGLRT YLRIPPPPTQ NLEVVSVKIS LVNDKASVFV AYNFTGSVSV SQMSTAGVCC YDFPVPTPSS PSLSAPSARH DLDGRYVGSS TLVDALTTQR GLELSVTERI GVAPVDGTKL ALCLIPPTIP MERTREVDLE GIPHGYALVA STDAKSGRDA VDIYDSTNKL VAFHLLLSPG HRAVRTVGIT TPPVQCTDGG IKGGRSSALV LTSGGSFVTL TEKLTDEKIS LLVQKNLFSA AIVVAYADPS YQPEEISQLY RQYAEHLYHK GDFSAAIDQY IHTIGSLESS HVIFRYLDAP KIPLLVKYLE SLRSRDLATA VHNELLRTCH LKMNDREAAE AIITTSSSID KASLSSILTN ISSSPKEALA TICSLDARQT AEIVIQHGAS LTRVLPREMA GIVISLCVGT YSPKGLADAA SAASTDLNRM IAYATDDKKK ACEPFPANLF ASAFVEHPKM LRLVLAHCNR NKCVLTPSLR RTLLELTLAE WNQAKRSGDT EAEKLRHKEA IAALTDSHSR EIGDYDALVI VQSASFDEGE LLLYERLQMA PLLLDRYAKD GGEKSRRQML AMCQSDPEIL ADVLGRFVDM AGRRLSQSSV KISSDYDSDF DESEEILNDI QEALALARRQ RVIPPVRITR ILAGEGIGQF TDRNDSSSES NLNKRTVPLS VALEYVGTIL EESRREISRL KAEVEEYNQM CNSMATEIES LLRASHSLPS LAPTSSAAPR RLNIDGLYAK IRSGENENYS PGQPREAFWR DMEQSEDKFD TIARFFAKGV IS
|
| |