Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_43826 |
Symbol | |
ID | 7203959 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | - |
Start bp | 149284 |
End bp | 152986 |
Gene Length | 3703 bp |
Protein Length | 1003 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186287 |
Protein GI | 219113407 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.911661 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCACAA TGCGCCCCGA AGCTCGAGCA TCAGTTTCGG ATGGATTAGA ACGAGTAGAC GAGGACGATA TCTCCTTGTC CAGTACTTCC GAAAGTGGGT CGTCGGGTGC CCAAGTCATT CTCACCTCCG ACGAACGAGA TAAAAAGATC CGAGATCAGA TTATCAAAAA AGAAGAGGCA GATGTTAGGA AAGCGAAACT AATCGTCGGC TCTGCTCTGA TTCTGTCCAC TATCTTAGTT AGTGTCTGCA TCTACATTTT CGCATCTAAA GCAGAGGTCC TCAATTTCGA GCTCGAGGTA AGCACCATTG TTGAATCATA GAGGTAGCGG TTGGATAATG CTATCTCACA ACTCACTACA CCACTCTCAC AGCACGAAGG ATATGCTAAG AACATTGTGA ACTTGGTAAA ATGGGAAAAC CAGTATAATT TTGCTCTTAT GCAACAATTA AGCGCATCAG CTACGGCATC TGCCGCCATG ACTGGCTCGG TGTTCCCCAA CGTGACCCAG AAGTACTTCG AGATCACAGG TGGCTATGTG GATGGCTTGG GCGGTATAAT GGCAACTGCC TACGCTCCAA TTATTGCAGC CGAGGAAGTG ACTCAGTGGG AAACATATTC ACAAGAGAAC CAAGGTTGGA TTGGAGACAG TACTGTACTT CGGCAAGTCC ATCCGGGACA TAGACAACCC ATGGAAGGTA CCATTCAAGA CCACGAATTC GATCGACGAC TTGATTCCGG GTCCATCAAG CCATATATTT GGCGTTGGGA AGATGGCGAG CAAGTCCAAG AAACCACCTT TTCGGGCAAT GTACTGGCTC CTTTTTGGCA AAGTTCTCCG GCCGATGCCG CTTCTGTAAA CCAAAACTCC TCGCAAACAA GGACATAGCT CACCTATTTT CTCTGGTTGT AGAAATGAAC CATACAGTCA TCTCCCACGC CGTCCAAATT GACAGGCTCT TTGATTTCGT CTTTGACCTC CATGAAAAAG AAAGAAAGAA AGAAGAACCC GCATTCATTT ATCATGGAAC CTGTCTACGC AGAGTTCTCC GAGAATCCCG TTCTTGTCGG TATTTTGATT GCCATTTCTG CATGGGAAAA TTTATTTGAT CGAGTGTTGC CAGAGGGAAC AAACGGGCTA GTCTGTGTTG TAAAGGATAC CTGCGGCAAC GTTTTTACGT ACGAAATCAA CGGCGAAATT GCAACTCATC TTGGATATCT TGACCTCCAT GACGAGAGAT TTGATCAATA CCAAAGAACA ACACCTATCG AGTTGTATGA TTCCGAGGCA GCCAGCCTTT GCAAACATGA CCTTTACATT TATCCATCGT CGACTTTCCG AAGTGCATAC AATACCAACA GACCAGCCAT CTACAAAAGT GTAGTCGCGT TAGCGTTTGC TTTCACTGCA CTACTGCTTC TCATGTACGA CAAGCTAGTA AGTCGACGTC AGGAGAAGAC AATGACGTCC GCCATTCGCA CCAACGCCTT GGTATCATCA CTTTCCCCGA AAATATCCGC GATCGACTTA TCGGTTACAA CGGACTTGAC CACGGTTCAA AAATGATTTC CGGGAATGAA AAAACAATGG AAAACTACGG GAATAAAGCT ATTGTAAATG CACCATTCCA TTCAAGACCC ATTGCAGATT TTTTCCCGCA AACAACGGTA CGTATATTGA AAGACAAAGA GCGACATTTG GCTTCCTCCT TCTGACTTTT ATTTTTAAAG ATCATGTTTG CCGATATCAC GGGCTTCACA TCCTGGGCCA GCGCAAGAGG TAAGTGAGGG AGGGTTAGAT CTGACGAATG ATGAAGAGAC CTCATCTTGG TAAATTATTC CACGCACAGA GCCATTCCAG GTATTTGAGC TCCTTGAAAC AATATACGGC GCCTTTGACG AGATTGCAAA GAAACGTAGG GTTTTCAAAG TTGAAACTGT TGGGGACTGC TATGTGGCTG TTGCCGGTAT CCCGATGCAA CGCAAAGATC ATGCTGTTAC AATGGCCCGA TACGCTCGTG ATTGCCACCA CAAAATGAAC GAGCTGACTC GTCGGCTGGA ACTCGTCTTC GGGCCTGATA CCGCTGATCT TGCCTTTAGA ATTGGTCTTC ATAGCGGGCC TGTGACTGGC GGGGTACTAC GTGGGGAAAA CGCCAGGTTT CAGCTCTTTG GAGACACCGT CAATACAGCT GCTCGGATGG AAAGCACAGG TGTTCGCAAC CGTGTGCATA TCTCTGAGAC CACTGCCGAT CTACTGGTTC AAAGCGGAAA AGAACATTGG CTCAAACAAC GAGATATGAA GATCATTGCA AAAGGGAAGG GAGAAATGTC TACGTTCTGG CTCCAACTTG GGACCGAGCA CAGCGACGGA ACGTCAGTCT CTGGTACCAA TCACGTTGCT GACAAGAATG AAACATTGGA GGAAGAAAAA CATAAGCTTC AATCACTAGC TTCCGATAAA ACAAGACGAT TGATTGATTG GAATGTCGAA GTGCTCTTGC GTCTCCTCGG TCAAATCGTT GCGTGCAGAA TTACACACCC GGTCAAGATT TCCGGAGTTT TCGTTCGGAA TAGCGCGTCT CCGAAGGGGC AAACAGTTCT TGAGGAAGTT AAAGAAATCA TAACTTTGCC TAATTTCAAC GCCAAAAGCG CAGAGCTCAG AAAAAAAGAT TCGGCAACAA CACAGCTCAA TGATGATGTG GTCCAGCAAC TACGTGAATA CGTAGCGAAT GTTGCAGCTC TGTATCGCTG TAATCCTTTC CATAATTTTG AACACGCTTC GCACGTGACC ATGTCAGTAG TCAAACTGCT CAGCCGAATA GTAGTCCCAG CGGATGTTGA CTATGAAAAT CTTGATACGG ATAAAATTGC GTCAACCCTG CACGATCACA CCTACGGAAT CACTTCAGAT CCTTTGACTC AATTTGCCTG CGTCTTTTCG GCTTTAATTC ACGATGTCGA CCCATAGTGG CGTTCCGAAC TCGCAACTAA TTAAGGAAGA CACGAAACTT GCTGCATTTT ACAAGGGCAA GAGTATCGCC GAACAGAATG CGGTTGATTT GGCATGGGAT CTGCTCAACG AAGACTCATA CAGCAGTCTG CGGGCGGCGA TATATCGCGA CGACATCGAG CGAAAACGAT TCCGACAGCT GGTGGTCAAT TTGGTCATGG CCACAGACAT AATGGATGCG GATCTCAAAA TCCTGCGCAA TGCTCGATGG AACAAGGCAT TTTCCGAAGC AAGTTTGCAA GAATCCATGG TCCAATCAAC AAATCGTAAG GCAACAATTG TGATTGAGCA CTTGATTCAG GCATCAGACG TTGCTCACAC GATGCAGCAC TGGCATATCT ATCGCAAATG GAATGAGCGA TTGTTCGAAG AAATGTACAA CGCGTTTATT GATGGTCGGG CAGAGAAGAA TCCGGCGGAG TTCTGGTACC AAGGGGAGCT GGGATTTTTC GACTTTTACA TTGTTCCGCT TGCAAAAAAA CTGGAGGAAT GTGGAGTCTT TGGGGTGTCG AGCGAAGAGT ACTTGAATTA TGCGCTACGC AACCGTCAAA AATGGTCAGA CAAAGGGCAA CAAATGTAGG GGATATGATG CAGAAATTGT CTCAAGGAGC GAGCCAAGTC AAAAGAAATG ATTGCAAGCA AGACATGCTT TTCCTTTAAC CGTGGCGATG TGTATGCACA TGGCCGATTT CGT
|
Protein sequence | MTTMRPEARA SVSDGLERVD EDDISLSSTS ESGSSGAQVI LTSDERDKKI RDQIIKKEEA DVRKAKLIVG SALILSTILV SVCIYIFASK AEVLNFELEH EGYAKNIVNL VKWENQYNFA LMQQLSASAT ASAAMTGSVF PNVTQKYFEI TGGYVDGLGG IMATAYAPII AAEEVTQWET YSQENQGWIG DSTVLRQVHP GHRQPMEGTI QDHEFDRRLD SGSIKPYIWR WEDGEQVQET TFSGNVLAPF WQSSPADAAS SSPTPSKLTG SLISSLTSMK KKERKKNPHS FIMEPVYAEF SENPVLVGIL IAISAWENLF DRVLPEGTNG LVCVVKDTCG NVFTYEINGE IATHLGYLDL HDERFDQYQR TTPIELYDSE AASLCKHDLY IYPSSTFRSA YNTNRPAIYK SVVALAFAFT ALLLLMYDKL IMFADITGFT SWASAREPFQ VFELLETIYG AFDEIAKKRR VFKVETVGDC YVAVAGIPMQ RKDHAVTMAR YARDCHHKMN ELTRRLELVF GPDTADLAFR IGLHSGPVTG GVLRGENARF QLFGDTVNTA ARMESTGVRN RVHISETTAD LLVQSGKEHW LKQRDMKIIA KGKGEMSTFW LQLGTEHSDG TSVSGTNHVA DKNETLEEEK HKLQSLASDK TRRLIDWNVE VLLRLLGQIV ACRITHPVKI SGVFVRNSAS PKGQTVLEEV KEIITLPNFN AKSAELRKKD SATTQLNDDV VQQLREYVAN VAALYRCNPF HNFEHASHVT MSVVKLLSRI VVPADVDYEN LDTDKIASTL HDHTYGITSD PLTQFACVFS ALIHDVDPYI AEQNAVDLAW DLLNEDSYSS LRAAIYRDDI ERKRFRQLVV NLVMATDIMD ADLKILRNAR WNKAFSEASL QESMVQSTNR KATIVIEHLI QASDVAHTMQ HWHIYRKWNE RLFEEMYNAF IDGRAEKNPA EFWYQGELGF FDFYIVPLAK KLEECGVFGV SSEEYLNYAL RNRQKWSDKG QQM
|
| |