Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37482 |
Symbol | |
ID | 7202382 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | + |
Start bp | 243467 |
End bp | 247886 |
Gene Length | 4420 bp |
Protein Length | 1398 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181516 |
Protein GI | 219122364 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.594296 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAGGAA GAAAATCACG CTCTTTGTCA TGGACTCTTC CTTTATGGTA CGCCTTCTTC GGAGCATGCC ATGAATTTGC CCATCTATTG ACTGCAGTTA TTTTGGGCTT AGGACGCGAC GTTTGGGTTG AGGGAATAAC CACTGTTCTG GGTCGAGCGC TAATCGAAAG ACACTGTGTC CTGCCGGGGC TGGAAGACTC CAATGCTAGT GATTTCAATG TGGGGGTGGT ACGCCATATC GGTTGGTTCT TCAGCGTGCT TTTGGCTTTC GGCGTTGCTC GGACGGCAAT TCCGACGAGC AAAGAAGTTC GATTCGCGTC ATTCGTAACG GCTTTAGAGG CTTTGTCAAC AGACCTCTTC AGCTTAAATG TTCTGCCGAT TTTGACAAAG ATAAATTCTT CAGGCTCGAT TTCCTCTGTC TTGTTTTGTG GTAACTTTGG AATAATCCTT CTCCATCAGG CTTGGTTGTC CGATAATGGC AAATCGGCGA TGGACATTTT GGAACGGATG ATAAAAGTGA CAATGATGAG AGGCGCTCAG TCAGGTGGCG TGGTCACCTT TCAACCGCTG AAAAATTCGC TTCGAGGTAT GCGTAGTCGT GTTGTGAACA AGAAACGGAC GGATTTGAGT GTTTTGGTGC GTAAGGAAGT GGAACGAGAT GCGATACGTT CCGTTTCGCG ACCCTTTCCA GAATCATTCG TGCAAAGCTT TTCGGGTCAC ACACGATTTG CCACCTCAAG TAAAGCGAGT TTGGATGGCA CACATCCTCA CCTGTGGACC CCAGCAAGCA ATCGTCGGCT TTATAATTTT TCAGTCCGAC GTTCTGGTCC ACACCAATAC ATGCCTGAGA CCACAAAAGT TGAAAACTAC ATAACTCACA ACGGTGATTT TGATTTTTAC ATTGTCAATG GACAAACTTT CGACTTGGAA GTTATTCAAA AATGGCTTCC GGTTGTCACC GGCTTTCCAA TGCCGTCTTC CGTCGACAGC TGCGCGGTTG CCGGAGTAAT TGATTTACTT CGAACTCAAG GCCGCTTCGG CCTCAGCGCC CGCTGCGCAA TCTGCTTTGG GCTATCAACA AGCAAGATGG AAGAGGAGCT ATTCTCTTTT CCGACCTATG AGCAGTTTGA ACAAATTGGA CTTGTGTTTG AAGAAGCCAT GGATGTAATG CTCCAGACGA CAACATTTTC CACTTTGTGT GACGATCCGA ATATGCGTCA TTCATTCAGT CTTTGTGTGC TCAGTAAGCT AAAACCGCAC AGGGTATCAC TCTTGCAGCC GATCCAACGC TACATATGCG ATGAAGAAGA GGGTGGCGCA AATCTGCTTT CTTTTTGCTT GTGTACAATC AATGCTTTCT TTGACAATGA TCTCCTTATG GCAACCACGA CGTTTTTGAA GAACGCGAAA GGATCGTTTG GTTTGTGTGT CACTTCATCA ATAGACGCTC ATCGGCAAAT TTGTTTAGCT GCTCGCGGGC AAAAGGTGAG AAATGTCCTG TTGTTTTAAG TCGTCTTTAC CACCAATAGC GCTCATCACA GTGTTTTTCA TTGACCATGT ACAGATGTCC ATTGCATTTT ACCCGTCTAA AGGCCTCGTT TGCTACGGAT CGGAGCAGGC GTCAGTCAAA GCAGGTATGA ATACAGCATT CCCAGGTGCT GTAGACGAAT TAGGAACATC TCGTGGTGAC ATTGACGCCG ACGTTCTGCG GCTAGATTTG GACGATCTGG GCGGCGAAAT CATGTGCCTC GATTGGGGTG GCAAAGCTTT TCGGAGTCCT GCAGTTTCTC TACCACATAG GCACTTGATT GAACACGTGC TCATGAAAGG GTCTGTCAAG GTAGTATTGG TACAGGAGTC TAAGACAACC ATGCAACGCG CTCAACTGTT CCATCGTATG ACACGCCTAT CACGAAATCC ACTCATCAAG ATGTTAGCTG AAGAATCCAA AGACCTCGTC CTTTCCGACA TAAACGATAC TCCACGTGCA TGCCAAGCGA TTCAAGACGA CTGGGACTCA AATCAAACTG GGAAAAGCAT GAATCGCCTA ACAGCTTACA ATTTGTCGTG TTGTTTGCGT AAACGACTTG AAATGCATTT AAATGGACCG GTCCACAATC GAGCAGTCGA CATTTTGTTA ACTGGATGCG AAGTTTCTCT TTGGTTAGCT GAGCAGTTTG CGACTGATCT TCAAAAGGCC TTTCCCAGGC TCCGAACTAA GGCCATCAGC AGCAATAAGC TCCTGGGACT GTATGGGCAG GAACTTGCGG TGCCGTCAAT TGGGTTCCCG TATTCTCCCC GTACATACGA CCTAAACGAT GCCATTGTTG TGATTGTGAG CCACAGCGGT GGAACATTTG CTCCGTTAGC TTGCTCGAAT CTGCTTCAAA GCACAACCAA AAATATTTTT GTCGTTACGT CCGAATGGGA TACGCAGATT GGCAAGCAGC TCCGTACTAT GGACGGCCTT CTGGACGCTA GCTGGGACCA TCTTTTCAGT AGTCGAATAT TTTCGACGGA GGTGGGAATG CGCCCAGCAG AGCCCTGCTC TGTCACAGTG GCGGCTACCC AACAGCTCCT CACAAATTTG TTTCAGTATA TCTCTGCAGT TATCCTCAGT GATGAAAGAT TCCGTCAAGT AGTTGCTGCG ACGATTTCTG AGCAGGACCT GAGGATTTTA GAGAAGTGCA ATAGAGAGAA CATTGCTGCT CTTTCCGACA TTGTTGGAGT CAACGAGTGC GGTTTTGCAA TCAAAAAGCA AATTTCGGCA GAGATTGACC TTAGGAAAGC TGGCAATCTT TGGGCAGAGC ATGTCCTCGA AAACGCGAAA GCTTACATTA TGACTTTTGT ATACATTTTT GTGACCGTTG TTTCAGGCTA CCATCTCATC TACGCAATTT CTTATGCATG TGGTCTAGAC GATACAAGCA ATTTTGTCCA CCTCATTCGT GTTCTTGATG CTGCAATCTA TTTCTGGTTA CCACAGATCA ATGTCGCTCT CTTGCGATTG TTTCAGAGTC GTGAACTGCT TCACCGGATG GGTTGTCGCA CTGTCGCCAT TGCTGATATA CCTTGGGTTG CGCAATCGGC CGAAGCATTC CTGAGTAAAA TTTTTGCGTG CTCCTACAGC ATTGCAGGAA TCAACGTGCT CAGCGGCAAT CCCAATGATC ACTTTGTCCA TCGTCATACG CACAGAGTTG TCCGCGGGGC ACTGGTCCTC TGTGGAAGGC CCGATGGTCG GCTTTCTGCA TTGGCAACGG CGGAAGCCAC TGTGTGCTTA TCTGTCAATC AAGCAAGTTC TATTCAAAGT TTAGGCGGTA CATGTGAAAG TATTACAGTG GGTCACAATC CCTTTAAGCT GCCCTTGACA AAATTTGGAA TCTTTCTCAA AAGCAATCGC CCGCAGTTTC TCTGCGAACG GATGCTGGTT GAGAAAGATG CTGAAGGAAA AAGGAATGAA GATGATTTGG CGCCTAGCCC GGTGGATCAA AGCTTACAAT GTGAGGGCGT AAAGAACTTT ACACAGGATG GGCAAGAACG CCATTCAGAG CTGAATATGG CACTCAGTTC CAGTGTACAC TTACTCCCGC AGAAAACGCG GTCCGCATCA GCTTTGCTGG GAGCATACAC GAACATTGCG GAACAAGCTG ATAGAAATCG TGGATGCAAT GGATCATCTG AACATAGCTC CATAGACGAA GTACTTTATT CTGCCATTCA AGAGCGATCT TGGTCAAATG AGGGCAAGAA GCTTTTTGAA GCCCTAGATG TGAACAGCGA TGGCCTGCTC AGCGAGGATG AAGTCATTGA TGGACTTTAC AGGCTTGAAA CTCTTTTTCT AGAAGACCAC ACTCGGGCAA TGTTTAAAGT TGCAGACGGC AATTCTAGTG GGCATGTGGA TTTTGACGAG TTTCTAAAGC TGCTTGACCT AGCCGATATA GATGGTGATA TAAAGGTTCC CTCCGCGAGT CGAGACGAGC GCGGGAACAT TCGGATAGAG CCCAGTCACG AGGAGTACTT TGGAGAAACA TTGCGAAAAT TCAATGCAGG CAAGACGCAA AACAATGTCG AATTTCGTTT GGCTCATAGT CAGCATTTCA GCCAAGAGTT GTACGAGAGC CGCATTGCAT CGCTGCAAAG GTTTGTCTCA ATGACTGTGA TTTTCCATCA GATGGGCAGG CGCGTCGAAC AATTCTTTGA AACTATTTCA TTTGGGTTGC TAGGATACCG TATGGACAGG ACGCACAGTA TAATGAGAAT TGCAACCACT GCCTCCCCAA TCAGTGGAGC CGACGTACGC CACCGCATGA ATCAATTACA GCTCCAAAGT AAGATTCATT ATTCAATTCA TGTAATTTCA ATGGCTTACT TGCGATATCG TGCTCGCAAA CAATCATGCT ATCCTGAACC CGACAAGTGA
|
Protein sequence | MEGRKSRSLS WTLPLWYAFF GACHEFAHLL TAVILGLGRD VWVEGITTVL GRALIERHCV LPGLEDSNAS DFNVGVVRHI GWFFSVLLAF GVARTAIPTS KEVRFASFVT ALEALSTDLF SLNVLPILTK INSSGSISSV LFCGNFGIIL LHQAWLSDNG KSAMDILERM IKVTMMRGAQ SGGVVTFQPL KNSLRGMRSR VVNKKRTDLS VLVRKEVERD AIRSVSRPFP ESFVQSFSGH TRFATSSKAS LDGTHPHLWT PASNRRLYNF SVRRSGPHQY MPETTKVENY ITHNGDFDFY IVNGQTFDLE VIQKWLPVVT GFPMPSSVDS CAVAGVIDLL RTQGRFGLSA RCAICFGLST SKMEEELFSF PTYEQFEQIG LVFEEAMDVM LQTTTFSTLC DDPNMRHSFS LCVLSKLKPH RVSLLQPIQR YICDEEEGGA NLLSFCLCTI NAFFDNDLLM ATTTFLKNAK GSFGLCVTSS IDAHRQICLA ARGQKMSIAF YPSKGLVCYG SEQASVKAGM NTAFPGAVDE LGTSRGDIDA DVLRLDLDDL GGEIMCLDWG GKAFRSPAVS LPHRHLIEHV LMKGSVKVVL VQESKTTMQR AQLFHRMTRL SRNPLIKMLA EESKDLVLSD INDTPRACQA IQDDWDSNQT GKSMNRLTAY NLSCCLRKRL EMHLNGPVHN RAVDILLTGC EVSLWLAEQF ATDLQKAFPR LRTKAISSNK LLGLYGQELA VPSIGFPYSP RTYDLNDAIV VIVSHSGGTF APLACSNLLQ STTKNIFVVT SEWDTQIGKQ LRTMDGLLDA SWDHLFSSRI FSTEVGMRPA EPCSVTVAAT QQLLTNLFQY ISAVILSDER FRQVVAATIS EQDLRILEKC NRENIAALSD IVGVNECGFA IKKQISAEID LRKAGNLWAE HVLENAKAYI MTFVYIFVTV VSGYHLIYAI SYACGLDDTS NFVHLIRVLD AAIYFWLPQI NVALLRLFQS RELLHRMGCR TVAIADIPWV AQSAEAFLSK IFACSYSIAG INVLSGNPND HFVHRHTHRV VRGALVLCGR PDGRLSALAT AEATVCLSVN QASSIQSLGG TCESITVGHN PFKLPLTKFG IFLKSNRPQF LCERMLVEKD AEGKRNEDDL APSPVDQSLQ CEGVKNFTQD GQERHSELNM ALSSSVHLLP QKTRSASALL GAYTNIAEQA DRNRGCNGSS EHSSIDEVLY SAIQERSWSN EGKKLFEALD VNSDGLLSED EVIDGLYRLE TLFLEDHTRA MFKVADGNSS GHVDFDEFLK LLDLADIDGD IKVPSASRDE RGNIRIEPSH EEYFGETLRK FNAGKTQNNV EFRLAHSQHF SQELYESRIA SLQSGADVRH RMNQLQLQSK IHYSIHVISM AYLRYRARKQ SCYPEPDK
|
| |