Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45249 |
Symbol | |
ID | 7200263 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | + |
Start bp | 603868 |
End bp | 608538 |
Gene Length | 4671 bp |
Protein Length | 1421 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179250 |
Protein GI | 219116911 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACTCGT TCGGTAGCTT TGATCAGATT GAAGGATCGG ACGCGATTCC GCGCTACGAT CAAATGGGTC GATTGCTTTC GCCAGACGAA CGCGAACAAA TTGCTCGTAG GAAAGGTTTG TGCATGCGCT GTGGAATGAA AACGCATCAG GGTGCCTTCA AACGTCCCTT GACGGACGAT AACTGTTACA AAGGAACCTG CATTCGTTGC AACCCCAATG CCGTCCCTAA ACGTGTGTTG GAGTCATGGA ATCTCAAAAA TCGACCGGCC AATGTGCAGT TGGTGAGTGG AGCGGCACAG GCCAATGCTG TGGCTTCTGG CATTACTCTG CATGGGAAAC ATCTTTTGAA AAAAGCCACA CGCTCAGTTA TGGCTGCGGG GCGAGCTACC AATAAAAATG GACGAGTGAT TCCTACAGGA CCTTCGACAA CAATCGCATT CACGAGTGAA CATTCAACAT CTTCCAACTT GAAAACGGGT TCTGGTACGA GTTCAGAAGC GCAGTCTTCA CAGACTCCAT TACCCCAAAA ATCAGCGCCT CTGCTACATG CTCAAGATCT GCATCAACCC TGGATCGAAG GAAAAAGGCC ATCAGCCCAT ATACCCAGCC TGCACTCGGA ATCAAGCAGT ATGCGCCAAA CGTCGACATC GACTATTGGA AAACTCAGTC CGACTGTTTC CGACCGATCC TTGTACAGTG CAGAGTCATT TGCAAGTTCC GCCAGTCACG TGTCAACTAA ATGTGCTGCT TCTGGATGCG GAATTGGCAC TAACAATGGT AATTGTGGCC GCGAACCAAG TGTGGGTGCC GACCAAAATG CGAACGATGA TTTGATGGCA TCGGAGTACC GTCAAAAGCC TGCCAAATCG GCTGGTCTTT CCAATAGCGT ACGATTGAAT TCGTCGCGTA ATGATCTAGA TCTAAGCTCT CCTTCCAACA TCCGTTCTAA GTATCTCTCA AACAGATCAC ACAGCAAGAT GATAGAAGAC GACTGGACCG GGGGTTCTCA GGAAATGACT CATCATACAA TCATCGAAAC TTTGAGAGCG GCCCGAGCTG ATCCAGTAAA ACTCCGTCAA GCTCTACACG TGTTGCGAAA TGGCCTCGAA GTGGATTTGG ACGGAAGCTT AATTATTATT GCAAGAGATA TTTTGTCCCA GTATGTTTTT GATCCAAGCA TTGCCGATGC CGCGTGTGGG GCTGTCTGGA GAATTACCAC TCTGGAGGAT TCGCTTAAAT CCCTGGCGAT CGAATCAGGA ATCGTTGGAC TCCTAGTCGA TGCCCTGAAA GCGCATCATG AGGATGCAAC ACTTTGTGAA TGGGCACTCG GTGCGTTAAC AACGCTTGCG TGTAGTCCCT TCACGAAAAG CGACTTGGCG AAGACTGGAG TGATTGAAGC TGTTTTGGAA TTGCTCGACT TTCATCAAAA TTCGGCCGGA ATCTTGGAAT GGACATCCCG GTGCTTGCAC AACCTCGTAC ATCAATACGT GGTACTTGAT GTTGAAGCTG CCGAGGAGGA AATGCAAGCG CAAATCAAAA AGAATATTTC CAGCATCATT GAAGCCAACG GCGTTTCTAC GCTCCTTAAC GCAATGAAGC TTCATGCTAC CGAGCCAATT GCGCTGCTAT GGGCGACAAA GCTCATGTGG CGTCTTTTCG GTCGTAAGGA AGAAAGTTCC ACTGTGCGAG TCTTATTTCA ATTGCGTCAG GACGGATTCG TTCCTCTCTC CACAAAGCTG CTACGACAGC AGTCTACGAG CAGTGAACTT TTCCAACTTA TATCTCGATT AGTCTGTATG TTGTTGTTGA AGATGGATGA TGGAGCGTTG TATGAAACTG CCTCGGTTGC CATGCCGTCT ATTGTTCGTC AAATGGAAGA GTTCAAAGAC GACGAAACGT TGCAAGAGGC CGGATGTCGA CTGCTTTGCG CTCTCAGCAC CGGTGGCGAA TCAGTACAAG AAAATTTAAA GGAAGCGGAT GGAATCTTGG CAATTGTGAA GACTATGGAG CGTCTTCCTG AAAATTTGAT GCTATTGACA AGAGCTGGCT GGTGCTTATG GCGGCTATCC GCCAATCCCT CGCTGTTTGA TTCAGGCCTT GTGGAGAAAT CCTTACAAGC ATTGAGCAGT GCTATGGACA GCCACAGGGA CTCTGTCGAT CTATTGGTGT CGACGTGTGG ATTCTCGAGA AATAGTACCA TGGTAGATGG CGTGTCGCCT ACTGCTTATC CTTTAGATGT TATTTTTCGG TGGTTGACAA CGGAGGGGCA AGCAAGCCAT TATAAAGTAC AGGCATCACA AGCTTTGCAT ACTTTATCGA GAAAATACAA CGATTTCTTG CATCTGCTCA ACGAAAGCGT CGGTATCCCA AGGATGATTG CATCTCTTCG CGATCCACAA TCTTGTTGCC GTATTGATAT TTGCTGCATT CTGACAGTCC TGGCCAGAAA TTCAGAAGAC TCACGCCAAA TGCTCGCATC TGCTGATGTT GTCCAGACAG CTTTAGCAGA GCTCGCCGAA ACAAATGATC TCAACTGGAA GGCTTGTCTC CTGCGACTCG TTTCTACAAT TTTGGTGTCA GAATGTGCCC TACAGATTGA CGTTCCTATA CATGCTATTC AAACAGCAAT TGGTGGAGTG GAAGGGAGCA CATTTGATCC TTCCTTAAGC GAGTTAGCGT GCATATGTGT TCGCAACCTG CTTTTGACAC CTAATTCACG CGTTGGTGTC GAAGGGTTGG TGAAAGCTAT GACAGACACT ATTGATACAT GTGCCGTCTC CGACAGCCTC TGTATAGAGG CGTGCTATGT AATTTGGGCC CTGACCTCAA AGTACTCCGA TCGGAATCCA TCCGAATTGT CTGCTATGAT GACGTCTTTA ATTGGACTTA TGGGTAAGTT TATGGAACCG CTTAATCTTG AGATCCAGTC CGCTGCAGCT GGCACTCTAG CTAGTGTACT CGCATCTATA GTACGCTCTC CTATCCCTTT GAAGGTTCAA GATGTTGAAG TGGTCATATC GGTCATTTAT AAGATCATCG ATACTAAGCC CGGAGCATCC GAAGCAATTG AACATCTACT CGCCGTTTTG TGGAATTGCT GCCTTGTGGA CGAGAATACT CTCGTTCAAG GTGGTGTAGT TGTCGCAGTT ATCGATACTA TGGTCGACAA CGAAAGCAAT TTGCAAATTC AGGAGCGTGG TTGCACAATC CTCGCATTGC TTGCATCTGC GGAGAATTTG GAGGTAAACT TTAGCATCGC TGAAACCAAT GGTGTCGAGC TGCTGGTTAG TGCGCTTGCC GTATTCGGGG ACAACGTAAA TGTCACGTTG CAGGCTTGTA AATGCTTCTC ATATTTGAGT ATCGATCCAG AGCTACGTGT GATGATCGTG GCCCAAGGAG GTCTCCGTCT GGTTGTTGTT GCGATAACGT CGAATCCAGA TAATGCCGAA CTAGTGGGTT TCGCTTGCTC TACCCTCTTA AATTTGACCT TTGACGCTGA AGTATCTGCA TACATTGGAT CGGGGATCGT GGACGCAATT GTACAGACTA TGACTGGTCA CTTAAAGTCA GCGCTTTTAC AAGAAACAGG ATTGGGAATT TTACAAAATA TATCTATGCG CGGTCCGGAT GAGAAGGCAC GCATTGCTGA AGCTGGAGGA GTCGAAGCTG TCGTTTCAGT ACTTAGGGAA CATATTCGGT TACCGTCCGT TGTTGAACGT GGGCTTGCAA CGTTGTGGAG TTTGGCAGTT CTTGACGAAA ATCAGATACG AGTTGCGAAT GCTGACGGCA TCAATCTTGT GGTCAACTGC ATGATGGCCC TGATCGAATA TGAACGAGTG CAAAAGCAAG GCTGCGGGTG TCTATGCGCA CTGGCAGGTG ATTCAACCAG TAAAGTTTTG CTTCGCAATG CCGGTGGATT GGATGCAATA GTTTTTGCCA TGTGGGCTCA TTTCAACAAA AGTGGAGTTC AAAAAGAAGG ATGCAGAGCA ATATCAAATC TTGTACATGA TCCCGGAACG AATGAGATAA TGCTAGTATC AGAGACTGAG GTTGGGGCGA TACTATCTGC TATGAGAAGA TTTCCTTCAG TGGCCGATTT GCAGATGCAT GCTTGCTACT CTCTTCGAAA CCTTACACTA TCTGTGGACA ATGTAGCTGT CGTACTTGGA AGTGCGGACG ACATCCGCGA GCTCGTAGCC AAGGTTTCCT TACGAAACCC CGAATGCAGT CCAATCGTCA ACCAAATACT TTCTCATTTT GGATAAAAGG TTGTCGCTAG TAAAAAGAAA AGACTCAAGT GTAGACTAAA CGCGCCCGTT TGCTCTTACG TTGTCTTCAG TGGAAAAACG TCACCTAATT GGGTCACTTG CGGCGCGTAG GGATGTACCT TTCTTTCTCC GGTTGGAGAG CCTTCACGAC TGGTAGGTGG GGTGACAATG CTCTTCTTGC TATGTGCACC AGTGTTGACT GTTTTGTCCC AATGATCAAG AAGTTGTGGG CACTTTGGAG ACACTATCTC TTTTGAGCAC GACGTGTCGC GTTCCAATCC TTTCCTGGTT CGCTTACGGA GGAATCCTGT GTATTTTCTC TGGATCGAAA TGAACAGAAT TGATATAAAA CTAATCATGC ACAGTGTTCC GATCATAAAG TCAATGATTG TGCGGTTTTG G
|
Protein sequence | MDSFGSFDQI EGSDAIPRYD QMGRLLSPDE REQIARRKGL CMRCGMKTHQ GAFKRPLTDD NCYKGTCIRC NPNAVPKRVL ESWNLKNRPA NVQLVSGAAQ ANAVASGITL HGKHLLKKAT RSVMAAGRAT NKNGRVIPTG PSTTIAFTSE HSTSSNLKTG SGTSSEAQSS QTPLPQKSAP LLHAQDLHQP WIEGKRPSAH IPSLHSESSS MRQTSTSTIG KLSPTVSDRS LYSAESFASS ASHVSTKCAA SGCGIGTNNG NCGREPSVGA DQNANDDLMA SEYRQKPAKS AGLSNSVRLN SSRNDLDLSS PSNIRSKYLS NRSHSKMIED DWTGGSQEMT HHTIIETLRA ARADPVKLRQ ALHVLRNGLE VDLDGSLIII ARDILSQYVF DPSIADAACG AVWRITTLED SLKSLAIESG IVGLLVDALK AHHEDATLCE WALGALTTLA CSPFTKSDLA KTGVIEAVLE LLDFHQNSAG ILEWTSRCLH NLVHQYVVLD VEAAEEEMQA QIKKNISSII EANGVSTLLN AMKLHATEPI ALLWATKLMW RLFGRKEESS TVRVLFQLRQ DGFVPLSTKL LRQQSTSSEL FQLISRLVCM LLLKMDDGAL YETASVAMPS IVRQMEEFKD DETLQEAGCR LLCALSTGGE SVQENLKEAD GILAIVKTME RLPENLMLLT RAGWCLWRLS ANPSLFDSGL VEKSLQALSS AMDSHRDSVD LLVSTCGFSR NSTMVDGVSP TAYPLDVIFR WLTTEGQASH YKVQASQALH TLSRKYNDFL HLLNESVGIP RMIASLRDPQ SCCRIDICCI LTVLARNSED SRQMLASADV VQTALAELAE TNDLNWKACL LRLVSTILVS ECALQIDVPI HAIQTAIGGV EGSTFDPSLS ELACICVRNL LLTPNSRVGV EGLVKAMTDT IDTCAVSDSL CIEACYVIWA LTSKYSDRNP SELSAMMTSL IGLMGKFMEP LNLEIQSAAA GTLASVLASI VRSPIPLKVQ DVEVVISVIY KIIDTKPGAS EAIEHLLAVL WNCCLVDENT LVQGGVVVAV IDTMVDNESN LQIQERGCTI LALLASAENL EVNFSIAETN GVELLVSALA VFGDNVNVTL QACKCFSYLS IDPELRVMIV AQGGLRLVVV AITSNPDNAE LVGFACSTLL NLTFDAEVSA YIGSGIVDAI VQTMTGHLKS ALLQETGLGI LQNISMRGPD EKARIAEAGG VEAVVSVLRE HIRLPSVVER GLATLWSLAV LDENQIRVAN ADGINLVVNC MMALIEYERV QKQGCGCLCA LAGDSTSKVL LRNAGGLDAI VFAMWAHFNK SGVQKEGCRA ISNLVHDPGT NEIMLVSETE VGAILSAMRR FPSVADLQMH ACYSLRNLTL SVDNVAVVLG SADDIRELVA KVSLRNPECS PIVNQILSHF G
|
| |