Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49526 |
Symbol | |
ID | 7195857 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011690 |
Strand | - |
Start bp | 512484 |
End bp | 519542 |
Gene Length | 7059 bp |
Protein Length | 1633 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184267 |
Protein GI | 219128116 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAGCGTATTG CGAACGATAC CTTCCTTCCG TTACCCAAGT AAGCAATTCG TGTTCGTCCT GTGTGAATGT GCATGTGTGT TTGTATGTGT TTGTTAGTGT GTGTATGGGG GGAGGGAGAG GAAAGTAAGC ACAAGGGAGG CGCCTTCCAT CGCAAAGAAC GGGTCAAAGG GAGCGCCAGA GTTCGAATGT GTGAGAAAAA GAGAGAGATG CCGAGCATTC GTACAAACTA ATGTTCGGTT GCTTGGTGGA AGCACCAAGG GTTGCACGCA GTGCTGCCGT GCTTTGCTGT AGTTCTATTT GGTACACGGC GCACGTTTGC CACCCGACAC TATATCTCAC GCAAGAGCAC CTCTTTGCTT GCTTGTTCGT TTCTGTTAGG GATGACCACG GCGCCGCTCA CTGCAGCTCA ATCTTCTCCC TTGACTATAG CCAATACGAC GACAACGACA ATCAATAGCA AGCCTCTCGT GGATTCTCTT ACCAATACTG TTCCAGAGAC AGTCCAATCC TCTGGACGAG ACGGTAATTC CTCAAACAGT GAGACCAACA AAAGCAACAG CAACAGGACT CCCACTAAAT TTGCTGCGTT GTCCGAAGTG CCGCATCGAC GCTCGGCATT GGTGTCTACG GCTCACTCCG TCACGTCGAA ATCCAAACAG AATTCTACTC CACCGGTAGA CATGGCAACG GCGTCCGGAT CAGCCAGTCA AGGCTCGCAC GGCGAAAACA CGGGACGCTG GACCGCGGAA GAACACCGCT TGTTCTTACA GGGGTTGGAA CAGCATGGCA AGGGATGGAA GAAAATCGCG TCGCTCATCA AGTCGCGAAC CGTCGTACAG ATTCGGACGC ACGCCCAGAA GTACTTTCAG AAATTGGCCA AGGCTCGCCA AAATGGGGAA GAAGGCGATG TCGCCATGGA AGGTCGCGGT GGCGTGGCTT CCATTACCTC CGTCTCGACA ACTGCTGTTT TACCCAAGCG ACGTCGCCAG ACAACCGGAA CAAAACGCAA GGCCATTCAA TCCGTCGTGG CTTCCGCCCA GCGGCAAGGC AAGAAACTTG CCGCCGCAAA GACGAATCCT ACTCGACACC ATCCCTTGCC GCCGCCCCTA CCAACGGTCG CCCCCGCACT CGCGCATTAC ACTCTCCCCA GTACTGCGAT GATGGCCAAA AACGGCACCG CAGTGAAGGA AGAATTCGTC TCGCCCACCA ATCTTTCAGG ACCGGCCCTA GAAGATTCAT TGTAAGTCGT CCCCACCGTC GCATCGTGGT AGCCTTTCGT CTTCGTGGGT TGGAAAGCTG ACCCATTCCA TTCTCTTCTT TCGCAGATTC CGCTTCTTAA CCCCGGTTCC GGTATCGGAA CCACCGCTCA ACGAAGTAGC TCGTCAAGCC GGTGCCAACC CGATTTCTCT CCCCACCGAC AACCCAAGCT CTATTCCAAC GGTGGGTGCA GGAGAAATCT CGCCCACGGG AGTTTCAGAT TTGATGCTTT ACCCCTCGTG GACAGACTCA AAAGAGCCAC CTTCTTGGTA CAGCAAGGGC GCCGACATTG ACGCATTGCT CGATATGGGG GATTCGTTGG ACTGGTTGGA CGACACGGGG GATTTGAACG AGTCATATGT ACCACCCGTC GTGGACACAG CAATGGCCGC TCCAGAACCG CACACGACCT TTCACAGGTA CTCCGATCTG GGACATTCAA AGGGACTTCA CAGTACCAGT GTGACGTCTC TGCCACATGT CGATTCCAAC GCAAATGTGG AATCCGTTGT GCCGCCACTT CCCTCCATAT TCGATGGAGC CCCCGACTCG GGAGAGCATC TTGAGACCAC GGAAGGGATG GTACCTTCCA ACAGTACTTC TCACTTGGCG GATGAAATCG ACGACAGTGA AGGCATACAC GAACACCTAC AAGTATTTGA CAGTCCTTTG GAGGAGAACG ACTTCGTATN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNTACTGG GGCGACTCTT CAGTTCTTTA CCAAACGATA GAAAGCCCAA GCATCGGATC TCTCGCACAA ACACACGCAC TCGTTTACGG CCTGTCCGCT TCGACTCACC ATGGGCACGA TAGTATCTAC AGTGGGTCGG AGCGCAGAGA CAGATTTTTT CGTCACCGAG TCTCTACAGT TCCTACTGTC TTGGAACGAG GGACCAAGAC GGCAAACGCG GCGGATCCAT GATCCAACAC TCCTTGACGG AATTGACAGT AACGAGAATA TTCTCGACAC GACAGGTACA ACTTGCTCGG TTCTGAATAA TATTGTTCGT GATACAGTTA CTGTTAGTTC GGGTAGTTCG GGTAGCATTG GTACAGTGCA ATAGATACAT ACAATTGCGT TCGAATAACC CTATCCTCTA CGTAGGTCTC TCACGATTGT GGTTCGTGTC TGGAATGTAG ACTTGACTGT TCGTAGTATT GGTAGAGGGA ACTTGGTACC AATCAATACT CTAGTAGTAG TGGTAGTGGT AGTAGGGATG GGCGCGAACC GGACCGACGG GCGTGAGTTC GAAAAATCCC TCTCTCTGTT GGGTGACTCT TCCCATTCCT TCCAACGTTA ACCACAACAA CAACAACAAC GCTGTATCTG TATTGCTGGT TGGATACGCA ACTACTACTA CTACCACACG TAGTCGCCCC CATCACAATG CCAACGACCA ATTCGCCAGT CGAAAGCCGA ACGTTGTCGT GGACGGATCT CGGGTTGGAC ACGGACGAAG ATCACCCGCG TCGTTTGTTG CGTGTGCGGG ACGACGTGAT TGCCTACGGC GGCGACGAAG GCACACTGGT ACGCTTACCT TTCGTCACAA CGAGCAACGC CCAAAATGCC GACACGGGGT CCACGCGACC CTTGGCCGTG CGTCGCTTTG ATGAAGACGC CATACGTGCC GTCGCAGTCT CGGACGACGG AACCCGCGTT GCCGTCGGAA CGGATAGCGG TGCCACTCTT TTCTACCGTT ACGAGTTGGA TGGACACGTA GTAGACGCAC CTGGAAAGGG GCTCGTCTCC CGACACGGAT TTGTCACGCA CGACAGTGAC GACAACAACA ACAACAACAA CAACTCCCAC CAGAAACCCT CGGCAGACTT GTTTGGATCG CAGCCCGACG CCCTCGCCTT TGTCCCACAG CAACGTCCCG GGGAAGTCGT CCGTCACGGA CCCGTCTTTG ACGCTCCTGT ACGGCAACTC CTCTTTCTCC CCGACTCGCA TTTCCTCGCC ATTGCCACGG AAGCCGGATT GGCCGTTGTT TCCACCGATA CCGACAGCGG CATTGGTGGT GGCAGTCTGG ACACTAACCA CAACGAGAAC GTCAACCACC ACAACGTCAA ATACCTCCAC CGGGAAGCCC AAACGGCACA CGACGAATCC GGCATACGCG GACTCGCCCT CTGGCAAGCA AAGGACTGTC GTATACTCTC CTCACTCGCC ATGGACGGGC GTCTCTGTCA CTGGGATGTC TCTGCTCCCA CTCCCACACT CTGGAAACTA CTGCACCGCG AGACAGTACC GACCGTTACC AAGCCCGACC TGGGCGAAAT GCTCGGTGCC GATGCCTGGG ATCGATCCAC CATCCCCGTC GCCCATTCCC ACGAAAGCAT ACTCTTTTTG CCCGGAGAAA CCTACGTACA GGCGCGTCGC TACCGCAACC ACACCTGGGA ACTCCTACAG TCCCCTACCG GGGCCACCAA TACTACCGAC AAAGTACAGG GACACATTGA AGCCATTGTC GCCATGGCCC CGGCACCCAA CCCTCGAGAT CCGTACCTCG TCACCAGTGG ACGCGACGGA CGAGTCGTCC TCTGGAAACT ACAGTACTCT CATCACGACA ACAACAACAA CGACAACAAT CCAAACGACA ATGGTGACGG GCACATTGTC TTTCAAAAAC AAATCCTCCA GACGGATTCC GCCCCAACTC ATTTGTTGTG GACACTGGAC CAACCGACGC AAACGGAACG TCTCGACATG GTGACCGCCG ACGGACACTG GACTACTCTG GTAGGACGCG ACCAGATTGC TCCGGCCTGT CCAACCACTG CAGTGACCCA AGAGATCTCC CTCCCACACC GCCAATCAGC CGATTCCGTG CGGGAAAAAG AGAAGGAACA TGACGCAGAC TCGGACGACA GCGTTGATGA CTTTTCTTCG AACAAACCTT CCACACACCA AAAGAATCCG TTTGTGGACG ACGAGGCGGA GGACGACAAC GATGACGATA CGCTCGATAC GGCCTCGCGT GGAAAACTGG AGACGACCTC ACCAACGGAC AAGCGCGCCT CCAATCTTAA CAGCAGCGCT CTCGAAGAAC ACCACAATGA TCTAGACGAC GACTCCATCG GTGACGATGA CGACTCCTTC CACAACATTC CGACTCTCAC CACGCGACAT TCCGATTCGA TCCAGTGGCC TGAACCACAA CCCGCCTTTG GTCCTTCTTC CACATCGCTT GAATTGACTC GCCGCTTTTT GTGCTGGAAT CACATTGGGT CCGTTACGTT TCTTCGAGGA CAGGCCGGCA TCAACCGCAG CACGATCGAC ATTCACTTTA CGGACTCGGC ATTTCGTCGG CCCGTTTCCT TCACCGATAA TATGGGCTTC ATTCTGGGGT CCCTGGGGGA AGACGGCGGA ATATTCGCCA CCGACTTGGC GGAAGACGAG GATATTGATG AGGAGGACGA CGATATGGAC GGCTTGAACG TGTCGGCTGC TACCAAGGCC GCCGTCAAAC GTTCGCGCAA GGGTCCTTCG AACAAACCGA CCGGGTCGAG CATTTACTTT CATCGCTTCG AAACGTTCGG ATCCTTACGC GACAAGGATT GGTACTTGAC GCTCCCAGAT GGGGAGCGGG CTTTGGGGTG TGCGTCCGGT GAAGGATGGG CCGCCGTCGT AACGAGTCGC CGTTTCTTGC GGCTCTTTTC TTCGGGCGGC AATCAAGGAG AGGTGCTTTG GCTGAACGGC CACCCCGTCA CCATGGCTGG ACGGGGACGG TTCGTCGCGG TGGTATATCA CGAAAGTACA CCGTTACCAG ATGGAACACA AAAACTCGGA TACTTGGTGT TGGATGCGAT GGCGAATCGC GTAGTTGCCA AGGGGCCAGT GTCATGTATT AGCGGTGCAT CGACTCTTTC ATGGTTGGGG TTCAGCAATG ATGGATCTCT GCTGGCCATG GATTCGGATG GTATGCTGTC AATGTTGGTT TGCGCATCAT CCTTGGATGC GGAGGGACCG ACGGAAAAAC ACTGGGAATG GATGCCAATG CTGGACACGG TGGGGTTACG TAAATCCCGG GACGATTCCT TCTGGCCGGT CACAGTTTAT GACGGAAAGT TGGTGTGCGT CCCGCTCAAG GGTGGGATGA AGCATCCTGA TGCGGTGCGC CGTCCCGTCA CGGCCGCTCT CGGCTTTCGT CTTCCCCTGG CCCGGGGTCC TTTGACCAAG ACGCACACGT TGGAAGAGCT TGCGGTGCGC GCCGCGATTG CGCTAGGGCA GAAAAAGGCA ATTCACGAGA TTAGCCGGGA AGGCGACGAG GACGACGAGG ACTTTGAAAA AGAATACCGT TCCCTTTCGG CCCAAGTGGT ACGTTATCAT CGTTTTTGTG GGTGTTTGTG TGAATGACGT GCGAGGGTGT GCAGGGGAGA GGCCATCCAC GCTACACTGA TTGTCCGTCT CTAATAAGCT TTGTACGGTG TGCTTTGGCG ATTTCAACAG GACAAGGTCA CGCTAAAAAT GTTTGCAGCA ATCGCGGAAG CCGGTAAATT GGAGCGCGCT TTGGATTTGG TGGAGCGTTT GCATTTGGAA AAGAGCTACG ACTTGGCCAT GACGATTGGC GACCGGCACC GCAAACTTGT CGATTTGATC GAAGAGGCCA AGGATCGCAA GTTTGGAGAT CCGGGATCGC ACCAAGCCGA GTTTACGACC AAAGCGGAAT CCCCGAACTA TCAACGCCCC CGCATCTCTC CAGATTCGGC TGGGGCAAAA CGCAGTCTTG ACGATGAGGA CGAGGACGTC CGAAGCCGTC TCGTGCGTCG CAAACCAACG TTTGCCTAA
|
Protein sequence | MTTAPLTAAQ SSPLTIANTT TTTINSKPLV DSLTNTVPET VQSSGRDGNS SNSETNKSNS NRTPTKFAAL SEVPHRRSAL VSTAHSVTSK SKQNSTPPVD MATASGSASQ GSHGENTGRW TAEEHRLFLQ GLEQHGKGWK KIASLIKSRT VVQIRTHAQK YFQKLAKARQ NGEEGDVAME GRGGVASITS VSTTAVLPKR RRQTTGTKRK AIQSVVASAQ RQGKKLAAAK TNPTRHHPLP PPLPTVAPAL AHYTLPSTAM MAKNGTAVKE EFVSPTNLSG PALEDSLFRF LTPVPVSEPP LNEVARQAGA NPISLPTDNP SSIPTVGAGE ISPTGVSDLM LYPSWTDSKE PPSWYSKGAD IDALLDMGDS LDWLDDTGDL NESYVPPVVD TAMAAPEPHT TFHRYSDLGH SKGLHSTSVT SLPHVDSNAN VESVVPPLPS IFDGAPDSGE HLETTEGMKA QASDLSHKHT HSFTACPLRL TMGTIVSTVG RSAETDFFVT ESLQFLLSWN EGPRRQTRRI HDPTLLDGID SNENILDTTV APITMPTTNS PVESRTLSWT DLGLDTDEDH PRRLLRVRDD VIAYGGDEGT LVRLPFVTTS NAQNADTGST RPLAVRRFDE DAIRAVAVSD DGTRVAVGTD SGATLFYRYE LDGHVVDAPG KGLVSRHGFV THDSDDNNNN NNNSHQKPSA DLFGSQPDAL AFVPQQRPGE VVRHGPVFDA PVRQLLFLPD SHFLAIATEA GLAVVSTDTD SGIGGGSLDT NHNENVNHHN VKYLHREAQT AHDESGIRGL ALWQAKDCRI LSSLAMDGRL CHWDVSAPTP TLWKLLHRET VPTVTKPDLG EMLGADAWDR STIPVAHSHE SILFLPGETY VQARRYRNHT WELLQSPTGA TNTTDKVQGH IEAIVAMAPA PNPRDPYLVT SGRDGRVVLW KLQYSHHDNN NNDNNPNDNG DGHIVFQKQI LQTDSAPTHL LWTLDQPTQT ERLDMVTADG HWTTLVGRDQ IAPACPTTAV TQEISLPHRQ SADSVREKEK EHDADSDDSV DDFSSNKPST HQKNPFVDDE AEDDNDDDTL DTASRGKLET TSPTDKRASN LNSSALEEHH NDLDDDSIGD DDDSFHNIPT LTTRHSDSIQ WPEPQPAFGP SSTSLELTRR FLCWNHIGSV TFLRGQAGIN RSTIDIHFTD SAFRRPVSFT DNMGFILGSL GEDGGIFATD LAEDEDIDEE DDDMDGLNVS AATKAAVKRS RKGPSNKPTG SSIYFHRFET FGSLRDKDWY LTLPDGERAL GCASGEGWAA VVTSRRFLRL FSSGGNQGEV LWLNGHPVTM AGRGRFVAVV YHESTPLPDG TQKLGYLVLD AMANRVVAKG PVSCISGAST LSWLGFSNDG SLLAMDSDGM LSMLVCASSL DAEGPTEKHW EWMPMLDTVG LRKSRDDSFW PVTVYDGKLV CVPLKGGMKH PDAVRRPVTA ALGFRLPLAR GPLTKTHTLE ELAVRAAIAL GQKKAIHEIS REGDEDDEDF EKEYRSLSAQ VDKVTLKMFA AIAEAGKLER ALDLVERLHL EKSYDLAMTI GDRHRKLVDL IEEAKDRKFG DPGSHQAEFT TKAESPNYQR PRISPDSAGA KRSLDDEDED VRSRLVRRKP TFA
|
| |