Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46170 |
Symbol | |
ID | 7201251 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011677 |
Strand | - |
Start bp | 468751 |
End bp | 471998 |
Gene Length | 3248 bp |
Protein Length | 1069 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180644 |
Protein GI | 219119783 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATAGGG AGACCAAACG CCAGAGCGTA GCTGATAACA CACAAAAGGA TATAACGTAT CGATATGAAG TATCGTCACT TCAAAACAGA ACATCGGAAA ATGCTCAACG TGATTACATT TCGGCCTATG GGAAGAGTCA GAACGATGTA ATCTCCCCAA CAGCAGTTCA TCTGATTTCA AGGGTCTCAA CAGATCCGAG TAATTCTGTG CATAGTGATC TGCCGTCGTG CATGGGCCCG AGTCCGAACT TATCGGTATC TGGAAGTTTG TCTCGTAATC ACACTTCCGC CGAGTCTGAT TCTTCTCCCT ATTTTGAATC TTCCACGCCG CTGATGCCCT ATTCTTTGTC AAAACCTCAC CCTAAGGGGG AAACCAATTC GTCGGAAGCA TGCGCCAGCA ACGCAACTGG TATGGTGTGT TCTGTGCTCC CGGAACCAAA CCGCTCCAGG TGGCGACAAA CCTTTGGGCC GAGCCACGGG ACCGATCAAA CGGCTCTAGA GAACGTCAGT CTTAAAGCAC CGCAGCGTGC TTTTAAAAAC TCATCACGTG ATCAGTTTGG TGCCGTGGGC GACGATGATG ATATTTGCTC ATCTTCTGAC GAGGAAAATG GAATGATTGC AATTGGTATT CCCGATAAGA TTCAATCACG CGCTAAAGAA AAGATGCAAC AAAGCAATAT CTCCCAAACT ATCGCTGAGG AACTAGCGCG ACAAGAGCAT GAGGCAAAGT TAAAGAAGAA CCAGCTTCCA GCCAAAGCAA TCGTCACACG GCGCGTCACG GAACGCAAAC TGGGCGCTTC GAAGGATCCT CTATCACAGA ATCCCATGAT TGGAGAGAAA GATCACGTAC TCCAACCCGG CTCGGTTGTT GCTCGAGCCG GTTCAGAATC GCATCGTACC ATCATGGAAG AAAAAAGCCT GGCTCGTGGT ACGCCAACCC GTGAATTGTC GCTGATGCCA GGCGCATTCC GAGCCACGAC ACGTAGTGAC TTGGACCAAA AGAGTGCAGA GCGCGGAGTG GTGCGTCCCA TTTCATCCTC TTCGCGGAAC ATGAAGGCGC GTCGATTAAA TCGGTTAAAT GGACGTCAGA ACCATCAAAT TGAGGCTAGC GTGAATGAGG ACGATCAGCG CCGTCTGACA CATTCGTACA GCTCCTCGGA CGAGTCGGCT GTCTTGCTTT TGCCCATACG TACCAAATCC TCTGATTCCA TCAAGAGTGA TGTAGATTCG AGTAATCATT CTACCCGCGC CAGAGCCCGT TCCCGTTTCC ACCGCATGCG ACAAAAGTCT ATGGATTCTT CGGCTCACAG CTCAGATGGA TCTACAGTAG TACCTGCTTC AATTCGGGAT CTTGTCTCAC TGCGCAGCAT GGAAGAGACC ACTAAATTGC AGCGACACGA TTCTGGGCCT TCTTTGGCTC CGGCGACAAG TGGTTGTGTC GTTCCAGGAG CAACAGAGTT TATTGCCGCA GTACGTCACG AAAAGGAGAA TGGTCCATCG CTGGCGCCAG CAAGAGCCAT AAGGATTTCT GGCGAGTGCA ATGTAGAGGG CTCAAAACAA AGTGTGAAAA TATACGGACC GGTGTTGGCT TCTGGATTTG TTCCGGATCA AGTCGCCTTG ACTCCAGGTA TGATGGATCT GACAGGCCAA GAATGTTATC CAGATGAGGA TGAGGATGAT ACGATTGAAG CACAAGCTGG TCTTCCAGTA TTGATCCCAG GTGCGTTTGC GATTGAAGGT ATGGAATCGT CTCACACGGC TACCTCTAGG CACAACTCTG TGGTGGACAC GCAAAGTTTT TCGGAAGCTG AAGAAGTATA TGGGGAAATC GAAGAAGATC AAGCAGATAC AGAAATTTTC TTGGAGCCTT CCCCGGACGA CACACCGCCT CTCGTAGCCG AATTACACGA AGAAGTTGTT GTAGACGGAG CGGTCCTTGA AGAACACGGT GAGGACGATC CAAAGCAACG ACATAGGCTG CGTCTTTTTC AGGCAATGGC TTTTTTTTGT TCGGTTGTGG CAGTTACCCT CATTGCAGTT AGTGTTTCTG GGGTTTTCCA ACCAGATCAA GCTGGTCCAC AGAAGACAGC GCCGAAAATT TCCGGTTGGT TAGCGGCGGG TGAGGAACTT TTCGGATCTA CAGAGGAGGC GCAAGTTTTA TTTGGTACAT CCATTGGAAT GTCAGGAGAC GGTTTCATTC TTGCCGTGAG TTCTCCGGGA TGGGATAATT CCTCAACAGA GCTCAACGTC GGACAAGTGC AAGTTTTTTC TGGAGCAGAC ACCTTTAACG GGACCCAGTG GGATAATGTT GTCACTTTAG AAGGTCCAGG CTCGAGTGAA GATGAAAAGA CTTCCATAGC CATGTCTAGT GATGGTAGAC GGCTGGTAGT TGGTTATCCC TCTTTCAATA GTGGAACAGT ACAAGTTTTT GAAGATCGCG GTCGTGGATG GAGTCCTTAT GGGGGAGTCG AGCGTATGGA AAGAGATGGT GAAAATATTT GGTTTGGACA TGCCGTAGAT ATCAGCGCAG ATGGAGATGT TCTGGCAGTC GGCGCACCAC TCAGAAACTC TCTTGCGGGA GAACAGAGTG GAGCAGTTCG TGTCTTTCGG TCGTCCAACA CACGTTGGAT TCAAATTGGA TCCGATATTT TGGGCGAATC CATGAATGAC TTTGTAGGCT GGTCTTTGGC ACTCAACTCA CAAGACGGGT CACGCGTCGC TGTCGGTGGG CCAGTTGCTC GAGATGAGCG TGGGATTGTG CGCATATACG ATTGGGATGG CTCAACCTGG AAGCAAATCG GGGAAACTCT GACTGGGATC AATATCTTGA GTAGATTTGG ATCATCTGTT TCACTATCAG GGAACGGACA AGTGCTTGCA ATTGGTGCTC GAGGTACTGC GTTCGAACCT GGGGAGGTCC GTGTTTATCG AGAGATCGAC AATGCTTGGG TCACAGACAA TATCTTTAGC GGACTGGAGC CAAGCGAAGG ATTCGGGACA ACCGTGTCTC TTTCAAAAGA TGGTAATGTT CTCGCGATTG GCATCCCTCA GAATAACGAA TTTGGCAACG GCAGTGGTTC GGTGCAGGTG TGGAAATACT ATGATGATCA GAAGGCTTGG AAACAGGAAG GCACCAATAT TGGCGGATCC GAGGGGAGCG CGTTTGGGTC GGCTGTCGCA CTTTCCGCAG ACGGTTTTCG GGTAGGCGTT GGATCCCCAC TTGCAACGTT TGATGGCAGT GTAGCTAA
|
Protein sequence | MDRETKRQSV ADNTQKDITY RYEVSSLQNR TSENAQRDYI SAYGKSQNDV ISPTAVHLIS RVSTDPSNSV HSDLPSCMGP SPNLSVSGSL SRNHTSAESD SSPYFESSTP LMPYSLSKPH PKGETNSSEA CASNATGMVC SVLPEPNRSR WRQTFGPSHG TDQTALENVS LKAPQRAFKN SSRDQFGAVG DDDDICSSSD EENGMIAIGI PDKIQSRAKE KMQQSNISQT IAEELARQEH EAKLKKNQLP AKAIVTRRVT ERKLGASKDP LSQNPMIGEK DHVLQPGSVV ARAGSESHRT IMEEKSLARG TPTRELSLMP GAFRATTRSD LDQKSAERGV VRPISSSSRN MKARRLNRLN GRQNHQIEAS VNEDDQRRLT HSYSSSDESA VLLLPIRTKS SDSIKSDVDS SNHSTRARAR SRFHRMRQKS MDSSAHSSDG STVVPASIRD LVSLRSMEET TKLQRHDSGP SLAPATSGCV VPGATEFIAA VRHEKENGPS LAPARAIRIS GECNVEGSKQ SVKIYGPVLA SGFVPDQVAL TPGMMDLTGQ ECYPDEDEDD TIEAQAGLPV LIPGAFAIEG MESSHTATSR HNSVVDTQSF SEAEEVYGEI EEDQADTEIF LEPSPDDTPP LVAELHEEVV VDGAVLEEHG EDDPKQRHRL RLFQAMAFFC SVVAVTLIAV SVSGVFQPDQ AGPQKTAPKI SGWLAAGEEL FGSTEEAQVL FGTSIGMSGD GFILAVSSPG WDNSSTELNV GQVQVFSGAD TFNGTQWDNV VTLEGPGSSE DEKTSIAMSS DGRRLVVGYP SFNSGTVQVF EDRGRGWSPY GGVERMERDG ENIWFGHAVD ISADGDVLAV GAPLRNSLAG EQSGAVRVFR SSNTRWIQIG SDILGESMND FVGWSLALNS QDGSRVAVGG PVARDERGIV RIYDWDGSTW KQIGETLTGI NILSRFGSSV SLSGNGQVLA IGARGTAFEP GEVRVYREID NAWVTDNIFS GLEPSEGFGT TVSLSKDGNV LAIGIPQNNE FGNGSGSVQV WKYYDDQKAW KQEGTNIGGS EGSAFGSAVA LSADGFRCS
|
| |