Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_33072 |
Symbol | |
ID | 7197053 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | + |
Start bp | 1451461 |
End bp | 1454903 |
Gene Length | 3443 bp |
Protein Length | 872 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177837 |
Protein GI | 219112171 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.988083 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACATTTC CATTTATTCT TGACGGTTTT CAACAACAAG CCGTTGTTCG ATTAGAACGA TCTGAGTCTG TTTTTGTTGC CGCTCATACT TCGGCGGGGA AAACCGTTGG TGAGTTGACG TTGCGTTCAA GCAAGGTGTT CTCTTTTGTG GATTTCTCAC AGTCAGTGCA ACACTCTATT TACTACTTCT CCAAGTTGCG GAATACGCCG TGGCCTTGGC GAAGCAGCGT GGGACGCGCT GTGTATACAC GTCTCCAATC AAAGCCTTAA GTAACCAAAA GTTTCGCGAC TTCTCGTTGA AGTTCGGTGC GGAGAATATT GGTTTGATTA CTGGAGATCT ACAGGTCAAC GCAGACGACT CAACCTGTTT GATCATGACG GTAAGTGAAT AACAATTTCG AGCGGTCTCT ACCGCATCTT TTTTAATTTC ATGGTTCTTA CATGAACTGT AAATTGTGCA GACTGAAATT TTGCGGTCTA TGCTTTATCG AGGGGCCGAT TTAGTTCGCG ACATTGAGTT CGTGGTTTTC GACGAAGTAA GTCAAAGAGC GATTTCCTGC CAACATTATA TATATGTAGT TGGCCTTATA CGATTCTTCA CCTCGATAGG TACATTATGT CAATGATACC GAGCGAGGAG TTGTTTGGGA GGAAGTAATT ATAATGTTGC CCTCTTACGT GAACTTGATT TTCTTGTCGG CGACTACACC AAATACCTTG GAATTCTCAG ATTGGATTGG ACGGACTAAA CGAAAGCCGG TGTTCGTTAT TAAGACAGAC TACCGACCGG TTCCCTTATC GTTTAATTTG TGGGCAGGTC TTAAGCTTCA TACCGTAATG GAGGGTCGAG ATGGATTTCT TGAGAGAGGA TTTGCGTCAG CCGCGAACGC GCTCCTCCCT GCGTCCGCTA GAGACTCAAA AAAAACTAAG AGCGAATCCA AGGGTCGCCC GCCCGCTAAG ACTGCAATCG GCTCAAAACA GATGGCATGG CAAGCCCAGG GAACCAAGCA AAACTGGATG TCCCTTGTGC GCTTTTTGGA CAGAGAAAAT ATGACTCCAA CCGTTGTGTT CTCGTTTTCG AAGAAAAAGT GTGAGGAGAT TTCTATCATG TTGCAATCGC TTGATCTAAA TACGGCTAAG GAACGAGGTG CTGTCCAAGG TTTTACGCTG CAGACGGTGG CTCGTCTCTC CAAGAATGAT TCGAATCTAC CTCAGGTAGT AATGGTATGT GAGATGGTGC AACGGGGAAT CGGTATTCAC CATGGGGGTC TTCTCCCGAT ACTGAAAGAA ATGGTCGAGA TATTGTTTGC CAAATCACTG GTGAAAATTC TTTTTGCAAC TGAGACATTC GCCATGGGTG TGAATATGCC TGCACGCAGT GTAGTTTTCA ACAGTGTTCG GAAGCATGAC GGCAAGCAGT TTCGTCAACT CGAGCCTGGA GAAATAACGC AAATGGTAAG CTGACCTTCC GTTGACCTTT ACTACTCCAC CACACTCGTC ACTAAGATTA TCCGTTTCGT TTTTGAAAGG CCGGTCGCGC CGGGCGTCGT GGACTGGACA AAGTAGGCAC TGTGATTATA TGCTGTTTCG GCGAAACACC TCCACCGCAA CCTATGTTAA AGCAAATGTT GACTGGGTCA TCGACAAGAT TAAACAGTCG CTTTCGACTC ACATACAACA TGATTTTGAA CCTACTAAGG GTCGAGGAAA TGAGCGTTGA ATCAATGATC AAACGGTCAT TCTCCGAGTT TGCTACACAA CGAGCCCTGA CTACGAACGA CTTTCCCCAG TTGCTGACTC GAGGGATCAG AGCACTAGAG AAGTTGGAAG AAACTTACAA AGTAGAGGCA GCAAGTCGTA TTGGGTCTGA GGACGTGGAA GAGTATTTCT CGACTTGCAG CGAAATCCTT TCAATAACCG AACGTCTACT GACAAATGTG AGAGACACGG AGGCAGCATC GTTCGAAGGC ATTCTACAAA AGGGTAGAAT CGTTTTGATT TCGGCCTGTC GAGAGCTTGG TGCGGTCAGA GCCCCCGCAC TTGTACTTAA ATCGCCCTCG TTGTCTTCGA AACTGACTCC AAACGTAAAC GCTCGTACCG ATAGATCCAA TAGCAAGGAA GTACTTGTTG TTTGCCTTGT CTTACTACCA AGCAGTTACA TCGCATGCCA AAGTGACATC AATAAAAAGC CAGGGACAGT CGGCTACGTT GGGTTGACGC GCAGCCGTCA TTTTTCTATC AGGAAAATAC GGGTTGGACA GATCCTTCTG GTCTCTTCGC AGAAGTGTAA TGTTGACACG ACTTCTATTC TAAGGGAAGA GCACAGCTGT CTTGGGGATC CACGATTCAA TGCGACGTCA TTTCTAGCTC CAGCTCAAGC AACAGAGAAT CCTTTTGCTG GAATGAAAAC GCGGGGCAAA AAGGGGGCCT CAATGGACAA CAAAAGGGGA TCTGGCACTG CAAAAGCAGA TGAAGAAGTC GAGAAAGTCC TAGACTCGTT AATGGAAGCG GAAAGAGCAG AACTTTGTGA TTCTGGTGTA CCATTGTTAG ACCTCCGTGA CTTTTTGAAA CGGGGGGACA GTGTCTTACG ATCTCGACAG CTGTTCGGCC GACTTGAAGC TGAATTGGAT CAAATGCGGA ACTACGAAAT CCATCGCCAT CCCAGCCTCG AATCGATGTA CTCTACTGTG GAACGAAAAG AGAGTTTAAG AAGCAAGGTG AATACTTTGC GCCATCTTTT GTCGAATGAG TCGTTACAAC TTTTCCCAGA TTTCCTTCAG CGAAAAGCAG TACTTCGCAA ACTTGGATAT ATCGACGAGA AAGAAACCGT GTCCATCAAA GGACGCGTCG CTTGTGAAAC AAACACCTGC GAGGAGCTGA TTGTGACTGA GCTGGTTTTT GAAGGGCTCT TGAACGAACT CGATCCAGAA GAGATTGTCG CCGTCCTTAG TGCTCTAGTT TTTCAGGAGA AAGGCAAGGA AACTTCATTG AGCGTCGAAC TTCCTGAAAG ATTAATTGTT GCTGTGAGCA AATGAAGACA ATAGCATTAA ATCTGGGCCG TATTCAAAAG GATGTGGGCT TAGACATAGA TCCTGCTGAA TACAGCGAAA GCTCGCTCAA CTTCGGTCTC GTTCATGTCG TCTACGAATG GGCACTCGGA GTCCCTTTTA AAAGCATTTG CGACTTGACT GACGTTCAAG AAGGTTCTAT TGTTCGAAGC ATTACCCGCT TGGACGAGCT TTGCCGTGAA GTCCGAAATT GTGCACGAGT GGTTGGAAAT CCTACTCTGT ACAGAAAACT GGAAGCCGCA AGCATGGTGA GTATCTATGG AAGTTCGTAA ATGTTTGCAA AGATCTCGGC GAAACAACTC ATGCCTTCTC GTTTAAATGC CGTAGACAAT CAAGCGCGAC ATTGTGTTTG CTTCGAGTCT ATATGTGAGC TAG
|
Protein sequence | MTFPFILDGF QQQAVVRLER SESVFVAAHT SAGKTVVAEY AVALAKQRGT RCVYTSPIKA LSNQKFRDFS LKFGAENIGL ITGDLQVNAD DSTCLIMTTE ILRSMLYRGA DLVRDIEFVV FDEVHYVNDT ERGVVWEEVI IMLPSYVNLI FLSATTPNTL EFSDWIGRTK RKPVFVIKTD YRPVPLSFNL WAGLKLHTVM EGRDGFLERG FASAANALLP AMAWQAQGTK QNWMSLVRFL DRENMTPTVV FSFSKKKCEE ISIMLQSLDL NTAKERGAVQ GFTLQTVARL SKNDSNLPQV VMVCEMVQRG IGIHHGGLLP ILKEMVEILF AKSLVKILFA TETFAMGVNM PARSVVFNSV RKHDGKQFRQ LEPGEITQMA GRAGRRGLDK VGTVIICCFG ETPPPQPMLK QMLTGSSTRL NSRFRLTYNM ILNLLRVEEM SVESMIKRSF SEFATQRALT TNDFPQLLTR GIRALENRIG SEDVEEYFST CSEILSITER LLTNVRDTEA ASFEGILQKG RIVLISACRE LGAILLVSSQ KCNVDTTSIL REEHSSPAQA TENPFAGMKT RGKKGASMDN KRGSGTAKAD EEVEKVLDSL MEAERAELYL RDFLKRGDSV LRSRQLFGRL EAELDQMRNY EIHRHPSLES MYSTVERKES LRSKVNTLRH LLSNESLQLF PDFLQRKAVL RKLGYIDEKE TVSIKGRVAC ETNTCEELIV TELVFEGLLN ELDPEEIVAV LSALVFQEKG KETSLSVELP ERLITIALNL GRIQKDVGLD IDPAEYSESS LNFGLVHVVY EWALGVPFKS ICDLTDVQEG SIVRSITRLD ELCREVRNCA RVVGNPTLYR KLEAASMTIK RDIVFASSLY VS
|
| |