Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43097 |
Symbol | |
ID | 7196876 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 2026885 |
End bp | 2029163 |
Gene Length | 2279 bp |
Protein Length | 671 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176892 |
Protein GI | 219110281 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TCAGACTCAC CTCATCGGAC GGACTGGACG GTACTTGCTC CTCGGAAACA GGATACACCG TCCTTTTGGC AAGTGACGGT TACTTGACGC CAATTGCTCA CATTTAGGTT AACTTAGAAG CCATTGGAAT TTGAAATTAT AGGAAGGCTT TTCATTAACA TCGGAAGATT TTTCCTTTTC CTCCCACTGA TCCAAGTGCC TTTGTCTTTT TGATTTCATG GCGGCAATCA TTGGCAATTC ATTACTTGTA CCTCGGCGGC TGGTTCTTAT CGCAATGGCT GTCTCGGTTC TATTGTTCGT TCTCCGACCC GTGAGTTCTT TTTCCTTTAG ACCCGTGGGA CGGTCTTATG CCGCGGCGAT TACGCAAAGC AATGGTCTAC GACGGAGCAC ACGGGGGCAC GCTGTCGGTG TGTCGATTTC CCGGAGGGAC CCCGATGCGA TGCCCTCTCG CCTCTTTTCG TCGTCAACGG ACAGCAACAA GGAAACAAAG GCGTCAGTAG AAGAGCAGAT CAAAGTGAAA GGAGACGAGA TTCGAGCGCT CAAGGAATCT GGAGCAGACA AATCAACCGT TGCCCCTTTA ATTGACGAAT TGCTCGCTCT CAAAGCAAAG CTTGATCCTT CTATTCTCGA ACCCCCAAAA AAGGCACCGA AAGCCCAAAC GCAGCCAAAA AAGCAACAGA ATCAATCCGG TAAGAGGGAG AACGATGATT CTGATTTTAT CACTGCTCGT GAAGTGGATT ATTCGAAATG GTACAACGAT ATCGTTCGCG TCACTGGCCT CGCTGAAACT TCACCAGTTC GTGGCTGCAT GGTAATCAAG CCGTGGGGCA TGTCACTCTG GGACCGTGTT CGTACCGAGC TTGACGCCAA AATTCAAGCA CATGGTGCCG AAAATGCTTA CTTCCCTTTG CTCATACCCC AATCCTTCCT TTCCAAAGAA GCCGAACATG TTGACGGTTT CGCCAAGGAA TGCGCCGTAG TCACCCATCA CCGTCTCACG ACTAACCCAG ACGGCAGCGG TTTAATGGTC GACCCCGAAG CGGCTCTCGA GGACCCCCTC ATTGTTCGTC CCACTTCCGA GACCATGATT TGGTACATGT TCCGCAAATG GATCGTCTCC CACCGTGACT TGCCACTCAA AATCAACCAG TGGGCCAACG TAATGCGTTG GGAAATGCGG ACCCGACCGT TTCTGCGGAC TTCTGAATTT TTGTGGCAGG AGGGACACAC AGCCCACGCG ACACGGGATG GAGCCATTGC GGATGCCCAA GCTATGCTTG ATAACTATGC CACATTGTGC GAAGATTTGC TGGCCATGCC GGTAGTACGC GGTGTGAAGA GTCCATCGGA GCGCTTCGCA GGCGCCGAAG ATACGTATAC AATTGAAGCC TTGATGCAGA ATGGTTGGGC CTTACAGTCC GGGACATCGC ACTTTCTGGG GCAGTCTTTT GGTAAGGCCT TTAACGTGAC GTTCCAGGAC GAGAATGGTA CGCAGCAAGA TGTGTGGGGG ACCAGCTGGG GTGCCTCCAC TCGATTAATT GGTGCTCTTA TCATGACGCA TTCGGATGAC GCTGGTTTGG TCTTACCACC GAAAGTAGCT CCAGCTCAGG TTATAATTGT CCCAATCCCT CCAAAAAAGG ACGACGCAGA GACGAAACAA GCCATGGATA TTGCTATGAA TCAATTGACG GCAAGCTTGA AAGCTGAAGG TTTGCGCTTC AAGGTGGACG ATCGTGATTT CGTCCGCAGT GGTGCAAAGT TCTTTGAATG GGAACGCAAA GGTGTGCCTC TGCGTATTGA AATTGGACCA CGAGACGTTC GCAACAACGT CTGCGTCTTC AAGTACCGTG CGGGTGAGAA TGCTGACGAA AAGCAAACGA TTCCGCTGTC CGAAGCGGCG GCATCCGCAA CGGCCGGCTT GAAAAGTATG CAGCAAGACT TGTTGGAAGC GGCAAAAGCG AGATTGACCA ATGGAATTAC GACGGACACG ACATACGAAG AAATGAGAAC GTTTTTGGAA GCCGACGAGG CGTCCGAGTA TTCTGGGAAG GGTCTGTTCT TGGTGCCGTG GAAGTGTGAC GCCGAGAACG AAAACAAGAT CAAAGAGGAA TGCAAGGCTA CTATTCGATG CTACCCGCTT GACGCAAACA AGCAAGGTTT GCACCAAGGC AAGAAATGTT TCTATAGCGG CCATGACGCA ACGCACATGG CACTGTTTGG AAGGGCGTTT TAGCGCTAGA AAGGAATTAT AACAAGACTG TAAATCGCAT CGTGCTAAC
|
Protein sequence | MAAIIGNSLL VPRRLVLIAM AVSVLLFVLR PVSSFSFRPV GRSYAAAITQ SNGLRRSTRG HAVGVSISRR DPDAMPSRLF SSSTDSNKET KASVEEQIKV KGDEIRALKE SGADKSTVAP LIDELLALKA KLDPSILEPP KKAPKAQTQP KKQQNQSGKR ENDDSDFITA REVDYSKWYN DIVRVTGLAE TSPVRGCMVI KPWGMSLWDR VRTELDAKIQ AHGAENAYFP LLIPQSFLSK EAEHVDGFAK ECAVVTHHRL TTNPDGSGLM VDPEAALEDP LIVRPTSETM IWYMFRKWIV SHRDLPLKIN QWANVMRWEM RTRPFLRTSE FLWQEGHTAH ATRDGAIADA QAMLDNYATL CEDLLAMPVV RGVKSPSERF AGAEDTYTIE ALMQNGWALQ SGTSHFLGQS FGKAFNVTFQ DENGTQQDVW GTSWGASTRL IGALIMTHSD DAGLVLPPKV APAQVIIVPI PPKKDDAETK QAMDIAMNQL TASLKAEGLR FKVDDRDFVR SGAKFFEWER KGVPLRIEIG PRDVRNNVCV FKYRAGENAD EKQTIPLSEA AASATAGLKS MQQDLLEAAK ARLTNGITTD TTYEEMRTFL EADEASEYSG KGLFLVPWKC DAENENKIKE ECKATIRCYP LDANKQGLHQ GKKCFYSGHD ATHMALFGRA F
|
| |