Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_41294 |
Symbol | |
ID | 7199180 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011697 |
Strand | - |
Start bp | 36212 |
End bp | 38694 |
Gene Length | 2483 bp |
Protein Length | 810 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185318 |
Protein GI | 219130325 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCCCGA GCACGGAAAA GTTGCCCCCG CAGCTATGCC TTACCCTGCT TGAGTCCTCT GTTCGTGATG TTTCGGAACT CCGTCAAGTC AACACTACCG CAAATCTAGA TTTAGCTAAA GGGGGGTCTC CCATCAACTA TGAGAATTAC CTAAGCCTAC TTCTTGCTGC TGCAACTTTG TATGACAAAG GTAACAACCT TTCCAATTCT CGTAACCCAA AAGCCAAGCG TAGTGCTTTT GTTACTGAAA CCTCCTTTCC AGACGATGAT TACGGCGTCG ATTACGACAT TGACTTGTCC CCGTCCATCC TTTACGAGGC AAATGTCCAC AACCGTCGAG CAGGCGACCA GAATCGGGCC CGCCAGACCA ATGTCAATCG CGAACGTCCC TATATTCCCC GTGAGATGTG GGATAAACTG TCCGACGATG CGAAAGAAAT TCTTCGTGGC ATGTCGTCTC CTCAAGAAGG CAATGCATCG GCCAACAGTA AATCATCCTC TGCATTCCAC GCCAACTCCC ATTCCCTAAC TGATACGGGA CACTCTCCTT CAACGGACGA ATCGTTGCAC AAAGACGACA ACGACAAATT CCACGATTGT GGGAATGACA CGGAATTGCT TGCACATCTC ACTGATCGTT CAAGTACTAT GGCACATGGA GACATTCGAA AAGTTCTCGC TTCGGCTTCC TCCTGTAAGC AGAATCCCAC GAGCTCACTA CAGTCCAACA TGCTCGAGTA CAGTATTTCC CGGCACGCCG TTACTGGGAC AACATCCTCC CTCATTGACA GAGGTGCAAA CGGTGGACTC GCTGGGAATG ATGTTAAAAT CCTGAACAAG ACAGGTCGTT TTGCTAGCAT CACTGGTATC AATGACCATA CCCTGCCTGA TTTAGATATC GTCACCGCTG CTGGACTCGT TGAATCCCAG AACGGACCTA TCATTGTCAT ACTTCACCAG TATGCACACC ATGGGAAAAG TAAAACAATT CATTCTAGTG CGCAACTTGA ATACTACAAG AACGTTGTCG AAGACCGTTC TCGGGTTCTG GGGGGCAAAC AGCGTATCGT AACTCTAGAT GACTACGTTA TTCCTCTTCA CGTTCGCCAA GGACTGGCTT ACATGGACAT GCGCCCACCT TCGGATACCG AATTTGACAC ACTTCCGCAT GTTGTTCTTA CTTCCGATGT GGACTGGGAT CCGTCTATCA TTGACAATGA AATTGATCTC GTCACAGACT GGCATGATGC CGTCCAGGAC CTCCCCGGCG ATCTGTACGT TGAACCTCGC TTCAATTCAA CCGGGGAATA CCGACACAGA CACGTTGCCA ATTATGACAT TTTTTCGTCG CCCGAATTGG TCGATCCATC CACGGCTATT GGCAATATAC TTTCGTCAAA CAAGCATGAT ATGACCCGCA ATGCCCACAA TTACGAAGCT TTGCGCCCTT GTCTTGGTTG GATCTCTGCC GACACAGTTC GGAAGACCAT CTTGGCCACC ACACAATTCG CTCGCGAGGT TTATAATGCA CCTATGCGTA AGCACTTCAA GTCTCGTTTA CCGGCACTTA ATGTTCATCG TCGCAATGAA GCTGTCGCTA CCGATACCAT TTGGTCGGAC ACGCCTGCTG TTGATAATGG CGCTAAATTT GCACAACTAT TTGTTGGTAG ACGGTCGCTT GTCACCGACA TTTATCCTAT GAAAACCGAC AAAGAGTGTG TTAATGCTCT TGAAGACAAT ATTCGTCATT GTGGCGCCAT GGATAAGCTC ATCAGTGATC GTGCCAAGGC CGAAGTCAGC AAGAAGGTTT CTGATATTAC CCGTGCTTAC CACATTGATC AATGGCAAAG CGAGCCAAAT CACCAGCACC AAAATTATGC TGAACGCCGC ATTGCAACTG TCGAAGCAAA TGCGAATAAA ATCCTAAACA AAACCGGTGC ACCCAATTCT ACATGGTTAT TGTGTGTTTC CTACATTTGT TATTTGTTCA ATCATTTGGC ACATGAGTCT TTACACGATC GTACTCCCCT TGAAGTCCTC AACGGTAGTA CCCCTGATAT TAGCGTACTC CTTCAATTTC ATTTTTGGGA ACCGATCTAC TATCGACTAG AAGATCCTAC TTTTCCTTCC GATGGGACTG AAAAGAAAGG CCGCTTTGTT GGAATTGCTG ATTCCGTTGG TGATGCTCTT ACCTATAAGA TACTCACCAA TGACTCCCAC AAGATCCTTT TCCGATCCAG TGTCCGCTCT GCGTTGAAAC CTAGTGAAAC CAATTTGCGT CTTGAACCAC ATGAAGGGGA GAGTCCTCCT AAGCCTATCA ACTTCATTAA GTCGCGTAGA ACTGAGGACG AAAATTCTTA TGCCCTCCAC ACGCTACCTG GTTTCACCCC GGACGATCTC ATCGGACGCA CGTTTTTAAC TGGCACCCAG GACAATGGGG AGCGTTTCCG TGCACGTATT GCCAGAAAAA TCCTCGATCC TGA
|
Protein sequence | MVPSTEKLPP QLCLTLLESS VRDVSELRQV NTTANLDLAK GGSPINYENY LSLLLAAATL YDKGNNLSNS RNPKAKRSAF VTETSFPDDD YGVDYDIDLS PSILYEANVH NRRAGDQNRA RQTNVNRERP YIPREMWDKL SDDAKEILRG MSSPQEGNAS ANSKSSSAFH ANSHSLTDTG HSPSTDESLH KDDNDKFHDC GNDTELLAHL TDRSSTMAHG DIRKVLASAS SCKQNPTSSL QSNMLEYSIS RHAVTGTTSS LIDRGANGGL AGNDVKILNK TGRFASITGI NDHTLPDLDI VTAAGLVESQ NGPIIVILHQ YAHHGKSKTI HSSAQLEYYK NVVEDRSRVL GGKQRIVTLD DYVIPLHVRQ GLAYMDMRPP SDTEFDTLPH VVLTSDVDWD PSIIDNEIDL VTDWHDAVQD LPGDLYVEPR FNSTGEYRHR HVANYDIFSS PELVDPSTAI GNILSSNKHD MTRNAHNYEA LRPCLGWISA DTVRKTILAT TQFAREVYNA PMRKHFKSRL PALNVHRRNE AVATDTIWSD TPAVDNGAKF AQLFVGRRSL VTDIYPMKTD KECVNALEDN IRHCGAMDKL ISDRAKAEVS KKVSDITRAY HIDQWQSEPN HQHQNYAERR IATVEANANK ILNKTGAPNS TWLLCVSYIC YLFNHLAHES LHDRTPLEVL NGSTPDISVL LQFHFWEPIY YRLEDPTFPS DGTEKKGRFV GIADSVGDAL TYKILTNDSH KILFRSSVRS ALKPSETNLR LEPHEGESPP KPINFIKSRR TEDENSYALH TLPGQWGAFP CTYCQKNPRS
|
| |