Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_32455 |
Symbol | |
ID | 7196611 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 2515319 |
End bp | 2518686 |
Gene Length | 3368 bp |
Protein Length | 1107 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176994 |
Protein GI | 219110485 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTCCCG CCACCCGGCA AATGACGAGT GCAGCCGTCT ATGCCCACCT TTTGGACAAC GTACTTCTTC TTCCCCAAGG GCATCCTATC CGCCTCAGTT TTGAGCAACA AGGATATGAA TCGGCTGATG ATCTTCTGTG TATTTTTGAG AATGAACTTG AGTCTCTTGG ATACACTCCT TCTGTCCTTC CCGACGGCCT GGAAAACCCG CCAACTATAC CCCTTCTCAT GGCGCACCGA CAGATCATAC GTCATTTCTT GCGCTGGCAG GCATCTTTGG AACGACAAAA GGGGACACCT TTGAAGAACT CCGAGCTTGT TGCACTTAAC AATGAAGATT TTGTCCTTTA CCGTCGCTCA GCCCTTGGTC AAGTCTCGAC AGCAACTGCA CCGGTTAATG CTTCCCCAAC TGTCCAGAGC CCCATAGGAA AGACACGTTC GGCTGTCGAG GACTTCAAGC GTGGGATCAA ACGTGACAAA ACTCACTATC CCGTGCTTAA AGATGATCGG TACTGGGACA ACTTTTATCG GTCGTTTGTT GTTACTGCCG TAACACATAA CGTTGACAAA GTTCTAGATC CGACGTACAT CCCTACCGAT CCCTTGGAGA AATCCCTTTT TGAAGAGCAG AACAAGTTTG TATATTCTGC TCTAGAGCAT ACTCTCCAGA CGGACATGGG CAAGAACATT GTACGAGAGC ATAGTTTCGA CTTCAATGCC CAGGAAGTTT TCCGTAAGGT TGTGAAACAC TACACAGAGT CCGCTAGCGC GAAGATTAGT TCGTCTACTA CCCTGGGATA CCTTACAACT GCAAAGTACG GATCGTCATG GACTGGCACA GCAGAAGGTT TTATTCTTCA CTGGAAAAAT CACTTGCGCA TCTACAATGA CACTGTTCCT GCTGGTGAAC AGCTTCCTCA GCAACTATGC CTTAGTCTTT TGGAGAATGC TGTTCATGAT GTACCTGAGC TTCGACAGGT AAAAATCACT GCAACTCTTG ACTTAGCAAA GGGAGGTAAT CCTATTAGCT ATGGTGGTTA TCTCAGTCTA CTACTCGCAT CGGCATCGCT CTACGACAAC GGCAATAATC TATCTAATTC TCGTAGTGGC AAGAACAAGC GCAACATCTA TGCTAATGAA CTAGAGTACA ATCCGATGGA TTTTGAGAGT AAACCGGATG TAGACTATGA TATAGATGTG TCGCCTACCG CAATCTACGA AGCCAATGCT CATGCCCGTA ACAGCAGTTT CCGGAATCGT AGTCCGGCAA CTAATCGCGA GCGACCTTAC ATCCCTCGTG AAATGTGGAA CCTACTCTCC GACGATGCCA AAGCCATCCT CCAAGGCTTA ATAGCCCCCG GGAAGCAGGC CCCGTTGAAT AATAGTTCGC CACACCAATC GTTGCAGGCC AATACGCACG ATACCATTGG CGCGGAACAA ATCACAACGG ACACCTTCCA TGATTGCGCA CCCGAAACTG AATTGCTTGC CCACCTGACT GAGCGTGTTA GTCACATGAG CGACGGCGAC ATACGTAAGG TACTTGCCGC ATCTCGTGAT GGTCCCGCCT ATGATGAGCC CACACCACTG CAATCTAACG TACTTCAATA TCAAGTGTCT CGTCACAACG TCATTGAAAC TACGGCAGCC CTCGTCGACC GTGGAGCCAA TGGAGGTCTT GCCGGCAGTG ATGTCATGGT CTTGCATAAA ACAGGTCGTT CTGCAACCAT CACAGGTATC AATGATCATA CCTTGTCCGA TTTGGACATT GTCACCGCTG CTGGCTACAC TGAATCCCAA AATGGCCCCA TCATTCTCAT TATGAACCAA TACGCCCATT TGGGACAGGG TAAAACTATC CACTCCAGTG CACAGCTTGA ACACTATCGC AACCATGTCG AAGACCGTTC CCGTACTGTA GGAGGTAACC AGCGAATTGT AACATTGGAT GACTACATCA TCCCATTGCA CATTCGACAA GGACTCGCGT ACATGGATAT GCGGCGTCCT ACCGACAAGG AACTTGCGAC CCTTCCACAC GTTGTCCTAA CCTCCGACGT CGACTGGGAT CCCTCCGTAC TTGACCACGA AATTGATCTC GCAACCTCTT GGTATGATGA CAAATATGAT TTGCCTCAAT CACCTTACGT TGAACCACGT TTTGACCATA CAGGCAAATA CCTCCATTGT CACATTTCCC TTTGCAACCA TCGCGATGAC GTTGTTGACC GTGTATTATA TTGCCAACAG CACCTCGTCA CGAAAAATGT GCAAGATTAT GAGGCCCTTC GTCCGTGTTT TGGATGGGTC TCTGCTGAAA CCGTTCGCAA GACCATCATG GCGACCACGC AGCATGCACG CGAAGTATAT AACGCTCCGT TACGCAAACA TTTTAAGTCT CGCTTTCCCG CTCTAAATGT ACACCGTCGT AATGAACCAG TTGCTACCGA TACCATTTGG TCCGACACCC CTGCTGTCGA TAATGGTGCT AAATTTGCAC AACTTTTCGT TGGTCGACGC TCCCTTGTCA CCGACGCTTA CCCCATGAAA ACTGACAAAG AATTCGTCAA TACCCTTGAG GACCATATCC GTTACCGGGG TGCCATGGAC AAATTGATTA GCGATCGTGC CCAGGTTGAA ATCAGCAAAA AGGTCACCGA TATTACACGC GCATATAATA TCGACCAGTG GCAAAGTGAA CCAAACCATC AACACCAAAA CTTTGCCGAA CGTCGTATTG CCACTATCGA GGCTAATACC AACAACATTC TCAATCTTTC CGGTGCCCCT GATTCCGCCT GGTTACTTTG CGTGACATAT GTTTGTTATG TTTTCAACCA TTTGGCACAT GAATCCCTAG ATAACCGCAC TCCCCTTGAA GTCCTCACCG GCTCCACGCC TGATATCAGT GTTCTCCTTC AGTTTCATTT TTGGGAACCG GTCTATTATA AGCTCGAAAA TGCGACATTT CCTTCTGGTG GTACCGAACA ACAAGGACGT TTTGTTGGCA TCGCCGACTC CGTCGGCGAC GCTCTCACTT ATAAGATCCT TACCCACACC ACCAACCGCA TTCTTCATCG CTCTAGTGTC CGTTCTGCGA CCATTCCCGG ACAAACCAAC CTACGCCTTA CGCCACAGGA TGGGGAGAGT GGTCCTAAAC CCATCAACTT TATCAAGTCG CGTAGAACCG AAAACAAAAA TTCCTATGCC ATTAAGGAGT TGCCTGGTTT CACACCTGAT GACCTTATAG GTCGTACGTT CCTCACCGAC ACTCGGGATG ATGGGGAGCG TTTGAAGGCA CGAATCACGC GGAAAATATT GGACCCAGAC AAGCCCTCGG ATGTAAAGTT CCTTGTCGAA ATCAATGA
|
Protein sequence | MVPATRQMTS AAVYAHLLDN VLLLPQGHPI RLSFEQQGYE SADDLLCIFE NELESLGYTP SVLPDGLENP PTIPLLMAHR QIIRHFLRWQ ASLERQKGTP LKNSELVALN NEDFVLYRRS ALGQVSTATA PVNASPTVQS PIGKTRSAVE DFKRGIKRDK THYPVLKDDR YWDNFYRSFV VTAVTHNVDK VLDPTYIPTD PLEKSLFEEQ NKFVYSALEH TLQTDMGKNI VREHSFDFNA QEVFRKVVKH YTESASAKIS SSTTLGYLTT AKYGSSWTGT AEGFILHWKN HLRIYNDTVP AGEQLPQQLC LSLLENAVHD VPELRQVKIT ATLDLAKGGN PISYGGYLSL LLASASLYDN GNNLSNSRSG KNKRNIYANE LEYNPMDFES KPDVDYDIDV SPTAIYEANA HARNSSFRNR SPATNRERPY IPREMWNLLS DDAKAILQGL IAPGKQAPLN NSSPHQSLQA NTHDTIGAEQ ITTDTFHDCA PETELLAHLT ERVSHMSDGD IRKVLAASRD GPAYDEPTPL QSNVLQYQVS RHNVIETTAA LVDRGANGGL AGSDVMVLHK TGRSATITGI NDHTLSDLDI VTAAGYTESQ NGPIILIMNQ YAHLGQGKTI HSSAQLEHYR NHVEDRSRTV GGNQRIVTLD DYIIPLHIRQ GLAYMDMRRP TDKELATLPH VVLTSDVDWD PSVLDHEIDL ATSWYDDKYD LPQSPYVEPR FDHTGKYLHC HISLCNHRDD VVDRVLYCQQ HLVTKNVQDY EALRPCFGWV SAETVRKTIM ATTQHAREVY NAPLRKHFKS RFPALNVHRR NEPVATDTIW SDTPAVDNGA KFAQLFVGRR SLVTDAYPMK TDKEFVNTLE DHIRYRGAMD KLISDRAQVE ISKKVTDITR AYNIDQWQSE PNHQHQNFAE RRIATIEANT NNILNLSGAP DSAWLLCVTY VCYVFNHLAH ESLDNRTPLE VLTGSTPDIS VLLQFHFWEP VYYKLENATF PSGGTEQQGR FVGIADSVGD ALTYKILTHT TNRILHRSSV RSATIPGQTN LRLTPQDGES GPKPINFIKS RRTENKNSYA IKELPGFTPD DLIGRTNHAE NIGPRQALGC KVPCRNQ
|
| |