Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_41507 |
Symbol | |
ID | 7199339 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011699 |
Strand | + |
Start bp | 35120 |
End bp | 38487 |
Gene Length | 3368 bp |
Protein Length | 1107 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185474 |
Protein GI | 219130653 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTCCCG CCACCCGGCA AATGACGAGT GCAGCCGTCT ATGCCCACCT TTTGGACAAC GTACTTCTTC TTCCCCAAGG GCATCCTATC CGCCTCAGTT TTGAGCAACA AGGATATGAA TCGGCTGATG ATCTTCTGTG TATTTTTGAG AATGAACTTG AGTCTCTTGG ATACACTCCT TCTGTCCTTC CCGACGGCCT GGAAAACCCG CCAACTATAC CCCTTCTCAT GGCGCACCGA CAGATCATAC GTCATTTCTT GCGCTGGCAG GCATCTTTGG AACGACAAAA GGGGACACCC TTGAAGAACT CCGAGCTTGT TGCACTTAAC AATGAAGATT TTGTCCTTTA CCGTCGCTCA GCCCTTGGTC AAGTCTCGAC AGCAACTGCA CCGGTTAATG CTCCCCCAAC TGTCCAGAGC CCCATAGGAA AGACACGTTC GGCTGTCGAG GACTTCAAGC GTGGGATCAA ACGTGACAAA ACTCACTATC CCGTGCTTAA AGATGATCGG TACTGGGACA ACTTTTATCG GTCGTTTGTT GTTACTGCCG TAACACATAA CGTTGACAAA GTTCTAGATC CGACGTACAT CCCTACCGAT CCCTTGGAGA AATCCCTTTT TGAAGAGCAG AACAAGTTCG TATATTCTGC TCTAGAGCAT ACTCTCCAGA CGGACATGGG CAAGAACATT GTACGCGAGC ATAGTTTCGA CTTCAATGCC CAGGAAGTTT TCCGTAAGGT TGTGAAACAC TACACAGAGT CCGCTAGCGC GAAGATTAGT TCGTCTACTA CCCTGGGATA CCTTACAACT GCAAAGTACG GATCGTCATG GACTGGCACA GCAGAAGGTT TTATTCTTCA CTGGAAAAAT CACTTGCGCA TCTACAATGA CACTGTTCCT GCTGGTGAAC AGCTTCCTCA GCAACTATGC CTTAGTCTTT TGGAGAATGC TGTTCATGAT GTACCTGAGC TTCGACAGGT AAAAATCACT GCAACTCTTG ACTTAGCAAA GGGAGGTAAT CCTATTAGCT ATGGTGGTTA TCTCAGTCTA CTACTCGCAT CGGCATCGCT CTACGACAAC GGCAATAATC TATCTAATTC TCGTAGTGGC AAGAACAAGC GCAACATCTA TGCTAATGAA CTAGAGTACA ATCCGATGGA TTTTGAGAGT AAACCGGATG TAGACTATGA TATAGATGTG TCGCCTACCG CAATCTACGA AGCCAATGCT CATGCCCGTA AGAGCAGTTC CCGGAATCGT AGTCCGGCAA CTAATCGCGA GCGACCTTAC ATCCCTCGTG AAATGTGGAA CCTGCTCTCC GACGATGCCA AAGCCATCCT CCAAGGCTTA ATAGCCCCCG GGAAGCAGGC CCCGTTGAAT AATAGTTCGC CACACCAATC GTTGCAGGCC AATACGCACG ATACCATTGG CGCGGAACAA ATCACAACGG ACACCTTCCA TGATTGCGCA CCCGAAACTG AATTGCTTGC CCACCTGACT GAGCGTGTTA GTCACATGAG CGACGGCGAC ATACGTAAGG TACTTGCCGC ATCTCGTAAT GGTCCCGCCT ATGATGAGCC CACACCACTG CAATCTAACG TACTTCAATA TCAAGTGTCT CGTCACAACG TCATTGAAAC TACGGCAGCC CTCGTCGACC GTGGAGCCAA TGGAGGTCTT GCCGGCAGTG ATGTCATGGT CTTGCATAAA ACAGGTCGTT CTGCAACCAT CACAGGTATC AATGATCATA CCTTGTCCGA TTTGGACATT GTCACCGCTG CTGGCTACAC TGAATCCCAA AATGGCCCCA TCATTCTCAT TATGAACCAA TACGCCCATT TGGGACAGGG TAAAACTATC CACTCCAGTG CACAGCTTGA ACACTATCGC AACCATGTCG AAGACCGTTC CCGTACTGTA GGAGGTAACC AGCGAATTGT AACATTGGAT GACTACATCA TCCCATTGCA CATTCGACAA GGACTCGCGT ACATGGATAT GCGGCGTCCT ACCGACAAGG AACTTGCGAC CCTTCCACAC GTTGTCCTAA CCTCCGACGT CGACTGGGAT CCCTCCGTAC TTGACCACGA AATTGATCTC GCAACCTCTT GGTATGATGA CATATATGAT TTGCCTCAAT CACCTTACGT TGAACCACGT TTTGACCATA CAGGCAAATA CCTCCATCGT CACATTTCCC TTTGCAACCA TCGCGATGAC GTTGTTGACC GTGTATTATA TTGCCAACAG CACCTCGTCA CGAAAAATGT GCAAGATTAT GAGGCCCTTC GTCCGTGTTT TGGATGGGTC TCTGCTGAAA CCGTTCGCAA GACCATCATG GCGACCACGC AGCATGCACG CGAAGTATAT AACGCTCCGT TACGCAAACA TTTTAAGTCT CGCTTTCCCG CTCTAAATGT ACACCGTCGT AATGAACCAG TTGCTACCGA TACCATTTGG TCCGACACCC CTGCTGTCGA TAATGGTGCT AAATTTGCAC AACTTTTCGT TGGTCGACGC TCCCTTGTCA CCGACGCTTA CCCCATGAAA ACTGACAAAG AATTCGTCAA TACCCTTGAG GACCATATCC GTTACCGGGG TGCCATGGAC AAATTGATTA GCGATCGTGC CCAGGTTGAA ATCAGCAAAA AGGTCACCGA TATTACACGC GCATATAATA TCGACCAGTG GCAAAGTGAA CCAAACCATC AACACCAAAA CTTTGCCGAA CGTCGTATTG CCACTATCGA GGCTAATACC AACAACATTC TCAATCTTTC CGGTGCCCCT GATTCCGCCT GGTTACTTTG CGTGACATAT GTTTGTTATG TTTTCAACCA TTTGGCACAT GAATCCCTAG ATAACCGCAC TCCCCTTGAA GTCCTCACCG GCTCCACGCC TGATATCAGT GTTCTCCTTC AGTTTCATTT TTGGGAACCG GTCTATTATA AGCTCGAAAA TGCGACATTT CCTTCTGGTG GTACCGAACA ACAAGGACGT TTTGTTGGCA TCGCCGACTC CGTCGGCGAC GCTCTCACTT ATAAGATCCT TACCCACACC ACCAACCGCA TTCTTCATCG CTCTAGTGTC CGTTCTGCGA CCATTCCCGG ACAAACCAAC CTACGCCTTA CGCCACAGGA TGGGGAGAGT GGTCCTAAAC CCATCAACTT TATCAAGTCG CGTAGAACCG AAAACAAAAA TTCCTATGCC ATTAAGGAGT TGCCTGGTTT CACACCTGAT GACCTTATAG GTCGTACGTT CCTCACCGAC ACTCGGGATG ATGGGGAGCG TTTGAAGGCA CGAATCACGC GGAAAATATT GGACCCAGAC AAGCCCTCGG ATGTAAAGTT CCTTGTCGAA ATCAATGA
|
Protein sequence | MVPATRQMTS AAVYAHLLDN VLLLPQGHPI RLSFEQQGYE SADDLLCIFE NELESLGYTP SVLPDGLENP PTIPLLMAHR QIIRHFLRWQ ASLERQKGTP LKNSELVALN NEDFVLYRRS ALGQVSTATA PVNAPPTVQS PIGKTRSAVE DFKRGIKRDK THYPVLKDDR YWDNFYRSFV VTAVTHNVDK VLDPTYIPTD PLEKSLFEEQ NKFVYSALEH TLQTDMGKNI VREHSFDFNA QEVFRKVVKH YTESASAKIS SSTTLGYLTT AKYGSSWTGT AEGFILHWKN HLRIYNDTVP AGEQLPQQLC LSLLENAVHD VPELRQVKIT ATLDLAKGGN PISYGGYLSL LLASASLYDN GNNLSNSRSG KNKRNIYANE LEYNPMDFES KPDVDYDIDV SPTAIYEANA HARKSSSRNR SPATNRERPY IPREMWNLLS DDAKAILQGL IAPGKQAPLN NSSPHQSLQA NTHDTIGAEQ ITTDTFHDCA PETELLAHLT ERVSHMSDGD IRKVLAASRN GPAYDEPTPL QSNVLQYQVS RHNVIETTAA LVDRGANGGL AGSDVMVLHK TGRSATITGI NDHTLSDLDI VTAAGYTESQ NGPIILIMNQ YAHLGQGKTI HSSAQLEHYR NHVEDRSRTV GGNQRIVTLD DYIIPLHIRQ GLAYMDMRRP TDKELATLPH VVLTSDVDWD PSVLDHEIDL ATSWYDDIYD LPQSPYVEPR FDHTGKYLHR HISLCNHRDD VVDRVLYCQQ HLVTKNVQDY EALRPCFGWV SAETVRKTIM ATTQHAREVY NAPLRKHFKS RFPALNVHRR NEPVATDTIW SDTPAVDNGA KFAQLFVGRR SLVTDAYPMK TDKEFVNTLE DHIRYRGAMD KLISDRAQVE ISKKVTDITR AYNIDQWQSE PNHQHQNFAE RRIATIEANT NNILNLSGAP DSAWLLCVTY VCYVFNHLAH ESLDNRTPLE VLTGSTPDIS VLLQFHFWEP VYYKLENATF PSGGTEQQGR FVGIADSVGD ALTYKILTHT TNRILHRSSV RSATIPGQTN LRLTPQDGES GPKPINFIKS RRTENKNSYA IKELPGFTPD DLIGRTNHAE NIGPRQALGC KVPCRNQ
|
| |