Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_18241 |
Symbol | AP3mu |
ID | 7197467 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | - |
Start bp | 900013 |
End bp | 901410 |
Gene Length | 1398 bp |
Protein Length | 416 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178028 |
Protein GI | 219112553 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAATCCC TGTTCATTCT TTCACCCACG GGAGAAGTAT TGATTGAACG TCACTTTCGT GGCGTTGTGA CTTCTCGATC TGTTTGCGAA ACCTTTTGGG AGCGAGCGTC CGAGTCGGTC AATCACCACG GCGGGCTTTC CTCCGCGACG AGCCTGCTGA CGTCGCTGCA CTACGATAGC GTACCTCCCG TAATGGAGGT TCCAGAATCT GACCAAGGAA CACTCTACGT TATATCCATT CTACGCGAAG GCCTCAGCTA TTTGGCCGTC TGTCCAGCCG AAGTCAGTCC GCTTCTTATT ATTGAATTCT TGCAACGAAT CGCCAATATC TTTGTCGAGT ACTTTGGACC TCCGGCGGAC GAATCCGCCA TAAAAGACAA TTTTTCTACC GTTTATCAGC TAATCGAAGA GATGGTTGAC TTTGGATGGC CGTTAACAAC GGAACCCAAC GCGCTCAAGG CCATGATTCG TCCACCCACG GTGATGAGCA AACTGTTGCA ATCATCGACG ACCGTCAGTG ACGAATTGCC GTCGGGAACG ATTAGTAACA TTCCCTGGCG CGCCGCAAAC GTACACTACA CACAAAACGA AATTTATATG GACATTGTGG AGGAGGTTGA CGCGATTGTA AACGCTTCTG GCGCGGTTGT GTCGTCGGAC GTTAGCGGGT CGATTCAATG TCAATCACAC CTGTCCGGTG TTCCGGATCT GTTGCTAACG TTCAAAGAGC CGGATCTGAT TGACGACTGC AGCTTTCATC CTTGTGTACG CTACGCTCGA TTCGAAAACG ACAAAGTGGT TTCCTTCGTC CCGCCGGACG GTAATTTCGA GCTCATGCGA TACCGCATAC ATCCGGAGCG AGCACGCAAT TTTAGTCCTC CGGTATACTG CCATCCGCAA TGGTCATATA GCTCCTCAAC GGATGCGTCA CAAAGCATAA CATCTGAGCG ACCTACCAAA AACGGCCGTA TAGCGCTACA AGTTGGTGTC ACAACTTTGA GCAGTTTGGT GTTTTCGGCG TCAAGAAAGG GCCCCCTGCA GGTTGAAGAA GTGGCTGTAC TGATTCCGTT TCCTAAACAG ACACGAACGA CTGCTGGGTT TCAGGTCAAT ATTGGTTCGG TCATGTATGA TGAAGCCGCC AAAGTTGCCC GCTGGACGCT CGGCAAGATG GATGCGTCTA GAAAAGCGAC CTTGTCGTGT ACTTTTACAG CCCTGACAAG CAACGACGAA GAAATCACAT CCTCCATACC CAATGTATCG CTCACTTGGA AGATTCCGCT AGCATCCGTA TCGGGATTGT CCGTCAGTGG TCTCTCCGTC ACTGGAGAGT CCTACAGACC ATACAAAGGT GTACGGAACG TTACCAAGTC GGGCCTATTT CAAGTGCGGT GTAGCTGA
|
Protein sequence | MQSLFILSPT GEVLIERHFR GVVTSRSVCE TFWERAVPPV MEVPESDQGT LYVISILREG LSYLAVCPAE VSPLLIIEFL QRIANIFVEY FGPPADESAI KDNFSTVYQL IEEMVDFGWP LTTEPNALKA MIRPPTVMSK LLQSSTTVSD ELPSGTISNI PWRAANVHYT QNEIYMDIVE EVDAIVNASG AVVSSDVSGS IQCQSHLSGV PDLLLTFKEP DLIDDCSFHP CVRYARFEND KVVSFVPPDG NFELMRYRIH PERARNFSPP VYCHPQWSYS SSTDASLVFS ASRKGPLQVE EVAVLIPFPK QTRTTAGFQV NIGSVMYDEA AKVARWTLGK MDASRKATLS CTFTALTSND EEITSSIPNV SLTWKIPLAS VSGLSVSGLS VTGESYRPYK GVRNVTKSGL FQVRCS
|
| |