Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45284 |
Symbol | AP4mu |
ID | 7199938 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | + |
Start bp | 733928 |
End bp | 735629 |
Gene Length | 1702 bp |
Protein Length | 470 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179278 |
Protein GI | 219116967 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0000814146 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAAGCA ATTTTTTCGT CTTGTCACCG CGGGGCGACA CGATTCTCGC CAAACAGTAT CGGGTCGACA ACCTCAAACA GAGCGCACAC GAACGATCGC ACGTGGAAGC ACTCTTTCGA AAAATCAAAT TCTGGGACGA CTTCGCGACG AGCGAAGCCG AAGAAGCGCA AGCGGAGAAA GCTCGGCAGG ATAACGGCAA GTCTAAACAT CAACAACAAC AATTTGAGAA CAATGATCAG CAAAAGAGAA TGGGCGACGC CCCGCCGGTA TTTCTCATGC CCGATGGATT GACATACTTT CACGTCAAGC GTAATGGTCT GATCTTTGGA GCGTCGACCG CGAGGAACGT TAGTCCAAAT ACGGTAGTCG AGGTCAGTAC ATGTTCGTCT GGTCTTTGGC GTTGTCGGAA GCATTCTTCG GAATAGACGG AAATGTAGAC GAGTATTCTG TAATTGCATG GACTAACATA ACCTTTGTTG CTTACTTGTT CCGCTTTCTC GGTCGCCACT ACCAGTTACT ATCAACGATT GCTCGAATAT TCAAGGACTA CTGTGGACTC TTGTCGGAAG AAGCCTTGCG CAAGAATTTT ATTTTATGCT ACGAGCTCCT CGACGAAATG ATTGACTTTG GCTACCCCCA GGTCACGCGG ACGGAAAATC TCAAAAGTTT TGTTTATAAC GAACCCATTG TGGTGGACCA CGTCGCGAAC ACGGGCACCA TGATCAACCC CAAGACAGCT TCCGCCAATG CCGTACACAA ACCCGTTATT TCGTCGGTCC ACGAAAACGG ACGCAAGTCG GGCCTCAACA ACAACCAAAA GAACGAAATC TTTGTCGATA TTCTCGAGCG TCTTAATGTA CTCTTCTCCA ACAACGGCTA CGTGCTCAAC AGCACAATCG ACGGCTGCAT TCAGATGAAA TCGTATCTGG CAGGGAATCC AGAACTACGG GTTGCGCTGA ACGAAGACTT GTCCATTGGA AAAGATTCCC GGTACAATGG TGTTGCGGTT GACGACATGA ATTTCAACGA CTGTGTTAAT CTTTCCGAGT TTGATTCCTC GCGAACAATA TCATTCATCC CGCCCGATGG CGAATTTATC GTGCTCAATT ACCGCATTAC CGGAGAATTC AATACTCCAT TTCGTATTTT TCCGTCGATT GAAGAAACCG AACCCAACAA AATTGAGATT GTTGTTCTCA TTCGCGCCGA AATGCCGAAC AATCATTTTG GAGCAAACGT GTCGGTCGAA ATTCCCGTCC CCCACTGTAC TACGTCCGCC TCATGTAGTC TTGTTTCGGC ACCGGGCACC GGACATGCGC ACGCCGAGCT AGTGGCCACT GAAGGCAAGA TTGTGTGGAC CATGAAGAAG TTTCCTGGCG GCGGAGAACA AACGATGCGG GCCAAAGTGT CGCTCAGCAA GCCCTGTACC ACGGCGATCC GGAGAGAAAT CGGACCTATC AATATGTGCT TTGAGATTCC CATGTACAAC GTTTCTAATT TGCAAGTGCG CTATTTGCGA GTAGCAGAAA ACATGGTCGG CTACACACCG TACCGTTGGG TTCGCTACGT AACGCAGTCC AGTTCCTACG TTTGCCGAGT GTAATACGCT CGCAAAGCGT CAGGTGGGGT TAGTCTGGGA TCGAGAAAGA GCAGTAACTA GTATCGAAAC ACGACTAAGC TAGATTCCGA ATTACACTCT TG
|
Protein sequence | MISNFFVLSP RGDTILAKQY RVDNLKQSAH ERSHVEALFR KIKFWDDFAT SEAEEAQAEK ARQDNGKMGD APPVFLMPDG LTYFHVKRNG LIFGASTARN VSPNTVVELL STIARIFKDY CGLLSEEALR KNFILCYELL DEMIDFGYPQ VTRTENLKSF VYNEPIVVDH VANTGTMINP KTASANAVHK PVISSVHENG RKSGLNNNQK NEIFVDILER LNVLFSNNGY VLNSTIDGCI QMKSYLAGNP ELRVALNEDL SIGKDSRYNG VAVDDMNFND CVNLSEFDSS RTISFIPPDG EFIVLNYRIT GEFNTPFRIF PSIEETEPNK IEIVVLIRAE MPNNHFGANV SVEIPVPHCT TSASCSLVSA PGTGHAHAEL VATEGKIVWT MKKFPGGGEQ TMRAKVSLSK PCTTAIRREI GPINMCFEIP MYNVSNLQVR YLRVAENMVG YTPYRWVRYV TQSSSYVCRV
|
| |