Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_54730 |
Symbol | AP4beta |
ID | 7202532 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | + |
Start bp | 489032 |
End bp | 491741 |
Gene Length | 2710 bp |
Protein Length | 805 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181569 |
Protein GI | 219122474 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.442884 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GCGACTAGAA AAAAGAGTGT TAGGACCAAC GCAGGTCCTT CTCTTTCGTC TTTTTCGGAT CGGATAGCGG ATACATCGTC TAACGCGTTC GTTCGTTCGT TCTGGCTTTC ATCTAGAATA GTGTATACGA CGGGCCTCCA CTCTGGCACG TACCGACAAA CAAGCTATGT CGACCCCGTA CGCTGGCCAA GCCCCACCGC CTCCGGGCGG GGCTACGGCA ACAGCGGGTG TTCCCGATTC GTACTTTACC GAATCGCGCA AGGGTGAAAT CAACGAACTA CGAACCCTGT TGCGAGCCTT TGCCACCGAA CGGGATCCGC AACGCAAACG AGATATTATT AAAAAAGTCA TTGCCTATAT GACGCTTGGC ATTGACGTAT CGCGACTCTT TTCCGAAATG ATGTAAGTTG ACATAGAACA CTCTGCGTCG GTTTTTGCAC TGCTCGTCGC TGGAACTACG TTTGTTTCTA ACACTGTAGA AATCCATTCC TTCTCTTGTT GGTGTGGTCA TTTGTGTGGA AACAGGATGG CGATTGAAAC ACGCGATCTC GTTATCAAAA AGATGGTCTA TTTGTACTTG ACCAACTACG CCCGCACCCA TCCGGATCTG GCACAAATGT GCACCAACAC CTTGCAGAAA GACTGCGGCA ACGAGGATCC CATGGTCCGT GGCCTCGCCT TGCGGGCCCT CTGCGGTCTT AATCTCCCCC AAATGGTCGA ATACATTAGT GAACCGTTGC GCCGGGCCTT GACCGATGGA CACGCCTACG TGCGCAAAAC AGGAGTCATG GGGATCCTCA AATTGTACCA TTTGGACCCG GATGGCTTCC ACGAAGCAAA CTTTGTCGAT ATTCTCTACG ACATGTTGCG GGATCCCGAC GCCAGTGTCA TCACGAACTG TATCATCGTG CTCAACGAAG TTATGCAAAA ATCTCCCAAC GGTGGCATGG CGATTAATCG GGCCATTATG TTGCACCTGC TCAACCGGAT TCACGAGTTT AACGAATTCG CCAAGGTGCA AGTGCTGGAA CTCGTTCCGC GCTATATTCC CGCCAACGAA GACGAAGGCT TCCAAATTAT GAATCTGCTT GATCCCGTCT TGCGCACATC TTCTAGTGGA GCCGTTGTGG CAACCGTGCG GGCCTTTCTG AGTCTTTCGG ATACCCTGGA CGACGGATCT GAAGCAATGA AGCGGCAAAT TGTGGCCCGC GTCAAGGCGC CACTCGTGAC GCAAATATCC TCTGGGTCGT CCGAAATTAT GTACACCCTA CTCAAACACG TCGACACCTT GACGACCATT TGTCCGGGAG TCTTCGACGA CGAATACCGA CAATTTTATG TACGCTACAA CGAACCCACG CACGTAAAGT ACCTGAAAGT TGCCATTCTA CCCCGCATGG CGAACCCGGA CACGGCGCCC GACATTGTTT CCGAATTGGC CGAAATGGTC CACGATCGCA ACACCAAGCT GTCTCGCGCA GCCGTCGTAT CTATGGGCCG GATCGCGTGT AGTGGTAACG GTGGAGCGGG TGCCGCTGAG AGCATTGCAC GCCGACTCGT GGAACTGATG GATTCCGGAA CCGATCATAT TGCGAGCGAA GCCGCTACTG CTTTGACTCT CATGGTCCGC AAAGAACCTT CCATCAAGAC GCTAGTGGCA CCACCTCTAG TGCGATCGCT CAAGTATATT GCAGAGTCGT CTGGTAAGGC TAGTACCATT ATCTTGTTGG GCGAATGCGG AGAGCTCGTC ACCGAAGCCC CTTACGCACT GGAAAAATTG ATTGACACCT ACGATGACAT CCACGATGTG AACATCAAGA TTGCACTTTT GACAAGCACT GTGAGATTGT TTTTTATGCG ACCTCCGGAA GTTCAGCGCA TGCTTGGACG GTTATTGGCG GTAGCAACAG ATGATGTCTC GTCGCAGGAT TTGCACGATA GGGCTTTGAT GTATTATCGG ATGCTGCAAT CTGGCGCCGA CCCACACACA TTGGAACGCG TCGTTCGAAC TAGTACGGTG GTCGCGCAGG GTGTGAGTTT CGCCGAAGAA GACGACTCGG AACTTCGCAA GGAACTCATG GAAGAATTCA ATACGCTGTC AATCATTTAC GGCAAACCAT CAGTCAACTT TATTGCGCCC GAATTCCAGG TCAAATACAA AAAAATGCCA GACGAGCACC CTCTGGCACC TGGAGAAACG GGTTCATTTG TTGCCCCGCC GGTGGCACCC GTCCCTGCAG CGAGTATTCC CGTCGTCTCA AACGATGTAG ACTTGTTGGG ATTTGGTGAT GCTGAACCGA TGGCCGCACC AGCACCGATG GCATCCAACA ATCTCACACT GAGTGCTTCG GCGTCCATGA CCGGCGGAGA ATACCAAAGC CAGTGGGGAT CGGTTTCGGA TGCCGATGCG ACAGTGTCTG ATGTTCCCCT GCGAGCTTTG CCGTCCTCAA CAGATGAGAT TGAAAATGTC CTGGCCGCCG TAAATATCAT GACCATGGCA AGTGGGGAAT TGCCAAACGA ATTCAAATTC TTTTTGTACG CAAAAGACAG TTCGAGCGGT GCAACAATTA TGATTCAAAC AAACGTGGAC AAGTCTTCTC CGTCGGATGC CTGTATGATT GCTACGATAA AGATTGGTGG GCCGTGTCTG GACCCTACAG GATTGGCTGA ACAGGTCATT CAAATAATGC GTATGCAACT ATCCTAGTAG
|
Protein sequence | MSTPYAGQAP PPPGGATATA GVPDSYFTES RKGEINELRT LLRAFATERD PQRKRDIIKK VIAYMTLGID VSRLFSEMMM AIETRDLVIK KMVYLYLTNY ARTHPDLAQM CTNTLQKDCG NEDPMVRGLA LRALCGLNLP QMVEYISEPL RRALTDGHAY VRKTGVMGIL KLYHLDPDGF HEANFVDILY DMLRDPDASV ITNCIIVLNE VMQKSPNGGM AINRAIMLHL LNRIHEFNEF AKVQVLELVP RYIPANEDEG FQIMNLLDPV LRTSSSGAVV ATVRAFLSLS DTLDDGSEAM KRQIVARVKA PLVTQISSGS SEIMYTLLKH VDTLTTICPG VFDDEYRQFY VRYNEPTHVK YLKVAILPRM ANPDTAPDIV SELAEMVHDR NTKLSRAAVV SMGRIACSGN GGAGAAESIA RRLVELMDSG TDHIASEAAT ALTLMVRKEP SIKTLVAPPL VRSLKYIAES SGKASTIILL GECGELVTEA PYALEKLIDT YDDIHDVNIK IALLTSTVRL FFMRPPEVQR MLGRLLAVAT DDVSSQDLHD RALMYYRMLQ SGADPHTLER VVRTSTVVAQ GVSFAEEDDS ELRKELMEEF NTLSIIYGKP SVNFIAPEFQ VKYKKMPDEH PLAPGETGSF VAPPVAPVPA ASIPVVSNDV DLLGFGDAEP MAAPAPMASN NLTLSASASM TGGEYQSQWG SVSDADATVS DVPLRALPSS TDEIENVLAA VNIMTMASGE LPNEFKFFLY AKDSSSGATI MIQTNVDKSS PSDACMIATI KIGGPCLDPT GLAEQVIQIM RMQLS
|
| |