Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_1550 |
Symbol | APC6 |
ID | 7196023 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 784481 |
End bp | 785935 |
Gene Length | 1455 bp |
Protein Length | 381 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176654 |
Protein GI | 219109801 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.058598 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGTGGAGAGG CACGTTCTCA GTTGGGTGAC GGCTCCCGAG CTGCTGTTTT TTGGAAACTA GCCCTTCGTC TTGATGCGGC ATGTCAGGCT GCATGGAATA GGTTACAAGA TTTTCATCTT TTGACGTCAT CCGAATGCCA TGACCTTGTT TTGAGCTTAC ATTTCGGAGA GAAGGCGTGG TTGCGGGATT TGTACATGGC CAGCATCGCG CTTACACCTC AGGATACAGA GAAGCCAGAT ACCGAAGGCT CAGGACCGTC ATGCGACGCC ACGGACACTT ACAAATTTAT CCCACAAAAT AATGCGGCGA ATTCCTCCTT TACTATGCAT GGCGCTGGGA ACTTGGACGC AAGCTCTATT CAGTTAGCTT CTCCAATTTT CACGTTTCCT ACGTCTACCA ACGCGACGAC GTCCAAGCCT ACCGCTGACG ATTCGTGGAA ACCGGGCACG GGGATCTCCA AGATCCAGCA AGACGTCGAT CATGCTTTCC GCAAGCTCTG GAGCGACTAC AAACTAAATC ATTCACCCCA AGTTCTGGCC ATGGCAGCCC GGCGTGCATA TCGCCAGTAC GATTGGTCTG GGGCGCTCAA GCACTGCCAA GCGTTGGCAG TGCTAGACCC GTCAATGGCC GGCGCCGCTT ACTGCTACGT AGCCACGCTC GTAATTCTTG GGCACAAGCG TGTTTTGTTC CGTCTCGCCC ACGAATGGGT CGACGCCAAC CCTAAAGCGG CCTGTGCCTG GTTTGCTGTC GGTGCTTACT ACTTCTGCTG CGAGCGTTAT CATGTCGCAC AGCGCCATTT TTGTCGGGCG ACGCGCCTCG ATTCGTCTTG TGCAGAGGCG TGGATTGCCT TTGGCTGTGC CTTTTCAGCT TGCGATGAAT CTGACCAAGC ACTGGCATCC TATCGAGCCG CCCACCGTCT TAGCCCTGGG GAGCATACGT CGCTATTGTA CATGGGAATG GAGTACGCTC GAACCAATCA CAAAGTCTTG GCAGAATATT TTTTACAATC GGCCTGGGCA TCGAGCGGTG GTGATCCGCT GTGTTCGCAT GAATTGGGAG TATTGTATGC ACAGAAAGGT CAACACGAAA AAGCAGTTTT TTGGTTTCAC CGGACGTTGC GAGTGGTTGT GGCTACTACA AGCGGTGGGA ATGAAGAAGA CATCAAACGG TCTCGGTCCA TGCAGGAGTG TTTGGACCTT TGTCAGGATC CCTACTGGGA GGCGACTCTG TATAGCTTAG GACACTCGTA CCGCAAGATG CGACAGTATG AAGTGGCAGC CTCTTGTTTC GATCGATGTA CCGCATTGTG TCCCATGAAG TTCTCAACAT ACTCTGCCCT TGGTTTAACA AAGCACTTGA ATGGTGATGT GGATGGCGCT ATTGACCTCT ACCATAAGGC GCTATGCTAT CGACCGGACG ATTCTTTTAG TACAGAGATG TTAAAAAGAG CTTTG
|
Protein sequence | RGEARSQLGD GSRAAVFWKL ALRLDAACQA AWNRLQDFHL LTSSECHDLV LSLHFGEKAW LRDLYMASIA LTPQDTEKPD TEGSGPDYKL NHSPQVLAMA ARRAYRQYDW SGALKHCQAL AVLDPSMAGA AYCYVATLVI LGHKRVLFRL AHEWVDANPK AACAWFAVGA YYFCCERYHV AQRHFCRATR LDSSCAEAWI AFGCAFSACD ESDQALASYR AAHRLSPGEH TSLLYMGMEY ARTNHKVLAE YFLQSAWASS GGDPLCSHEL GVLYAQKGQH EKAVFWFHRT LRCLDLCQDP YWEATLYSLG HSYRKMRQYE VAASCFDRCT ALCPMKFSTY SALGLTKHLN GDVDGAIDLY HKALCYRPDD SFSTEMLKRA L
|
| |