Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48767 |
Symbol | |
ID | 7195043 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | + |
Start bp | 192923 |
End bp | 194021 |
Gene Length | 1099 bp |
Protein Length | 314 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183312 |
Protein GI | 219126120 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 0.977253 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAGGT CCAAGACCGC CGCAAGCGAC GTGCAGCAAC AACGCTGGCA AAAAACCCAG GACGAGGACA CATTGCACTC CCTGTTTGAG AGGAAAAAAG CATCCATTCC CAAGAGTATC TGCAATCAGA CACTTTATTT GTACACTTTT AGCAAAAATA ACTCTTCATC TCATATACAG TTTGGAAATG TTGACGGCTC GTACTTTCTT ACGCTCGACC TTGTCGCTTC CGCAGCAGCG TCATGCTTGG GAGTCCACCG GAGTCCGCAC CTTGTCATCC AAAAAGCCCG TGGCTCCGGC AACGGCGAAA GCAGCGGTGG AGGAAGATGT AATTTCTCCG GTAACTGTGC GTCGTGCGAA AATGTCCAAA ATTGATAGGA ACATGGTGTT CGGTACTACG GGAAAGTTCC TCGCTCCCGT CCTACCAGAG AATCCGGCGG AAATCTCCGC GCTGGACCCT GCCGATCAAG GCCACCGGCT CAAAATGGAC GGGACCGCCC GGGTCGTTAT GATTCGCCAG GAAAGGGCTT CGAATCGCCA ATCCCCACTC AAACACGAAA AGTATTGGCG TATCTTTTTC TACGAAGACG GCATGGTGGC GGAAAAGTGG ACGAATTCGC TCATGGGTTG GACTTCGAAC GGCGATCCCT ACCAATCAGC GCCACCGTTG ATCTTCCCCA ACGCGGCCGA TGCGGTGTAC TTTGCAAAAA AGCGGGGTTG GAATTTTGTA GTCAAGCAAC CCATCATGCG TGACCCACGC GAGGATGGTG CTCAGTACCA AGACAACTTT TTGCCCCTGG CCGTGGCGGC CAGGGTCCAA AAGGAAGGTG TGTCGTGCGA TCAGTGGGCC CGGGATCACG CCGGTACATC CAGCTACTTC CGCCCGCTCA AATACCACGG TGATGGCCTC GTGCCACAGC ACGGACCCAA CGGAAACGCC CCGATCGCCA AACACGTTCC CGGCTACTAC AAGTTGCGAT AAACTTTTTA CAATGCTGCA CTGCACCCAA CCGGTAGGTT TGAAACAAAA GTACTGGAAT TGAGGAAGGA GGGTAGAGGC GTAGGAATGA ATGAAAAATA AGAAATTGTT GCGTAATGG
|
Protein sequence | MKRSKTAASD VQQQRWQKTQ DEDTLHSLFE RKKASIPKTK ITLHLIYSLE MLTARTFLRS TLSLPQQRHA WESTGVRTLS SKKPVAPATA KAAVEEDVIS PVTVRRAKMS KIDRNMVFGT TGKFLAPVLP ENPAEISALD PADQGHRLKM DGTARVVMIR QERASNRQSP LKHEKYWRIF FYEDGMVAEK WTNSLMGWTS NGDPYQSAPP LIFPNAADAV YFAKKRGWNF VVKQPIMRDP REDGAQYQDN FLPLAVAARV QKEGVSCDQW ARDHAGTSSY FRPLKYHGDG LVPQHGPNGN APIAKHVPGY YKLR
|
| |