Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_8926 |
Symbol | |
ID | 7196864 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 1971725 |
End bp | 1972771 |
Gene Length | 1047 bp |
Protein Length | 349 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176883 |
Protein GI | 219110263 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.304945 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGATA CAGCCGAGCA GATATCGATC GATGATGTTT CGAAGGAGTT GCGCGTCGTA CGTCAAGAAT TTGGAGAAAA AATGGAGCGC GTCACTTCGG TTGCCGACGC GGAAGTCATC CGTCGGGAGT ATCTTGGCAA GAAAGGACCC ATCAACAAAG CGATGGGATA TATGAGACTA CTACCAAACG AAGACAAGCC CAAGTTGGGT GCGGTTGTCA ATGAAATTAA GGAAGCCCTG GAAACAACCA TGACGGAACG CATGGATGCT TTGAAGGTAG CGGAGATCGA AGCGGCGATG GAGTTGGAGC GGATCGATGT CACGCAACCG GGTTTATGGA ATTCGCCAGA TATCGGGAGA CGCCACCCTC TTAGTATTAC AATGGAAAAG GCGGTGGATA TTTTCACCAA GTTGGGATAC GATACTGTTA CCGGCTGTGC GGATTCTCCC GAAATCGAAA ACGATTACTA TTGCTTTGAA GCCCTCAACT GCCCCAAAGA TCATCCCGCT CGTGATATGC AGGATACTTT CTATCTAACG GAGGATCTTG AACTCATGCT TCGAACACAC ACTTCGGCGG TACAGATTCG CCAGTTGGAA AAACGCAAAC CTCCGCTCCG TATCGTGGCT CCCGGACGCG TTTATCGAAA AGACGATATT GATGCCACCC ATTCCCTTAT GTTTCACCAA GTGGAGATCT TGGCATTGGA AAAGCGCGGC GAGCTCAATC TCGGACATTT GAAAGGAACG GTGGAGCATT TCCTTCAAAA TATGTTCGGA CCCAATATTA AAGTCCGTTT CCGTGGTAGT TACTTTCCGT TTACGGAGCC CAGTATGGAA GTCGACGTCT TTTTTCGTGG CAAATGGTTG GAAGTGTTAG GATGTGGTAT GGTGGATCCA AACGTCTTGG AAATGGCCGG AATCGACCCG AACGAATACT CGGGATTCGC CGCCGGCTTT GGAGTGGAGC GCTTCGCGAT GGTAATTCAC GGCATTACCG ATCTGCGCGA GTTTTACAAG AATGACAAAC GGTTCTTACA ACAGTTT
|
Protein sequence | MSDTAEQISI DDVSKELRVV RQEFGEKMER VTSVADAEVI RREYLGKKGP INKAMGYMRL LPNEDKPKLG AVVNEIKEAL ETTMTERMDA LKVAEIEAAM ELERIDVTQP GLWNSPDIGR RHPLSITMEK AVDIFTKLGY DTVTGCADSP EIENDYYCFE ALNCPKDHPA RDMQDTFYLT EDLELMLRTH TSAVQIRQLE KRKPPLRIVA PGRVYRKDDI DATHSLMFHQ VEILALEKRG ELNLGHLKGT VEHFLQNMFG PNIKVRFRGS YFPFTEPSME VDVFFRGKWL EVLGCGMVDP NVLEMAGIDP NEYSGFAAGF GVERFAMVIH GITDLREFYK NDKRFLQQF
|
| |