Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45993 |
Symbol | |
ID | 7201055 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | + |
Start bp | 889364 |
End bp | 890556 |
Gene Length | 1193 bp |
Protein Length | 322 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180136 |
Protein GI | 219118738 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00551878 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGGATG CCAAGAGGAA CACTACAGCG TCTTCTTCGG AAGAAGAATC GTCTTCGGAC GACGACAGTT TGGCATTGGA AGGGGTCGTT GCCCGGCATC CTGACGCCCC CTCCTCGTCG GACGATGACG ACGAAGGAGA AGAAGAACCT TGTGAGCCCA CGGCTGACGA AACGGCCAAT CAAAAAGACT TTGGCACTGA CAAACGCAAG GGAGCAACGA CAAGGTCGTC CAAAGTTCCA CCACCCAAGC GGAAAAAATC CAACGAATCC GAAACAATGC AAGTCGAATT TACCTTTCAC GATATGGACG ACAAGTTCTT TCACGGACTC AAATCCCTCC TCCACAACAG CTGCACCACT TACCAACCAC ATTCGTCGGA GCTGGTAGAC TTGATGATAA ACAATATTGC CGTGGGGACC GTCCTGTCGA CACAAGGCGA TACGGAGAAT AACGTCTACG GCTTTGCCTC TGTCCTCAAC ATTACCGAGC ATCAGCAATC GCCCGCCATA CAGCAGCTTC AACGATTTTG TCTCGACGGA TGCCCGGCCG ATCGGAGTGC AGAACTAGAG GTCGTCTTGT CGGGAAAAAC CAAACGGCCC GCCGGCTTTG TCTTGCACGG TCGCATGCTC AATCTGCCTT TGGAAATTGT CGAAGTCTTG CAGCAGCAAC TCGTCTTGGA TATGGATTGG GCCGTCGAAC ACGCCGAGGG CGGTGTTGAC GCGCGCAAAG CCCTCGACTT TGGAGTCTTT CTCCGACTCG CTCCCTGTCA AAAGGACAAC ACGGGAGCAC TGGTCTATCG CTTTTTTGAC GACGAAGTGT TGGCGGGCCA AGCAGACTTT AGTTTCCTGG TGGAAGCTCC CGCCAGCTAT TCCAAGGAAG ACAAAAATTA CGTATCCGTG ATTGTTCTCA CCAAAACAGG ACACCGTGCC GCCATGAAAG ATCTAGCCAA GCTCATTCAC GGAAGATGAA GGCATGATCA TGCAACGTGT CTTGTTCATG AATGAGGGAC ACGCTCCCTT TATCGATAGC CAATACCCTA CACCGACAAA CCGTAAAACC AAGTTTTTTA CAAAATTGTA CCGCATACAC GTTGTTTCTG TTTGGACAAA GTTCAGCTAC GATTCAGAGT TTAACAATAA ATTCGTACAA TTCGCGGATT CCTCTCTACT GCCCATCAAA CTAATTCCAT ATT
|
Protein sequence | MPDAKRNTTA SSSEEESSSD DDSLALEGVV ARHPDAPSSS DDDDEGEEEP CEPTADETAN QKDFGTDKRK GATTRSSKVP PPKRKKSNES ETMQVEFTFH DMDDKFFHGL KSLLHNSCTT YQPHSSELVD LMINNIAVGT VLSTQGDTEN NVYGFASVLN ITEHQQSPAI QQLQRFCLDG CPADRSAELE VVLSGKTKRP AGFVLHGRML NLPLEIVEVL QQQLVLDMDW AVEHAEGGVD ARKALDFGVF LRLAPCQKDN TGALVYRFFD DEVLAGQADF SFLVEAPASY SKEDKNYVSV IVLTKTGHRA AMKDLAKLIH GR
|
| |