Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49671 |
Symbol | |
ID | 7198154 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011691 |
Strand | + |
Start bp | 368620 |
End bp | 370213 |
Gene Length | 1594 bp |
Protein Length | 454 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184353 |
Protein GI | 219128298 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.245553 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTTAGAGGAA CTCACCCCCG TCCACACACA AACAACGCTT CTACAGGACG ACAAAAACTC GGATTGTGAC GCCAAGCACA CCCTCACCAA TTGAAGAGAC TCACAGTCAA GCGTTGCCTT TGCTTCTTTT CTTGTAGCCC CCATCGCTAC TGTTCATCAT GTCCAACAGT CTTTACGATG ACGGCTCGTA CAATGCCTTC CGCGTCCCTT TCTACCGCAC ACGCAAGTTT TTCATCGGAG TCACTCTTTC GGTCATGCTA CTGATTGCGA TCGTTGCCGT GGTAGTTTCC GGGAACAAGG GATCCGAGTT TGCAGCGCTA GGAAATTCCT CCACCCCTCG TGTTCCGGAA ATAGAAGTCT CGGAATCTAC TCTTCAAGAA AACGAAGCGG AACTCGGAGC CGCCCTGATC CAACTTTACG ATCGATTGGA CATTCCGTGG AATGGACTAT ACGAAGACGC TACTCCTCAG GGCAGGGCCT TGCAAGCTGT GGCGGGTACA AAGCTCTACG CTTCGTTGGA CCGCGTTCGT AGTGTCCAGC GCTACGCCCT CGGAGTCTTT TACTACTCTA CCTTTGCGGT TGCGCATCCG TACTTGGAAG CCGAGGACAC CCGTCCCTGG GGTTCCAGTG ATTTCTGGAT GAGCAGTACT CCCGAGTGTG AGTGGGAGGG CATCACGTGT GATGACAGTG GTCGTGTCGC AGCGATCGAT TTGTCAAGCA ACTATCTGTC CGGGACGCTT CCACTGGAAC TTGCACTTTT GGACAAGCTT GTGGGTCTGA ATCTGGCAAA CAACTATATA TTCGGGGAAG GTGCCAGCAA CGACGTCTGG AGCTATTTGC CAAACCTACA GGACCTTATG ATGGACGATA ACTTTGTGAT TGCCACCACG GGACTACCAT CCCAAATGAA GAGCTTGGAA TCGATTCAGA AGTTGTCTGT TTCTTACAAT CTACTGCAGG GTGTTCTGGA TGGCGAGATT ATCGGCAATA TGCAGCGTCT GTCACATCTG GAAGTGGAAT CAAACTATAT TTCTGGTGAA CTTCCCGTGG AACTTGGAAC TCTGCCCGAT TTGGTCTACT TTTACATCCG CCGTAACAGT CTTTCGTTCA ACTTGAACAA GCTCATCGTG CCGAACCGAT TTCCAAAGAT CTTCGCGTTG TGGCTCGACT CCAACCCTAT TACGGGAACG ATTCCTAGCG AGATTGGAAC GCTGACCACA CTCACATCCT TCAGCCTCAC CAATGCCACT TTGACGGGGA AAATTCCCTC CGAAATGGGC AATTTGGCCA AAATGAAGCG TTGCTGGTTG TACGACAATG CGTTGACTGG GACAATCCCA CAAGCTTTGT CGAGTTGGGT CGACTTGCAA GTTTTGGAAG TGTCTGGAAA CAACTTTGTC GGAGACATGC CTCAAGGTGT GTGCGACGCC ATTACGGCCT CAGACTACCA GTTCAAAACC TTATCAGCGG ATTGTACCCG TATCGCCTGC GAAGGATGTT GTACGGAATG TGAAAACAGT TAAACGACAT TTTTTTGGGT AGAAAGCAAT ACCCTTTTAT ATTTTAACAC TTGAGATTAT GTTTACTTTG TCTC
|
Protein sequence | MSNSLYDDGS YNAFRVPFYR TRKFFIGVTL SVMLLIAIVA VVVSGNKGSE FAALGNSSTP RVPEIEVSES TLQENEAELG AALIQLYDRL DIPWNGLYED ATPQGRALQA VAGTKLYASL DRVRSVQRYA LGVFYYSTFA VAHPYLEAED TRPWGSSDFW MSSTPECEWE GITCDDSGRV AAIDLSSNYL SGTLPLELAL LDKLVGLNLA NNYIFGEGAS NDVWSYLPNL QDLMMDDNFV IATTGLPSQM KSLESIQKLS VSYNLLQGVL DGEIIGNMQR LSHLEVESNY ISGELPVELG TLPDLVYFYI RRNSLSFNLN KLIVPNRFPK IFALWLDSNP ITGTIPSEIG TLTTLTSFSL TNATLTGKIP SEMGNLAKMK RCWLYDNALT GTIPQALSSW VDLQVLEVSG NNFVGDMPQG VCDAITASDY QFKTLSADCT RIACEGCCTE CENS
|
| |