Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_41562 |
Symbol | |
ID | 7199400 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011699 |
Strand | + |
Start bp | 175042 |
End bp | 176607 |
Gene Length | 1566 bp |
Protein Length | 521 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185495 |
Protein GI | 219130697 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.0981287 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGCAAT TTAAAGTTAC AGCTTCCATT CTGCTCAGTG CTTTGCTTTT GCAAGGATCG CAGGCTAAGT TTTTCTCGGA GGACTCTGAG GTCCTCACTG TCAATCGTGT ACCAAAGATT AGCAAAAGTG GCAACAAAAG CCAACCTCTA AGGGAACCGG AATCGGTGCC TTCGTCGTCT TTTTCGGGCG GTTTTATTAC AGCCGAAAAG GAAGTTTCGA CTGCCTTCCG AGACCGTACC AGTACCGCTT TTGGCCGAGT CTCCTGGGGA AACAATACTC TCAAACGGAG CAAAAGGACA AAGAGTGCCA AAGGCTCCGA CCCAGGAAGC TCAGATTCAA CTCCGGGCAG CGAGATTCTA ACTGTCATTC GTCAACGGCA AAAACGAGGG GGTAAGGGTT CGAGCAAGGA TGACAACGCT TTACGAACGA TTGAGCCAAA AGAGAGCCCC ACTCCGTCAC CGGTGGACCT TTCCACTTTT TCTCCCGGTA TGATCGACGA AACGCCCCAA CCCAGCCCCG AAGAAACTTT TGCCCCCACA TTGACTGGAA CGACGCCTGT ACCAAGCATA ATTCAGACAA ATCCTGGAGT ACTGGCAACA CCATTTCCTA CTGACGAAGA TGCCTTGCCG ACGCCTTTTC CTACATTCTT TCCAACTGCG AACGAAAACC CTTTCCCTAC CATTCCACAA AGTACATTCT TGCCCACTCC CACACCTCTA TGCTTCGAGT CATCGCTGGA ATTGCAATTC GCGGTAGACG AGTATTTGCT AGACAGCAGT CCCGACACGG AAGTGGCGTT CTTCTACGGG CATCCGATGG AAGAGTGGTG CGTCTCCAGC ATCGTGGACT TCAGCAACCT TTTCTCTGCC TTTCGGAATT CGGACACCAG TACCTTCAAT GAGCCTCTGA ATGGTTGGGA TATGTCGAGT GCCGAAACAC TCGAAAACAT GTTCGCGGGG GCCGAAAGCT TCGACCAACC CCTGTTTGAT TGGGATACGT CCAACGTGTC TACTATGACT CGAGCGTTTA GTGGAGCAGA AAGTTTCAAC AGCGACATAC GAGCTTGGGA TACCTCAAAC GTCCTAGATA TGCAAGCAAT GTTTGCGGGA GCTATCAGCT TTAATGGCAA TATTGCTTCT TGGGACATTC GAAATGTGGA GAATTTGTCT TTCATGTTCG CTGAAGCAAC GAGCTTTGCT GGGGATTTAT CGCAATGGGA ACCACTCAGT GCCATTTCAA TGGTGCAAAT GTTTCTCGGT GCTAGCTCAT TCAACAGCGA TATTTCGAGA TGGGACGTAT CGGCAGTCGA ATTATTCTCG AGTATGTTCA ACGAAGCGAT TTCCTTCAAC CAAGACATAT CTGGATTCGA TTTGTCGAGT GCGACCAATT TGGACCGTAT GATGTTCATG GCAGAGTCCT TTAGCCAAGA TGTATGCAAC TGGGGTTCTA CGCTTGACCC ATTTTTGGCT CCGTTCGAAG TTTTTCAAGG CACGGATTGC CCAAACGTCT CGGATCCCAG CCTTGATAAC ACCCCTCCAG GTCCCCTTTG CTTTCAATGT TCGTAG
|
Protein sequence | MAQFKVTASI LLSALLLQGS QAKFFSEDSE VLTVNRVPKI SKSGNKSQPL REPESVPSSS FSGGFITAEK EVSTAFRDRT STAFGRVSWG NNTLKRSKRT KSAKGSDPGS SDSTPGSEIL TVIRQRQKRG GKGSSKDDNA LRTIEPKESP TPSPVDLSTF SPGMIDETPQ PSPEETFAPT LTGTTPVPSI IQTNPGVLAT PFPTDEDALP TPFPTFFPTA NENPFPTIPQ STFLPTPTPL CFESSLELQF AVDEYLLDSS PDTEVAFFYG HPMEEWCVSS IVDFSNLFSA FRNSDTSTFN EPLNGWDMSS AETLENMFAG AESFDQPLFD WDTSNVSTMT RAFSGAESFN SDIRAWDTSN VLDMQAMFAG AISFNGNIAS WDIRNVENLS FMFAEATSFA GDLSQWEPLS AISMVQMFLG ASSFNSDISR WDVSAVELFS SMFNEAISFN QDISGFDLSS ATNLDRMMFM AESFSQDVCN WGSTLDPFLA PFEVFQGTDC PNVSDPSLDN TPPGPLCFQC S
|
| |