Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50365 |
Symbol | |
ID | 7199145 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011697 |
Strand | + |
Start bp | 90710 |
End bp | 92974 |
Gene Length | 2265 bp |
Protein Length | 707 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185281 |
Protein GI | 219130248 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.899784 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGCTTG CTGCTGCTGC ACTCTCAATT GCATCCCCAG GCGGTAAGAA AGCTCCCGTT TCATTTCGAA CCGCAATCGG TCGCAAATCA CTAGAACGCG ATGCGCTGGA TGCGATCATG ATGATGAAAG GCTGCGAAAA ATCTATTTCC CCACCGTCGT TTACTAGTTC ATCTACGATT GTCGGCACTT CCCCGGTATT GGGTGATTCA ACACCACATA AACGCTCGTT TCAGGCTTTG GATGTGTTAG TGAACGCGGT TGGTTTTCAT CGAGAACCAA ATGACCAAAT GAAGCAGACG AAGATGTGTC CATCCCAACT TGCAAAACGT CGCGCTCGTC GTAAGAAACT ACGGGAAGAG CGCAAATCTC GCCTAGGTCC CTGGACAAAA AAAGAAGACA GAGCTTTAGA CAACGCAGTG AAAAGCGTTA ACAAGCTTCC TTCATTTTCG TGGACGCAAG TGTCGAAACT TGTACCTCAT AGAGACAATG AGCAGTGTCG TAATCGGTGG CGCTACCACG TTAATCCGGA ACTATGCAAT GATCCCTTTA CTCCAGAGGA ATGGCGATTT TTCTTGGAAT ACCAGCGTCA GTACGGAAAT CGGTGGACAG AATTGGGTCA GAAGATGAAA GGACGGTAAG TGTATAATTC AAACATATAC GAGAATGCCT ATGCAAAGAC TAACGCTATT GTTTCTCCTC GTTCAGTTCA CGAACGTTCA TCGGAAACCA GTGGCAGGCT GCGAAACGGA CAATCATAAG TGCCCTCAAA GCGCGACGCA AGGCAAACAT GCCAATTGTA GACGAAGATG GTTCTGTTGA CTATCAGTCC GACTTCGATT TTTGCCTCCA GTCGGTCATT GAGCGATACA GCAAAGACGG TTCTCCAAAG AAAAACTTCT CCGACAACGA GGACAACAGT GACGGGGACA CGTCTACCAC AGAATCTCTT TCGGATACAG GAGAAAGAGA AGTCTCATTA AAAGTCAAGA CGAAGCGATC TGGCGCAGAA ACTGTGATCA AACAAGGCTG GCGAAAGTCT GGGAGTTCAG AGTATACTGC AACAAGCGAT AAAACAGGAG CTCAAATTCA AGAATCTTCC GCTTGGACAA AAGTAGAAGA TTCATACTTG GCACAAATGG CACCTGGAGT CGGTGATAAT CCAGCTAAAT GGTCCGACAT AGCGCGACGT TTCTCACGTC GCAGTGGAAG AGAGTGCCGC GATCGTTGGA TGCAACATCT CTCGGATTTA TCTGTAGATA GGATGTCTTC GACTGTACCA TTTGAAATGC GCCCGTACAC TGCCGCTACG CGACAGGGTA TGGCACCCAA AGAGAATGGA TTTCACAGTC AAGACTACAA ATGGGATGGC GAGACGACGT CGAATCGTTG GGCTAGTCTC AAGCGCCTTG TTGTTGGTCA ACCAGTATCC AACAAACGAA TAAAACTGGA TCTCCCAAAG GGGAATTGGT CAGCCCCCCT TCCGTACCAA CTAAAAAACG CACAAACATC TTACGTCCCC ATGGCCCCTC GTCAGCATCG CGACAACGTT TGGCTCCCTG AGTCAAAGCC GGATTGCGTA ATTAACAATT CAGAAGTGTC CAGGACTAGT AAGCAGGGTA GCACATTCCG TGCGGCTCTC GATACAAGAA CTGTCAAGGT GCTGGAGTCG GAGCCGGAGC CCCGCCATTC AGAAATTGAT TCCATGATTT ATAACCGGCG CTTACCGAAA CCTTGGACAT CGGACGAAGA CGAAGTACTT CGTGAAAGTA TGAATGTCTT CAAAGATGAG GAGCATTGCA TGCAAAAAGT TGCTGAACGC CTCCCTAGTC GTTGTCCAAA GCAATGCCAA CAAAGATGGG TTTGTCATTT ACAGCCTGGA TTAAATAAAG ACCCACTGAG CAAAGAAGAA ATCCGTACGC TTTTGGAATG GCAACGCAAA TTGGGTAACA AATGGACTTT AATAGGTCAA GCCCTTGACA ATAGGTAAGG GTGGCTATAT TTGTGGTGCA TGAATGCTTA ACGCTGTTGG CTAACATCAT AAATATATAC CCAGACCGAA AAATTTTATT GCCAATCGCT GGTATCATCG AATCCGACCT TCACTTTTTC GATACATTAC AGAGAAACAC GGAATTTCTC ATTCTCAGAT TCAGAATAAA GATGGATCAA TTGATTATCA TGGTGATTTT GAGGGTGCTG TAGAATTTGT CGTCAAAGAC TTGCTCTTTG GAACTCGAAT CGGGAGTCAC CCAGGAAAGC TATAA
|
Protein sequence | MMLAAAALSI ASPGGKKAPV SFRTAIGRKS LERDALDAIM MMKGCEKSIS PPSFTSSSTI VGTSPVLGDS TPHKRSFQAL DVLVNAVGFH REPNDQMKQT KMCPSQLAKR RARRKKLREE RKSRLGPWTK KEDRALDNAV KSVNKLPSFS WTQVSKLVPH RDNEQCRNRW RYHVNPELCN DPFTPEEWRF FLEYQRQYGN RWTELGQKMK GRSRTFIGNQ WQAAKRTIIS ALKARRKANM PIVDEDGSVD YQSDFDFCLQ SVIERYSKDG SPKKNFSDNE DNSDGDTSTT ESLSDTGERE VSLKVKTKRS GAETVIKQGW RKSGSSEYTA TSDKTGAQIQ ESSAWTKVED SYLAQMAPGV GDNPAKWSDI ARRFSRRSGR ECRDRWMQHL SDLSVDRMSS TVPFEMRPYT AATRQGMAPK ENGFHSQDYK WDGETTSNRW ASLKRLVVGQ PVSNKRIKLD LPKGNWSAPL PYQLKNAQTS YVPMAPRQHR DNVWLPESKP DCVINNSEVS RTSKQGSTFR AALDTRTVKV LESEPEPRHS EIDSMIYNRR LPKPWTSDED EVLRESMNVF KDEEHCMQKV AERLPSRCPK QCQQRWVCHL QPGLNKDPLS KEEIRTLLEW QRKLGNKWTL IGQALDNRPK NFIANRWYHR IRPSLFRYIT EKHGISHSQI QNKDGSIDYH GDFEGAVEFV VKDLLFGTRI GSHPGKL
|
| |