Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50277 |
Symbol | |
ID | 7199037 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011696 |
Strand | + |
Start bp | 168391 |
End bp | 170203 |
Gene Length | 1813 bp |
Protein Length | 542 aa |
Translation table | |
GC content | 45% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185140 |
Protein GI | 219129952 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GGCGCCAAGA AAGAACAAAA ATCACTCCGA AGCGAATATC CTTCTCTTCT TCTTTTCACT TGTTTCGTTG CTAACAGCGC GAACCAAAAT AAAATAATGG CAGATAAAGA CCGGTACATA GTAGCTCAGG AGGAGACAGG ATATGGCTAT TTCGATGCCG AAGGTGAGAG GGAACTTTTC CAATCAAAGT CTCAAGACGA CTTTCCTTTG ACATGGCGAG GCGATCCTGC CGAGAACTGT GCAGATTGGA CAGTTGTCGT GGTTACAAAT GAGCTTAAGG CAGCATCCTA TCATGTGCAT AAAAACATCA TGTGTTTTGG ATCGAGAGCA AGCAAATTCT TCTCAAGGAC TATGTTAAAT AAGACTCCAA GCAAGAAACG AAGCAAACGC CAGCAATTGT CGACCACAAA AGTTGAACTC AATCAGCAAG ATGCAGACAA TTTCCCTCTG TTGCTAGACT TCATTTATGC ACCATGCGGG AAAATGACGT CCGGTGGGAC TGTCTTAACG GCGGCTTCGA CATTGACTTC CCCGTCGTTG TTGCCTGTTG TGAATGAAGA TGACGGTTCT TCTTCATACA CGGTTGAAAA CATTTCAACA AACAATGCAG TATCCTTACG TCATTTAGCA AAAAAATTTG AGATAGAGGC TCTGGTCCTT GCAGTGAACA AGTTTATACA GCGAGACTTA AATTTCAAAA CTGGACCAGC TTACCTCTGC GCGGCTTCTG AATACAAAGA TGAGCGTCTT CTGGAGTCAT CACGACGTCT CTGTGCAGAG AACTTTGAAC AAATTGACAT GAGAGAATTA ATCAGACTTC CTTTGTGCCT TTTCCGCGTT GTTGTTCGGT GTCTGGAGTC TTTCGAAAGG GACAACAAAA GTTTGAGCTG CTTCCTTAGC GAGGTCGTAT GCCGTTATTT GGAAAAACAT CAAAAAGAAA TAAATGGTGA GCTACTTTTG GAACTCACTG ATTCGCTCGT CATGCCATAT ATTGCACCCG AGGCAGCTAT CGGTTTCACA TCATTGATCA AAGAGCTAAA AACTGAAAAT GCTGAAAAGC ATTTTCATGC GTTAAGCAAT CTTAGTCGCC GATGCGCGCA ATCTGTTGTT AAAGAATACG GCTGGACCGA CTTCTCGGTA AGCGCAGCTG TTGACGAGTA CTTGGGTCAC TCTAAAGACT TCAAATTTGA TGATCACGAG GGATGGCGCG TTGATAGTCT TCTATTCGCG ACCAGTTTTG CAGCCGCATT GGAGCAGGCG CAAGATGACT ACGAAGAAGT ATTGGTGGAG CAGGAGCGAC TTTCATGCAT GGTTGGCGCT CTTCACCAGA CAGTGAGCAT GATGGAGGTT TTAAATTCGA AGAAGGACGA ACATATGACG AAACAGCAAC GGGCTATTGA TGAAGCGAAG AAGCAGATCT TGGGTCTCAA GGACCAAATC AGTATGATCA GTCAGCAACA ACTACAGCGA ACTTCGATTT ACCCTGCCCT TTACGAGCAA ACGGATACCG AAATTCAAAT GGGTTCGTCA TTGGACTTTC GTCACGAGAA GGAGCATCGG GATGCACCGC CGCCGTTGCT TTCGTATAGC GAGGAAACAT GGTCTCCGGT TAAGGATTTG GTATCGCCGT GTCAAGTTGG ATTGGACGTT CAACAAAATA AGAGCAGGAG AAAGCAAGAG TTGAGAACGA AGGCTGAAAT GAGATCGCGA AGTATGCTTG TATAGCTTCT ACGGATCGAC CAGACAAAAA TATTAGACGC GGAAAGAATT TTAGGCGGTG CCATACAGCA GCTATAAACG CAAACTTGAT TTC
|
Protein sequence | MADKDRYIVA QEETGYGYFD AEGERELFQS KSQDDFPLTW RGDPAENCAD WTVVVVTNEL KAASYHVHKN IMCFGSRASK FFSRTMLNKT PSKKRSKRQQ LSTTKVELNQ QDADNFPLLL DFIYAPCGKM TSGGTVLTAA STLTSPSLLP VVNEDDGSSS YTVENISTNN AVSLRHLAKK FEIEALVLAV NKFIQRDLNF KTGPAYLCAA SEYKDERLLE SSRRLCAENF EQIDMRELIR LPLCLFRVVV RCLESFERDN KSLSCFLSEV VCRYLEKHQK EINGELLLEL TDSLVMPYIA PEAAIGFTSL IKELKTENAE KHFHALSNLS RRCAQSVVKE YGWTDFSVSA AVDEYLGHSK DFKFDDHEGW RVDSLLFATS FAAALEQAQD DYEEVLVEQE RLSCMVGALH QTVSMMEVLN SKKDEHMTKQ QRAIDEAKKQ ILGLKDQISM ISQQQLQRTS IYPALYEQTD TEIQMGSSLD FRHEKEHRDA PPPLLSYSEE TWSPVKDLVS PCQVGLDVQQ NKSRRKQELR TKAEMRSRSM LV
|
| |