Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_29217 |
Symbol | |
ID | 7203235 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | + |
Start bp | 662451 |
End bp | 664684 |
Gene Length | 2234 bp |
Protein Length | 406 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182273 |
Protein GI | 219123940 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTCGAGGTTC CCATTGCGTG TGTTTCGAGA CCATTCATTT GCCAATCTTT CCACCTTGTC ACAGTCCTAC CTCCATTCCT TTTGGAAGTA GTCGCCCGAA AAGATTGAGA CAGAACGCGT CGGACAAAGC AGTCAAACAT GGTGGACGAG AAAGAACTGA GCGATGCCGA ATCCGTAGGC AGTGAAGCCG AAGAAGAGGC GGAAGAAGTT ACGGATCTTT CCAGTAGGTA TGTGCGACGA ATGGTGAAGC GACAGGGTGC AAGGCAACGT CGAAGTGCAG ATTACAGGGT CGGAAAGTCT TGTATGCGGA AGAGCTTTAC TCACGCTTGG ATTTTTCTTT GGCTTTCTCC GATTGCAAAC TCACCGTTTA CAGCGACGTT TGTACCAAAT ACCAAGAAGC CGCCAAGATT GTCAATCTCG CCTTACAAGG CCTGGTCTCG CAGTGCGTTC CTGGTGGGAC CATACTCGAT ATTTGCGAGT TTGGGCAAAC AATCATTACC ACGCAGTCAG CTAAACTCTA TACCAAAAAG GTCAACGGAC AAGTCGTGGA TCGGGGAGTG GCGTTCCCCG TTTGCATTTC CGTCAACGAC ATTGTCTGCA ATCATTCGCC CTTGCCAAAC GAAGAACGGG TAAGTGCCGT CCCATGTGGT ATAGAGAAAA ATGGCGTACC TGCCGTACAA AATAACCTGT GGTGGTCGAC ACAAGTATGC CACGATTGCT TGCCGACCGT ATCGCTCAAA ACGATACCGT TCATTTCCTC CCCACTTTCT ACGGAAAGGA GGAGGAAGGG CGGGCTTTGA CTGGACTGCA GACCCGTAGT AGAGAACTGC ACAGGAAGAC GAATGGACGA GGATTGTGTG CTTTGGTCGT ATCACTCGTG TACTGCTGCG CGTCTGCTTG TCACCGCGCG CACCGAGTCG GTCCCGAAAC GCCTCTCTCT CGAGCCGAAT GTGACCGATC GCTGCCGCGT AGACCCTCTG TTCCCATGGT CACAGGACGG TGGCGTGATG GGTCGATGCA TTCCACAAAT CCGCTCTCCC CCATTAACGA TTACGTGCCC GAGTACCCCC ATCCATCCCA CACACACACT TTGTGTCTCA CACGCTGATT TCTTCTTTTG CTGTTTCCTC TATTGTACAT ACACACCCAC GGAAACCGGC CGCAGCCTGC ACTCAAAGCC GGAGACATTG TCAAGATGGA TCTTGGGTGC CACATTGACG GTTACATTGC GGTAGCCGCA CACACGTGTG TTGTCCCGGA ATCCCCCGAT ACCCCACCCA CGCTGGAAGA CGCCCAAGTC ACGGGCAACG TGGCTGTAGC CGCGTACAAC GCCATGCTAG TGGCCGCCGC TACCATTGCG GCCGGAAAGA AGAATACGGA CGTCACCAAG GCCGTCGAAC GGGTCGCTCA AGCCTACGGT GTCACCCCTA TTAGCTCGGT CCGTATGCAC CAAATGAAAC GTTACGTCCT CGACGGTGTT AAGGAAGTCG CGCTCAAGGA ACCCACGGCG GAAGAAATTG CCACTGAAGA ACGCTTGCCC GAATGTACCT TTGAACAGAA CGAAGTCTAC GCTGTCGACG TGGCCATGAG CACCGGCGAC GGCAACGCTC GTCCGGGAGA TTTGCGCACT ACCGTTTTCA AACGCAACGT AGAGCATCAG TATTCCCTCA AAGTGCAGGC CTCGCGTCAG CTTTTGGCCG AAGTGGACAG CAAATTTCCC ACCATGCCCT TTACACTCCG ACACTTGTCC GACGTTCGTA AGGCGCGATT GGGGATCCCT GAGTGCGTCT CGCACGGGTT GTTGACACCC TACCCGTCCC TGCACGATCA TTCTGGAACC GTGGCGCATT TCAAATGCAC CGTCCTATTG TTGCCTTCGG GAACGATCCG GGTGACAGGT CTTGAAAAAC CGGAATACTT CCAAACGTCC ACGGCACCGG ATGAGGAGAC CGTCAAGGTT TTGCAACAAC TCGAGGAAGA AGCGGCTAAA AAAGCGGCCC GGAAGGCGGC CAAGAAAAAC AAGAAAAAGA GCAAGAAATA AAGCGCGCGA GTCTGACAAA GTCCCCTTTG ATATGGCTCG GAAGGTAAGG CGACACGCGA CGGAATGTAG AGGCGAGTGG AAGGCATCCA AAGCTAACTT TCGTCAACGG CTTATCTGGC GACGGAAAGA AAGTAAATAA AGAAGCTGCA TCCAACGAAT TATGTTTTAT TAGCGAAATA CGCTTGCGTT GTCG
|
Protein sequence | MVDEKELSDA ESVGSEAEEE AEEVTDLSSS DVCTKYQEAA KIVNLALQGL VSQCVPGGTI LDICEFGQTI ITTQSAKLYT KKVNGQVVDR GVAFPVCISV NDIVCNHSPL PNEERPALKA GDIVKMDLGC HIDGYIAVAA HTCVVPESPD TPPTLEDAQV TGNVAVAAYN AMLVAAATIA AGKKNTDVTK AVERVAQAYG VTPISSVRMH QMKRYVLDGV KEVALKEPTA EEIATEERLP ECTFEQNEVY AVDVAMSTGD GNARPGDLRT TVFKRNVEHQ YSLKVQASRQ LLAEVDSKFP TMPFTLRHLS DVRKARLGIP ECVSHGLLTP YPSLHDHSGT VAHFKCTVLL LPSGTIRVTG LEKPEYFQTS TAPDEETVKV LQQLEEEAAK KAARKAAKKN KKKSKK
|
| |