Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_24547 |
Symbol | |
ID | 7196343 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 842611 |
End bp | 844295 |
Gene Length | 1685 bp |
Protein Length | 506 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177173 |
Protein GI | 219110843 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.9636 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTAAATCCTC GACGACGGAT GGGCAGGCGA CGATGGAAGC GTGTTGCGCA TGGGATTTGC ACATTTGGTT ATATCAAGAC AACAACTTTG ATGAACTTTT GACGGAACGA GTCGTAATTC CGGCCGCTTA CCTTTCTATG CAACAAGCGA ATCAACTGCT GCAAGATATG GAAGAGAACG AAGTTGTATT GGTCACACTA TACACACGTT GGCGGCCCCA GTACAATCCC GCGATCTTGC TAATTTGGGC TCTGGGAGTA TCAGTCGCGG CATTAGCTGC CTATCTTTCC GCCGGCGACT ACCATGACTA TATTCGTAGA GTGCTGCGCC GGCAAGAACG TCACCGGCAA GGTATAGATA CAACTACTTC CTCAAGAACG AAAGTGAACG ATGGCGTTGA ACGCTCGGCG TCGTCGGCCA GGGCTCCTCC TGAAGATATG GAATTAACTG CCGCCCACGC ACTTGGCTTC ATCATTATGG CCAGCTCGAG TTTATTGGTT TTGTTCTATT TCAAGATCTA TGGCATTGTC AAAGTTTTTT ACTCGATGGG TTGCAGTAAG GCTGTCAGTC AGGTAGTTGT GGATCCTTTT TTGAAACGGC TAATGAAGAA ATTTCGAGTT CGCAATCAAA TCATTTGGCG AACAAATACA GAAGATTTTG GTGATATATC ATTGCGTGAC ATTATGGCGC ACGTCATTGG ATTCACTTTG GGTCTCTCTT GGTTGATCAT CGCTTTTGTG GCGCGGGACC CGGGATCAAT TACTTTTTTC TGGATCATGC AAGATATTTT TGGTACCTGC ATGTGTGTCA TGTTCCTGCA GGTTATCAAA CTCAATAGCA TTCGAGTCGC GGCAATTTTG CTGGTTGTAG CTTTTTTCTA CGATATCTTT TTTGTCTTCG TGACGCCGTT GCTGTTTCAA GGCAAGTCTG TCATGATTAC AGTGGCGACA AGTGGTGGAC CACCGACGGC GGATCCTCTA TGGTGTGAAA AATACCCCAA CGACGCCAAC TGCCAGGGAG GAAATCCGTT ACCCATGCTG TTAACTATCC CTCGCCTGTT TGATTTCGAA GGTGGTTCAA GTCTGCTTGG ATTAGGAGAT ATCGTCTTGC CTGGTTTGTT ACTGAGTTTC GCCGCCCGTT TTGATGCCGC TAAGCGAATG ATGGGCGTTA TGGGCGGCGG TAGTGGTAGC TTGACATCTT ATCATTGCCA AGAACGGCGC TACTGCTGCA GCTGTGGATT GTGCAGTGGG GGATACTTTC CTCCCATGGT GGCAGCCTAT GCGGTAGGTC TACTCATGGC AAACATGGCT GTACAAATCA TGCATATGGG GCAGCCTGCG TTGCTGTATT TAGTTCCTTG CTGTTTGGGA ACCATGGTAT ACATGGGATG GCGCAGGAAC GAACTATCAG AACTCTGGGA TATTTCCAAA GTTATACGGT CCGCCGATAA TACTTTGTAC GGGGATTACT ATTCGTCAGG GCCATCAACA ATGGCAACTT CCCACGATCG TCACGCTCCA TTACCGCAAG ATGACGACGA ACCTGGTATT GCTGTACAGA CTGTGCCATC CGCATTGGAC GATACGGGCA GTGCTCCATT TCTTCCCGAA AACGATCCTT GAGTGACGTG AAGCAATAAT TGTGCGTAAA ATAAAAAAAT CGATTAAAAA TGTTTGCGTT ATTCT
|
Protein sequence | MEACCAWDLH IWLYQDNNFD ELLTERVVIP AAYLSMQQAN QLLQDMEENE VVLVTLYTRW RPQYNPAILL IWALGVSVAA LAAYLSAGDY HDYIRRVLRR QERHRQGIDT TTSSRTKVND GVERSASSAR APPEDMELTA AHALGFIIMA SSSLLVLFYF KIYGIVKVFY SMGCSKAVSQ VVVDPFLKRL MKKFRVRNQI IWRTNTEDFG DISLRDIMAH VIGFTLGLSW LIIAFVARDP GSITFFWIMQ DIFGTCMCVM FLQVIKLNSI RVAAILLVVA FFYDIFFVFV TPLLFQGKSV MITVATRNPL PMLLTIPRLF DFEGGSSLLG LGDIVLPGLL LSFAARFDAA KRMMGVMGGG SGSLTSYHCQ ERRYCCSCGL CSGGYFPPMV AAYAVGLLMA NMAVQIMHMG QPALLYLVPC CLGTMVYMGW RRNELSELWD ISKVIRSADN TLYGDYYSSG PSTMATSHDR HAPLPQDDDE PGIAVQTVPS ALDDTGSAPF LPENDP
|
| |